Newer Versions

v2.1
v2.0

Older Versions

v1.6
v1.7

LucidWorks Enterprise v1.8

Support Resources

LucidWorks Forum
KnowledgeBase

This is the documentation for LucidWorks Enterprise v1.8. The most current release is v2.1.

compared with
Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (2)

View Page History
*lucid.fs / Remote or Pseudo Filesystems*


|| Key || Type || Description ||
| url | string | Root URL formats vary by filesystem type: \\ \\ For CIFS (Windows Shares) filesystems, the root URL includes the protocol ({{smb}}), the host address, and the path to crawl: {{smb://_host_/path/to/crawl}}. \\ \\ For FTP, the root URL is a fully qualified FTP URL, with optional username and password parameters. Credentials can be passed as a part of the URL, or submitted as {{username}} and {{password}} properties. For example, {{ftp://<username>@<password>:<hostname>:<port>/path/to/crawl}}. \\ \\ For HDFS (Hadoop), the root URL is a fully-qualified Hadoop file system URL, including the protocol ({{hdfs}}), host name and port of the namenode, and path of the target resource to crawl: {{hdfs://namenode:9000/path/to/crawl}}. \\ \\ For S3n (Amazon) and S3 (Hadoop over Amazon), the root URL is a fully-qualified URL that starts with the {{s3n}} protocol, the name of the bucket, and the path inside the bucket. Both {{AccessKeyId}} and {{SecretAccessKey}} are needed: submit {{AccessKeyId}} as the username and {{SecretAccessKey}} as the password. You can also pass these credentials as part of the URL in the following format: {{s3n://<username>@<password>:bucket/path}} . However, Amazon S3 credentials often contain characters that are not allowed in URLs. In that case, you must pass these credentials by setting the "username" and "password" properties explicitly. |
|type|string| One of supported data source types, MUST be consistent with the root URL's protocol. The following values are supported: file, smb, hdfs, s3n, s3, kfs|
| follow_links | boolean | Indicates whether to follow symbolic links in the file system. |
| bounds | string | Either "tree" to limit the crawl to a strict subtree, or "none" for no limits. |