Support Resources

LucidWorks Forum
KnowledgeBase

LucidWorks Platform v2.0

PDF Version

Older Versions

LWE Guide 1.8
LWE Guide 1.7
LWE Guide 1.6

This is the documentation for LucidWorks Platform v2.0, the latest release is v2.1.

Skip to end of metadata
Go to start of metadata

Where possible, terms are linked to relevant parts of the documentation for more information. 


Jump to a letter:

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

A


Alerts
An alert allows a user to save searches. There are two types: active, which will send notifications when new results are found, and passive, which do not send notifications.


Auto-Complete
A way to provide users suggestions for possible matching queries before they have finished typing. In LucidWorks Enterprise, this relies on an index of terms to be created on a regular basis by scheduling it as an activity.

B


Boolean Operators
These control the inclusion or exclusion of keywords in a query by using operators such as AND, OR, and NOT.

C


Click Scoring Relevance Framework
A method of changing the relevance ranking of a document based on the number of times other users have clicked on the same document. 


Collection
One or more documents grouped together for the purposes of searching. See also Document.


Component
A part of LucidWorks Enterprise that has been designed to stand alone or can be run independently from other components. LucidWorks Enterprise has two main components: LWE Core, which runs Solr, indexing, and other critical application functions and LWE UI, which runs the Administrative UI, the front-end search interface, and the alerting functionality.


Connector
A connector is a program or piece of code that allows a connection to be made to a data source and content to be extracted from it.


Crawler
Also known as a "spider", this is a program that is able to retrieve web pages from internal or external web servers.

D


Data Source
Defines the metadata required to connect to a location containing content to be indexed. It could be a file system path, a Web URL, a JDBC connection, or some other set of values.


Distributed Index
A distributed index is one where the search index for a collection is spread across more than one shard.


Distributed Search
Distributed search is one where queries are processed across more than one shard.


Document
One or more Fields. See also Field.

F


Field
The content to be indexed/searched along with metadata defining how the content should be processed by LucidWorks Enterprise.

I


Inverse Document Frequency (IDF)
A measure of the general importance of a term. It is calculated as the number of total Documents divided by the number of Documents that a particular word occurs in the collection. See http://en.wikipedia.org/wiki/Tf-idf and http://lucene.apache.org/java/2_3_2/scoring.html for more info on TF-IDF based scoring and Lucene scoring in particular. See also Term Frequency.

Inverted Index
A way of creating a searchable index that lists every word and the documents that contain those words, similar to an index in the back of a book which lists words and the pages on which they can be found. When performing keyword searches, this method is considered more efficient than the alternative, which would be to create a list of documents paired with every word used in each document. Since users search using terms they expect to be in documents, finding the term before the document saves processing resources and time.

M


Metadata
Literally, data about data. Metadata is information about a document, such as it's title, author, or location.

N


Natural Language Query
A search that is entered as a user would normally speak or write, as in, "What is aspirin?"

Q


Query Parser
A query parser processes the terms entered by a user.

R


Recall
The ability of a search engine to retrieve all of the possible matches to a user's query.


Relevance
The appropriateness of a document to the search conducted by the user.


Replication
A method of copying a master index from one server to one or more "slave" or "child" servers. In LucidWorks Enterprise, the master continues to manage updates to the index, while queries are handled by the slaves. This approach enables LucidWorks Enterprise to properly manage query load and ensure responsiveness.


REST API
An alternative way of controlling LucidWorks Enterprise without accessing the user interface.

S


Shard
A method of partitioning a database or search engine to maximize performance and efficiency.


SolrCloud  
Ongoing work within the Solr community to improve Solr's ability to operate in a cloud environment.


Solr Schema (schema.xml)
The Apache Solr index schema. The schema defines the fields to be indexed and the type for the field (text, integers, etc.) The schema is stored in schema.xml and is located in the Solr home conf directory.


Solr Config (solrconfig.xml)
The Apache Solr configuration file. Defines indexing options, RequestHandlers, highlighting, spellchecking and various other configurations. The file, solrconfig.xml is located in the Solr home conf directory.


Spell Check
The ability to suggest alternative spellings of search terms to a user, as a check against spelling errors causing few or zero results.  In LucidWorks Enterprise, effective spell checking requires an index to be built on a regular basis by scheduling it as an [activity].


Stopwords
Generally, words that have little meaning to a user's search but which may have been entered as part of a natural language query. Stopwords are generally very small pronouns, conjunctions and prepositions (such as, "the", "with", or "and")


Synonyms
Synonyms generally are terms which are near to each other in meaning and may substitute for one another. In a search engine implementation, synonyms may be abbreviations as well as words, or terms that are not consistently hyphenated. Examples of synonyms in this context would be "Inc." and "Incorporated" or "iPod" and "i-pod".

T


Term Frequency
The number of times a word occurs in a given document. See http://en.wikipedia.org/wiki/Tf-idf and http://lucene.apache.org/java/2_3_2/scoring.html for more info on TF-IDF based scoring and Lucene scoring in particular.
See also Inverse Document Frequency (IDF).

W


Wildcard
A wildcard allows a substitution of one or more letters of a word to account for possible variations in spelling or tenses. In LucidWorks Enterprise, there are two ways to use them. One is to use an asterisk (*) at the end of a term to find all documents that contain words that start with that pattern. For example, paint* would find paint, painter and painting. A second way is to use a question mark (?) in the middle of a term to substitute for one character in that term. Such as, c?t would find cat, cot and cut. It's also possible to use wildcards at the start of a term in the same way - either to replace a single letter (using the ? symbol) or to find documents that contain words that end with a pattern using a *. For example, *sphere would find ecosphere and stratosphere.

Labels

lwdg lwdg Delete
copy copy Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.