Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. Apache Lucene is an open source project available for free download.
JanusGraph supports Apache Lucene as a single-machine, embedded index backend. Lucene has a slightly extended feature set and performs better in small-scale applications compared to Elasticsearch, but is limited to single-machine deployments.
Lucene Embedded Configuration
For single machine deployments, Lucene runs embedded with JanusGraph. JanusGraph starts and interfaces with Lucene internally.
To run Lucene embedded, add the following configuration options to the
graph configuration file where
/data/searchindex specifies the
directory where Lucene should store the index data:
In the above configuration, the index backend is named
search by a different name to change the name of the index.
- Full-Text: Supports all
Textpredicates to search for text properties that matches a given word, prefix or regular expression.
- Geo: Supports
Geopredicates to search for geo properties that are intersecting, within, or contained in a given query geometry. Supports points, lines and polygons for indexing. Supports circles and boxes for querying point properties and all shapes for querying non-point properties.
- Numeric Range: Supports all numeric comparisons in
- Collections: Supports indexing SET and LIST cardinality properties, except
- Temporal: Nanosecond granularity temporal indexing.
- Custom Analyzer: Choose to use a custom analyzer
- Not Query-normal-form: Supports queries other than Query-normal-form (QNF). QNF for JanusGraph is a variant of CNF (conjunctive normal form) with negation inlined where possible.
Refer to Configuration Reference for a complete listing of all Lucene specific configuration options in addition to the general JanusGraph configuration options.
Note, that each of the index backend options needs to be prefixed with
[INDEX-NAME] stands for the name of the
index backend. For instance, if the index backend is named search then
these configuration options need to be prefixed with
configure an index backend named search to use Lucene as the index
system, set the following configuration option:
- Please refer to the Apache Lucene homepage and available documentation for more information on Lucene.