Apache Lucene Software Library

From GM-RKB
(Redirected from Lucene system)
Jump to navigation Jump to search

An Apache Lucene Software Library is an Apache Java-based document search software library.



References

2011

2010

  • (McCandless et al., 2010) ⇒ Michael McCandless, Erik Hatcher, and Otis Gospodnetić. (2004). “Lucene in Action, 2nd edition.” Manning Publications. ISBN:1933988177
    • QUOTE:... Lucene itself is just a JAR, with the real fun and power coming from what you build around it. … Although Java is the primary language used with Lucene, the index format is language netural.

2009a

2009b

  • http://lucene.apache.org/java/docs/features.html
    • Scalable, High-Performance Indexing
      • over 20MB/minute on Pentium M 1.5GHz
      • small RAM requirements -- only 1MB heap
      • incremental indexing as fast as batch indexing
      • index size roughly 20-30% the size of text indexed
    • Powerful, Accurate and Efficient Search Algorithms
      • ranked searching -- best results returned first
      • many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more
      • fielded searching (e.g., title, author, contents)
      • date-range searching
      • sorting by any field
      • multiple-index searching with merged results
      • allows simultaneous update and searching
    • Cross-Platform Solution
      • Available as Open Source software under the Apache License which lets you use Lucene in both commercial and Open Source programs
      • 100%-pure Java
      • Implementations in other programming languages available that are index-compatible

2004