Lucene
From Wikipedia, the free encyclopedia
Developed by | Apache Software Foundation |
---|---|
Latest release | 2.4.1 / 2009-03-09 |
Written in | Java |
Operating system | Cross-platform |
Type | Search and index |
License | Apache License 2.0 |
Website | http://lucene.apache.org |
Lucene is a free/open source information retrieval library, originally created in Java by Doug Cutting. It is supported by the Apache Software Foundation and is released under the Apache Software License.
Lucene has been ported to other programming languages including Delphi, Perl, C#, C++, Python, Ruby and PHP.
Contents |
[edit] Features and common use
While suitable for any application which requires full text indexing and searching capability, Lucene has been widely recognized for its utility in the implementation of Internet search engines and local, single-site searching.
At the core of Lucene's logical architecture is the idea of a document containing fields of text. This flexibility allows Lucene's API to be independent of the file format. Text from PDFs, HTML, Microsoft Word, and OpenDocument documents, as well as many others can all be indexed so long as their textual information can be extracted.
[edit] Lucene-based projects
Lucene itself is just an indexing and search library and does not contain crawling and HTML parsing functionality. The Apache project Nutch is based on Lucene and provides this functionality; the Apache project Solr is a fully-featured search server based on Lucene; Compass is a Java Search Engine Framework built on the top of Lucene.
[edit] See also
[edit] References
- Gospodnetic, Otis; Erik Hatcher (2004). Lucene in Action. Manning Publications. ISBN 978-1-932394-28-3. http://www.manning.com/hatcher2/.
- Gospodnetic, Otis; Erik Hatcher,Michael McCandless (2009). Lucene in Action, 2nd edition. Manning Publications. ISBN 1933988177. http://www.manning.com/hatcher3/.
[edit] External links
- Lucene homepage
- Chris Conrad. "Behind the Scenes of the SourceForge.net Search System". http://blog.dev.sf.net/index.php?/archives/10-Behind-the-Scenes-of-the-SourceForge.net-Search-System.html.[dead link]
- "Lucene Wikipedia indexer". http://schmidt.devlib.org/software/lucene-wikipedia.html.[dead link] — introductory article with Java code for search on Wikipedia data , This page on internet archive
- "Full-Text Search for Database Using Lucene Search Engine". http://blog.scalingweb.com/2007/11/03/full-text-search-for-database-using-lucene-search-engine/.[dead link]
- Compass project
- EB-eye EBI's Lucene based search engine.
- Lucene FAQ
- Luke - Lucene Index Toolbox
- LuSql - Index database records using Lucene
|