Lucene

From Wikipedia, the free encyclopedia

(Redirected from Clucene)
Jump to: navigation, search
Lucene
Lucene logo
Developer Apache Software Foundation
Latest release 2.2.0 / 19 June 2007
OS Cross-platform
Genre Search and index API.
License Apache Software License
Website lucene.apache.org

Lucene is a free/open source information retrieval library, originally implemented in Java by Doug Cutting. It is supported by the Apache Software Foundation and is released under the Apache Software License. Lucene has been ported to programming languages including Delphi, Perl, C#, C++, Python, Ruby and PHP.

While suitable for any application which requires full text indexing and searching capability, Lucene has been widely recognized for its utility in the implementation of Internet search engines and local, single-site searching. Lucene itself is just an indexing and search library and does not contain crawling and HTML parsing functionality. The Apache project Nutch is based on Lucene and provides this functionality; the Apache project Solr is a fully-featured search server based on Lucene.

At the core of Lucene's logical architecture is a notion of a document containing fields of text. This flexibility allows Lucene's API to be agnostic of file format. Text from PDFs, HTML, Microsoft Word documents, as well as many others can all be indexed so long as their textual information can be extracted.

Contents

  • isoHunt Uses Lucene for the site search [1]
  • Gplex Database (homepage) Uses a C# version of Lucene to index your database metadata.
  • EB-eye_EBI's_Search_Engine EMBL-EBI's Biomedical databases search engine (contains than 200 million documents)
  • Joost Internet TV uses Lucene to search for programs.
  • MediaWiki can use Lucene for full-text search.
  • Liferay open source portal, uses Lucene for full-text search.
  • Beagle uses a port of Lucene to C# called Lucene.Net as its indexer.
  • Daisy uses Lucene for site search.
  • Merobase Component finder creates its index with Lucene
  • db4o works in combination with Lucene to support full text search.
  • Digg [2]
  • Docco (homepage) uses Lucene for desktop search.
  • DSpace (homepage) uses Lucene.
  • CNET uses Lucene to search their product category listings.
  • LjFind uses Lucene to search over 110,000,000 LiveJournal posts.
  • Red-Piranha [3] is another Lucene based search engine. It is ready to use, deployable as a GUI, command line or Tomcat web application, and has the ability to "learn" what the user wants.
  • The Flock web browser uses Clucene, a C++ version, to do a full text search of browser history.
  • KnowledgeBase [4] - A service focused CRM platform which uses the Lucene search engine
  • Zimbra groupware incorporates Lucene.
  • Ants P2P is using Lucene for the search option, within this anonymous file sharing program.
  • LIRE - Lucene Image Retrieval [5] CBIR library, which uses the Lucene search engine
  • MMBase has an expansion that uses Lucene for indexing its data.
  • Alfresco,[6] a free/open source Enterprise Content Management system
  • Strigi [7] uses CLucene, a C++ version, to index and search the desktop.
  • Midgard uses Lucene for its indexing and full-text search
  • Nuxeo EP,[8] a free/open source Enterprise Content Management (ECM) platform
  • Local Lucene,[9] a Geographical based searching solution using Lucene
  • Perst, an open source, object-oriented embedded database, integrates with Lucene for full-text database indexing and searching and for ACID-compliant transactional protection of the Lucene index
  • judy's book [10] uses Solr Lucene.
  • MindTouch Deki Wiki,[11] a free open source wiki and application platform, employs dotLucene for indexing wiki pages and file attachments.
  • LoopTeK Search, Internet Video content search.
  • Scalix is using Lucene for their Search and Indexing Service (SIS), available in version 11 of Scalix.
  • panFMP [12] is a generic and flexible framework for building metadata portals independent of metadata formats and protocols. As panFMP was developed specifically for Spatial Data Infrastructures, Lucene was extended by performant trie-based range-queries.
  • Jira [13] is a popular issue tracking system.
  • VYRE Unify [14] Content management platform

A more extensive list of software that uses Lucene is in the PoweredBy page of Lucene's wiki.

Lucene has been ported or is in the process of being ported to various programming languages other than Java:

  • Erik Hatcher and Otis Gospodnetic, "Lucene in Action", Manning, ISBN 1932394281

Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.