Sponsored Links
Directory Sites
Software for indexing and searching text documents, using full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Support for document types such as HTML, SGML, mail folders, and USMARC.
www.etymon.com
Megaputer provides a complete family of unique solutions for Natural Language Text Retrieval and Analysis, Data Mining and Knowledge Discovery in Databases.
www.megaputer.com
A complete world wide web indexing and searching system for a small domain or intranet. Source code (GPL).
www.htdig.org
The tools they use at their site for sale. Demo version available for download.
software.infoseek.com
Zebra is a fulltext and free-text indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, and GILS records. The Zebra server is freely available for noncommercial applications.
www.indexdata.dk
Thunderstone has a number of full text search related products including their flagship text/relational database, Texis.
www.thunderstone.com
Search engine vendor of BRS/Search, a text based core product, and web enabled products.
www.dataware.com
Information and select sections of a book about indexing and compression techniques for documents and images. Also provides information about open source IR system released with the book.
www.cs.mu.oz.au
Supplier of information retrieval and collaborative software.
www.opentext.com
Combined Computer Resources, Inc. (CCR), a software developer and integrator, specializes in customizing and integrating document imaging, COLD report management and workflow software products.
www.winocular.com
SimpleScan Software, Inc - providing powerful, cost effective, enterprise wide document management software solutions.
www.simplescan.com
High speed, fully featured, multilingual fielded fulltext engine. Available for many platforms including Solaris, BSD, Linux and Windows-NT.
www.bsn.com
Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range.
www.dtsearch.com
Provides document scanning, optical character recognition and full-text searching.
www.searchexpress.com
Cheshire II is a "Next-Generation Online Catalog and Full-Text Information Retrieval System." It features advanced IR techniques, including support for Boolean and probabilistic 'best match' ranked searching, SGML/XML as the primary data base format, and a client/server architecture that uses the Z39.50 Information Retrieval Protocol.
cheshire.berkeley.edu