:: EGOTHOR

Preface

Key Features.  EGOTHOR is a high-performance, full-featured text search engine written entirely in Java. It is technology suitable for nearly any application that requires full-text search, especially cross-platform. It can be configured as a standalone engine, metasearcher, peer-to-peer HUB, and, moreover, it can be used as a library for an application that needs full-text search.

  1. Written in JAVA for cross platform compatibility.

  2. Able to recognize many of the most familiar file formats: HTML, PDF, PS, and Microsoft's DOC, and XLS.

  3. Fast! 50 HTML pages per second can be indexed.

  4. The best compression methods (Golomb, Elias-Gamma, Block coding) are used.

  5. Based on the extended Boolean model that can operate as the Boolean model and Vector model.

  6. Universal stemmer that can process almost any (European) language. (No testing has been done for other languages.)

  7. New dynamization algorithm.

  8. Open architecture.

Mailing lists.  There is one public mailing list: - user questions and answers, general questions, technical topics for Egothor developers and advanced users.

History

2002

SM2 is rewritten and SM3 (code-name Egothor, Sheeef) is here. Aim: Composing all our results together.

2001

Finding a true dynamisation of indices. Aim: (weak) Real-time indexing

2000

I/O tests of JAVA platform; compression of small numbers in JAVA. Aim: Could we build an engine for huge collections?

1999

As part of SM2 the multilingual stemmer is developed. Aim: Semi-automatic error free stemmer for any European language.

1998

The Second Search Maestro (SM2) is developed. Aim: Pilot project.

1997

The First Search Maestro was created. Platform: OS/2, C/C++/REXX/OREXX. Aim: Native search engine for OS/2.

Prev Up Next
Installation and Use of ::egothor Home Chapter 1. Installation
© 2003-2004 Egothor Developers