:: EGOTHOR |
Key Features. EGOTHOR is a high-performance, full-featured text search engine written entirely in Java. It is technology suitable for nearly any application that requires full-text search, especially cross-platform. It can be configured as a standalone engine, metasearcher, peer-to-peer HUB, and, moreover, it can be used as a library for an application that needs full-text search.
Written in JAVA for cross platform compatibility.
Able to recognize many of the most familiar file formats: HTML, PDF, PS, and Microsoft's DOC, and XLS.
Fast! 50 HTML pages per second can be indexed.
The best compression methods (Golomb, Elias-Gamma, Block coding) are used.
Based on the extended Boolean model that can operate as the Boolean model and Vector model.
Universal stemmer that can process almost any (European) language. (No testing has been done for other languages.)
New dynamization algorithm.
Open architecture.
Mailing lists. There is one public mailing list: <egothor-tech@egothor.org> - user questions and answers, general questions, technical topics for Egothor developers and advanced users.
History
SM2 is rewritten and SM3 (code-name Egothor, Sheeef) is here. Aim: Composing all our results together.
Finding a true dynamisation of indices. Aim: (weak) Real-time indexing
I/O tests of JAVA platform; compression of small numbers in JAVA. Aim: Could we build an engine for huge collections?
As part of SM2 the multilingual stemmer is developed. Aim: Semi-automatic error free stemmer for any European language.
The Second Search Maestro (SM2) is developed. Aim: Pilot project.
The First Search Maestro was created. Platform: OS/2, C/C++/REXX/OREXX. Aim: Native search engine for OS/2.
Prev | Up | Next |
Installation and Use of ::egothor | Home | Chapter 1. Installation |
© 2003-2004 Egothor Developers |