|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Class Summary | |
|---|---|
| DoubleMetaphone | This module implements a "sounds like" algorithm developed by Lawrence Philips which he published in the June, 2000 issue of C/C++ Users Journal. |
| DupWithoutDiacritics | This filter transforms all (Latin) words to non-diacritical (ASCII), but still keeps the original tokens. |
| Grammer | This class is really grammer - it produces N-grams. |
| LowerCase | This filter transforms all words to lower case. |
| Nysiis | This module implements the New York State Identification and Intelligence System (NYSIIS) Phonetic Code. |
| ParagraphFilter | Filter sets the sentence, paragraph and sentenceInParagraph fields
in the Token class, just like the ParagraphPunctFilter. |
| ParagraphPunctFilter | Filter sets the sentence, paragraph and sentenceInParagraph fields in the Token class. |
| Phonetics | |
| PunctFilter | |
| RemoveDiacritics | This filter transforms all (Latin) words to non-diacritical (ASCII). |
| Stemmer | The Stemmer object is a filter which transforms all words to their respective stems. |
| StopFilter | This abstract class should be extended by any class wishing to ignore certain tokens while processing all tokens. |
| WordNGrammer | This class produces N-grams of words. |
This package defines objects that filter tokens. They are used when you want to transform tokens to their stems or - for example - to lower case characters.
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||