| BuildHTML | org.egothor.parser.Tokenizer | org.egothor.data.Document | bhtml.location : String | 10 |
| TAGGED NOHTMLTAGS | HOME | | via org.egothor.indexer.html.HTMLDocument
bhtml.location is resow as html.base
|
| BuildHTML2 | org.egothor.parser.Tokenizer | org.egothor.data.Document | bhtml.location : String | 15 |
| TAGGED NOHTMLTAGS | HOME SNIPPET | | via org.egothor.indexer.html2.HTMLDocument
bhtml.location is resow as html.base.
|
| DefineCSASCII | java.io.Reader | java.io.Reader | | 2 |
| !CSASCII | CSASCII | | via org.egothor.util.recode.cs_ascii_r |
| DefineHTML | java.io.Reader | java.io.Reader | | 10 |
| HTML !NOHTMLTAGS | NOHTMLTAGS | html.{meta,base,linx,summ,titl} | via org.egothor.analyzer.html.Handler (Swing parser) |
| DefineHTML2 | java.io.Reader | java.io.Reader | | 15 |
| HTML !NOHTMLTAGS | NOHTMLTAGS | html.{meta,base,linx,summ,titl} | via org.egothor.analyzer.html2.Handler (JavaCC parser) |
| DefineLowerCase | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | locale : java.util.Locale | 2 |
| !LOWERCASE | LOWERCASE | | via org.egothor.misc.LowerCase |
| DefineReader | java.lang.String | java.io.Reader | | 1 |
| FILENAME | BUFFERED | | via java.io.BufferedReader and java.io.FileReader |
| DefineEncReader | java.lang.String | java.io.Reader | input.encoding : String | 2 |
| FILENAME | BUFFERED STREAMDECODE | | via java.io.BufferedReader using UTF-8 encoding for the stream (by default)
see input.encoding |
| DefineStem | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | stemmer.table : java.util.Locale | 2 |
| !STEM | STEM | | via org.egothor.parser.misc.Stem |
| DefineStopper | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | stopfilter : org.egothor.parser.misc.StopFilter | 2 |
| !STOPLIST | STOPLIST | | |
| DefineTokenizer | java.io.Reader | org.egothor.parser.Tokenizer | | 10 |
| !TAGGED | TAGGED | | via org.egothor.parser.plain.Simple |