BuildHTML | org.egothor.parser.Tokenizer | org.egothor.data.Document | bhtml.location : String | 10 |
TAGGED NOHTMLTAGS | HOME | | via org.egothor.indexer.html.HTMLDocument
bhtml.location is resow as html.base
|
BuildHTML2 | org.egothor.parser.Tokenizer | org.egothor.data.Document | bhtml.location : String | 15 |
TAGGED NOHTMLTAGS | HOME SNIPPET | | via org.egothor.indexer.html2.HTMLDocument
bhtml.location is resow as html.base.
|
DefineCSASCII | java.io.Reader | java.io.Reader | | 2 |
!CSASCII | CSASCII | | via org.egothor.util.recode.cs_ascii_r |
DefineHTML | java.io.Reader | java.io.Reader | | 10 |
HTML !NOHTMLTAGS | NOHTMLTAGS | html.{meta,base,linx,summ,titl} | via org.egothor.analyzer.html.Handler (Swing parser) |
DefineHTML2 | java.io.Reader | java.io.Reader | | 15 |
HTML !NOHTMLTAGS | NOHTMLTAGS | html.{meta,base,linx,summ,titl} | via org.egothor.analyzer.html2.Handler (JavaCC parser) |
DefineLowerCase | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | locale : java.util.Locale | 2 |
!LOWERCASE | LOWERCASE | | via org.egothor.misc.LowerCase |
DefineReader | java.lang.String | java.io.Reader | | 1 |
FILENAME | BUFFERED | | via java.io.BufferedReader and java.io.FileReader |
DefineEncReader | java.lang.String | java.io.Reader | input.encoding : String | 2 |
FILENAME | BUFFERED STREAMDECODE | | via java.io.BufferedReader using UTF-8 encoding for the stream (by default)
see input.encoding |
DefineStem | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | stemmer.table : java.util.Locale | 2 |
!STEM | STEM | | via org.egothor.parser.misc.Stem |
DefineStopper | org.egothor.parser.Tokenizer | org.egothor.parser.Tokenizer | stopfilter : org.egothor.parser.misc.StopFilter | 2 |
!STOPLIST | STOPLIST | | |
DefineTokenizer | java.io.Reader | org.egothor.parser.Tokenizer | | 10 |
!TAGGED | TAGGED | | via org.egothor.parser.plain.Simple |