:: EGOTHOR

Predefined Filters

Table 1.1. Predefined Filters

ConnectorInput "something"Output "something"SawPrice
Input flagsOutput flagsHarvestNote
BuildHTMLorg.egothor.parser.Tokenizerorg.egothor.data.Documentbhtml.location : String10
TAGGED NOHTMLTAGSHOME via org.egothor.indexer.html.HTMLDocument bhtml.location is resow as html.base
BuildHTML2org.egothor.parser.Tokenizerorg.egothor.data.Documentbhtml.location : String15
TAGGED NOHTMLTAGSHOME SNIPPET via org.egothor.indexer.html2.HTMLDocument bhtml.location is resow as html.base.
DefineCSASCIIjava.io.Readerjava.io.Reader 2
!CSASCIICSASCII via org.egothor.util.recode.cs_ascii_r
DefineHTMLjava.io.Readerjava.io.Reader 10
HTML !NOHTMLTAGSNOHTMLTAGShtml.{meta,base,linx,summ,titl}via org.egothor.analyzer.html.Handler (Swing parser)
DefineHTML2java.io.Readerjava.io.Reader 15
HTML !NOHTMLTAGSNOHTMLTAGShtml.{meta,base,linx,summ,titl}via org.egothor.analyzer.html2.Handler (JavaCC parser)
DefineLowerCaseorg.egothor.parser.Tokenizerorg.egothor.parser.Tokenizerlocale : java.util.Locale2
!LOWERCASELOWERCASE via org.egothor.misc.LowerCase
DefineReaderjava.lang.Stringjava.io.Reader 1
FILENAMEBUFFERED via java.io.BufferedReader and java.io.FileReader
DefineEncReaderjava.lang.Stringjava.io.Readerinput.encoding : String2
FILENAMEBUFFERED STREAMDECODE via java.io.BufferedReader using UTF-8 encoding for the stream (by default) see input.encoding
DefineStemorg.egothor.parser.Tokenizerorg.egothor.parser.Tokenizerstemmer.table : java.util.Locale2
!STEMSTEM via org.egothor.parser.misc.Stem
DefineStopperorg.egothor.parser.Tokenizerorg.egothor.parser.Tokenizerstopfilter : org.egothor.parser.misc.StopFilter2
!STOPLISTSTOPLIST  
DefineTokenizerjava.io.Readerorg.egothor.parser.Tokenizer 10
!TAGGEDTAGGED via org.egothor.parser.plain.Simple

Flags which are prefixed by "!", must not be set, if the elementary filter is used.

Prev Up Next
Registration and Use of Filters Home Chapter 2. CVS Access
© 2003-2004 Egothor Developers