|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.egothor.html.HTMLParser
public class HTMLParser
This class is part of the Egothor Project
Constructor Summary | |
---|---|
HTMLParser(boolean clinks)
Constructor for the HTMLParser object. |
|
HTMLParser(boolean clinks,
boolean ilinks)
Constructor for the HTMLParser object. |
Method Summary | |
---|---|
java.net.URI |
getBaseURL()
Return the base URL. |
java.util.ArrayList<Anchor> |
getImageLinks()
Return a Vector of the links obtained by the Handler. |
java.util.ArrayList<Anchor> |
getLinks()
Return a Vector of the links obtained by the Handler. |
java.util.HashMap<java.lang.String,java.lang.String> |
getMeta()
Return a Hashtable containing the metadata obtained by the Handler. |
CharStream |
getReader(java.io.Reader i,
java.lang.String baseURL,
EventEncoder encoder)
Return a StringReader that will use the given input stream and read from the given URL. |
java.lang.String |
getSummary()
Return the summary. |
java.lang.String |
getTitle()
Return the title. |
static void |
main(java.lang.String[] args)
Entry point to the HTMLParser application. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public HTMLParser(boolean clinks)
clinks
- if set to true the object will collect
links from the documentgetLinks()
public HTMLParser(boolean clinks, boolean ilinks)
clinks
- if set to true the object will collect
links from the documentilinks
- if set to true the object will collect
img-src links from the documentMethod Detail |
---|
public java.net.URI getBaseURL()
public java.util.ArrayList<Anchor> getLinks()
public java.util.ArrayList<Anchor> getImageLinks()
public java.util.HashMap<java.lang.String,java.lang.String> getMeta()
public CharStream getReader(java.io.Reader i, java.lang.String baseURL, EventEncoder encoder) throws java.io.IOException
i
- the input streambaseURL
- where to read from
java.io.IOException
- if an I/O error occurspublic java.lang.String getSummary()
public java.lang.String getTitle()
public static void main(java.lang.String[] args)
args
- the path to the file to parse
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |