| Package | Description |
|---|---|
| de.l3s.boilerpipe.sax |
Classes related to parsing and producing HTML from/to Boilerpipe TextDocuments.
|
| Class and Description |
|---|
| BoilerpipeHTMLContentHandler
A simple SAX
ContentHandler, used by BoilerpipeSAXInput. |
| BoilerpipeHTMLParser
A simple SAX Parser, used by
BoilerpipeSAXInput. |
| HTMLDocument
An
InputSourceable for HTMLFetcher. |
| HTMLHighlighter
Highlights text blocks in an HTML document that have been marked as "content"
in the corresponding
TextDocument. |
| InputSourceable
An InputSourceable can return an arbitrary number of new
InputSources for a given document. |
| TagAction
Defines an action that is to be performed whenever a particular tag occurs
during HTML parsing.
|
| TagActionMap
Base class for definition a set of
TagActions that are to be used for the
HTML parsing process. |