org.htmlparser.parserapplications
public class StringExtractor extends Object
StringBean
to accumulate
the user visible text (what a browser would display) into a single string.
Constructor Summary | |
---|---|
StringExtractor(String resource)
Construct a StringExtractor to read from the given resource. |
Method Summary | |
---|---|
String | extractStrings(boolean links)
Extract the text from a page. |
static void | main(String[] args)
Mainline. |
Parameters: resource Either a URL or a file name.
Parameters: links if true
include hyperlinks in output.
Returns: The textual contents of the page.
Throws: ParserException If a parse error occurs.
Parameters: args The command line arguments.
HTML Parser is an open source library released under LGPL. | |