org.htmlparser.visitors

Class TextExtractingVisitor

public class TextExtractingVisitor extends NodeVisitor

Extracts text from a web page. Usage: Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();
Constructor Summary
TextExtractingVisitor()
Method Summary
StringgetExtractedText()
voidvisitEndTag(Tag tag)
voidvisitStringNode(Text stringNode)
voidvisitTag(Tag tag)

Constructor Detail

TextExtractingVisitor

public TextExtractingVisitor()

Method Detail

getExtractedText

public String getExtractedText()

visitEndTag

public void visitEndTag(Tag tag)

visitStringNode

public void visitStringNode(Text stringNode)

visitTag

public void visitTag(Tag tag)
HTML Parser is an open source library released under LGPL. SourceForge.net