The classes in this package all implement the {@link org.htmlparser.Node Node} interface. They individually implement the more specific interfaces for text, remarks and tags. The three node types are mapped as follows:
Interface | Concrete Implementation |
{@link org.htmlparser.Text Text} | {@link org.htmlparser.nodes.TextNode TextNode} |
{@link org.htmlparser.Remark Remark} | {@link org.htmlparser.nodes.RemarkNode RemarkNode} |
{@link org.htmlparser.Tag Tag} | {@link org.htmlparser.nodes.TagNode TagNode} |
The {@link org.htmlparser.lexer.Lexer Lexer} parses an HTML stream into a contiguous stream of these nodes.
The {@link org.htmlparser.Parser Parser} returns either these nodes or specific {@link org.htmlparser.tags Tag} objects (which are subclasses of TagNode) for tags with names that have been registered via {@link org.htmlparser.PrototypicalNodeFactory#registerTag registerTag()}.