AbstractNode

java.lang.Object
- org.htmlparser.nodes.AbstractNode

所有已实现的接口:

java.io.Serializable, java.lang.Cloneable, Node

直接已知子类:

RemarkNode, TagNode, TextNode
```
public abstract class AbstractNode
extends java.lang.Object
implements Node, java.io.Serializable
```
The concrete base class for all types of nodes (tags, text remarks). This class provides basic functionality to hold the Page, the starting and ending position in the page, the parent and the list of children.

另请参阅:

序列化表格

构造器概要

构造器
构造器和说明

AbstractNode(Page page, int start, int end)
Create an abstract node with the page positions given.

构造器
构造器和说明
`AbstractNode(Page page, int start, int end)` Create an abstract node with the page positions given.

方法概要

所有方法实例方法抽象方法具体方法
限定符和类型	方法和说明
`abstract void`	`accept(NodeVisitor visitor)` Visit this node.
`java.lang.Object`	`clone()` Clone this object.
`void`	`collectInto(NodeList list, NodeFilter filter)` Collect this node and its child nodes (if-applicable) into the collectionList parameter, provided the node satisfies the filtering criteria.
`void`	`doSemanticAction()` Perform the meaning of this tag.
`NodeList`	`getChildren()` Get the children of this node.
`int`	`getEndPosition()` Gets the ending position of the node.
`Node`	`getFirstChild()` Get the first child of this node.
`Node`	`getLastChild()` Get the last child of this node.
`Node`	`getNextSibling()` Get the next sibling to this node.
`Page`	`getPage()` Get the page this node came from.
`Node`	`getParent()` Get the parent of this node.
`Node`	`getPreviousSibling()` Get the previous sibling to this node.
`int`	`getStartPosition()` Gets the starting position of the node.
`java.lang.String`	`getText()` Returns the text of the node.
`void`	`setChildren(NodeList children)` Set the children of this node.
`void`	`setEndPosition(int position)` Sets the ending position of the node.
`void`	`setPage(Page page)` Set the page this node came from.
`void`	`setParent(Node node)` Sets the parent of this node.
`void`	`setStartPosition(int position)` Sets the starting position of the node.
`void`	`setText(java.lang.String text)` Sets the string contents of the node.
`java.lang.String`	`toHtml()` Return the HTML for this node.
`abstract java.lang.String`	`toHtml(boolean verbatim)` Return the HTML for this node.
`abstract java.lang.String`	`toPlainTextString()` Returns a string representation of the node.
`abstract java.lang.String`	`toString()` Return a string representation of the node.

从类继承的方法 java.lang.Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait

- 构造器详细资料
  - AbstractNode
```
public AbstractNode(Page page,
                    int start,
                    int end)
```
    Create an abstract node with the page positions given. Remember the page and start & end cursor positions.
    
    参数:
    
    page - The page this tag was read from.
    
    start - The starting offset of this node within the page.
    
    end - The ending offset of this node within the page.
- 方法详细资料
  - clone
```
public java.lang.Object clone()
                       throws java.lang.CloneNotSupportedException
```
    Clone this object. Exposes java.lang.Object clone as a public method.
    
    指定者:
    
    clone 在接口中 Node
    
    覆盖:
    
    clone 在类中 java.lang.Object
    
    返回:
    
    A clone of this object.
    
    抛出:
    
    java.lang.CloneNotSupportedException - This shouldn't be thrown since the Node interface extends Cloneable.
    
    另请参阅:
    
    Cloneable
  - toPlainTextString
```
public abstract java.lang.String toPlainTextString()
```
    Returns a string representation of the node. It allows a simple string transformation of a web page, regardless of node type.
    Typical application code (for extracting only the text from a web page) would then be simplified to:
```
 Node node;
 for (Enumeration e = parser.elements (); e.hasMoreElements (); )
 {
     node = (Node)e.nextElement();
     System.out.println (node.toPlainTextString ());
     // or do whatever processing you wish with the plain text string
 }
 
```
    指定者:
    
    toPlainTextString 在接口中 Node
    
    返回:
    
    The 'browser' content of this node.
  - toHtml
```
public java.lang.String toHtml()
```
    Return the HTML for this node. This should be the sequence of characters that were encountered by the parser that caused this node to be created. Where this breaks down is where broken nodes (tags and remarks) have been encountered and fixed. Applications reproducing html can use this method on nodes which are to be used or transferred as they were received or created.
    
    指定者:
    
    toHtml 在接口中 Node
    
    返回:
    
    The sequence of characters that would cause this node to be returned by the parser or lexer.
  - toHtml
```
public abstract java.lang.String toHtml(boolean verbatim)
```
    Return the HTML for this node. This should be the exact sequence of characters that were encountered by the parser that caused this node to be created. Where this breaks down is where broken nodes (tags and remarks) have been encountered and fixed. Applications reproducing html can use this method on nodes which are to be used or transferred as they were received or created.
    
    指定者:
    
    toHtml 在接口中 Node
    
    参数:
    
    verbatim - If true return as close to the original page text as possible.
    
    返回:
    
    The (exact) sequence of characters that would cause this node to be returned by the parser or lexer.
  - toString
```
public abstract java.lang.String toString()
```
    Return a string representation of the node. Subclasses must define this method, and this is typically to be used in the manner
```
System.out.println(node)
```
    指定者:
    
    toString 在接口中 Node
    
    覆盖:
    
    toString 在类中 java.lang.Object
    
    返回:
    
    A textual representation of the node suitable for debugging
  - collectInto
```
public void collectInto(NodeList list,
                        NodeFilter filter)
```
    Collect this node and its child nodes (if-applicable) into the collectionList parameter, provided the node satisfies the filtering criteria.
    This mechanism allows powerful filtering code to be written very easily, without bothering about collection of embedded tags separately. e.g. when we try to get all the links on a page, it is not possible to get it at the top-level, as many tags (like form tags), can contain links embedded in them. We could get the links out by checking if the current node is a CompositeTag, and going through its children. So this method provides a convenient way to do this.
    Using collectInto(), programs get a lot shorter. Now, the code to extract all links from a page would look like:
```
 NodeList collectionList = new NodeList();
 NodeFilter filter = new TagNameFilter ("A");
 for (NodeIterator e = parser.elements(); e.hasMoreNodes();)
      e.nextNode().collectInto(collectionList, filter);
 
```
    Thus, collectionList will hold all the link nodes, irrespective of how deep the links are embedded.
    Another way to accomplish the same objective is:
```
 NodeList collectionList = new NodeList();
 NodeFilter filter = new TagClassFilter (LinkTag.class);
 for (NodeIterator e = parser.elements(); e.hasMoreNodes();)
      e.nextNode().collectInto(collectionList, filter);
 
```
    This is slightly less specific because the LinkTag class may be registered for more than one node name, e.g. <LINK> tags too.
    指定者:
    
    collectInto 在接口中 Node
    
    参数:
    
    list - The node list to collect acceptable nodes into.
    
    filter - The filter to determine which nodes are retained.
  - getPage
```
public Page getPage()
```
    Get the page this node came from.
    
    指定者:
    
    getPage 在接口中 Node
    
    返回:
    
    The page that supplied this node.
    
    另请参阅:
    
    Node.setPage(org.htmlparser.lexer.Page)
  - setPage
```
public void setPage(Page page)
```
    Set the page this node came from.
    
    指定者:
    
    setPage 在接口中 Node
    
    参数:
    
    page - The page that supplied this node.
    
    另请参阅:
    
    Node.getPage()
  - getStartPosition
```
public int getStartPosition()
```
    Gets the starting position of the node.
    
    指定者:
    
    getStartPosition 在接口中 Node
    
    返回:
    
    The start position.
    
    另请参阅:
    
    Node.setStartPosition(int)
  - setStartPosition
```
public void setStartPosition(int position)
```
    Sets the starting position of the node.
    
    指定者:
    
    setStartPosition 在接口中 Node
    
    参数:
    
    position - The new start position.
    
    另请参阅:
    
    Node.getStartPosition()
  - getEndPosition
```
public int getEndPosition()
```
    Gets the ending position of the node.
    
    指定者:
    
    getEndPosition 在接口中 Node
    
    返回:
    
    The end position.
    
    另请参阅:
    
    Node.setEndPosition(int)
  - setEndPosition
```
public void setEndPosition(int position)
```
    Sets the ending position of the node.
    
    指定者:
    
    setEndPosition 在接口中 Node
    
    参数:
    
    position - The new end position.
    
    另请参阅:
    
    Node.getEndPosition()
  - accept
```
public abstract void accept(NodeVisitor visitor)
```
    Visit this node.
    
    指定者:
    
    accept 在接口中 Node
    
    参数:
    
    visitor - The visitor that is visiting this node.
  - getParent
```
public Node getParent()
```
    Get the parent of this node. This will always return null when parsing without scanners, i.e. if semantic parsing was not performed. The object returned from this method can be safely cast to a CompositeTag.
    
    指定者:
    
    getParent 在接口中 Node
    
    返回:
    
    The parent of this node, if it's been set, null otherwise.
    
    另请参阅:
    
    Node.setParent(org.htmlparser.Node)
  - setParent
```
public void setParent(Node node)
```
    Sets the parent of this node.
    
    指定者:
    
    setParent 在接口中 Node
    
    参数:
    
    node - The node that contains this node. Must be a CompositeTag.
    
    另请参阅:
    
    Node.getParent()
  - getChildren
```
public NodeList getChildren()
```
    Get the children of this node.
    
    指定者:
    
    getChildren 在接口中 Node
    
    返回:
    
    The list of children contained by this node, if it's been set, null otherwise.
    
    另请参阅:
    
    Node.setChildren(org.htmlparser.util.NodeList)
  - setChildren
```
public void setChildren(NodeList children)
```
    Set the children of this node.
    
    指定者:
    
    setChildren 在接口中 Node
    
    参数:
    
    children - The new list of children this node contains.
    
    另请参阅:
    
    Node.getChildren()
  - getFirstChild
```
public Node getFirstChild()
```
    Get the first child of this node.
    
    指定者:
    
    getFirstChild 在接口中 Node
    
    返回:
    
    The first child in the list of children contained by this node, null otherwise.
  - getLastChild
```
public Node getLastChild()
```
    Get the last child of this node.
    
    指定者:
    
    getLastChild 在接口中 Node
    
    返回:
    
    The last child in the list of children contained by this node, null otherwise.
  - getPreviousSibling
```
public Node getPreviousSibling()
```
    Get the previous sibling to this node.
    
    指定者:
    
    getPreviousSibling 在接口中 Node
    
    返回:
    
    The previous sibling to this node if one exists, null otherwise.
  - getNextSibling
```
public Node getNextSibling()
```
    Get the next sibling to this node.
    
    指定者:
    
    getNextSibling 在接口中 Node
    
    返回:
    
    The next sibling to this node if one exists, null otherwise.
  - getText
```
public java.lang.String getText()
```
    Returns the text of the node.
    
    指定者:
    
    getText 在接口中 Node
    
    返回:
    
    The text of this node. The default is null.
    
    另请参阅:
    
    Node.setText(java.lang.String)
  - setText
```
public void setText(java.lang.String text)
```
    Sets the string contents of the node.
    
    指定者:
    
    setText 在接口中 Node
    
    参数:
    
    text - The new text for the node.
    
    另请参阅:
    
    Node.getText()
  - doSemanticAction
```
public void doSemanticAction()
                      throws ParserException
```
    Perform the meaning of this tag. The default action is to do nothing.
    
    指定者:
    
    doSemanticAction 在接口中 Node
    
    抛出:
    
    ParserException - Not used. Provides for subclasses that may want to indicate an exceptional condition.

类 AbstractNode

构造器概要

方法概要

从类继承的方法 java.lang.Object

构造器详细资料

AbstractNode

方法详细资料

clone

toPlainTextString

toHtml

toHtml

toString

collectInto

getPage

setPage

getStartPosition

setStartPosition

getEndPosition

setEndPosition

accept

getParent

setParent

getChildren

setChildren

getFirstChild

getLastChild

getPreviousSibling

getNextSibling

getText

setText

doSemanticAction