How to use
collectInto
method
in
org.htmlparser.Node

Best Java code snippets using org.htmlparser.Node.collectInto (Showing top 8 results out of 315)

/**
 * Search given node and pick up any objects of given type.
 * @param node The node to search.
 * @param type The class to search for.
 * @return A node array with the matching nodes.
 */
public static Node[] findTypeInNode(Node node, Class type)
{
  NodeFilter filter;
  NodeList ret;
  
  ret = new NodeList ();
  filter = new NodeClassFilter (type);
  node.collectInto (ret, filter);
  return (ret.toNodeArray ());
}

/**
 * Search given node and pick up any objects of given type.
 * @param node The node to search.
 * @param type The class to search for.
 * @return A node array with the matching nodes.
 */
public static Node[] findTypeInNode(Node node, Class type)
{
  NodeFilter filter;
  NodeList ret;
  
  ret = new NodeList ();
  filter = new NodeClassFilter (type);
  node.collectInto (ret, filter);
  return (ret.toNodeArray ());
}

/**
 * Extract all nodes matching the given filter.
 * @see Node#collectInto(NodeList, NodeFilter)
 * @param filter The filter to be applied to the nodes.
 * @throws ParserException If a parse error occurs.
 * @return A list of nodes matching the filter criteria,
 * i.e. for which the filter's accept method
 * returned <code>true</code>.
 */
public NodeList extractAllNodesThatMatch (NodeFilter filter)
  throws
    ParserException
{
  NodeIterator e;
  NodeList ret;
  ret = new NodeList ();
  for (e = elements (); e.hasMoreNodes (); )
    e.nextNode ().collectInto (ret, filter);
  return (ret);
}

/**
 * Extract all nodes matching the given filter.
 * @see Node#collectInto(NodeList, NodeFilter)
 * @param filter The filter to be applied to the nodes.
 * @throws ParserException If a parse error occurs.
 * @return A list of nodes matching the filter criteria,
 * i.e. for which the filter's accept method
 * returned <code>true</code>.
 */
public NodeList extractAllNodesThatMatch (NodeFilter filter)
  throws
    ParserException
{
  NodeIterator e;
  NodeList ret;
  ret = new NodeList ();
  for (e = elements (); e.hasMoreNodes (); )
    e.nextNode ().collectInto (ret, filter);
  return (ret);
}

  e.nextNode ().collectInto (list, filter);
if ((null != getEndTag ()) && (this != getEndTag ())) // 2nd guard handles <tag/>
  getEndTag ().collectInto (list, filter);

  node.collectInto (ret, filter);
else
  ret.add (node);

  node.collectInto (ret, filter);
else
  ret.add (node);

  e.nextNode ().collectInto (list, filter);
if ((null != getEndTag ()) && (this != getEndTag ())) // 2nd guard handles <tag/>
  getEndTag ().collectInto (list, filter);

Javadoc

Collect this node and its child nodes into a list, provided the node satisfies the filtering criteria.

This mechanism allows powerful filtering code to be written very easily, without bothering about collection of embedded tags separately. e.g. when we try to get all the links on a page, it is not possible to get it at the top-level, as many tags (like form tags), can contain links embedded in them. We could get the links out by checking if the current node is a org.htmlparser.tags.CompositeTag, and going through its children. So this method provides a convenient way to do this.

Using collectInto(), programs get a lot shorter. Now, the code to extract all links from a page would look like:

 
NodeList list = new NodeList (); 
NodeFilter filter = new TagNameFilter ("A"); 
for (NodeIterator e = parser.elements (); e.hasMoreNodes ();) 
e.nextNode ().collectInto (list, filter);

Thus, list will hold all the link nodes, irrespective of how deep the links are embedded.

Another way to accomplish the same objective is:

 
NodeList list = new NodeList (); 
NodeFilter filter = new TagClassFilter (LinkTag.class); 
for (NodeIterator e = parser.elements (); e.hasMoreNodes ();) 
e.nextNode ().collectInto (list, filter);

This is slightly less specific because the LinkTag class may be registered for more than one node name, e.g. <LINK> tags too.

Popular methods of Node

toHtml
Return the HTML for this node. This should be the exact sequence of characters that were encountered
getText
Returns the text of the node.
accept
Apply the visitor to this node.
getChildren
Get the children of this node.
getEndPosition
Gets the ending position of the node. This is the character (not byte) offset of the character follo
getStartPosition
Gets the starting position of the node. This is the character (not byte) offset of this node in the
setChildren
Set the children of this node.
setParent
Sets the parent of this node.
toPlainTextString
A string representation of the node. This is an important method, it allows a simple string transfor
clone
Allow cloning of nodes. Creates and returns a copy of this object. The precise meaning of "copy" may
doSemanticAction
Perform the meaning of this tag. This is defined by the tag, for example the bold tag may switch
getFirstChild
Get the first child of this node.

Popular in Java

Making http post requests using okhttp
onCreateOptionsMenu (Activity)
notifyDataSetChanged (ArrayAdapter)
setRequestProperty (URLConnection)
Runnable (java.lang)
Represents a command that can be executed. Often used to run code in a different Thread.
System (java.lang)
Provides access to system-related information and resources including standard input and output. Ena
ResultSet (java.sql)
An interface for an object which represents a database table entry, returned as the result of the qu
Handler (java.util.logging)
A Handler object accepts a logging request and exports the desired messages to a target, for example
JFileChooser (javax.swing)
Location (org.springframework.beans.factory.parsing)
Class that models an arbitrary location in a Resource.Typically used to track the location of proble
Top Vim plugins

How to use collectIntomethodin org.htmlparser.Node

Best Java code snippets using org.htmlparser.Node.collectInto (Showing top 8 results out of 315)

How to use
collectInto
method
in
org.htmlparser.Node