org.apache.nutch.parse.html
Class HTMLMetaProcessor
- java.lang.Object
 - org.apache.nutch.parse.html.HTMLMetaProcessor
 
public class HTMLMetaProcessor extends Object
Class for parsing META Directives from DOM trees. This class handles specifically Robots META directives (all, none, nofollow, noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache instructions. All meta directives are stored in a HTMLMetaTags instance.
Constructor Summary
 Constructors   Constructor and Description   HTMLMetaProcessor()   
Method Summary
 Methods   Modifier and Type Method and Description   static void getMetaTags(HTMLMetaTags metaTags,
           Node node,
           URL currURL) 
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.
-    
Methods inherited from class java.lang.Object
 clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait     
Constructor Detail
-  
HTMLMetaProcessor
public HTMLMetaProcessor()
Method Detail
-  
getMetaTags
public static final void getMetaTags(HTMLMetaTags metaTags, Node node, URL currURL)
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.
