- Prev
 - Next
 
Uses of Class
org.apache.nutch.parse.HTMLMetaTags
Packages that use HTMLMetaTags Package Description org.apache.nutch.analysis.lang
Text document language identifier. org.apache.nutch.microformats.reltag
A microformats Rel-Tag Parser/Indexer/Querier plugin. org.apache.nutch.parse
TheParseinterface and related classes. org.apache.nutch.parse.headings
Parse filter to extract headings (h1, h2, etc.) from DOM parse tree. org.apache.nutch.parse.html
An HTML document parsing plugin. org.apache.nutch.parse.js
Parser and parse filter plugin to extract all (possible) links from JavaScript files and embedded JavaScript code snippets. org.apache.nutch.parse.metatags
Parse filter to extract meta tags: keywords, description, etc. org.apache.nutch.parse.tika
Parse various document formats with help of Apache Tika. org.creativecommons.nutch
Sample plugins that parse and index Creative Commons medadata.
Uses of HTMLMetaTags in org.apache.nutch.analysis.lang
 Methods in org.apache.nutch.analysis.lang with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult HTMLLanguageParser.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc) 
Scan the HTML document looking at possible indications of content language
1.
Uses of HTMLMetaTags in org.apache.nutch.microformats.reltag
 Methods in org.apache.nutch.microformats.reltag with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult RelTagParser.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc) 
Scan the HTML document looking at possible rel-tags
Uses of HTMLMetaTags in org.apache.nutch.parse
 Methods in org.apache.nutch.parse with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult HtmlParseFilters.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc) 
Run all defined filters.
    ParseResult HtmlParseFilter.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc) 
Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
Uses of HTMLMetaTags in org.apache.nutch.parse.headings
 Methods in org.apache.nutch.parse.headings with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult HeadingsParseFilter.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc)    
Uses of HTMLMetaTags in org.apache.nutch.parse.html
 Methods in org.apache.nutch.parse.html with parameters of type HTMLMetaTags   Modifier and Type Method and Description    static void HTMLMetaProcessor.getMetaTags(HTMLMetaTags metaTags,
           Node node,
           URL currURL) 
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.
Uses of HTMLMetaTags in org.apache.nutch.parse.js
 Methods in org.apache.nutch.parse.js with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult JSParseFilter.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc)    
Uses of HTMLMetaTags in org.apache.nutch.parse.metatags
 Methods in org.apache.nutch.parse.metatags with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult MetaTagsParser.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc)    
Uses of HTMLMetaTags in org.apache.nutch.parse.tika
 Methods in org.apache.nutch.parse.tika with parameters of type HTMLMetaTags   Modifier and Type Method and Description    static void HTMLMetaProcessor.getMetaTags(HTMLMetaTags metaTags,
           Node node,
           URL currURL) 
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.
Uses of HTMLMetaTags in org.creativecommons.nutch
 Methods in org.creativecommons.nutch with parameters of type HTMLMetaTags   Modifier and Type Method and Description    ParseResult CCParseFilter.filter(Content content,
      ParseResult parseResult,
      HTMLMetaTags metaTags,
      DocumentFragment doc) 
Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.
- Prev
 - Next
 
