[TOC]

  • Prev
  • Next

Uses of Interface

org.apache.nutch.parse.HtmlParseFilter

Uses of HtmlParseFilter in org.apache.nutch.analysis.lang

Classes in org.apache.nutch.analysis.lang that implement HtmlParseFilter Modifier and Type Class and Description class HTMLLanguageParser

Uses of HtmlParseFilter in org.apache.nutch.microformats.reltag

Classes in org.apache.nutch.microformats.reltag that implement HtmlParseFilter Modifier and Type Class and Description class RelTagParser Adds microformat rel-tags of document if found.

Uses of HtmlParseFilter in org.apache.nutch.parse.headings

Classes in org.apache.nutch.parse.headings that implement HtmlParseFilter Modifier and Type Class and Description class HeadingsParseFilter HtmlParseFilter to retrieve h1 and h2 values from the DOM.

Uses of HtmlParseFilter in org.apache.nutch.parse.js

Classes in org.apache.nutch.parse.js that implement HtmlParseFilter Modifier and Type Class and Description class JSParseFilter This class is a heuristic link extractor for JavaScript files and code snippets.

Uses of HtmlParseFilter in org.apache.nutch.parse.metatags

Classes in org.apache.nutch.parse.metatags that implement HtmlParseFilter Modifier and Type Class and Description class MetaTagsParser Parse HTML meta tags (keywords, description) and store them in the parse metadata so that they can be indexed with the index-metadata plugin with the prefix 'metatag.'.

Uses of HtmlParseFilter in org.creativecommons.nutch

Classes in org.creativecommons.nutch that implement HtmlParseFilter Modifier and Type Class and Description class CCParseFilter Adds metadata identifying the Creative Commons license used, if any.

  • Prev
  • Next

Copyright © 2014 The Apache Software Foundation