[TOC]
org.apache.nutch.parse
Class HtmlParseFilters
- java.lang.Object
- org.apache.nutch.parse.HtmlParseFilters
public class HtmlParseFilters extends Object
Creates and caches HtmlParseFilter
implementing plugins.
Field Summary
Fields Modifier and Type Field and Description static String
HTMLPARSEFILTER_ORDER
Constructor Summary
Constructors Constructor and Description HtmlParseFilters(org.apache.hadoop.conf.Configuration conf)
Method Summary
Methods Modifier and Type Method and Description ParseResult
filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Run all defined filters.
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Field Detail
-
HTMLPARSEFILTER_ORDER
public static final String HTMLPARSEFILTER_ORDER
- See Also:
- [Constant Field Values](../../../../constant-values.html#org.apache.nutch.parse.HtmlParseFilters.HTMLPARSEFILTER_ORDER)
Constructor Detail
-
HtmlParseFilters
public HtmlParseFilters(org.apache.hadoop.conf.Configuration conf)
Method Detail
-
filter
public ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)
Run all defined filters.