[TOC]

org.apache.nutch.parse

Class HtmlParseFilters


public class HtmlParseFilters
extends Object

Creates and caches HtmlParseFilter implementing plugins.

Field Summary

Fields Modifier and Type Field and Description static String HTMLPARSEFILTER_ORDER

Constructor Summary

Constructors Constructor and Description HtmlParseFilters(org.apache.hadoop.conf.Configuration conf)

Method Summary

Methods Modifier and Type Method and Description ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc) Run all defined filters.

-    

Methods inherited from class java.lang.Object

clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail

-  

HTMLPARSEFILTER_ORDER

public static final String HTMLPARSEFILTER_ORDER
  - See Also:
  - [Constant Field Values](../../../../constant-values.html#org.apache.nutch.parse.HtmlParseFilters.HTMLPARSEFILTER_ORDER)       

Constructor Detail

-  

HtmlParseFilters

public HtmlParseFilters(org.apache.hadoop.conf.Configuration conf)

Method Detail

-  

filter

public ParseResult filter(Content content,
                 ParseResult parseResult,
                 HTMLMetaTags metaTags,
                 DocumentFragment doc)

Run all defined filters.

Copyright © 2014 The Apache Software Foundation