- Prev Class
- Next Class
org.apache.nutch.microformats.reltag
Class RelTagParser
- java.lang.Object
- org.apache.nutch.microformats.reltag.RelTagParser
- All Implemented Interfaces:
- org.apache.hadoop.conf.Configurable, HtmlParseFilter, Pluggable
public class RelTagParser extends Object implements HtmlParseFilter
Adds microformat rel-tags of document if found.
- See Also:
- http://www.microformats.org/wiki/rel-tag
Field Summary
Fields Modifier and Type Field and Description static org.slf4j.Logger
LOG
static String
REL_TAG
-
Fields inherited from interface org.apache.nutch.parse.HtmlParseFilter
X_POINT_ID
Constructor Summary
Constructors Constructor and Description RelTagParser()
Method Summary
Methods Modifier and Type Method and Description ParseResult
filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible rel-tags
org.apache.hadoop.conf.Configuration
getConf()
void
setConf(org.apache.hadoop.conf.Configuration conf)
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Field Detail
-
LOG
public static final org.slf4j.Logger LOG
-
REL_TAG
public static final String REL_TAG
- See Also:
- [Constant Field Values](../../../../../constant-values.html#org.apache.nutch.microformats.reltag.RelTagParser.REL_TAG)
Constructor Detail
-
RelTagParser
public RelTagParser()
Method Detail
-
filter
public ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)
Scan the HTML document looking at possible rel-tags
- Specified by:
- <code>filter</code> in interface <code>HtmlParseFilter</code>
-
setConf
public void setConf(org.apache.hadoop.conf.Configuration conf)
- Specified by:
- <code>setConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>
-
getConf
public org.apache.hadoop.conf.Configuration getConf()
- Specified by:
- <code>getConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>
- Prev Class
- Next Class