- Prev Class
- Next Class
org.apache.nutch.urlfilter.prefix
Class PrefixURLFilter
- java.lang.Object
- org.apache.nutch.urlfilter.prefix.PrefixURLFilter
public class PrefixURLFilter extends Object implements URLFilter
Filters URLs based on a file of URL prefixes. The file is named by (1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, and (2) attribute "file" in plugin.xml of this plugin Attribute "file" has higher precedence if defined. The format of this file is one URL prefix per line.
Field Summary
-
Fields inherited from interface org.apache.nutch.net.URLFilter
X_POINT_ID
Constructor Summary
Constructors Constructor and Description PrefixURLFilter()
PrefixURLFilter(String stringRules)
Method Summary
Methods Modifier and Type Method and Description String
filter(String url)
org.apache.hadoop.conf.Configuration
getConf()
static void
main(String[] args)
void
setConf(org.apache.hadoop.conf.Configuration conf)
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Constructor Detail
-
PrefixURLFilter
public PrefixURLFilter() throws IOException
- Throws:
- <code>IOException</code>
-
PrefixURLFilter
public PrefixURLFilter(String stringRules) throws IOException
- Throws:
- <code>IOException</code>
Method Detail
-
filter
public String filter(String url)
- Specified by:
- <code>filter</code> in interface <code>URLFilter</code>
-
main
public static void main(String[] args) throws IOException
- Throws:
- <code>IOException</code>
-
setConf
public void setConf(org.apache.hadoop.conf.Configuration conf)
- Specified by:
- <code>setConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>
-
getConf
public org.apache.hadoop.conf.Configuration getConf()
- Specified by:
- <code>getConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>
- Prev Class
- Next Class