[TOC]

  • Prev Class
  • Next Class

org.apache.nutch.urlfilter.prefix

Class PrefixURLFilter


public class PrefixURLFilter
extends Object
implements URLFilter

Filters URLs based on a file of URL prefixes. The file is named by (1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, and (2) attribute "file" in plugin.xml of this plugin Attribute "file" has higher precedence if defined. The format of this file is one URL prefix per line.

Field Summary

-    

Fields inherited from interface org.apache.nutch.net.URLFilter

X_POINT_ID

Constructor Summary

Constructors Constructor and Description PrefixURLFilter() PrefixURLFilter(String stringRules)

Method Summary

Methods Modifier and Type Method and Description String filter(String url) org.apache.hadoop.conf.Configuration getConf() static void main(String[] args) void setConf(org.apache.hadoop.conf.Configuration conf)

-    

Methods inherited from class java.lang.Object

clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail

-  

PrefixURLFilter

public PrefixURLFilter()
                throws IOException
  - Throws: 
  - <code>IOException</code>       
-  

PrefixURLFilter

public PrefixURLFilter(String stringRules)
                throws IOException
  - Throws: 
  - <code>IOException</code>       

Method Detail

-  

filter

public String filter(String url)
  - Specified by: 
  - <code>filter</code> in interface <code>URLFilter</code>        
-  

main

public static void main(String[] args)
                 throws IOException
  - Throws: 
  - <code>IOException</code>       
-  

setConf

public void setConf(org.apache.hadoop.conf.Configuration conf)
  - Specified by: 
  - <code>setConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>        
-  

getConf

public org.apache.hadoop.conf.Configuration getConf()
  - Specified by: 
  - <code>getConf</code> in interface <code>org.apache.hadoop.conf.Configurable</code>       

  • Prev Class
  • Next Class

Copyright © 2014 The Apache Software Foundation