[TOC]
org.apache.nutch.crawl
Class DeduplicationJob.DBFilter
- java.lang.Object
- org.apache.nutch.crawl.DeduplicationJob.DBFilter
- All Implemented Interfaces:
- Closeable, AutoCloseable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Mapper
- Enclosing class:
- DeduplicationJob
public static class DeduplicationJob.DBFilter extends Object implements org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.Text,CrawlDatum,org.apache.hadoop.io.BytesWritable,CrawlDatum>
Constructor Summary
Constructors Constructor and Description DeduplicationJob.DBFilter()
Method Summary
Methods Modifier and Type Method and Description void
close()
void
configure(org.apache.hadoop.mapred.JobConf arg0)
void
map(org.apache.hadoop.io.Text key,
CrawlDatum value,
org.apache.hadoop.mapred.OutputCollector
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Constructor Detail
-
DeduplicationJob.DBFilter
public DeduplicationJob.DBFilter()
Method Detail
-
configure
public void configure(org.apache.hadoop.mapred.JobConf arg0)
- Specified by:
- <code>configure</code> in interface <code>org.apache.hadoop.mapred.JobConfigurable</code>
-
close
public void close() throws IOException
- Specified by:
- <code>close</code> in interface <code>Closeable</code>
- Specified by:
- <code>close</code> in interface <code>AutoCloseable</code>
- Throws:
- <code>IOException</code>
-
map
public void map(org.apache.hadoop.io.Text key, CrawlDatum value, org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.BytesWritable,CrawlDatum> output, org.apache.hadoop.mapred.Reporter reporter) throws IOException
- Specified by:
- <code>map</code> in interface <code>org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.text,crawldatum,org.apache.hadoop.io.byteswritable,crawldatum></org.apache.hadoop.io.text,crawldatum,org.apache.hadoop.io.byteswritable,crawldatum></code>
- Throws:
- <code>IOException</code>