[TOC]
org.apache.nutch.crawl
Class DeduplicationJob.DedupReducer
- java.lang.Object
- org.apache.nutch.crawl.DeduplicationJob.DedupReducer
- All Implemented Interfaces:
- Closeable, AutoCloseable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Reducer
- Enclosing class:
- DeduplicationJob
public static class DeduplicationJob.DedupReducer extends Object implements org.apache.hadoop.mapred.Reducer<org.apache.hadoop.io.BytesWritable,CrawlDatum,org.apache.hadoop.io.Text,CrawlDatum>
Constructor Summary
Constructors Constructor and Description DeduplicationJob.DedupReducer()
Method Summary
Methods Modifier and Type Method and Description void
close()
void
configure(org.apache.hadoop.mapred.JobConf arg0)
void
reduce(org.apache.hadoop.io.BytesWritable key,
Iterator
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Constructor Detail
-
DeduplicationJob.DedupReducer
public DeduplicationJob.DedupReducer()
Method Detail
-
reduce
public void reduce(org.apache.hadoop.io.BytesWritable key, Iterator<CrawlDatum> values, org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,CrawlDatum> output, org.apache.hadoop.mapred.Reporter reporter) throws IOException
- Specified by:
- <code>reduce</code> in interface <code>org.apache.hadoop.mapred.Reducer<org.apache.hadoop.io.byteswritable,crawldatum,org.apache.hadoop.io.text,crawldatum></org.apache.hadoop.io.byteswritable,crawldatum,org.apache.hadoop.io.text,crawldatum></code>
- Throws:
- <code>IOException</code>
-
configure
public void configure(org.apache.hadoop.mapred.JobConf arg0)
- Specified by:
- <code>configure</code> in interface <code>org.apache.hadoop.mapred.JobConfigurable</code>
-
close
public void close() throws IOException
- Specified by:
- <code>close</code> in interface <code>Closeable</code>
- Specified by:
- <code>close</code> in interface <code>AutoCloseable</code>
- Throws:
- <code>IOException</code>