[TOC]
org.apache.nutch.crawl
Class DeduplicationJob.DedupReducer
- java.lang.Object
- org.apache.nutch.crawl.DeduplicationJob.DedupReducer
- All Implemented Interfaces:
- Closeable, AutoCloseable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Reducer
- Enclosing class:
- DeduplicationJob
public static class DeduplicationJob.DedupReducer extends Object implements org.apache.hadoop.mapred.Reducer<org.apache.hadoop.io.BytesWritable,CrawlDatum,org.apache.hadoop.io.Text,CrawlDatum>
Constructor Summary
Constructors Constructor and Description DeduplicationJob.DedupReducer()
Method Summary
Methods Modifier and Type Method and Description void close() void configure(org.apache.hadoop.mapred.JobConf arg0) void reduce(org.apache.hadoop.io.BytesWritable key,
Iterator
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Constructor Detail
-
DeduplicationJob.DedupReducer
public DeduplicationJob.DedupReducer()
Method Detail
-
reduce
public void reduce(org.apache.hadoop.io.BytesWritable key,
Iterator<CrawlDatum> values,
org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,CrawlDatum> output,
org.apache.hadoop.mapred.Reporter reporter)
throws IOException
- Specified by:
- <code>reduce</code> in interface <code>org.apache.hadoop.mapred.Reducer<org.apache.hadoop.io.byteswritable,crawldatum,org.apache.hadoop.io.text,crawldatum></org.apache.hadoop.io.byteswritable,crawldatum,org.apache.hadoop.io.text,crawldatum></code>
- Throws:
- <code>IOException</code>
-
configure
public void configure(org.apache.hadoop.mapred.JobConf arg0)
- Specified by:
- <code>configure</code> in interface <code>org.apache.hadoop.mapred.JobConfigurable</code>
-
close
public void close()
throws IOException
- Specified by:
- <code>close</code> in interface <code>Closeable</code>
- Specified by:
- <code>close</code> in interface <code>AutoCloseable</code>
- Throws:
- <code>IOException</code>
