- Prev
- Next
Uses of Class
org.apache.nutch.metadata.MetaWrapper
Packages that use MetaWrapper Package Description org.apache.nutch.segment
A segment stores all data from on generate/fetch/update cycle: fetch list, protocol status, raw content, parsed content, and extracted outgoing links.
Uses of MetaWrapper in org.apache.nutch.segment
Methods in org.apache.nutch.segment that return types with arguments of type MetaWrapper Modifier and Type Method and Description org.apache.hadoop.mapred.RecordReader
SegmentMerger.ObjectInputFormat.getRecordReader(org.apache.hadoop.mapred.InputSplit split,
org.apache.hadoop.mapred.JobConf job,
org.apache.hadoop.mapred.Reporter reporter)
org.apache.hadoop.mapred.RecordWriter
SegmentMerger.SegmentOutputFormat.getRecordWriter(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.mapred.JobConf job,
String name,
org.apache.hadoop.util.Progressable progress)
Methods in org.apache.nutch.segment with parameters of type MetaWrapper Modifier and Type Method and Description void
SegmentMerger.map(org.apache.hadoop.io.Text key,
MetaWrapper value,
org.apache.hadoop.mapred.OutputCollector
Method parameters in org.apache.nutch.segment with type arguments of type MetaWrapper Modifier and Type Method and Description void
SegmentMerger.map(org.apache.hadoop.io.Text key,
MetaWrapper value,
org.apache.hadoop.mapred.OutputCollector
void
SegmentMerger.reduce(org.apache.hadoop.io.Text key,
Iterator
NOTE: in selecting the latest version we rely exclusively on the segment name (not all segment data contain time information).
void
SegmentMerger.reduce(org.apache.hadoop.io.Text key,
Iterator
NOTE: in selecting the latest version we rely exclusively on the segment name (not all segment data contain time information).
- Prev
- Next