org.apache.nutch.crawl
Interfaces
Classes
- AbstractFetchSchedule
- AdaptiveFetchSchedule
- CrawlDatum
- CrawlDatum.Comparator
- CrawlDb
- CrawlDbFilter
- CrawlDbMerger
- CrawlDbMerger.Merger
- CrawlDbReader
- CrawlDbReader.CrawlDatumCsvOutputFormat
- CrawlDbReader.CrawlDatumCsvOutputFormat.LineRecordWriter
- CrawlDbReader.CrawlDbDumpMapper
- CrawlDbReader.CrawlDbStatCombiner
- CrawlDbReader.CrawlDbStatMapper
- CrawlDbReader.CrawlDbStatReducer
- CrawlDbReader.CrawlDbTopNMapper
- CrawlDbReader.CrawlDbTopNReducer
- CrawlDbReducer
- DeduplicationJob
- DeduplicationJob.DBFilter
- DeduplicationJob.DedupReducer
- DeduplicationJob.StatusUpdateReducer
- DefaultFetchSchedule
- FetchScheduleFactory
- Generator
- Generator.CrawlDbUpdater
- Generator.DecreasingFloatComparator
- Generator.GeneratorOutputFormat
- Generator.HashComparator
- Generator.PartitionReducer
- Generator.Selector
- Generator.SelectorEntry
- Generator.SelectorInverseMapper
- Injector
- Injector.InjectMapper
- Injector.InjectReducer
- Inlink
- Inlinks
- LinkDb
- LinkDbFilter
- LinkDbMerger
- LinkDbReader
- MapWritable
- MD5Signature
- MimeAdaptiveFetchSchedule
- NutchWritable
- Signature
- SignatureComparator
- SignatureFactory
- TextProfileSignature
- URLPartitioner