nutch 中文文档帮助手册教程

白天 夜间 首页 下载 阅读记录
  我的书签   添加书签   移除书签

org.apache.nutch.urlfilter.validator

来源 CrawlScript 浏览 118 扫码 分享 2022-05-03 16:21:17
  • org.apache.nutch.urlfilter.validator
    • Classes

    org.apache.nutch.urlfilter.validator

    Classes

    • UrlValidator

    若有收获,就点个赞吧

    0 人点赞

    上一篇:
    下一篇:
    • 书签
    • 添加书签 移除书签
    • Package org.apache.nutch.analysis.lang
    • nutcher
    • 此中文注释由社区”Nutch开发者” nutcher.org提供,作者是”逼格DATA”,未经允许,禁止转载
    • Nutch教程——导入Nutch工程,执行完整爬取
    • Nutch教程——URLNormalizer源码详解 by 逼格DATA
    • Nutch教程——URLNormalizer源码详解
    • Deprecated API
    • org.apache.nutch.analysis.lang
    • Uses of Packageorg.apache.nutch.analysis.lang
    • org.apache.nutch.collection
    • Uses of Packageorg.apache.nutch.collection
    • Class CrawlDatum.Comparator
    • Class CrawlDatum
    • Class CrawlDb
    • Class CrawlDbFilter
    • Class CrawlDbMerger.Merger
    • Class CrawlDbMerger
    • Class CrawlDbReader.CrawlDbDumpMapper
    • Class CrawlDbReader.CrawlDbStatMapper
    • Class CrawlDbReader.CrawlDbStatReducer
    • Class CrawlDbReader.CrawlDbTopNMapper
    • Class CrawlDbReader.CrawlDbTopNReducer
    • Class CrawlDbReader
    • Class CrawlDbReducer
    • Class DeduplicationJob.DBFilter
    • Class DeduplicationJob.DedupReducer
    • Class DeduplicationJob
    • Class DefaultFetchSchedule
    • Interface FetchSchedule
    • Class FetchScheduleFactory
    • Class Generator.CrawlDbUpdater
    • Class Generator.GeneratorOutputFormat
    • Class Generator.HashComparator
    • Class Generator.PartitionReducer
    • Class Generator.Selector
    • Class Generator.SelectorEntry
    • Class Generator.SelectorInverseMapper
    • Class Generator
    • Class Injector.InjectMapper
    • Class Injector.InjectReducer
    • Class Injector
    • Class Inlink
    • Class Inlinks
    • Class LinkDb
    • Class LinkDbFilter
    • Class LinkDbMerger
    • Class LinkDbReader
    • Class MD5Signature
    • Class MapWritable
    • Class MimeAdaptiveFetchSchedule
    • Class NutchWritable
    • Class Signature
    • Class SignatureComparator
    • Class SignatureFactory
    • Class TextProfileSignature
    • Class URLPartitioner
    • Uses of Classorg.apache.nutch.crawl.CrawlDb
    • Uses of Classorg.apache.nutch.crawl.CrawlDbFilter
    • Uses of Classorg.apache.nutch.crawl.CrawlDbMerger.Merger
    • Uses of Classorg.apache.nutch.crawl.CrawlDbMerger
    • Uses of Classorg.apache.nutch.crawl.CrawlDbReader
    • Uses of Classorg.apache.nutch.crawl.CrawlDbReducer
    • Uses of Classorg.apache.nutch.crawl.DeduplicationJob
    • Uses of Classorg.apache.nutch.crawl.DefaultFetchSchedule
    • Uses of Interfaceorg.apache.nutch.crawl.FetchSchedule
    • Uses of Classorg.apache.nutch.crawl.FetchScheduleFactory
    • Uses of Classorg.apache.nutch.crawl.Generator.Selector
    • Uses of Classorg.apache.nutch.crawl.Generator
    • Uses of Classorg.apache.nutch.crawl.Injector.InjectMapper
    • Uses of Classorg.apache.nutch.crawl.Injector.InjectReducer
    • Uses of Classorg.apache.nutch.crawl.Injector
    • Uses of Classorg.apache.nutch.crawl.Inlink
    • Uses of Classorg.apache.nutch.crawl.Inlinks
    • Uses of Classorg.apache.nutch.crawl.LinkDb
    • Uses of Classorg.apache.nutch.crawl.LinkDbFilter
    • Uses of Classorg.apache.nutch.crawl.LinkDbMerger
    • Uses of Classorg.apache.nutch.crawl.LinkDbReader
    • Uses of Classorg.apache.nutch.crawl.MD5Signature
    • Uses of Classorg.apache.nutch.crawl.MapWritable
    • Uses of Classorg.apache.nutch.crawl.NutchWritable
    • Uses of Classorg.apache.nutch.crawl.Signature
    • Uses of Classorg.apache.nutch.crawl.SignatureComparator
    • Uses of Classorg.apache.nutch.crawl.SignatureFactory
    • Uses of Classorg.apache.nutch.crawl.TextProfileSignature
    • Uses of Classorg.apache.nutch.crawl.URLPartitioner
    • org.apache.nutch.crawl
    • Hierarchy For Package org.apache.nutch.crawl
    • Uses of Packageorg.apache.nutch.crawl
    • Class Fetcher.InputFormat
    • Class Fetcher
    • Class FetcherOutputFormat
    • Class OldFetcher.InputFormat
    • Class OldFetcher
    • Uses of Classorg.apache.nutch.fetcher.Fetcher.InputFormat
    • Uses of Classorg.apache.nutch.fetcher.Fetcher
    • Uses of Classorg.apache.nutch.fetcher.FetcherOutputFormat
    • Uses of Classorg.apache.nutch.fetcher.OldFetcher
    • org.apache.nutch.fetcher
    • Hierarchy For Package org.apache.nutch.fetcher
    • Uses of Packageorg.apache.nutch.fetcher
    • Class CleaningJob.DBFilter
    • Class CleaningJob.DeleterReducer
    • Class CleaningJob
    • Interface IndexWriter
    • Class IndexWriters
    • Class IndexerMapReduce
    • Class IndexerOutputFormat
    • Class IndexingException
    • Interface IndexingFilter
    • Class IndexingFilters
    • Class IndexingFiltersChecker
    • Class IndexingJob
    • Class NutchDocument
    • Class NutchField
    • Class NutchIndexAction
    • Class AnchorIndexingFilter
    • org.apache.nutch.indexer.anchor
    • Hierarchy For Package org.apache.nutch.indexer.anchor
    • Uses of Packageorg.apache.nutch.indexer.anchor
    • Class BasicIndexingFilter
    • org.apache.nutch.indexer.basic
    • Hierarchy For Package org.apache.nutch.indexer.basic
    • Uses of Packageorg.apache.nutch.indexer.basic
    • Uses of Classorg.apache.nutch.indexer.CleaningJob.DBFilter
    • Uses of Classorg.apache.nutch.indexer.CleaningJob
    • Uses of Interfaceorg.apache.nutch.indexer.IndexWriter
    • Uses of Classorg.apache.nutch.indexer.IndexWriters
    • Uses of Classorg.apache.nutch.indexer.IndexerMapReduce
    • Uses of Classorg.apache.nutch.indexer.IndexerOutputFormat
    • Uses of Classorg.apache.nutch.indexer.IndexingException
    • Uses of Interfaceorg.apache.nutch.indexer.IndexingFilter
    • Uses of Classorg.apache.nutch.indexer.IndexingFilters
    • Uses of Classorg.apache.nutch.indexer.IndexingJob
    • Uses of Classorg.apache.nutch.indexer.NutchDocument
    • Uses of Classorg.apache.nutch.indexer.NutchField
    • Uses of Classorg.apache.nutch.indexer.NutchIndexAction
    • Class FeedIndexingFilter
    • org.apache.nutch.indexer.feed
    • Hierarchy For Package org.apache.nutch.indexer.feed
    • Uses of Packageorg.apache.nutch.indexer.feed
    • Class MetadataIndexer
    • org.apache.nutch.indexer.metadata
    • Hierarchy For Package org.apache.nutch.indexer.metadata
    • Uses of Packageorg.apache.nutch.indexer.metadata
    • Class MoreIndexingFilter
    • org.apache.nutch.indexer.more
    • Hierarchy For Package org.apache.nutch.indexer.more
    • Uses of Packageorg.apache.nutch.indexer.more
    • org.apache.nutch.indexer
    • Hierarchy For Package org.apache.nutch.indexer
    • Uses of Packageorg.apache.nutch.indexer
    • Class StaticFieldIndexer
    • org.apache.nutch.indexer.staticfield
    • Hierarchy For Package org.apache.nutch.indexer.staticfield
    • Uses of Packageorg.apache.nutch.indexer.staticfield
    • org.apache.nutch.indexer.subcollection
    • Hierarchy For Package org.apache.nutch.indexer.subcollection
    • Uses of Packageorg.apache.nutch.indexer.subcollection
    • Class TLDIndexingFilter
    • org.apache.nutch.indexer.tld
    • Hierarchy For Package org.apache.nutch.indexer.tld
    • Uses of Packageorg.apache.nutch.indexer.tld
    • Class URLMetaIndexingFilter
    • org.apache.nutch.indexer.urlmeta
    • Hierarchy For Package org.apache.nutch.indexer.urlmeta
    • Uses of Packageorg.apache.nutch.indexer.urlmeta
    • Class DummyIndexWriter
    • org.apache.nutch.indexwriter.dummy
    • Hierarchy For Package org.apache.nutch.indexwriter.dummy
    • Uses of Packageorg.apache.nutch.indexwriter.dummy
    • Interface ElasticConstants
    • Class ElasticIndexWriter
    • org.apache.nutch.indexwriter.elastic
    • Hierarchy For Package org.apache.nutch.indexwriter.elastic
    • Uses of Packageorg.apache.nutch.indexwriter.elastic
    • Interface SolrConstants
    • Class SolrIndexWriter
    • Class SolrMappingReader
    • Class SolrUtils
    • Uses of Classorg.apache.nutch.indexwriter.solr.SolrUtils
    • org.apache.nutch.indexwriter.solr
    • Hierarchy For Package org.apache.nutch.indexwriter.solr
    • Uses of Packageorg.apache.nutch.indexwriter.solr
    • Interface CreativeCommons
    • Interface DublinCore
    • Interface Feed
    • Interface HttpHeaders
    • Class MetaWrapper
    • Class Metadata
    • Interface Nutch
    • Class SpellCheckedMetadata
    • Uses of Interfaceorg.apache.nutch.metadata.CreativeCommons
    • Uses of Interfaceorg.apache.nutch.metadata.DublinCore
    • Uses of Interfaceorg.apache.nutch.metadata.Feed
    • Uses of Interfaceorg.apache.nutch.metadata.HttpHeaders
    • Uses of Classorg.apache.nutch.metadata.MetaWrapper
    • Uses of Classorg.apache.nutch.metadata.Metadata
    • Uses of Interfaceorg.apache.nutch.metadata.Nutch
    • org.apache.nutch.metadata
    • Hierarchy For Package org.apache.nutch.metadata
    • Uses of Packageorg.apache.nutch.metadata
    • Class RelTagParser
    • org.apache.nutch.microformats.reltag
    • Hierarchy For Package org.apache.nutch.microformats.reltag
    • Uses of Packageorg.apache.nutch.microformats.reltag
    • Interface URLFilter
    • Class URLFilterChecker
    • Class URLFilterException
    • Class URLFilters
    • Interface URLNormalizer
    • Class URLNormalizerChecker
    • Class URLNormalizers
    • Uses of Interfaceorg.apache.nutch.net.URLFilter
    • Uses of Classorg.apache.nutch.net.URLFilterChecker
    • Uses of Classorg.apache.nutch.net.URLFilterException
    • Uses of Classorg.apache.nutch.net.URLFilters
    • Uses of Interfaceorg.apache.nutch.net.URLNormalizer
    • Uses of Classorg.apache.nutch.net.URLNormalizerChecker
    • Uses of Classorg.apache.nutch.net.URLNormalizers
    • org.apache.nutch.net
    • Hierarchy For Package org.apache.nutch.net
    • Uses of Packageorg.apache.nutch.net
    • Class HttpDateFormat
    • Class ProtocolException
    • Interface Response
    • Uses of Classorg.apache.nutch.net.protocols.HttpDateFormat
    • Uses of Interfaceorg.apache.nutch.net.protocols.Response
    • org.apache.nutch.net.protocols
    • Hierarchy For Package org.apache.nutch.net.protocols
    • Uses of Packageorg.apache.nutch.net.protocols
    • org.apache.nutch.net.urlnormalizer.basic
    • Hierarchy For Package org.apache.nutch.net.urlnormalizer.basic
    • Uses of Packageorg.apache.nutch.net.urlnormalizer.basic
    • org.apache.nutch.net.urlnormalizer.host
    • Hierarchy For Package org.apache.nutch.net.urlnormalizer.host
    • Uses of Packageorg.apache.nutch.net.urlnormalizer.host
    • org.apache.nutch.net.urlnormalizer.pass
    • Hierarchy For Package org.apache.nutch.net.urlnormalizer.pass
    • Uses of Packageorg.apache.nutch.net.urlnormalizer.pass
    • org.apache.nutch.net.urlnormalizer.regex
    • Hierarchy For Package org.apache.nutch.net.urlnormalizer.regex
    • Uses of Packageorg.apache.nutch.net.urlnormalizer.regex
    • Class HTMLMetaTags
    • Interface HtmlParseFilter
    • Class HtmlParseFilters
    • Class Outlink
    • Class OutlinkExtractor
    • Interface Parse
    • Class ParseData
    • Class ParseException
    • Class ParseImpl
    • Class ParseOutputFormat
    • Class ParseResult
    • Class ParseSegment
    • Class ParseStatus
    • Class ParseText
    • Class ParseUtil
    • Interface Parser
    • Class ParserChecker
    • Class ParserFactory
    • Class ParserNotFound
    • Uses of Classorg.apache.nutch.parse.HTMLMetaTags
    • Uses of Interfaceorg.apache.nutch.parse.HtmlParseFilter
    • Uses of Classorg.apache.nutch.parse.HtmlParseFilters
    • Uses of Classorg.apache.nutch.parse.Outlink
    • Uses of Classorg.apache.nutch.parse.OutlinkExtractor
    • Uses of Interfaceorg.apache.nutch.parse.Parse
    • Uses of Classorg.apache.nutch.parse.ParseData
    • Uses of Classorg.apache.nutch.parse.ParseException
    • Uses of Classorg.apache.nutch.parse.ParseImpl
    • Uses of Classorg.apache.nutch.parse.ParseOutputFormat
    • Uses of Classorg.apache.nutch.parse.ParseResult
    • Uses of Classorg.apache.nutch.parse.ParseSegment
    • Uses of Classorg.apache.nutch.parse.ParseStatus
    • Uses of Classorg.apache.nutch.parse.ParseText
    • Uses of Classorg.apache.nutch.parse.ParseUtil
    • Uses of Interfaceorg.apache.nutch.parse.Parser
    • Uses of Classorg.apache.nutch.parse.ParserChecker
    • Uses of Classorg.apache.nutch.parse.ParserFactory
    • Uses of Classorg.apache.nutch.parse.ParserNotFound
    • Class ExtParser
    • Uses of Classorg.apache.nutch.parse.ext.ExtParser
    • org.apache.nutch.parse.ext
    • Hierarchy For Package org.apache.nutch.parse.ext
    • Uses of Packageorg.apache.nutch.parse.ext
    • Class FeedParser
    • Uses of Classorg.apache.nutch.parse.feed.FeedParser
    • org.apache.nutch.parse.feed
    • Hierarchy For Package org.apache.nutch.parse.feed
    • Uses of Packageorg.apache.nutch.parse.feed
    • Class HeadingsParseFilter
    • org.apache.nutch.parse.headings
    • Hierarchy For Package org.apache.nutch.parse.headings
    • Uses of Packageorg.apache.nutch.parse.headings
    • Class DOMBuilder
    • Class DOMContentUtils.LinkParams
    • Class DOMContentUtils
    • Class HTMLMetaProcessor
    • Class HtmlParser
    • Class XMLCharacterRecognizer
    • Uses of Classorg.apache.nutch.parse.html.DOMBuilder
    • Uses of Classorg.apache.nutch.parse.html.DOMContentUtils
    • Uses of Classorg.apache.nutch.parse.html.HTMLMetaProcessor
    • Uses of Classorg.apache.nutch.parse.html.HtmlParser
    • org.apache.nutch.parse.html
    • Hierarchy For Package org.apache.nutch.parse.html
    • Uses of Packageorg.apache.nutch.parse.html
    • Class JSParseFilter
    • Uses of Classorg.apache.nutch.parse.js.JSParseFilter
    • org.apache.nutch.parse.js
    • Hierarchy For Package org.apache.nutch.parse.js
    • Uses of Packageorg.apache.nutch.parse.js
    • Class MetaTagsParser
    • org.apache.nutch.parse.metatags
    • Hierarchy For Package org.apache.nutch.parse.metatags
    • Uses of Packageorg.apache.nutch.parse.metatags
    • org.apache.nutch.parse
    • Hierarchy For Package org.apache.nutch.parse
    • Uses of Packageorg.apache.nutch.parse
    • Class SWFParser
    • Uses of Classorg.apache.nutch.parse.swf.SWFParser
    • org.apache.nutch.parse.swf
    • Hierarchy For Package org.apache.nutch.parse.swf
    • Uses of Packageorg.apache.nutch.parse.swf
    • Class DOMContentUtils
    • Class HTMLMetaProcessor
    • Class TikaParser
    • Uses of Classorg.apache.nutch.parse.tika.DOMContentUtils
    • Uses of Classorg.apache.nutch.parse.tika.HTMLMetaProcessor
    • Uses of Classorg.apache.nutch.parse.tika.TikaParser
    • org.apache.nutch.parse.tika
    • Hierarchy For Package org.apache.nutch.parse.tika
    • Uses of Packageorg.apache.nutch.parse.tika
    • Class ZipParser
    • Class ZipTextExtractor
    • Uses of Classorg.apache.nutch.parse.zip.ZipParser
    • Uses of Classorg.apache.nutch.parse.zip.ZipTextExtractor
    • org.apache.nutch.parse.zip
    • Hierarchy For Package org.apache.nutch.parse.zip
    • Uses of Packageorg.apache.nutch.parse.zip
    • Class CircularDependencyException
    • Class Extension
    • Class ExtensionPoint
    • Class MissingDependencyException
    • Interface Pluggable
    • Class Plugin
    • Class PluginClassLoader
    • Class PluginDescriptor
    • Class PluginManifestParser
    • Class PluginRepository
    • Class PluginRuntimeException
    • Uses of Classorg.apache.nutch.plugin.Extension
    • Uses of Classorg.apache.nutch.plugin.ExtensionPoint
    • Uses of Interfaceorg.apache.nutch.plugin.Pluggable
    • Uses of Classorg.apache.nutch.plugin.Plugin
    • Uses of Classorg.apache.nutch.plugin.PluginClassLoader
    • Uses of Classorg.apache.nutch.plugin.PluginDescriptor
    • Uses of Classorg.apache.nutch.plugin.PluginManifestParser
    • Uses of Classorg.apache.nutch.plugin.PluginRepository
    • org.apache.nutch.plugin
    • Hierarchy For Package org.apache.nutch.plugin
    • Uses of Packageorg.apache.nutch.plugin
    • Class Content
    • Interface Protocol
    • Class ProtocolException
    • Class ProtocolFactory
    • Class ProtocolNotFound
    • Class ProtocolOutput
    • Class ProtocolStatus
    • Interface RobotRules
    • Class RobotRulesParser
    • Uses of Classorg.apache.nutch.protocol.Content
    • Uses of Interfaceorg.apache.nutch.protocol.Protocol
    • Uses of Classorg.apache.nutch.protocol.ProtocolException
    • Uses of Classorg.apache.nutch.protocol.ProtocolFactory
    • Uses of Classorg.apache.nutch.protocol.ProtocolNotFound
    • Uses of Classorg.apache.nutch.protocol.ProtocolOutput
    • Uses of Classorg.apache.nutch.protocol.ProtocolStatus
    • Uses of Interfaceorg.apache.nutch.protocol.RobotRules
    • Uses of Classorg.apache.nutch.protocol.RobotRulesParser
    • Class File
    • Class FileError
    • Class FileException
    • Class FileResponse
    • Uses of Classorg.apache.nutch.protocol.file.File
    • Uses of Classorg.apache.nutch.protocol.file.FileError
    • Uses of Classorg.apache.nutch.protocol.file.FileException
    • Uses of Classorg.apache.nutch.protocol.file.FileResponse
    • org.apache.nutch.protocol.file
    • Hierarchy For Package org.apache.nutch.protocol.file
    • Uses of Packageorg.apache.nutch.protocol.file
    • Class Client
    • Class Ftp
    • Class FtpError
    • Class FtpException
    • Class FtpResponse
    • Class FtpRobotRulesParser
    • Class PrintCommandListener
    • Uses of Classorg.apache.nutch.protocol.ftp.Client
    • Uses of Classorg.apache.nutch.protocol.ftp.Ftp
    • Uses of Classorg.apache.nutch.protocol.ftp.FtpError
    • Uses of Classorg.apache.nutch.protocol.ftp.FtpException
    • Uses of Classorg.apache.nutch.protocol.ftp.FtpResponse
    • org.apache.nutch.protocol.ftp
    • Hierarchy For Package org.apache.nutch.protocol.ftp
    • Uses of Packageorg.apache.nutch.protocol.ftp
    • Class Http
    • Enum HttpResponse.Scheme
    • Class HttpResponse
    • Class BlockedException
    • Class HttpBase
    • Class HttpException
    • Class HttpRobotRulesParser
    • Uses of Classorg.apache.nutch.protocol.http.api.HttpBase
    • org.apache.nutch.protocol.http.api
    • Hierarchy For Package org.apache.nutch.protocol.http.api
    • Uses of Packageorg.apache.nutch.protocol.http.api
    • Uses of Classorg.apache.nutch.protocol.http.Http
    • Uses of Classorg.apache.nutch.protocol.http.HttpResponse
    • org.apache.nutch.protocol.http
    • Hierarchy For Package org.apache.nutch.protocol.http
    • Uses of Packageorg.apache.nutch.protocol.http
    • Class Http
    • Interface HttpAuthentication
    • Class HttpResponse
    • Uses of Classorg.apache.nutch.protocol.httpclient.Http
    • org.apache.nutch.protocol.httpclient
    • Hierarchy For Package org.apache.nutch.protocol.httpclient
    • Uses of Packageorg.apache.nutch.protocol.httpclient
    • org.apache.nutch.protocol
    • Hierarchy For Package org.apache.nutch.protocol
    • Uses of Packageorg.apache.nutch.protocol
    • Class AbstractScoringFilter
    • Interface ScoringFilter
    • Class ScoringFilterException
    • Class ScoringFilters
    • Uses of Interfaceorg.apache.nutch.scoring.ScoringFilter
    • Uses of Classorg.apache.nutch.scoring.ScoringFilters
    • Class DepthScoringFilter
    • org.apache.nutch.scoring.depth
    • Hierarchy For Package org.apache.nutch.scoring.depth
    • Uses of Packageorg.apache.nutch.scoring.depth
    • Class LinkAnalysisScoringFilter
    • org.apache.nutch.scoring.link
    • Hierarchy For Package org.apache.nutch.scoring.link
    • Uses of Packageorg.apache.nutch.scoring.link
    • Class OPICScoringFilter
    • org.apache.nutch.scoring.opic
    • Hierarchy For Package org.apache.nutch.scoring.opic
    • Uses of Packageorg.apache.nutch.scoring.opic
    • org.apache.nutch.scoring
    • Hierarchy For Package org.apache.nutch.scoring
    • Uses of Packageorg.apache.nutch.scoring
    • Class TLDScoringFilter
    • Uses of Classorg.apache.nutch.scoring.tld.TLDScoringFilter
    • org.apache.nutch.scoring.tld
    • Hierarchy For Package org.apache.nutch.scoring.tld
    • Uses of Packageorg.apache.nutch.scoring.tld
    • Class URLMetaScoringFilter
    • org.apache.nutch.scoring.urlmeta
    • Hierarchy For Package org.apache.nutch.scoring.urlmeta
    • Uses of Packageorg.apache.nutch.scoring.urlmeta
    • Class LinkDatum
    • Class LinkDumper.Inverter
    • Class LinkDumper.LinkNode
    • Class LinkDumper.LinkNodes
    • Class LinkDumper.Merger
    • Class LinkDumper.Reader
    • Class LinkDumper
    • Class LinkRank
    • Class LoopReader
    • Class Loops.Finalizer
    • Class Loops.Initializer
    • Class Loops.LoopSet
    • Class Loops.Looper
    • Class Loops.Route
    • Class Loops
    • Class Node
    • Class NodeDumper.Dumper
    • Class NodeDumper.Sorter
    • Class NodeDumper
    • Class NodeReader
    • Class ScoreUpdater
    • Class WebGraph.OutlinkDb
    • Class WebGraph
    • Uses of Classorg.apache.nutch.scoring.webgraph.LinkDatum
    • Uses of Classorg.apache.nutch.scoring.webgraph.LinkDumper
    • Uses of Classorg.apache.nutch.scoring.webgraph.LinkRank
    • Uses of Classorg.apache.nutch.scoring.webgraph.LoopReader
    • Uses of Classorg.apache.nutch.scoring.webgraph.Loops.Route
    • Uses of Classorg.apache.nutch.scoring.webgraph.Loops
    • Uses of Classorg.apache.nutch.scoring.webgraph.Node
    • Uses of Classorg.apache.nutch.scoring.webgraph.NodeDumper
    • Uses of Classorg.apache.nutch.scoring.webgraph.NodeReader
    • Uses of Classorg.apache.nutch.scoring.webgraph.WebGraph
    • org.apache.nutch.scoring.webgraph
    • Hierarchy For Package org.apache.nutch.scoring.webgraph
    • Uses of Packageorg.apache.nutch.scoring.webgraph
    • Class ContentAsTextInputFormat
    • Interface SegmentMergeFilter
    • Class SegmentMergeFilters
    • Class SegmentMerger
    • Class SegmentPart
    • Class SegmentReader.TextOutputFormat
    • Class SegmentReader
    • Uses of Interfaceorg.apache.nutch.segment.SegmentMergeFilter
    • Uses of Classorg.apache.nutch.segment.SegmentMergeFilters
    • Uses of Classorg.apache.nutch.segment.SegmentMerger
    • Uses of Classorg.apache.nutch.segment.SegmentPart
    • Uses of Classorg.apache.nutch.segment.SegmentReader
    • org.apache.nutch.segment
    • Hierarchy For Package org.apache.nutch.segment
    • Uses of Packageorg.apache.nutch.segment
    • Class Benchmark.BenchmarkResults
    • Class Benchmark
    • Class DmozParser
    • Class FreeGenerator.FG
    • Class FreeGenerator
    • Class ResolveUrls
    • Class ArcInputFormat
    • Class ArcRecordReader
    • Class ArcSegmentCreator
    • Uses of Classorg.apache.nutch.tools.arc.ArcInputFormat
    • Uses of Classorg.apache.nutch.tools.arc.ArcRecordReader
    • Uses of Classorg.apache.nutch.tools.arc.ArcSegmentCreator
    • org.apache.nutch.tools.arc
    • Hierarchy For Package org.apache.nutch.tools.arc
    • Uses of Packageorg.apache.nutch.tools.arc
    • Uses of Classorg.apache.nutch.tools.Benchmark
    • Uses of Classorg.apache.nutch.tools.DmozParser
    • Uses of Classorg.apache.nutch.tools.FreeGenerator.FG
    • Uses of Classorg.apache.nutch.tools.FreeGenerator
    • Uses of Classorg.apache.nutch.tools.ResolveUrls
    • org.apache.nutch.tools
    • Hierarchy For Package org.apache.nutch.tools
    • Uses of Packageorg.apache.nutch.tools
    • Class RegexRule
    • Class RegexURLFilterBase
    • Uses of Classorg.apache.nutch.urlfilter.api.RegexRule
    • org.apache.nutch.urlfilter.api
    • Hierarchy For Package org.apache.nutch.urlfilter.api
    • Uses of Packageorg.apache.nutch.urlfilter.api
    • Class AutomatonURLFilter
    • org.apache.nutch.urlfilter.automaton
    • Hierarchy For Package org.apache.nutch.urlfilter.automaton
    • Uses of Packageorg.apache.nutch.urlfilter.automaton
    • Class DomainURLFilter
    • org.apache.nutch.urlfilter.domain
    • Hierarchy For Package org.apache.nutch.urlfilter.domain
    • Uses of Packageorg.apache.nutch.urlfilter.domain
    • Hierarchy For Package org.apache.nutch.urlfilter.domainblacklist
    • Uses of Packageorg.apache.nutch.urlfilter.domainblacklist
    • Class PrefixURLFilter
    • org.apache.nutch.urlfilter.prefix
    • Hierarchy For Package org.apache.nutch.urlfilter.prefix
    • Uses of Packageorg.apache.nutch.urlfilter.prefix
    • Class RegexURLFilter
    • org.apache.nutch.urlfilter.regex
    • Hierarchy For Package org.apache.nutch.urlfilter.regex
    • Uses of Packageorg.apache.nutch.urlfilter.regex
    • Class SuffixURLFilter
    • org.apache.nutch.urlfilter.suffix
    • Hierarchy For Package org.apache.nutch.urlfilter.suffix
    • Uses of Packageorg.apache.nutch.urlfilter.suffix
    • Class UrlValidator
    • org.apache.nutch.urlfilter.validator
    • Hierarchy For Package org.apache.nutch.urlfilter.validator
    • Uses of Packageorg.apache.nutch.urlfilter.validator
    • Class CommandRunner
    • Class DeflateUtils
    • Class DomUtil
    • Class EncodingDetector
    • Class FSUtils
    • Class GZIPUtils
    • Class GenericWritableConfigurable
    • Class HadoopFSUtil
    • Class LockUtil
    • Class MimeUtil
    • Class NodeWalker
    • Class NutchConfiguration
    • Class NutchJob
    • Class ObjectCache
    • Class PrefixStringMatcher
    • Class StringUtil
    • Class SuffixStringMatcher
    • Class TimingUtil
    • Class TrieStringMatcher.TrieNode
    • Class TrieStringMatcher
    • Class URLUtil
    • Uses of Classorg.apache.nutch.util.CommandRunner
    • Uses of Classorg.apache.nutch.util.DeflateUtils
    • Uses of Classorg.apache.nutch.util.DomUtil
    • Uses of Classorg.apache.nutch.util.EncodingDetector
    • Uses of Classorg.apache.nutch.util.FSUtils
    • Uses of Classorg.apache.nutch.util.GZIPUtils
    • Uses of Classorg.apache.nutch.util.HadoopFSUtil
    • Uses of Classorg.apache.nutch.util.LockUtil
    • Uses of Classorg.apache.nutch.util.MimeUtil
    • Uses of Classorg.apache.nutch.util.NodeWalker
    • Uses of Classorg.apache.nutch.util.NutchConfiguration
    • Uses of Classorg.apache.nutch.util.NutchJob
    • Uses of Classorg.apache.nutch.util.ObjectCache
    • Uses of Classorg.apache.nutch.util.PrefixStringMatcher
    • Uses of Classorg.apache.nutch.util.StringUtil
    • Uses of Classorg.apache.nutch.util.SuffixStringMatcher
    • Uses of Classorg.apache.nutch.util.TimingUtil
    • Uses of Classorg.apache.nutch.util.TrieStringMatcher
    • Uses of Classorg.apache.nutch.util.URLUtil
    • Enum DomainStatistics.MyCounter
    • Class DomainStatistics
    • Enum DomainSuffix.Status
    • Class DomainSuffix
    • Class DomainSuffixes
    • Enum TopLevelDomain.Type
    • Class TopLevelDomain
    • Uses of Classorg.apache.nutch.util.domain.DomainStatistics
    • Uses of Classorg.apache.nutch.util.domain.DomainSuffix
    • Uses of Classorg.apache.nutch.util.domain.DomainSuffixes
    • Uses of Classorg.apache.nutch.util.domain.TopLevelDomain
    • org.apache.nutch.util.domain
    • Hierarchy For Package org.apache.nutch.util.domain
    • Uses of Packageorg.apache.nutch.util.domain
    • org.apache.nutch.util
    • Hierarchy For Package org.apache.nutch.util
    • Uses of Packageorg.apache.nutch.util
    • Class CCIndexingFilter
    • Class CCParseFilter.Walker
    • Class CCParseFilter
    • Uses of Classorg.creativecommons.nutch.CCIndexingFilter
    • Uses of Classorg.creativecommons.nutch.CCParseFilter
    • org.creativecommons.nutch
    • Hierarchy For Package org.creativecommons.nutch
    • Uses of Packageorg.creativecommons.nutch
    • Packages
    • Hierarchy For All Packages
    • Serialized Form
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • Classes for domain name analysis.
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • Welcome to Nutch!
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    • 空标题文档
    暂无相关搜索结果!

      让时间为你证明

      展开/收起文章目录

      分享,让知识传承更久远

      文章二维码

      手机扫一扫,轻松掌上读

      文档下载

      请下载您需要的格式的文档,随时随地,享受汲取知识的乐趣!
      PDF文档 EPUB文档 MOBI文档

      书签列表

        阅读记录

        阅读进度: 0.00% ( 0/0 ) 重置阅读进度

          思维导图备注