- Prev
- Next
Uses of Class
org.apache.nutch.parse.ParseResult
Packages that use ParseResult Package Description org.apache.nutch.analysis.lang
Text document language identifier. org.apache.nutch.microformats.reltag
A microformats Rel-Tag Parser/Indexer/Querier plugin. org.apache.nutch.parse
TheParseinterface and related classes. org.apache.nutch.parse.ext
Parse wrapper to run external command to do the parsing. org.apache.nutch.parse.feed
Parse RSS feeds. org.apache.nutch.parse.headings
Parse filter to extract headings (h1, h2, etc.) from DOM parse tree. org.apache.nutch.parse.html
An HTML document parsing plugin. org.apache.nutch.parse.js
Parser and parse filter plugin to extract all (possible) links from JavaScript files and embedded JavaScript code snippets. org.apache.nutch.parse.metatags
Parse filter to extract meta tags: keywords, description, etc. org.apache.nutch.parse.swf
Parse Flash SWF files. org.apache.nutch.parse.tika
Parse various document formats with help of Apache Tika. org.apache.nutch.parse.zip
Parse ZIP files: embedded files are recursively passed to appropriate parsers. org.creativecommons.nutch
Sample plugins that parse and index Creative Commons medadata.
Uses of ParseResult in org.apache.nutch.analysis.lang
Methods in org.apache.nutch.analysis.lang that return ParseResult Modifier and Type Method and Description ParseResult HTMLLanguageParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible indications of content language
1.
Methods in org.apache.nutch.analysis.lang with parameters of type ParseResult Modifier and Type Method and Description ParseResult HTMLLanguageParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible indications of content language
1.
Uses of ParseResult in org.apache.nutch.microformats.reltag
Methods in org.apache.nutch.microformats.reltag that return ParseResult Modifier and Type Method and Description ParseResult RelTagParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible rel-tags
Methods in org.apache.nutch.microformats.reltag with parameters of type ParseResult Modifier and Type Method and Description ParseResult RelTagParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible rel-tags
Uses of ParseResult in org.apache.nutch.parse
Methods in org.apache.nutch.parse that return ParseResult Modifier and Type Method and Description static ParseResult ParseResult.createParseResult(String url,
Parse parse)
Convenience method for obtaining ParseResult from a single Parse output.
ParseResult HtmlParseFilters.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Run all defined filters.
ParseResult HtmlParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
ParseResult ParseStatus.getEmptyParseResult(String url,
org.apache.hadoop.conf.Configuration conf)
A convenience method.
ParseResult Parser.getParse(Content c)
This method parses the given content and returns a map of ParseResult ParseUtil.parse(Content content)
Performs a parse by iterating through a List of preferred Parsers until a successful parse is performed and a Parse object is returned.
ParseResult ParseUtil.parseByExtensionId(String extId,
Content content)
Method parses a Content object using the Parser specified by the parameter extId, i.e., the Parser's extension ID.
Methods in org.apache.nutch.parse with parameters of type ParseResult Modifier and Type Method and Description ParseResult HtmlParseFilters.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Run all defined filters.
ParseResult HtmlParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page.
Uses of ParseResult in org.apache.nutch.parse.ext
Methods in org.apache.nutch.parse.ext that return ParseResult Modifier and Type Method and Description ParseResult ExtParser.getParse(Content content)
Uses of ParseResult in org.apache.nutch.parse.feed
Methods in org.apache.nutch.parse.feed that return ParseResult Modifier and Type Method and Description ParseResult FeedParser.getParse(Content content)
Parses the given feed and extracts out and parsers all linked items within the feed, using the underlying ROME feed parsing library.
Uses of ParseResult in org.apache.nutch.parse.headings
Methods in org.apache.nutch.parse.headings that return ParseResult Modifier and Type Method and Description ParseResult HeadingsParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Methods in org.apache.nutch.parse.headings with parameters of type ParseResult Modifier and Type Method and Description ParseResult HeadingsParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Uses of ParseResult in org.apache.nutch.parse.html
Methods in org.apache.nutch.parse.html that return ParseResult Modifier and Type Method and Description ParseResult HtmlParser.getParse(Content content)
Uses of ParseResult in org.apache.nutch.parse.js
Methods in org.apache.nutch.parse.js that return ParseResult Modifier and Type Method and Description ParseResult JSParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc) ParseResult JSParseFilter.getParse(Content c)
Methods in org.apache.nutch.parse.js with parameters of type ParseResult Modifier and Type Method and Description ParseResult JSParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Uses of ParseResult in org.apache.nutch.parse.metatags
Methods in org.apache.nutch.parse.metatags that return ParseResult Modifier and Type Method and Description ParseResult MetaTagsParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Methods in org.apache.nutch.parse.metatags with parameters of type ParseResult Modifier and Type Method and Description ParseResult MetaTagsParser.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Uses of ParseResult in org.apache.nutch.parse.swf
Methods in org.apache.nutch.parse.swf that return ParseResult Modifier and Type Method and Description ParseResult SWFParser.getParse(Content content)
Uses of ParseResult in org.apache.nutch.parse.tika
Methods in org.apache.nutch.parse.tika that return ParseResult Modifier and Type Method and Description ParseResult TikaParser.getParse(Content content)
Uses of ParseResult in org.apache.nutch.parse.zip
Methods in org.apache.nutch.parse.zip that return ParseResult Modifier and Type Method and Description ParseResult ZipParser.getParse(Content content)
Uses of ParseResult in org.creativecommons.nutch
Methods in org.creativecommons.nutch that return ParseResult Modifier and Type Method and Description ParseResult CCParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.
Methods in org.creativecommons.nutch with parameters of type ParseResult Modifier and Type Method and Description ParseResult CCParseFilter.filter(Content content,
ParseResult parseResult,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.
- Prev
- Next
