- Prev
- Next
Uses of Class
org.apache.nutch.parse.Outlink
Packages that use Outlink Package Description org.apache.nutch.parse
TheParseinterface and related classes. org.apache.nutch.parse.html
An HTML document parsing plugin. org.apache.nutch.parse.tika
Parse various document formats with help of Apache Tika. org.apache.nutch.parse.zip
Parse ZIP files: embedded files are recursively passed to appropriate parsers.
Uses of Outlink in org.apache.nutch.parse
Methods in org.apache.nutch.parse that return Outlink Modifier and Type Method and Description Outlink[] ParseData.getOutlinks()
The outlinks of the page.
static Outlink[] OutlinkExtractor.getOutlinks(String plainText,
org.apache.hadoop.conf.Configuration conf)
Extracts Outlink from given plain text.
static Outlink[] OutlinkExtractor.getOutlinks(String plainText,
String anchor,
org.apache.hadoop.conf.Configuration conf)
Extracts Outlink from given plain text and adds anchor to the extracted Outlinks
static Outlink Outlink.read(DataInput in)
Methods in org.apache.nutch.parse with parameters of type Outlink Modifier and Type Method and Description void ParseData.setOutlinks(Outlink[] outlinks)
Constructors in org.apache.nutch.parse with parameters of type Outlink Constructor and Description ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta) ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta)
Uses of Outlink in org.apache.nutch.parse.html
Method parameters in org.apache.nutch.parse.html with type arguments of type Outlink Modifier and Type Method and Description void DOMContentUtils.getOutlinks(URL base,
ArrayList
This method finds all anchors below the supplied DOM node, and creates appropriate Outlink records for each (relative to the supplied base URL), and adds them to the outlinks ArrayList.
Uses of Outlink in org.apache.nutch.parse.tika
Method parameters in org.apache.nutch.parse.tika with type arguments of type Outlink Modifier and Type Method and Description void DOMContentUtils.getOutlinks(URL base,
ArrayList
This method finds all anchors below the supplied DOM node, and creates appropriate Outlink records for each (relative to the supplied base URL), and adds them to the outlinks ArrayList.
Uses of Outlink in org.apache.nutch.parse.zip
Method parameters in org.apache.nutch.parse.zip with type arguments of type Outlink Modifier and Type Method and Description String ZipTextExtractor.extractText(InputStream input,
String url,
List
- Prev
- Next
