- Prev
- Next
Uses of Class
org.apache.nutch.parse.Outlink
Packages that use Outlink Package Description org.apache.nutch.parse
TheParse
interface and related classes. org.apache.nutch.parse.html
An HTML document parsing plugin. org.apache.nutch.parse.tika
Parse various document formats with help of Apache Tika. org.apache.nutch.parse.zip
Parse ZIP files: embedded files are recursively passed to appropriate parsers.
Uses of Outlink in org.apache.nutch.parse
Methods in org.apache.nutch.parse that return Outlink Modifier and Type Method and Description Outlink[]
ParseData.getOutlinks()
The outlinks of the page.
static Outlink[]
OutlinkExtractor.getOutlinks(String plainText,
org.apache.hadoop.conf.Configuration conf)
Extracts Outlink
from given plain text.
static Outlink[]
OutlinkExtractor.getOutlinks(String plainText,
String anchor,
org.apache.hadoop.conf.Configuration conf)
Extracts Outlink
from given plain text and adds anchor to the extracted Outlink
s
static Outlink
Outlink.read(DataInput in)
Methods in org.apache.nutch.parse with parameters of type Outlink Modifier and Type Method and Description void
ParseData.setOutlinks(Outlink[] outlinks)
Constructors in org.apache.nutch.parse with parameters of type Outlink Constructor and Description ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta)
ParseData(ParseStatus status,
String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta)
Uses of Outlink in org.apache.nutch.parse.html
Method parameters in org.apache.nutch.parse.html with type arguments of type Outlink Modifier and Type Method and Description void
DOMContentUtils.getOutlinks(URL base,
ArrayList
This method finds all anchors below the supplied DOM node
, and creates appropriate Outlink
records for each (relative to the supplied base
URL), and adds them to the outlinks
ArrayList
.
Uses of Outlink in org.apache.nutch.parse.tika
Method parameters in org.apache.nutch.parse.tika with type arguments of type Outlink Modifier and Type Method and Description void
DOMContentUtils.getOutlinks(URL base,
ArrayList
This method finds all anchors below the supplied DOM node
, and creates appropriate Outlink
records for each (relative to the supplied base
URL), and adds them to the outlinks
ArrayList
.
Uses of Outlink in org.apache.nutch.parse.zip
Method parameters in org.apache.nutch.parse.zip with type arguments of type Outlink Modifier and Type Method and Description String
ZipTextExtractor.extractText(InputStream input,
String url,
List
- Prev
- Next