org.apache.nutch.crawl
Class LinkDbReader
- java.lang.Object
- org.apache.hadoop.conf.Configured
- org.apache.nutch.crawl.LinkDbReader
- All Implemented Interfaces:
- Closeable, AutoCloseable, org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool
public class LinkDbReader extends org.apache.hadoop.conf.Configured implements org.apache.hadoop.util.Tool, Closeable
.
Field Summary
Fields Modifier and Type Field and Description static org.slf4j.Logger
LOG
Constructor Summary
Constructors Constructor and Description LinkDbReader()
LinkDbReader(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path directory)
Method Summary
Methods Modifier and Type Method and Description void
close()
String[]
getAnchors(org.apache.hadoop.io.Text url)
Inlinks
getInlinks(org.apache.hadoop.io.Text url)
void
init(org.apache.hadoop.fs.Path directory)
static void
main(String[] args)
void
processDumpJob(String linkdb,
String output)
int
run(String[] args)
-
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
Field Detail
-
LOG
public static final org.slf4j.Logger LOG
Constructor Detail
-
LinkDbReader
public LinkDbReader()
-
LinkDbReader
public LinkDbReader(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path directory) throws Exception
- Throws:
- <code>Exception</code>
Method Detail
-
init
public void init(org.apache.hadoop.fs.Path directory) throws Exception
- Throws:
- <code>Exception</code>
-
getAnchors
public String[] getAnchors(org.apache.hadoop.io.Text url) throws IOException
- Throws:
- <code>IOException</code>
-
getInlinks
public Inlinks getInlinks(org.apache.hadoop.io.Text url) throws IOException
- Throws:
- <code>IOException</code>
-
close
public void close() throws IOException
- Specified by:
- <code>close</code> in interface <code>Closeable</code>
- Specified by:
- <code>close</code> in interface <code>AutoCloseable</code>
- Throws:
- <code>IOException</code>
-
processDumpJob
public void processDumpJob(String linkdb, String output) throws IOException
- Throws:
- <code>IOException</code>
-
main
public static void main(String[] args) throws Exception
- Throws:
- <code>Exception</code>
-
run
public int run(String[] args) throws Exception
- Specified by:
- <code>run</code> in interface <code>org.apache.hadoop.util.Tool</code>
- Throws:
- <code>Exception</code>