org.apache.nutch.fetcher
Class OldFetcher
- java.lang.Object
- org.apache.hadoop.conf.Configured
- org.apache.nutch.fetcher.OldFetcher
- All Implemented Interfaces:
- org.apache.hadoop.conf.Configurable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.MapRunnable
,org.apache.hadoop.io.Writable,org.apache.hadoop.io.Text,NutchWritable>, org.apache.hadoop.util.Tool
public class OldFetcher extends org.apache.hadoop.conf.Configured implements org.apache.hadoop.util.Tool, org.apache.hadoop.mapred.MapRunnable<org.apache.hadoop.io.WritableComparable<?>,org.apache.hadoop.io.Writable,org.apache.hadoop.io.Text,NutchWritable>
The fetcher. Most of the work is done by plugins.
Nested Class Summary
Nested Classes Modifier and Type Class and Description static class
OldFetcher.InputFormat
Field Summary
Fields Modifier and Type Field and Description static String
CONTENT_REDIR
static org.slf4j.Logger
LOG
static int
PERM_REFRESH_TIME
static String
PROTOCOL_REDIR
Constructor Summary
Constructors Constructor and Description OldFetcher()
OldFetcher(org.apache.hadoop.conf.Configuration conf)
Method Summary
Methods Modifier and Type Method and Description void
close()
void
configure(org.apache.hadoop.mapred.JobConf job)
void
fetch(org.apache.hadoop.fs.Path segment,
int threads)
static boolean
isParsing(org.apache.hadoop.conf.Configuration conf)
static boolean
isStoringContent(org.apache.hadoop.conf.Configuration conf)
static void
main(String[] args)
Run the fetcher.
void
run(org.apache.hadoop.mapred.RecordReader
int
run(String[] args)
-
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
Field Detail
-
LOG
public static final org.slf4j.Logger LOG
-
PERM_REFRESH_TIME
public static final int PERM_REFRESH_TIME
- See Also:
- [Constant Field Values](../../../../constant-values.html#org.apache.nutch.fetcher.OldFetcher.PERM_REFRESH_TIME)
-
CONTENT_REDIR
public static final String CONTENT_REDIR
- See Also:
- [Constant Field Values](../../../../constant-values.html#org.apache.nutch.fetcher.OldFetcher.CONTENT_REDIR)
-
PROTOCOL_REDIR
public static final String PROTOCOL_REDIR
- See Also:
- [Constant Field Values](../../../../constant-values.html#org.apache.nutch.fetcher.OldFetcher.PROTOCOL_REDIR)
Constructor Detail
-
OldFetcher
public OldFetcher()
-
OldFetcher
public OldFetcher(org.apache.hadoop.conf.Configuration conf)
Method Detail
-
configure
public void configure(org.apache.hadoop.mapred.JobConf job)
- Specified by:
- <code>configure</code> in interface <code>org.apache.hadoop.mapred.JobConfigurable</code>
-
close
public void close()
-
isParsing
public static boolean isParsing(org.apache.hadoop.conf.Configuration conf)
-
isStoringContent
public static boolean isStoringContent(org.apache.hadoop.conf.Configuration conf)
-
run
public void run(org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.WritableComparable<?>,org.apache.hadoop.io.Writable> input, org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,NutchWritable> output, org.apache.hadoop.mapred.Reporter reporter) throws IOException
- Specified by:
- <code>run</code> in interface <code>org.apache.hadoop.mapred.MapRunnable<org.apache.hadoop.io.writablecomparable<?>,org.apache.hadoop.io.Writable,org.apache.hadoop.io.Text,NutchWritable></org.apache.hadoop.io.writablecomparable<?></code>
- Throws:
- <code>IOException</code>
-
fetch
public void fetch(org.apache.hadoop.fs.Path segment, int threads) throws IOException
- Throws:
- <code>IOException</code>
-
main
public static void main(String[] args) throws Exception
Run the fetcher.
- Throws:
- <code>Exception</code>
-
run
public int run(String[] args) throws Exception
- Specified by:
- <code>run</code> in interface <code>org.apache.hadoop.util.Tool</code>
- Throws:
- <code>Exception</code>