[TOC]

  • Summary:
  • Nested |
  • Field |
  • Constr |
  • Method
  • Detail:
  • Field |
  • Constr |
  • Method

org.apache.nutch.parse

Interface Parse


public interface Parse

The result of parsing a page's raw content.

Method Summary

Methods Modifier and Type Method and Description ParseData getData() Other data extracted from the page. String getText() The textual content of the page. boolean isCanonical() Indicates if the parse is coming from a url or a sub-url

Method Detail

-  

getText

String getText()

The textual content of the page. This is indexed, searched, and used when generating snippets.

-  

getData

ParseData getData()

Other data extracted from the page.

-  

isCanonical

boolean isCanonical()

Indicates if the parse is coming from a url or a sub-url

  • Summary:
  • Nested |
  • Field |
  • Constr |
  • Method
  • Detail:
  • Field |
  • Constr |
  • Method

Copyright © 2014 The Apache Software Foundation