[TOC]

  • Prev
  • Next

Uses of Interface

org.apache.nutch.net.URLNormalizer

Uses of URLNormalizer in org.apache.nutch.net.urlnormalizer.basic

Classes in org.apache.nutch.net.urlnormalizer.basic that implement URLNormalizer Modifier and Type Class and Description class BasicURLNormalizer Converts URLs to a normal form: remove dot segments in path: /./ or /../ remove default ports, e.g.

Uses of URLNormalizer in org.apache.nutch.net.urlnormalizer.host

Classes in org.apache.nutch.net.urlnormalizer.host that implement URLNormalizer Modifier and Type Class and Description class HostURLNormalizer URL normalizer for mapping hosts to their desired form.

Uses of URLNormalizer in org.apache.nutch.net.urlnormalizer.pass

Classes in org.apache.nutch.net.urlnormalizer.pass that implement URLNormalizer Modifier and Type Class and Description class PassURLNormalizer This URLNormalizer doesn't change urls.

Uses of URLNormalizer in org.apache.nutch.net.urlnormalizer.querystring

Classes in org.apache.nutch.net.urlnormalizer.querystring that implement URLNormalizer Modifier and Type Class and Description class QuerystringURLNormalizer URL normalizer plugin for normalizing query strings but sorting query string parameters.

Uses of URLNormalizer in org.apache.nutch.net.urlnormalizer.regex

Classes in org.apache.nutch.net.urlnormalizer.regex that implement URLNormalizer Modifier and Type Class and Description class RegexURLNormalizer Allows users to do regex substitutions on all/any URLs that are encountered, which is useful for stripping session IDs from URLs.

  • Prev
  • Next

Copyright © 2014 The Apache Software Foundation