Apache Nutch is a highly extensible and scalable open source web crawler software project.

    Nutch is a project of the Apache Software Foundation and is part of the larger Apache community of developers and users.