Harvest-NG
Harvest-NG is a Perl-based set of tools for building a standards-compliant web crawler.
More information about them, from the website:
Harvest-NG is a set of tools for building a standards-compliant web crawler. It is implemented in Perl, taking advantage of many of the existing perl tools, and can provide a complete harvesting solution for a web site.
Visit Harvest-NG


