FeedHall

nutch

Nutch (https://nutch.apache.org) is an open source web crawler, which means that the nutch user agent could be pretty much anything running from an unconfigured nutch-instance. The crawler does obey robots.txt

However, if you disallow nutch, you will be disallowing all nutch-based crawlers according to https://nutch.apache.org/bot.html .

Info

Regex: ^nutch