Apache Project Website Checks

Checking Project Websites for required and disallowed content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

The script also checks for 3rd party resource references that might be in conflict with our privacy policy.

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Mon, 30 Dec 2024 16:12:22 GMT over 218 websites.

Site Check For Project - Nutch

Results for Project Nutch .
Check Results column is the actual text or URL found on the homepage for this check (when applicable).
Check Type Check Results Check Description
Uri https://nutch.apache.org/
Foundation Text of a link expected to match regular expression: apache|asf|foundation
All projects must feature some prominent link back to the main ASF homepage at http://www.apache.org/
Events https://www.apachecon.com/event-images/snippet.js
License URL expected to match regular expression: ^https?://.*apache.org/licenses/?$
There should be a "License" (*not* "Licenses") navigation link which points to: http[s]://www.apache.org/licenses[/]. (Do not link to sub-pages)
Thanks URL expected to match regular expression: ^https?://.*apache.org/foundation/(thanks|sponsors)
"Sponsors", "Thanks" or "Thanks to our Sponsors" should link to: http://www.apache.org/foundation/thanks.html or sponsors.html
Security URL expected to match regular expression: ^https?://.*apache.org/.*[Ss]ecurity
"Security" should link to either to a project-specific page [...], or to the main http://www.apache.org/security/ page.
Sponsorship URL expected to match regular expression: ^https?://.*apache.org/foundation/sponsorship
"Sponsorship", "Sponsor Apache", or "Donate" should link to: http://www.apache.org/foundation/sponsorship.html
Trademarks . Apache Nutch, Nutch, Apache, the Apache feather logo, and the Apache Nutch project logo are trademarks of The Apache Software Foundation.
Copyright © 2004-2024 The Apache Software Foundation.
Privacy URL expected to match regular expression: \Ahttps://privacy\.apache\.org/policies/privacy-policy-public\.html\z | \Ahttps?://(?:www\.)?apache\.org/foundation/policies/privacy\.html\z
All websites must link to the Privacy Policy.
Resources Found 0 external resources: {}
Image nutch.svg