This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img
The script also checks for 3rd party resource references that might be in conflict with our privacy policy.
View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Thu, 03 Oct 2024 12:12:01 GMT over 218 websites.
Check Type | Check Results | Check Description |
---|---|---|
Uri | https://gobblin.apache.org | |
Foundation | Foundation | |
Events | https://www.apache.org/events/current-event.html | |
License | https://www.apache.org/licenses/ | |
Thanks | https://www.apache.org/foundation/thanks.html | |
Security | https://www.apache.org/security | |
Sponsorship | https://www.apache.org/foundation/sponsorship.html | |
Trademarks | Apache, Apache Gobblin, the Apache feather and the Gobblin logo are trademarks of The Apache Software Foundation | |
Copyright | Copyright © 2021 The Apache Software Foundation | |
Privacy |
URL expected to match regular expression:
\Ahttps://privacy\.apache\.org/policies/privacy-policy-public\.html\z
|
\Ahttps?://(?:www\.)?apache\.org/foundation/policies/privacy\.html\z
All websites must link to the Privacy Policy. |
|
Resources | Found 3 external resources: {"netdna.bootstrapcdn.com"=>1, "code.jquery.com"=>1, "fonts.googleapis.com"=>1} |
Text of a link expected to match regular expression:
Found \d+ external resources
Websites must not link to externally hosted resources |
Image | gobblin.svg |