Apache Project Website Checks

Checking Project Websites For required content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Wed, 28 Sep 2022 17:00:00 GMT over 206 websites.

Site Check For Project - Impala

Results for Project Impala .
Check Results column is the actual text or URL found on the homepage for this check (when applicable).
Check Type Check Results Check Description
Uri https://impala.apache.org/
Foundation Apache Software Foundation
Events http://www.apache.org/events/current-event-234x60.png
License https://www.apache.org/licenses/
Thanks https://www.apache.org/foundation/thanks.html
Security https://www.apache.org/security/
Sponsorship https://www.apache.org/foundation/sponsorship.html
Trademarks Apache Impala, Impala, Apache, the Apache feather logo, and the Apache Impala project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
Copyright Text of a link expected to match regular expression: ((Copyright|©).*apache|apache.*(Copyright|©))
All website content SHOULD include a copyright notice for the ASF.
Privacy URL expected to match regular expression: \A(https://privacy\.apache\.org/policies/privacy-policy-public.html|https?://(www\.)?apache\.org/foundation/policies/privacy\.html)\z
All websites must link to the Privacy Policy.
Resources Found 2 external resources: {"ajax.googleapis.com"=>1, "maxcdn.bootstrapcdn.com"=>1} Text of a link expected to match regular expression: Found \d+ external resources
Websites must not link to externally hosted resources
Image impala.svg