Apache Project Website Checks

Checking Project Websites for required and disallowed content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

The script also checks for 3rd party resource references that might be in conflict with our privacy policy.

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Thu, 26 Jun 2025 14:12:58 GMT over 219 websites.

Site Check For Project - StormCrawler

Results for Project StormCrawler .
Check Results column is the actual text or URL found on the homepage for this check (when applicable).
Check Type Check Results Check Description
Uri https://stormcrawler.apache.org
Foundation The Apache Software Foundation
Events https://www.apachecon.com/event-images/snippet.js
License URL expected to match regular expression: ^https?://.*apache.org/licenses/?$
There should be a "License" (*not* "Licenses") navigation link which points to: http[s]://www.apache.org/licenses[/]. (Do not link to sub-pages)
Thanks https://www.apache.org/foundation/sponsors
Security https://www.apache.org/security/
Sponsorship https://www.apache.org/foundation/sponsors URL expected to match regular expression: ^https?://.*apache.org/foundation/sponsorship
"Sponsorship", "Sponsor Apache", or "Donate" should link to: http://www.apache.org/foundation/sponsorship.html
Trademarks Apache StormCrawler, StormCrawler, the Apache feather logo are trademarks of The Apache Software Foundation.
Copyright © 2025 The Apache Software Foundation
Privacy https://privacy.apache.org/policies/privacy-policy-public.html
Resources Found 3 external resources: {"LOG Loaded ApacheCon Events Planner"=>1, "LOG rendering a wide format event banner for communityovercode2025 event..."=>1, "LOG Setting banner URL to https://communityovercode.org/"=>1} Text of a link expected to match regular expression: Found \d+ external resources
Websites must not link to externally hosted resources
Image URL expected to match regular expression: .
Projects SHOULD add a copy of their logo to https://www.apache.org/logos/ to be included in ASF homepage.