Apache Project Website Checks

Checking Project Websites For required content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Tue, 19 Mar 2024 06:09:22 GMT over 216 websites.

Site Check For Project - PDFBox

Results for Project PDFBox .
Check Results column is the actual text or URL found on the homepage for this check (when applicable).
Check Type Check Results Check Description
Uri https://pdfbox.apache.org/
Foundation The Apache Software Foundation
Events https://www.apache.org/events/current-event.html
License https://www.apache.org/licenses
Thanks https://www.apache.org/foundation/thanks.html
Security https://pdfbox.apache.org/security.html
Sponsorship https://www.apache.org/foundation/sponsorship.html
Trademarks Apache PDFBox, PDFBox, Apache, the Apache feather logo and the Apache PDFBox project logos are trademarks of The Apache Software Foundation.
Copyright Copyright © 2009–2024 The Apache Software Foundation.
Privacy URL expected to match regular expression: \Ahttps://privacy\.apache\.org/policies/privacy-policy-public\.html\z | \Ahttps?://(?:www\.)?apache\.org/foundation/policies/privacy\.html\z
All websites must link to the Privacy Policy.
Resources Found 2 external resources: {"fonts.googleapis.com"=>2} Text of a link expected to match regular expression: Found \d+ external resources
Websites must not link to externally hosted resources
Image pdfbox.svg