This software package provides metrics for performance analysis. The metrics used in the package are described in detail.
At the core of this package lies the spider that fetches all the pages referred to in a web-site. For each page fetched, several metrics are checked. Here is a brief explanation of the kwalitee metrics used in this package:
- Return status: This metric is the most basic check for a web-page and evaluates if it can be fetched. The HTTP return-status should be 200 OK. The SCORE for this metric is 0 for a return status other than 200 and 1 for a return status of 200.
- Title: The next metric checks if the title tag-pair has content. The SCORE for this metric is 0 if there is no content and 1 if there is content.
- Valid: The next metric is to check if the (X)HTML in the page validates. The default behavior is to use the validator available on. The SCORE for this metric is 0 if it's not valid, 1 for valid, or validation disabled.
- Links: This metric ensures that web-pages do not contain "dead links." All hyperlinks are checked with an HTTP HEAD request to see if they can be "followed." URLs that have the same origin as the primary URL will also be put on the "to-fetch-list" of the spider. The MAX SCORE for this metric is 1 (excluding URLs specified by robot-rules/exclude pattern).
- Images: This metric checks if the web-page contains "dead images." All images are checked with an HTTP HEAD request to verify if they exist on the server. If the Image::Info module is available, the image is fetched from the server, and a basic sanity test on the image is done. The MAX SCORE for this metric is 1 (excluding images specified by robot-rules/exclude pattern).
- Styles: The next metric is to check if the web-page contains "dead style references." All styles referenced in are fetched. If validation is switched on, they will be sent to the CSS-validator at: TODO: Extract inline styles and send them off for validation. The MAX SCORE for this metric is 1.
Every individual page can have a maximum of 6 kwalitee points that lead to a kwalitee of 1.00. For the complete web-site, the mean of the page scores is taken and presented as a fraction of 1.00.
Version 0.018: N/A