Build Integrity Report

Version: v0.1 Status: Draft Published: 2026-05-26 Authors: HVTracker

1. Purpose

The build integrity report provides a machine-readable summary of each HVTracker data pipeline run. It answers: "What exactly was generated, from what source, how many succeeded, and what failed?"

If HVTracker is to serve as a trust registry, its own data pipeline must be transparent. This report is the self-audit.

2. Endpoint

GET /data/build_report.json

Generated during every cron run by fetch_and_build.py. Refreshes daily at 06:00 UTC alongside all other data endpoints.

3. Fields

FieldTypeDescription
generated_atstring (ISO 8601)Timestamp when the report was generated
data_timestampstringHuman-readable timestamp shown on the leaderboard
schema_versionstringData schema version (e.g. "v0.1")
methodology_versionstringMethodology version (e.g. "v2.0")
configured_agentsintegerTotal entries in agents.json (active + legacy)
active_agentsintegerSuccessfully fetched and scored agents
legacy_agentsintegerLegacy agents (inactive ≥365 days, rendered separately)
total_generatedintegerTotal agent profile pages generated (active + legacy)
categoriesobjectMap of category name → agent count
warningsarrayEligibility violations (criterion, repo, detail)
warning_countintegerNumber of eligibility warnings
failed_fetchesarrayRepos that could not be fetched (404, rate limit, etc.)
missing_repos_countintegerCount of failed fetches
package_failuresarrayRepos with configured package but no download data
package_failure_countintegerCount of package lookup failures
scorecard_unavailable_countintegerAgents without OSSF Scorecard data
fingerprint_agentsarrayRepos with fingerprint-based action tracking configured
fingerprint_agent_countintegerCount of fingerprint-tracked agents

4. Usage

Consumers can use this report to:

  • Monitor pipeline health (failed fetch count trending up = API issue)
  • Verify data freshness (compare generated_at against expectations)
  • Audit signal coverage (scorecard unavailable count, package failures)
  • Track growth (configured vs active agents over time)

This report is public. It is part of HVTracker's commitment to transparent data sourcing.