An independent, evidence-based trust comparison of DeepEval and Promptfoo, two Observability & Evaluation projects in the HVTracker registry. Scores come from public, checkable signals — supply-chain provenance, OSSF Scorecard, maintenance, and adoption — not popularity.
| Signal | DeepEvalconfident-ai/deepeval | Promptfoopromptfoo/promptfoo |
|---|---|---|
| HVTrust score | 100.0 | 100.0 |
| Evidence grade | A | A |
| Overall rank | #18 | #14 |
| Rank in Observability & Evaluation | #2 | #1 |
| GitHub stars | 16.6k | 22.9k |
| Last updated | 2d ago | today |
| Build provenance | No | Yes |
| OSSF Scorecard | 4.0 / 10 | — |
| License | Apache-2.0 | MIT |
| Downloads | 2.2M/wk | 384k/wk |
| Trust dimensions (points earned) | ||
| Safety / integrity / 25 | 6.1 | 7.5 |
| Identity & provenance / 20 | 10.8 | 18.0 |
| Transparency / 17 | 11.9 | 8.5 |
| Maintenance / 20 | 19.9 | 20.0 |
| Adoption / 20 | 18.6 | 17.9 |
How to read this: HVTrust (0–100) weighs supply-chain signals (provenance, OSSF Scorecard, signed commits, open license) alongside real-world adoption, scaled by an evidence-confidence factor. Grade bands: A ≥ 80, B ≥ 65, C ≥ 50, D < 50. Signals refresh daily. Full methodology v4.0 →