An independent, evidence-based trust comparison of DeepEval and Evidently, two Observability & Evaluation projects in the HVTracker registry. Scores come from public, checkable signals — supply-chain provenance, OSSF Scorecard, maintenance, and adoption — not popularity.
| Signal | DeepEvalconfident-ai/deepeval | Evidentlyevidentlyai/evidently |
|---|---|---|
| HVTrust score | 100.0 | 100.0 |
| Evidence grade | A | A |
| Overall rank | #17 | #29 |
| Rank in Observability & Evaluation | #2 | #3 |
| GitHub stars | 16.6k | 7.7k |
| Last updated | 2d ago | 63d ago |
| Build provenance | No | Yes |
| OSSF Scorecard | 4.0 / 10 | 3.5 / 10 |
| License | Apache-2.0 | Apache-2.0 |
| Downloads | 2.2M/wk | 296k/wk |
| Trust dimensions (points earned) | ||
| Safety / integrity / 25 | 6.1 | 16.4 |
| Identity & provenance / 20 | 10.8 | 18.0 |
| Transparency / 17 | 11.9 | 11.5 |
| Maintenance / 20 | 19.9 | 7.8 |
| Adoption / 20 | 18.6 | 16.6 |
How to read this: HVTrust (0–100) weighs supply-chain signals (provenance, OSSF Scorecard, signed commits, open license) alongside real-world adoption, scaled by an evidence-confidence factor. Grade bands: A ≥ 80, B ≥ 65, C ≥ 50, D < 50. Signals refresh daily. Full methodology v4.0 →