RegistryCompare › DeepEval vs Promptfoo

DeepEval vs Promptfoo

An independent, evidence-based trust comparison of DeepEval and Promptfoo, two Observability & Evaluation projects in the HVTracker registry. Scores come from public, checkable signals — supply-chain provenance, OSSF Scorecard, maintenance, and adoption — not popularity.

DeepEval and Promptfoo are tied on HVTrust at 100.0/100. Full breakdown below.
Signal DeepEvalconfident-ai/deepeval Promptfoopromptfoo/promptfoo
HVTrust score 100.0 100.0
Evidence grade A A
Overall rank #18 #14
Rank in Observability & Evaluation #2 #1
GitHub stars 16.6k 22.9k
Last updated 2d ago today
Build provenance No Yes
OSSF Scorecard 4.0 / 10
License Apache-2.0 MIT
Downloads 2.2M/wk 384k/wk
Trust dimensions (points earned)
Safety / integrity / 25 6.1 7.5
Identity & provenance / 20 10.8 18.0
Transparency / 17 11.9 8.5
Maintenance / 20 19.9 20.0
Adoption / 20 18.6 17.9
Open in the live compare tool → DeepEval profile Promptfoo profile More Observability & Evaluation →

How to read this: HVTrust (0–100) weighs supply-chain signals (provenance, OSSF Scorecard, signed commits, open license) alongside real-world adoption, scaled by an evidence-confidence factor. Grade bands: A ≥ 80, B ≥ 65, C ≥ 50, D < 50. Signals refresh daily. Full methodology v4.0 →