An independent, evidence-weighted trust comparison of two observability & evaluation — ranked on verifiable signals, not popularity.
| Signal | Weights & Biases Weave | Promptfoo |
|---|---|---|
| HVTrust score | 84.9 | 73.8 |
| Evidence grade | B | B |
| Safety / Integrity (30) | 22.6 | 15.0 |
| Transparency (20) | 15.2 | 10.0 |
| Maintenance (20) | 19.9 | 20.0 |
| Adoption (10) | 7.2 | 8.8 |
| GitHub stars | 1.1k | 21.7k |
| Weekly downloads | 218,176 | 251,066 |
| Last push | 2026-05-29 | 2026-05-30 |
| Language | Python | TypeScript |
| OSSF Scorecard | 5.2 | — |