An independent, evidence-based trust comparison of Langfuse and MLflow, two Observability & Evaluation projects in the HVTracker registry. Scores come from public, checkable signals — supply-chain provenance, OSSF Scorecard, maintenance, and adoption — not popularity.
| Signal | Langfuselangfuse/langfuse | MLflowmlflow/mlflow |
|---|---|---|
| HVTrust score | 77.2 | 90.5 |
| Evidence grade | B | A |
| Overall rank | #43 | #7 |
| Rank in Observability & Evaluation | #3 | #1 |
| GitHub stars | 30.2k | 26.8k |
| Last updated | today | today |
| Build provenance | No | Yes |
| OSSF Scorecard | 6.4 / 10 | 5.5 / 10 |
| License | NOASSERTION | Apache-2.0 |
| Downloads | 6.9M/wk | 8.8M/wk |
| Trust dimensions (points earned) | ||
| Safety / integrity / 25 | 12.6 | 19.4 |
| Identity & provenance / 20 | 10.8 | 18.0 |
| Transparency / 17 | 13.9 | 13.2 |
| Maintenance / 20 | 20.0 | 20.0 |
| Adoption / 20 | 19.9 | 19.9 |
How to read this: HVTrust (0–100) weighs supply-chain signals (provenance, OSSF Scorecard, signed commits, open license) alongside real-world adoption, scaled by an evidence-confidence factor. Grade bands: A ≥ 80, B ≥ 65, C ≥ 50, D < 50. Signals refresh daily. Full methodology v3.2 →