Registry › Compare › Langfuse vs MLflow

Langfuse vs MLflow

An independent, evidence-based trust comparison of Langfuse and MLflow, two Observability & Evaluation projects in the HVTracker registry. Scores come from public, checkable signals — supply-chain provenance, OSSF Scorecard, maintenance, and adoption — not popularity.

MLflow leads on trust — 90.5/100 (Grade A) vs 77.2/100 (Grade B), a 13.3-point gap. Full breakdown below.

Signal	Langfuselangfuse/langfuse	MLflowmlflow/mlflow
HVTrust score	77.2	90.5
Evidence grade	B	A
Overall rank	#43	#7
Rank in Observability & Evaluation	#3	#1
GitHub stars	30.2k	26.8k
Last updated	today	today
Build provenance	No	Yes
OSSF Scorecard	6.4 / 10	5.5 / 10
License	NOASSERTION	Apache-2.0
Downloads	6.9M/wk	8.8M/wk
Trust dimensions (points earned)
Safety / integrity / 25	12.6	19.4
Identity & provenance / 20	10.8	18.0
Transparency / 17	13.9	13.2
Maintenance / 20	20.0	20.0
Adoption / 20	19.9	19.9

Open in the live compare tool → Langfuse profile MLflow profile More Observability & Evaluation →

How to read this: HVTrust (0–100) weighs supply-chain signals (provenance, OSSF Scorecard, signed commits, open license) alongside real-world adoption, scaled by an evidence-confidence factor. Grade bands: A ≥ 80, B ≥ 65, C ≥ 50, D < 50. Signals refresh daily. Full methodology v3.2 →