You're Not Installing What You Think You Are

June 2, 2026 · 6 min read · HVTracker Research

Run pip install litellm right now. You'll get a package with 122 million weekly downloads. You can read every line of the source code on GitHub. Everything looks legitimate.

But here's what you can't do: prove that the package you just installed was actually built from that source code.

There is no cryptographic attestation, no build provenance, no SLSA signature — nothing that links the binary on PyPI to the commit on GitHub. The install probably matches. But "probably" is doing a lot of heavy lifting when the thing you're installing can execute shell commands, call APIs, and access your files.

83%
of the 178 AI agents we track ship without build provenance

Downloads are not trust

This is the part most people get wrong. They see "122M weekly downloads" and think that's a safety signal. It's not. Downloads tell you how popular something is. They tell you absolutely nothing about whether the package on the registry was built from the source you're reading.

Here are the most-downloaded AI agent packages that ship with zero provenance:

AgentWeekly DownloadsStarsProvenance
LiteLLM122M48.9kNone
LangChain77M138.2kNone
Weaviate35M16.3kNone
Browser Use8.8M96.5kNone
Claude Code8.7M129.3kNone
Google ADK7.9M19.9kNone
Flowise6.4M53.3kNone
Qdrant5.1M31.7kNone
Langfuse4.9M28.3kNone
Composio3.6M10.6kNone

Combined: over 280 million weekly installs with no way to verify the build.

Think about what these tools do. LiteLLM proxies your API keys to LLM providers. Browser Use controls your browser. Claude Code executes shell commands. These aren't passive libraries — they're active agents with real permissions. The gap between "source on GitHub" and "binary on the registry" matters more here than anywhere.

The ones who actually do it

31 out of 178 agents we track publish build provenance. And here's the pattern that's hard to ignore: they dominate the top of our trust rankings.

AgentWeekly DownloadsStarsProvenanceHVTrust Rank
n8n218M190.6kVerified#6
LangGraph15.6M33.6kVerified#1
Codex13.6M87.6kVerified#3
Vercel AI SDK13.0M24.6kVerified#4
PydanticAI9.6M17.4kVerified#2
OpenAI Agents SDK6.1M26.8kVerified#5
Cline4.2M62.6kVerified#7
CopilotKit240k31.9kVerified#38

The top 7 agents on HVTracker all have verified provenance. That's not a coincidence — it's a signal. Projects that bother with build attestations also tend to sign commits, run OSSF Scorecard, and maintain security policies. Trust signals cluster.

What's actually in the gap

When a package ships without provenance, here's what sits between the source code and your machine — unverified:

The maintainer's local machine or CI pipeline → their registry credentials → the registry's build storage → your pip install.

At any of those steps, the package could be modified. A compromised CI secret. A hijacked npm/PyPI token. A supply chain attack on a build dependency. Without provenance attestation, you have no way to detect any of these after the fact.

With provenance, you can run npm audit signatures or check pip inspect attestations and get a cryptographic chain: this package was built at this commit, by this CI workflow, from this repo. If anything doesn't match, you know.

Why AI agents are different

When someone compromises a date-formatting library, the worst case is data exfiltration through a dependency. Bad, but bounded.

When someone compromises an AI agent, the worst case is whatever that agent was authorized to do:

The blast radius of a compromised AI agent is categorically larger than a compromised utility library. And yet 83% of them ship with the same level of supply chain verification as a left-pad clone: none.

It's not hard to fix

Both npm and PyPI support provenance attestation today. If your project uses GitHub Actions (and most do), enabling it is a configuration change — not a rewrite:

CopilotKit added provenance and their trust score jumped from the 60s to 75. It's one of the highest-leverage trust improvements a maintainer can make.

Not an accusation. Missing provenance doesn't mean a project is compromised. It means you can't independently verify that it isn't. For most packages, that's an acceptable risk. For an AI agent with tool access and API keys? You should at least know the gap exists.

Check if your agent has provenance

We track build provenance, OSSF Scorecard, signed commits, and 10+ other trust signals across hundreds of AI agents. Updated daily.

Browse the trust registry

Data from HVTracker signals as of June 2, 2026. Provenance is checked via npm registry attestations and PyPI PEP 740 metadata. Download counts from npm and PyPI APIs. Full methodology.