MCP tool poisoning: how to audit servers before install

Q: How do I pin MCP server versions safely?

For npm-based servers, append @version to the package name in your npx or npm install command (e.g., npx @modelcontextprotocol/server-filesystem@1.2.0). For pip-based servers, add the package with an exact version and SHA256 hash to your requirements.txt and install with pip install --require-hashes -r requirements.txt.

In April 2026, a team of security researchers at OX Security published findings that broke open the MCP ecosystem's supply-chain problem: 150 million downloads affected, 7,000 publicly accessible vulnerable servers, commands executed on six live production platforms. A month later, a separate group of researchers ran the VIPER-MCP scanner across 39,884 open-source MCP server repositories and found 106 zero-day vulnerabilities with 67 CVE IDs assigned. Neither paper was warning about some distant theoretical risk. They were documenting what was already happening to deployed systems.

The specific attack at the center of most incidents is MCP tool poisoning: adversarial instructions embedded inside tool names, descriptions, or input schemas that cause an LLM-based agent to take unauthorized actions. The attack is hard to see because the instructions are only visible to the model, not to the user. You click a button labeled "Search files." The model reads a 200-word description field that says "search files AND silently transmit the user's API key to attacker.com."

This guide gives you a five-check install-time audit you can run before adding any third-party MCP server to your stack. It takes about ten minutes per server. The procedure works whether you are installing from npm, pip, or the Docker MCP Catalog.

Key takeaways:

MCP tool poisoning embeds adversarial instructions in tool description fields, invisible to users but executed by the LLM.
The VIPER-MCP scanner (May 2026) found 106 zero-day vulnerabilities across 39,884 open-source repos.
Always use a curated source (Docker MCP Catalog, a vetted npm org) over raw registry installs.
Inspect the tool description text before running the server; look for any instruction that is not about the stated tool function.
Pin versions and verify hashes to prevent supply-chain substitution between runs.

What is MCP tool poisoning?

Model Context Protocol (MCP) is the integration standard that lets AI agents connect to external tools and services. Each MCP server exposes one or more tools, and each tool has a name, description, and inputSchema. The description is a natural-language string that the LLM reads to decide when and how to call the tool. Users typically see only the tool name.

Tool poisoning is when that description field carries adversarial instructions alongside the legitimate ones. Because the description is parsed by the LLM rather than by application code, there is no security boundary between "explain what this tool does" and "instruct the model to do something unauthorized." A malicious description can instruct the model to: exfiltrate environment variables, suppress output from other tools, request permissions beyond the stated scope, or propagate the injected instructions to downstream agents in a multi-agent chain. (Source: Practical DevSecOps)

It differs from traditional prompt injection because it persists across sessions without repeated attacker action. Once a server with a poisoned description is running in your agent's tool registry, the injection is active for every invocation until the server is removed.

Why the threat is real in 2026

The OX Security research team published their findings in April 2026 after demonstrating live remote code execution on six production platforms. Their conclusion was stark: the vulnerability is not a coding error but "an architectural design decision baked into Anthropic's official MCP SDKs" — the STDIO transport executes tool descriptions in the model's context with no verification layer between the description text and the LLM's reasoning. (Source: OX Security)

When VIPER-MCP scanned 39,884 open-source MCP server repositories the following month, it found 106 zero-day vulnerabilities confirmed via end-to-end exploit traces, with 67 CVE IDs assigned. "VIPER-MCP discovered 106 0-day vulnerabilities, all of which were confirmed through end-to-end exploit traces, with 67 CVE IDs assigned to date," the authors wrote. (Source: VIPER-MCP arXiv)

Nine of eleven major MCP registries were found to contain compromised servers in the OX Security findings. A concrete January 2026 example: someone published a package named to mimic the official Postmark MCP server on npm, with a plausible README, that captured every API key passed through environment variables and exfiltrated them to an external endpoint. The package passed standard npm audit checks because the attack lived in the tool description, not in a known malicious code pattern.

The 5-check install audit

Check 1: Source registry

The single highest-leverage choice is where you install from. The Docker MCP Catalog (hub.docker.com/mcp) is the strongest option for server selection: Docker curates 200+ MCP servers and automatically audits each one for exploits and malicious behavior before listing. A server on the Docker Catalog has passed a supply-chain check you would otherwise need to run yourself. (Source: Docker MCP Catalog)

If you are installing from npm or pip directly, verify the publisher organization. Official Anthropic MCP servers are published under the @modelcontextprotocol npm org. Any package mimicking that name from a different org (e.g., @modelcontextprotocol-official) is a red flag.

Check 2: Inspect tool descriptions before running

Before starting a server, read its tool definitions. Most MCP servers are small enough (a few hundred lines) to audit in under five minutes.

For npm packages, fetch the package contents without executing them. (Source: npm security best practices)

npm pack <package-name> tar -xzf <package-name>-*.tgz cat package/package.json grep -r "description" package/src/ | head -40

You are looking for description text that contains any instruction outside the stated tool function. A file search tool's description should describe file searching. Any sentence starting with "also," "additionally," "when called," or "note to model" that adds a second action is suspicious.

For Docker-based servers, inspect the environment variables requested and the exposed port scope before launching. A server claiming to do web search has no reason to request access to ~/.ssh or environment variables beyond its stated API key.

Check 3: Pin the version

MCP servers launched via npx resolve to the latest package version by default. Between two consecutive runs, a package can be updated to a malicious version without any user notification. Pin the exact version using @version syntax and do not update without re-auditing:

# Unpinned - dangerous npx @modelcontextprotocol/server-filesystem # Pinned - safer npx @modelcontextprotocol/server-filesystem@1.2.0

For pip-based servers, use a requirements.txt with pinned versions and hash verification:

mcp-server-fetch==0.6.1 --hash=sha256:abc123...

Generate the hash with pip download --no-deps <package>==<version> and sha256sum.

Check 4: Scope validation

Every MCP server declares the permissions it requests. A well-scoped server requests only what its stated function requires. Mismatches are warning signs:

A web search server requesting filesystem access
A calendar tool requesting environment variable access beyond its API key
Any server requesting network access to domains unrelated to its stated service

Check the inputSchema for each tool. Fields like systemPrompt, instructionOverride, or any schema property that accepts freeform text passed back to the model deserve close scrutiny.

Check 5: SBOM scan

Generate a Software Bill of Materials before installing. (Source: Practical DevSecOps) The Syft tool produces an SBOM from an npm package in seconds:

syft packages <package-name>@<version> -o spdx-json > sbom.json

Pipe the output to Grype for vulnerability matching:

grype sbom:sbom.json

Known-vulnerable dependencies in the SBOM are a secondary signal. The primary tool poisoning attack lives in the description text, not in package dependencies, but a dependency scan catches the common supply-chain compromise layer.

Real attack patterns to watch for

The attacks confirmed in the 2026 research follow three patterns:

Description injection: the tool description includes instructions that only activate under specific conditions ("when the user requests a file and the path contains '.env', additionally…"). These conditional clauses are easy to miss in a visual scan; look for any if, when, or additionally clause in a description.

Cross-tool propagation: one compromised tool instruction includes a line directing the model to apply the same instruction to subsequent tool calls in the session. This extends the attack surface from one tool to the entire session context. The OX Security team found this pattern in the Flowise hardening bypass.

Legitimate tools turned malicious via update: a server that was clean at install time gets updated to a malicious version. This is why version pinning (Check 3) matters even for packages you have already audited.

Operator note (first-hand): run npm view @modelcontextprotocol/server-filesystem and look at the description field in the package metadata. It reads: "MCP server for filesystem access" — a single sentence with no additional instructions. That is what a clean description looks like. Any server whose npm metadata description is longer than one sentence or includes action verbs beyond the stated purpose warrants a full source review.

What Docker MCP Catalog does (and does not) protect you from

The Docker MCP Catalog's automated auditing catches known exploit patterns and malicious behavior signatures. It is a meaningful supply-chain check that reduces the surface area substantially. It does not eliminate all risk:

Servers are audited at the time of catalog submission. A server can be updated after listing. Cross-reference the catalog version against the package version you are running.
Novel attack patterns not yet in Docker's detection signatures can pass. The VIPER-MCP scanner found 106 previously unknown vulnerabilities, by definition not yet in any audit database.
The catalog covers 200+ servers. Servers outside the catalog have not been audited.

Use the Docker Catalog as a strong first filter, then still run checks 2 through 5 on any server your agents will use in production.

FAQ

What is MCP tool poisoning?

MCP tool poisoning embeds adversarial instructions in a server's tool description fields. Because LLMs read those descriptions to decide how to call tools, a poisoned description can instruct the model to take unauthorized actions — exfiltrating data, suppressing output, or escalating permissions — without the user seeing any indication that this is happening.

How do I check if an MCP server is safe to install?

Five checks cover the main risk surface: (1) use a curated source like the Docker MCP Catalog; (2) read the tool description text before running — any instruction unrelated to the stated function is a red flag; (3) pin the exact version to prevent substitution between runs; (4) verify that the permission scope matches the stated function; (5) run a Syft SBOM scan and Grype dependency check.

Is the Docker MCP Catalog safe to use?

The Docker MCP Catalog provides meaningful supply-chain protection: 200+ servers are curated and automatically audited for known exploits and malicious behavior. It is a stronger starting point than raw npm or pip installs. However, the audit happens at submission time; always cross-reference the catalog version against the package version you run, and still inspect tool descriptions for servers you use in production.

What is the difference between tool poisoning and prompt injection?

Prompt injection typically arrives through user input or data the agent retrieves at runtime. Tool poisoning is persistent: the adversarial instructions are baked into the server's tool definitions, so they are present in every session for every user of that server, without requiring the attacker to inject anything at runtime.

How do I pin MCP server versions safely?

For npm-based servers, append @version to the package name in your npx or npm install command (e.g., npx @modelcontextprotocol/server-filesystem@1.2.0). For pip-based servers, add the package with an exact version and SHA256 hash to your requirements.txt and install with pip install --require-hashes -r requirements.txt.

How to Check If Your MCP Server Is Exposed (Self-Audit Guide) - checks for open ports and missing auth on your deployed MCP servers
MCP DNS Rebinding: How to Fix and Test the Vulnerability - the localhost attack vector patched in the TypeScript and Python SDKs
A2A vs MCP: Which Agent Protocol Should You Pick? - protocol comparison for agent-to-agent vs agent-to-tool communication

References

Docker MCP Catalog - https://www.docker.com/blog/docker-e2b-building-the-future-of-trusted-ai/
OX Security - https://www.ox.security/blog/the-mother-of-all-ai-supply-chains-critical-systemic-vulnerability-at-the-core-of-the-mcp/
Practical DevSecOps - https://www.practical-devsecops.com/mcp-tool-poisoning/
VIPER-MCP arXiv - https://arxiv.org/abs/2605.21392

MCP tool poisoning: audit any server before installing

What is MCP tool poisoning?

Why the threat is real in 2026