toolfence — security scanner for MCP servers

The other 80%

Auth answers one question. The attack surface has seven.

The MCP spec settled authentication — OAuth 2.1 for HTTP transports. That's roughly 20% of the risk. The rest lives in what the tools do and what their definitions say:

Tool poisoning

Malicious tool descriptions

A server ships a tool that tells the agent to read your secrets or ignore prior instructions. The agent complies. OAuth is happy.

Prompt injection

Adversarial tool metadata

Instructions smuggled into descriptions and schemas, fed verbatim into the agent's context window.

Tool drift

Definitions that change

A server you trusted silently redefines a tool after the fact. No one notices.

Scope explosion

Over-broad capability

Tools that touch the filesystem, run code, or reach the network with no per-call scoping.

Cost runaway

Catalog bloat

A 60K-token tool catalog prepended to every agent turn — inflating cost, latency, and mis-selection.

No audit

Invisible behavior

OAuth logs the login. It doesn't log the 47 tool calls the agent actually made, with what arguments.

12 checks · one command

Point it at a server. Get a severity-ranked report.

Works over Streamable HTTP, SSE, and local stdio. Markdown, JSON, and CI-friendly exit codes. Every detector is covered by a test that proves it fires on real attacks and stays quiet on benign servers.

Authentication posture
Server that lists tools with no credentials

Transport security
Plaintext HTTP for non-local endpoints

Prompt-injection signatures
Adversarial instructions in tool definitions

Known-bad signatures
Documented MCP abuse patterns, community-extensible

Tool integrity / drift
Definitions that changed since the last scan

Context cost
Catalogs large enough to inflate every turn

Rate-limit posture
No server-side ceiling on call volume

Naming hygiene
Duplicate or collision-prone tool names

Sensitive capability
Filesystem, code-exec, or network reach

Schema strength
Missing, untyped, or unsealed input schemas

Safety annotations
Missing readOnlyHint / destructiveHint

Unicode hygiene
Invisible / bidi / homoglyph characters hiding instructions

Where this goes

The scanner is the front door.

It tells you what's wrong. The hosted gateway stops it in production — output sanitization, per-call scope reduction, behavioral guardrails, and replayable audit, in front of every MCP server you run.

SHIPPING NOW

Open-source scanner

Find the risk before an agent connects. Free, forever.

Hosted gateway

Govern tool calls at runtime. Sanitize, scope, rate-limit, audit.

LATER

Tool marketplace

Curated, scanned, metered MCP servers — with monetization rails for authors.

Get started on GitHub Contribute a signature