Toolsmith · build your own MCP server

VCN #34 / Toolsmith / 2026-05-27 / Frontier Tower Floor 9

● live / build night

no SDK shortcuts / vcn #34 / your repo is an MCP server by 10pm

Rayyan Zahid · w/ Michalis Vasileiadis · Eric Mockler · Devinder Sodhi

> system boot

> loading model context protocol v.2025-06-18

> forging four tools · one server

● 200 OK · room F9 · welcome to the forge

● why now / the supply gap

MCP SERVER CARDS APPEAR ON FEWER THAN 15 SITES ON EARTH.

<15sites globally ship an MCP Server Card

4%

of the 200,000 top domains are agent-ready

3.9%

support Markdown content negotiation

The supply gap: Cloudflare's April 2026 study found MCP Server Cards on fewer than 15 sites. Shipping one real MCP server puts your repo in the top one percent of the agentic web.

Cloudflare Agent Readiness study, April 2026 / 200,000 most-visited domains analyzed. the supply side of the agentic web is nearly empty.

● the honest counter-signal

SUPPLY IS EXPLODING. DISCOVERY IS WHAT LAGS.

~4x

remote MCP deployments since May 2025. of the 20 most-searched servers, 80% offer remote deployment.

10,000+

active MCP servers as of early 2026. the protocol is past experiment, into production.

97M

monthly SDK downloads. the servers exist. the discoverable Server Cards are what nobody ships.

mcp_remote_growth / srinivasan_production. the gap is not "no servers." it is "no front door an agent can find." you fix the front door tonight.

● the promise

THE PROMISE.

you walk out tonight with your own MCP server. three of your functions, agent-callable. by 10pm.

7:30a six-line server that boots.

8:15the same server on a public URL.

9:00three of your tools wired in.

9:40the OAuth handshake you skip in dev and pay for in prod.

ship one working MCP server and you are, measurably, in the top ~1% of the agentic web. that is the receipt. now the foundation.

● what MCP is / one sentence

ONE PROTOCOL. ONE SENTENCE.

MCP is a JSON-RPC client-server interface for secure tool invocation and typed data exchange between an LLM application and external tools, data, and prompts.

MCP architecture: a host app runs a client that connects over JSON-RPC 2.0 to a server, which exposes three primitives -- Tools (actions), Resources (context), Prompts (templates).

ehtesham_survey (arXiv:2505.02279). that is the whole idea. everything tonight is an implementation detail of this one line.

● governance / who owns the spec

MCP IS NOT ANTHROPIC'S ANYMORE. IT IS A FOUNDATION PROJECT.

Anthropic donated MCP to the Agentic AI Foundation (AAIF) on 2025-12-09. AAIF is a directed fund under the Linux Foundation, co-founded by Anthropic, Block, and OpenAI.

MCP joins goose (Block) and AGENTS.md (OpenAI) as anchor projects. Platinum members include AWS, Anthropic, Block, Bloomberg, Cloudflare, Google, Microsoft, OpenAI. Each project keeps full autonomy over technical direction.

we teach (baseline)

2025-06-18

current revision (May 2026)

2025-11-25

2025-11-25 shipped on MCP's one-year public anniversary and adds an experimental Tasks primitive (any request can become call-now or fetch-later). We teach the first stable model and name its successor so nobody is misinformed.

aaif / mcp_spec_2025_06_18 / mcp_spec_2025_11_25.

● the surface / six features over JSON-RPC

THREE FEATURES THE SERVER OFFERS. THREE THE CLIENT OFFERS.

server features (you build these tonight)

Toolsactions the agent can take. model controlled.

Resourcescontext and data the agent can read. app controlled.

Promptsreusable templates the server offers. user controlled.

client features (the host provides these)

Samplingthe server asks the client's model to generate.

Rootsthe client tells the server which URI roots are in scope.

Elicitationthe server asks the user for more input mid-flow.

The three MCP server primitives: Tools are model controlled and act, Resources are app controlled and inform, Prompts are user controlled and guide.

mcp_spec_2025_06_18. tonight we ship Tools and one Resource. the other four are real, just not on the critical path.

● the wire / JSON-RPC 2.0

ONE REQUEST. ONE RESPONSE. AN ID TO CORRELATE.

initialize

opening handshake. params: protocolVersion, capabilities, clientInfo.

tools/list

discovery. returns a tools array, each with name, description, inputSchema.

tools/call

invocation. params name + arguments. reply is a content array.

// the call. ASCII-safe, copy it.
{"jsonrpc":"2.0","id":2,
 "method":"tools/call",
 "params":{
   "name":"add",
   "arguments":{"a":2,"b":3}}}

// the reply.
{"jsonrpc":"2.0","id":2,
 "result":{
   "content":[
     {"type":"text","text":"5"}],
   "isError":false}}

JSON-RPC 2.0 lifecycle: the client sends a request with jsonrpc 2.0, an id, a method, and params; the server replies with the same id and a result, or an error object. The id pairs response to request.

mcp_python_sdk. you already know this if you have used any RPC. the id pairs the reply to the request. that is the whole protocol.

● transports / the one hard rule

THREE WAYS TO SPEAK. ONLY ONE BELONGS IN PROD.

local dev

stdio

one process, one client, local only. what Claude Desktop and CLI tools expect. single-process glue.

production default

Streamable HTTP

network reachable, multi-client, bidirectional. one /mcp endpoint carrying a session.

streaming-heavy

SSE / WebSocket

streaming-style transport for the cases that need a long-lived push channel.

Transports compared: stdio runs a local subprocess for one client and caps out past about five clients; Streamable HTTP serves many remote clients over a single /mcp endpoint with sessions. Production rule: Streamable HTTP, never stdio.

The hard rule: production MCP is Streamable HTTP, never stdio. The spec even says STDIO implementations SHOULD NOT follow the HTTP authorization spec. stdio reads credentials from the environment, which is exactly why it does not belong in a multi-tenant deployment.

fastmcp / mcp_spec_2025_06_18. Segment 2 is the one-line pivot from the left card to the middle card.

● the layer map / where MCP sits

MCP IS HOW AN AGENT TALKS TO TOOLS. IT IS LAYER ONE OF FOUR.

MCP
tool access. JSON-RPC client-server. tonight.
VCN #34 / here
ACP
structured messaging. REST-native, multimodal.
then
A2A
agents talk to each other. capability-based Agent Cards.
VCN #35 / #36
ANP
open-network discovery. DIDs plus JSON-LD. agent marketplaces.
VCN #39

ehtesham_survey. the survey's phased roadmap is the exact arc this VCN series walks: MCP for tools, then ACP, then A2A, then ANP. WebMCP pushes MCP into the browser.

● the agent's-eye view

AN AGENT NEVER SEES YOUR CODE. IT SEES THIS.

The model gets back one thing from tools/list. A name. A description. An input schema. That prose IS the signal it acts on.

design rule, carried into Segment 3: verbose is fine. ambiguous is fatal. the description is written for the model, not the human reader.

// what the agent actually receives
{
  "name": "add",
  "description":
    "Add two integers a and b.
     Returns their sum.",
  "inputSchema": {
    "type": "object",
    "properties": {
      "a": {"type":"integer"},
      "b": {"type":"integer"}}}
}

mcp_python_sdk / tscg. this is the hinge of the whole night. the schema is the product, not the code behind it.

● clone-and-run / the reference set

SEVEN FIRST-PARTY SERVERS. ALL CLONE-AND-RUN.

Everything

reference and test server exercising prompts, resources, and tools.

Fetch

web content fetching and conversion for efficient LLM use.

Filesystem

secure file operations with configurable access controls.

Git

read, search, and manipulate Git repositories.

Memory

knowledge-graph-based persistent memory.

Sequential Thinking

dynamic, reflective step-by-step problem solving.

Time

time and timezone conversion.

+ yours

the eighth server is the one you ship tonight.

mcp_reference_servers (modelcontextprotocol/servers). intentionally minimal. read Everything first when you want to see all six features at once.

● the two pythonic paths

FROM SCRATCH? FASTMCP. ALREADY HAVE FASTAPI? FASTAPI_MCP.

FastMCP

"write a server from scratch."

decorator per tool. auto-schema from type hints. mcp.run() boots it.

PrefectHQ / Jeremiah Lowin. v3.3.1 (2026-05-15). Apache-2.0, Python >=3.10.

powers ~70% of MCP servers across all languages. ~1M downloads a day.

fastapi_mcp

"make an existing FastAPI product agent-callable."

wraps your FastAPI app via ASGI. mounts MCP at /mcp.

reuses Depends() for auth. OAuth 2.1 per spec.

Tadata. the path for builders already running a FastAPI service. no rewrite.

fastmcp / fastapi_mcp. we use FastMCP tonight because we are building from zero. neither is evangelized. the choice is about your starting point.

● the empirical reality / 177,000 tools

AGENTS USED TO READ. NOW THEY ACT.

177,436 MCP tools studied / Nov 2024 to Feb 2026

action share, start27%

action share, now65%

Tools split into perception (read), reasoning (analyze), and action (modify the world). Software development is 67% of all agent tools and 90% of MCP server downloads. The room tonight is the dead center of the distribution.

stein_177k (arXiv:2603.23802, UK AISI + Bank of England). higher stakes are arriving: payment-execution servers grew from 47 to 1,578 in thirteen months.

● the reality check / MCP-Universe

FRONTIER MODELS FAIL THE MAJORITY OF REAL MCP TASKS.

GPT-5 / best in class

43.72%

strongest in Financial Analysis (67.50%) and 3D Design (52.63%).

Claude-4.0-Sonnet

29.44%

benchmarked across 6 domains spanning 11 real-world MCP servers.

long-context. the catalog grows past what the model holds well.

unknown tools. the model cannot tell which tool to call.

mcp_universe (Salesforce AI Research, arXiv:2508.14704). both failure modes are server-design problems, not just model problems. the gap is your opportunity.

● the flipped constraint / TSCG

THE BOTTLENECK IS NOT THE MODEL. IT IS YOUR SCHEMA.

The old question was does a server even exist. Stein plus TSCG show the binding constraint flipped to do small models reliably use the schema. Production frameworks ship tool schemas as JSON, a format built for machine parsing, not for model interpretation.

0%

|

84.4%

Phi-4 14B tool-use accuracy at 20 tools / only the schema changed

tscg (arXiv:2605.04107) / stein_177k. for 4B to 14B models this protocol mismatch is the majority of tool-use failure. a server existing is necessary, not sufficient. we return to this hard at the discord slide.

● tonight / the four stops

FOUR SEGMENTS. EACH ONE LEAVES YOU WITH SOMETHING THAT RUNS.

1

The six-line hello world

FastMCP add(a, b) over stdio. The minimum legal MCP server. 90 seconds.

2

From stdio to Streamable HTTP

Same server, production transport, behind ngrok. One argument changes.

3

Bring your own tools

Wire three of your existing functions in. Add a Resource for context.

4

The OAuth handshake

RFC 9728, the well-known file. Skipped in dev, paid for in prod.

fastmcp / mcp_spec_2025_06_18. talk, then hands-on, per segment. you build alongside, not after.

● run of show / the next three hours

THE NEXT THREE HOURS.

7:00

Doorsgrab a seat, get on wifi, run the prereq check.

7:30

Talks + buildthe four segments, talk then hands-on, you build alongside.

9:00

Hands-on pushwire your own tools, get unstuck, ship the server.

9:45

Socialdemos, what you shipped, who is building what.

Frontier Tower Floor 9. ask out loud the moment you are stuck. a broken demo in the room beats a clean one you watched.

● setup / run this before 7:30

THREE LINES OF SETUP. THE HANDOUT HAS THE REST.

GET READY TO FORGE.python 3.11+, fastmcp, and ngrok for the public-url segment. that is all four stops need.

3 minterminal · one fresh folder

Open the labs ->

python --version # need 3.11+ (FastMCP needs >=3.10)
pip install fastmcp # the only dependency for stops 1 to 3
ngrok version # for stop 2, the public URL. install if missing.
# clone the handout repo. each of the four stops is one folder.
git clone the-handout-repo # 01_hello_stdio 02_http_ngrok 03_byo_tools 04_oauth

fastmcp. every handout server actually boots. the run-receipts are pasted in the repo so you can diff your terminal against ours.

● the five numbers / hold these all night

THE NUMBERS THAT MAKE THE CASE.

<15

sites on earth ship an MCP Server Card. one tonight puts you in the top ~1%.

43.72% / 29.44%

GPT-5 and Claude-4.0-Sonnet on real MCP tasks. frontier models fail most of it.

0 to 84.4%

Phi-4 14B tool-use, schema rewrite only. the bottleneck is the schema, not the model.

177,436

MCP tools studied. action share rose from 27% to 65%. agents do, not just read.

~70%

of all MCP servers run on FastMCP. ~1M downloads a day. six lines is a real server.

97M

monthly SDK downloads. 10,000+ active servers. past experiment, into production.

cloudflare_agent_readiness / mcp_universe / tscg / stein_177k / fastmcp / srinivasan_production.

● the receipt / why tonight is rare company

A RUNNING SERVER IS A RECEIPT.

with Server Cards on fewer than 15 sites and 4% of top domains agent-ready, the builder who walks out with a running, agent-callable MCP server is measurably in the top ~1% of the agentic web.

Not a slide you watched. A server you can curl. Three of your functions an agent can call. The supply side of the agentic web is nearly empty. Tonight you put something on it.

cloudflare_agent_readiness. we close on this exact receipt at the end. first we build it.

● end of foundation / now we build

segment 1 of 4

THE SIX-LINE HELLO WORLD.

Enough context. FastMCP add(a, b) over stdio. The minimum legal MCP server, in about ninety seconds. Open your terminal.

segment 1 / 4

The six line hello world.

FastMCP turns one plain Python function into an agent callable tool. No SDK boilerplate, no schema written by hand. Six lines in a file, one run command, and a real MCP server is alive on your laptop. This is the minimum legal server, and it is the spine of everything that follows tonight.

target 90 seconds transport stdio lab 01

● segment 1 · the whole server

● server.pypython

from fastmcp import FastMCP

mcp = FastMCP("hello")

@mcp.tool
def add(a: int, b: int) -> int:
    """Add two integers and return the sum."""
    return a + b

if __name__ == "__main__":
    mcp.run()

That is a complete MCP server. Six lines that matter. There is no seventh.

FastMCP, PrefectHQ. powers ~70% of MCP servers, ~1M downloads a day. [fastmcp]

● segment 1 · every line is load bearing

importFastMCP is the server. It owns the registry of tools, resources, and prompts, and it knows how to speak the wire protocol.
mcp = FastMCPThe string "hello" is the server name an agent sees on connect. The first piece of identity the agent reads. Make it specific.
@mcp.toolThe decorator is the whole trick. It reads your type hints and docstring and auto generates the JSON schema. You never write the schema by hand.
def add(...)The signature becomes the input schema (a: int, b: int). The docstring becomes the description, which is the agent's primary signal for when to call.
return a + bPlain Python. Do the work. The return value is marshalled back across the wire into the tool result.
mcp.run()Boot it. With no argument this defaults to stdio. Reads JSON-RPC on stdin, writes responses on stdout. That is all stdio is.

type hints become the schema; the docstring becomes the description. [fastmcp][mcp_python_sdk]

● segment 1 · boot it

Two commands. It is alive.

terminalany os

# install once
pip install fastmcp

# run the server
python server.py

It prints a startup banner, then sits there silently waiting for a client on stdin. That is correct. A bare stdio server has no terminal UI. It is a pipe, not an app. Stop it with Ctrl-C.

mcp.run() with no argument defaults to stdio transport. [fastmcp]

● segment 1 · why six lines is enough

A server that advertises even one tool over stdio satisfies the full call loop.

initializeClient opens. Server and client negotiate protocolVersion and capabilities. The handshake.

tools/listServer returns your add tool with its auto built inputSchema. Discovery.

tools/callClient invokes add with arguments. Server runs your Python, returns the result. Action.

all three over JSON-RPC 2.0. that loop is the whole protocol. [mcp_python_sdk]

● segment 1 · inspect it

A stdio server is meant to be driven, not typed at.

MCP Inspector

The official local web UI. See the tool list, fire add from a form, read the raw frames.

Claude Desktop

Point its MCP config at python /full/path/server.py, then ask it to add two numbers.

Cline

Same config shape inside VS Code. All three speak the same JSON-RPC loop over the same pipe.

launch the inspector

fastmcp dev server.py

Claude Desktop, Cline, MCP Inspector. one loop, any client. [mcp_python_sdk]

● segment 1 · call the tool

client sends · tools/call

{"jsonrpc":"2.0","id":2,
 "method":"tools/call",
 "params":{
   "name":"add",
   "arguments":{"a":2,"b":3}
 }}

->

server replies · id 2

{"jsonrpc":"2.0","id":2,
 "result":{
   "content":[
     {"type":"text","text":"5"}
   ],
   "isError":false
 }}

One request. One response. The id pairs them. The "text":"5" is your function's return value, marshalled back across the wire. That is an MCP call.

tools/call params name + arguments; result is a content array. [mcp_python_sdk]

lab 01 · ~5 min · segment 1

Six lines. The server lives.

Install FastMCP, write the six lines, boot it, then call the tool with a raw JSON-RPC ping. Copy buttons on every block. macOS, Linux, and Windows PowerShell variants. Do it now on your own laptop.

open lab 01 · six line server ->

handout: handouts/01_hello_stdio/ · receipts in RUN-RECEIPTS.md

● segment 1 · the trap that kills the demo

cp1252 will murder your server on connect.

UnicodeEncodeError: 'charmap' codec can't encode character

arrow glyphuse -> in any string a Windows stdout will print

curly quotesuse straight ' and " only

em dashuse a hyphen or a period, never the long dash

check markswrite PASS and FAIL in plain ASCII

Windows default stdout is cp1252. One fancy character in a print() or a docstring crashes the process. And stdio IS stdout, so a stray print to stdout corrupts the JSON-RPC stream. Keep all logging on stderr. If you genuinely need Unicode IO: import sys; sys.stdout.reconfigure(encoding="utf-8").

caught live on every Windows attendee in a prior lab. [self: manifest][mcp_spec_2025_06_18]

checkpoint 1 / 4

You have a running server.

PASSa file that boots and speaks MCP over stdio
PASSone tool an agent can discover and call
PASSa schema you never wrote by hand

next: this exact server, production transport, a public URL.

segment 2 / 4

stdio to Streamable HTTP.

stdio is a local pipe. It works for one client on your machine and dies the moment you want more. The good news: the server does not change. You flip one argument in run(), and the same tools answer HTTP at /mcp. Then ngrok puts that endpoint on the public internet.

change one line transport http lab 02

● segment 2 · why stdio dies in prod

stdio is local glue. Production is not local.

one client

One process pair, one stdin/stdout pipe. No concurrency. It does not scale to many concurrent clients.

no addressing

There is no URL and no network. Nothing remote can reach it. A pipe has no front door.

the spec agrees

stdio implementations SHOULD NOT follow the HTTP auth spec and read credentials from the environment instead. By design, not multi tenant.

MCP-Universe names long-context and unknown-tools as failure modes that compound under real load. [fastmcp][mcp_universe][mcp_spec_2025_06_18]

● segment 2 · the pivot

The whole production move is one argument.

server.py · the diffpython

    mcp.run()
    mcp.run(transport="http", host="127.0.0.1", port=8000)

Everything above the run call is byte for byte identical to Segment 1. transport="http" is FastMCP's Streamable HTTP transport. The string "streamable-http" is an accepted alias for the exact same thing, so you will see both in the wild.

same server, same tools, one argument. the most memorable lesson of the night. [fastmcp]

● segment 2 · the http shape

An endpoint at /mcp that carries a session.

response headers + first frame200 ok

HTTP/1.1 200 OK
content-type: text/event-stream
mcp-session-id: 40d353dd9f414e749b462b15fe160a91

event: message
data: {"jsonrpc":"2.0","id":1,"result":{
  "protocolVersion":"2025-06-18", ... ,
  "serverInfo":{"name":"hello-http","version":"3.3.1"}}}

endpointthe MCP endpoint is mounted at /mcp/. A request to /mcp 307-redirects to it. A 307 keeps the POST body, but curl needs -L to follow.
sessionthe server returns an mcp-session-id header. A real client reads it and sends it back on every following request.
streamthe reply is a text/event-stream frame. That is the Streamable half. The server can push notifications down the same stream.

verbatim from the booted server in RUN-RECEIPTS.md. [fastapi_mcp]

● segment 2 · go public

ngrok hands localhost a public https URL.

one time setup, then a second terminal

# register your token once
ngrok config add-authtoken YOUR_TOKEN

# with server.py running on 8000, tunnel it
ngrok http 8000

what ngrok prints

Forwarding  https://a1b2-203-0-113-7.ngrok-free.app -> http://localhost:8000

Your public MCP endpoint is that URL with /mcp on the end. Hand it to any remote client, anywhere. The server you wrote in Lab 01 is now on the internet.

standard ngrok usage; no special MCP claim. [self]

● segment 2 · the silent failure

No CORS, and browser agents fail silently.

The moment a browser based agent reaches your server, the browser sends a preflight. Miss the CORS headers and the request dies with no useful error in your logs, just a dead client. ngrok forwards faithfully, so this bites in prod, not in your terminal tests. Set CORS on both paths:

/mcpthe tool surface every agent hits

/.well-known/*discovery + the OAuth metadata from Segment 4

Allow your agent's origin, the POST and GET methods, and the Mcp-Session-Id header. FastMCP lets you pass CORS middleware when you build the HTTP app.

browser-side agents silently fail without CORS on both paths. [cloudflare_remote_mcp]

● segment 2 · prove it from anywhere

One curl, and a remote client talks to your server.

from any machineinitialize

curl -s -L -X POST https://a1b2-203-0-113-7.ngrok-free.app/mcp \
  -H "Content-Type: application/json" \
  -H "Accept: application/json, text/event-stream" \
  -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-06-18","capabilities":{},"clientInfo":{"name":"remote","version":"0"}}}'

Streamable HTTP needs both content types in the Accept header. The -L is load bearing: without it curl will not follow the 307 and you get an empty body. Point any remote MCP client at the same URL and it speaks the same loop.

the trailing-slash 307 trap, verbatim from the handout. [fastapi_mcp]

lab 02 · ~7 min · segment 2

One argument moves you to production.

Flip the transport, watch the server bind a port, confirm the endpoint answers, then tunnel it with ngrok and curl it from anywhere. The trailing-slash 307 trap and the stdio-vs-HTTP comparison are both in here.

open lab 02 · streamable http ->

handout: handouts/02_http_ngrok/ · receipts in RUN-RECEIPTS.md

● segment 2 · the two transports

Same tools. Two ways to reach them.

The hard rule, one more time: production MCP is Streamable HTTP, never stdio. stdio is the local dev pipe. HTTP is the front door.

[fastmcp][mcp_spec_2025_06_18]

checkpoint 2 / 4

It is deployable.

PASSthe same server, now on Streamable HTTP
PASSa real /mcp endpoint carrying a session
PASSa public URL a remote agent can reach
PASSyou avoided the number one prod mistake: stdio in production

next: hardening the posture, then wiring your own tools.

● segment 2 · production posture

The recipe: one session per tenant, isolated.

You do not need this tonight. You need to know it exists. On Cloudflare, McpAgent gives a Durable Object per client session with built in state and both SSE and Streamable HTTP transports. Multi tenant isolation is a trustedIssuers resolver that reads identity per tenant without a redeploy.

contractsstable server contracts the agent can rely on

contextper user context, scoped and isolated

timeoutsexplicit timeouts and error shapes

observabilityyou can see what the agent did

the five design dimensions to harden once you are remote. [cloudflare_remote_mcp][srinivasan_production]

segment 2 done · up next, segment 3

Your server is reachable. Now make it worth reaching.

A toy add over a public URL proves the protocol. The next move is the one you came for: wrap three of your own functions, and give the agent the context it needs to call them well.

segment 3: bring your own tools.

SEGMENT 3 / 4

Bring your
own tools.

You already have functions. This stop makes three of them agent-callable, then hands the agent context to ground on.

stop 1 hello world DONE stop 2 streamable HTTP DONE stop 3 your tools stop 4 oauth

Same decorator. Three real functions.

Wrapping is the move you already know. @mcp.tool over a function. The type hints become the schema. No rewrite.

from fastmcp import FastMCP

mcp = FastMCP("byo-tools")

@mcp.tool
def convert_length(value: float, from_unit: str, to_unit: str) -> float:
    """Convert a length between metric and imperial units.

    Supported units (case-insensitive): mm, cm, m, km, in, ft, yd, mi.
    Returns the converted value as a float. Raises ValueError on an
    unknown unit so the agent gets a clear, actionable error.
    """
    ...

@mcp.tool
def password_strength(password: str) -> dict:
    """Estimate the strength of a password and explain the score. ..."""
    ...

@mcp.tool
def text_stats(text: str) -> dict:
    """Return word count, character count, and a reading-time estimate. ..."""
    ...

convert_length password_strength text_stats

The agent never sees your code.

Verbose is fine.
Ambiguous is fatal.

Name the units and the return shape in the docstring. The agent reads prose, not your function body.
Raise a clear error, never a silent wrong answer. An agent recovers from "unknown to_unit: foo". It cannot recover from a plausible wrong number.
Keep the parameter list tight. Small models fail on bloated schemas before they fail on logic.

The description prose IS the agent's primary signal for whether and how to call. (TSCG)

Tools act. Resources inform.

Tool / a verb

Do this.

An action the agent invokes. It computes or changes the world. It can have side effects. convert_length is a tool.

Resource / a noun

Know this.

Read-only context the agent pulls in, addressed by a URI like a file. No side effects. guide://tool-design is a Resource.

Mixing the two is the classic beginner error. "Get the current config" that only reads is a Resource. "Restart the service" is a Tool.

Hand the agent context to ground on.

A Resource is just a function with @mcp.resource and a URI. The agent reads it. It does not call it like an action.

@mcp.resource(
    "guide://tool-design",
    name="tool_design_guide",
    description="House rules for writing agent-callable tool descriptions.",
    mime_type="text/markdown",
)
def tool_design_guide() -> str:
    """Serve the tool-design cheat sheet as a Resource."""
    return (
        "# Tool design for agents\n\n"
        "1. The description is the agent's only signal. "
        "Verbose is fine, ambiguous is fatal.\n"
        "2. Name the units and the return shape in the docstring.\n"
        "..."
    )

In a real client: resources/list shows guide://tool-design, then resources/read pulls the document into context. Booted in the handout, receipt is real.

What the agent sees when your schema is bad.

tools/list -> "do_thing"
  desc: "performs the operation
         on the input as configured"
  params: { x, y, z, opts, flags, ctx,
            mode, extra, meta, ... }

small model picks: nothing
                   wrong tool
                   empty args
result: FAIL

# tighten one line:
  desc: "Convert length between
         mm,cm,m,km,in,ft,yd,mi.
         Returns a float."

small model picks: convert_length
result: PASS

Same model. Same task. Only the schema changed.

0%->84.4%

Phi-4 14B tool-use accuracy at 20 tools, deterministic schema rewrite. 90.3% at 50 tools.

The constraint flipped. Not "does a server exist." Now: "do small models actually USE the schema."

TSCG, arXiv 2605.04107. JSON schemas are parser-shaped, not model-shaped. For 4B to 14B models, this mismatch is the majority of tool-use failure.

LAB 3 · make it yours

Three sample tools and one Resource, ready to run. Delete the samples, paste your own functions, ship.

pip install fastmcp
python server.py (stdio, 3 tools + 1 resource)
list_tools -> convert_length, password_strength, text_stats
swap in a function you already wrote

Lab 3 · wire 3 functions + a resource →

Checkpoint · segment 3

Your repo is now agent-callable.

Real functions, exposed as tools. A Resource for grounding. Schemas a small model can read. That is a working server other agents can pick up and use.

SEGMENT 4 / 4

The OAuth
handshake.

In dev you skip auth. In prod you pay for it. The thing you actually ship is one JSON file at a fixed URL.

stop 1 DONE stop 2 DONE stop 3 DONE stop 4 oauth

Auth is not optional theater.

The MCP Safety Audit coerced Claude 3.7 and Llama-3.3-70B, via tools on DEFAULT servers, into three attack classes.

code execmalicious code execution through an exposed tool
remote controlremote access control of the host
cred theftcredential theft from the environment

The spec's named risk is the confused-deputy problem. A server acting as a middleman to third-party APIs gets tricked into using stolen authorization codes to obtain tokens without user consent.

The audit ships MCPSafetyScanner, the first agentic tool to audit an arbitrary MCP server before deployment.

RFC 9728

OAuth Protected Resource Metadata.

Under spec 2025-06-18 a protected MCP server is an OAuth 2.1 Resource Server. Auth is optional, but when present over HTTP it follows this exactly.

SERVER MUST
implement RFC 9728 and include an authorization_servers field with at least one auth server.
SERVER MUST
return WWW-Authenticate on a 401, pointing at the resource metadata URL.
CLIENT MUST
send a resource parameter (RFC 8707) binding the token to ONE server, so a malicious server cannot reuse it.
SERVER MUST
reject tokens that do not name it in the audience claim, and never pass a client token upstream.

The whole thing you ship is this file.

One route. Two load-bearing fields. It says WHERE to authenticate. It does not mint tokens.

@mcp.custom_route("/.well-known/oauth-protected-resource", methods=["GET"])
async def oauth_protected_resource(request: Request) -> JSONResponse:
    metadata = {
        "resource": RESOURCE_BASE + "/mcp/",
        "authorization_servers": [AUTH_SERVER],
        "bearer_methods_supported": ["header"],
        "scopes_supported": ["mcp.read", "mcp.write"],
        "resource_documentation": RESOURCE_BASE + "/docs",
    }
    # CORS so browser-side agents can read it cross-origin.
    headers = {"Access-Control-Allow-Origin": "*"}
    return JSONResponse(metadata, headers=headers)

Your server is a resource server, not an authorization server. resource names you. authorization_servers is where the client goes. The CORS header is not optional for browser agents.

The handshake, end to end.

RFC 9728 OAuth handshake: client calls with no token and gets 401, fetches /.well-known/oauth-protected-resource to learn the authorization server, gets a token there, then retries with a bearer token and succeeds. The server only publishes metadata, never mints tokens.

401 with WWW-Authenticate, fetch the well-known doc, run OAuth against the auth server it names, retry with Bearer. stdio dev skips all of this. The bill comes due in prod, where you are a resource server on the public internet.

Do NOT roll your own auth server.

It is the #1 production MCP failure.

The easy 10%

The metadata stub you just saw. One JSON document. You can hand-write it.

The hard 90%

Token issuance, PKCE, refresh, key rotation, revocation. Get it wrong and you get confused-deputy attacks and token leakage.

Use a managed provider. Cloudflare's workers-oauth-provider wraps your Worker as a spec-compliant OAuth provider to clients and an OAuth client to your upstream. Or point authorization_servers at Auth0, Clerk, Okta, and just validate the JWT.

The hard 90%, delegated

Hand it to a provider. One argument.

FastMCP ships native auth providers (Clerk, Auth0, WorkOS, Descope). Wire one in, and the server runs the OAuth flow to it and validates every token. You write none of the crypto.

from fastmcp import FastMCP
from fastmcp.server.auth.providers.clerk import ClerkProvider

auth = ClerkProvider(
    domain="your-app.clerk.accounts.dev",
    client_id="...",
    client_secret="...",
    base_url="https://your-server.com",
)

mcp = FastMCP("hello-toolsmith", auth=auth)

Now FastMCP serves the /.well-known metadata, returns the 401 with WWW-Authenticate, and validates each bearer token against Clerk by introspection. The stub you just wrote collapses into one argument. Clerk holds the tokens, the keys, the rotation.

LAB 4 · see the wire shape

A real MCP HTTP server plus the well-known route. Boot it, curl the metadata, read the two fields.

python server.py
curl -s http://127.0.0.1:8000/.well-known/oauth-protected-resource

{"resource":"http://127.0.0.1:8000/mcp/",
 "authorization_servers":["https://auth.example.com"],
 "bearer_methods_supported":["header"],
 "scopes_supported":["mcp.read","mcp.write"]}

Lab 4 · RFC 9728 well-known + handshake →

Four traps that pass green and still break.

TRAP 1stdio in prod. Single-process local glue. It does not survive many concurrent clients. Production is Streamable HTTP.
TRAP 2missing CORS. No Access-Control-Allow-Origin on /mcp and /.well-known/* means browser-side agents fail silently.
TRAP 3inline auth. Rolling your own authorization server. Use a managed provider and validate the token.
TRAP 4human-shaped descriptions. Docstrings written for a human reader, not a model. Verbose is fine, ambiguous is fatal.

Where this goes next.

MCP is Layer 4, tool execution. The rest of the stack, and the open problems, are the rest of this series.

Schema reliability

Model-shaped schemas, not parser-shaped. TSCG is an active direction, not a settled tool. Expect this to become first-class.
Observability

Agents skip needed tool calls or fire ones they should not. Sparse autoencoders plus probes read model state before each action.teaser for VCN #38
The agentic-web stack

MCP for tools, then A2A for agent-to-agent via Agent Cards, then ANP for decentralized discovery.VCN #35 / #36 / #37 / #39

The receipt.

MCP Server Cards on fewer than 15 sites. Only 4% of the top 200,000 domains are agent-ready. You walked in with an idea. You walk out with a running, agent-callable MCP server. That is, measurably, the top ~1% of the agentic web.

Take the repo home.

Four servers, all booted, all with paste-the-output receipts. The labs run on a stranger's laptop with python and fastmcp.

STOP 1

01_hello_stdio

the six-line server over stdio

STOP 2

02_http_ngrok

same server, Streamable HTTP, behind ngrok

STOP 3

03_byo_tools

3 functions + a Resource

STOP 4

04_oauth

RFC 9728 well-known stub

Keep building: swap your own functions into 03_byo_tools, then publish a /.well-known/ai-agent.json so agents can find you. Both patterns are in the handout.

This was Layer 4. The series walks the rest.

Vibe Coding Nights runs weekly at Frontier Tower. The agentic-web stack, one layer at a time.

Jun 3#35 Well-Known the agent card, /.well-known discovery
Jun 10#36 Crosstalk A2A, agents talking to each other
Jun 17#37 the agentic-web stack continues
later#38 Observability reading model state before each action

Find every event on Luma. Get the weekly intel in THE SIGNAL, Immersive Commons' AI dispatch. Both links on the handout.

One more thing.

This deck is itself an MCP-discoverable surface.

GET /.well-known/ai-agent.json   -> the agent card for this deck
navigator.modelContext.registerTool("next_slide")
navigator.modelContext.registerTool("goto_slide")

It ships its own agent card and registers two live WebMCP tools. An agent in this room, in a WebMCP browser, could have advanced these very slides. The same resource-server, well-known, discovery posture segment 4 just taught. The thesis, run on itself.

Hosted by Rayyan Zahid, Michalis Vasileiadis, Eric Mockler, and Devinder Sodhi.

Vibe Coding Nights #34 · Toolsmith · Frontier Tower Floor 9.

Licensed CC BY-SA 4.0. Take it, fork it, run your own. Now go ship a server.