Generated using AI. Be aware that everything might not be accurate.

Chapter 1: MCP Architecture — Hosts, Clients, and Servers

Before writing a single line of code, it helps to understand what MCP actually is at the protocol level. This chapter covers the three roles in an MCP system, the three primitive types servers can expose, and how messages flow between components.

The Three Roles

Every MCP interaction involves three components:

Host

The Host is the application the user interacts with. Claude Desktop is a host. So is Claude Code, Cursor, and Windsurf. The host is responsible for:

Managing the user interface and conversation
Starting and stopping MCP servers
Mediating access between the AI model and servers
Enforcing security boundaries (deciding which servers the model can talk to)

The host owns the overall experience. It decides which MCP servers are available and presents their capabilities to the AI model.

Client

The Client lives inside the Host. It is the component that speaks the MCP protocol on the host’s behalf. Each host maintains one client per server connection.

The client:

Establishes and maintains the connection to a server
Sends requests (list tools, call tool, read resource, etc.)
Receives and forwards responses back to the model

In practice, when you configure Claude Desktop with an MCP server, Claude Desktop spawns a client connection to that server automatically. You rarely interact with the client directly.

Server

The Server is what you build. It is a process that:

Declares its capabilities when connecting (what tools, resources, and prompts it has)
Handles requests from the client
Returns results

Your server can be a local Python script, a remote HTTP service, or anything in between. It is isolated from the AI model — it communicates only with the client, never directly with the LLM.

The Communication Flow

Here is what happens when an AI assistant calls a tool on your server:

User ──► Host (e.g. Claude Desktop)
           │
           ▼
         Client ──► Server (your MCP server)
           │              │
           │    request   │
           │ ──────────── ►
           │              │
           │    response  │
           │ ◄────────────
           ▼
         AI Model ◄── Host presents result

The user asks a question or gives an instruction
The AI model (inside the Host) decides to call a tool
The Host/Client sends a tools/call request to your server
Your server executes the tool and returns the result
The result is passed back to the AI model
The model incorporates the result into its response

The Protocol: JSON-RPC 2.0

MCP messages are JSON-RPC 2.0. Every message is a JSON object with:

jsonrpc: always "2.0"
id: unique request identifier (for request/response pairs)
method: the operation name (e.g. tools/list, tools/call)
params: method-specific parameters

A tool call looks like this:

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "tools/call",
  "params": {
    "name": "get_weather",
    "arguments": { "city": "Paris" }
  }
}

And the server responds:

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "content": [
      { "type": "text", "text": "Paris: 18°C, partly cloudy" }
    ]
  }
}

You typically never write these JSON messages by hand — the Python SDK handles serialization for you.

The Three Primitives

MCP defines three types of capabilities a server can expose:

Tools

Tools are functions the AI can call. They take structured input and return structured output. Examples:

search_database(query: str) → list[dict]
send_email(to: str, subject: str, body: str) → str
run_python(code: str) → str

Tools are the most commonly used primitive. They are how the AI takes action in the world.

Resources

Resources are data the AI can read. They are identified by URIs and can be static or dynamic. Examples:

file:///home/user/notes.txt — a file’s contents
db://customers/42 — a database record
config://app/settings — application configuration

Resources are analogous to GET endpoints in a REST API. They return data without side effects.

Prompts

Prompts are reusable message templates. They accept arguments and return a sequence of messages the AI can use as context or instructions. Examples:

A “summarize document” template that takes a document URI and a target length
A “code review” template that takes a diff and style guidelines
A “explain error” template that takes an error message and stack trace

Prompts are optional but useful for standardizing common workflows.

Capability Negotiation

When a client connects to a server, they perform a handshake:

Client sends initialize with its protocol version and capabilities
Server responds with its protocol version and what it supports
Client sends initialized to confirm
Both sides now know what the other can do

After initialization, the client can call tools/list, resources/list, and prompts/list to discover exactly what the server offers. The AI model uses this information to decide which tools are available in a given conversation.

Server Lifecycle

A typical server lifecycle:

Startup — the host launches your server process (or connects to a running HTTP server)
Initialization — capability handshake
Discovery — client lists tools/resources/prompts
Operation — client makes requests as the AI calls tools or reads data
Shutdown — the host terminates the connection (sends a shutdown notification)

For stdio servers, startup and shutdown are tied to the process lifecycle. For HTTP servers, the connection is managed separately from the process.

Key Takeaways

An MCP system has three roles: Host (the app), Client (inside the host), and Server (what you build)
Communication uses JSON-RPC 2.0 over either stdio or HTTP
Servers expose three primitives: Tools (actions), Resources (data), and Prompts (templates)
A capability negotiation handshake happens at startup so both sides know what is available
Your server is isolated — it never talks directly to the AI model

← Introduction

Table of Contents

Chapter 2: Your First MCP Server →

>> You can subscribe to my mailing list here for a monthly update. <<

Gaëlle Candel