diff --git a/skills/README.md b/skills/README.md
new file mode 100644
index 0000000..5195868
--- /dev/null
+++ b/skills/README.md
@@ -0,0 +1,45 @@
+# MCPEngine Skills
+
+These are agent skills (SKILL.md files) that guide AI agents through the MCP development pipeline. Each skill encodes the exact process, patterns, and standards for a specific phase of MCP server/app development.
+
+**These are the "secret sauce" — the encoded knowledge that lets agents build production-quality MCP servers autonomously.**
+
+---
+
+## Pipeline Skills (in order)
+
+| # | Skill | Size | Purpose |
+|---|-------|------|---------|
+| 1 | **mcp-api-analyzer** | 43KB | Analyze API docs → structured analysis doc. Always the FIRST step. |
+| 2 | **mcp-server-builder** | 88KB | Build a complete MCP server from the analysis doc. Every pattern and template. |
+| 3 | **mcp-server-development** | 31KB | TypeScript MCP server patterns, best practices, error handling. |
+| 4 | **mcp-app-designer** | 85KB | Design and build visual HTML apps for each MCP server. |
+| 5 | **mcp-apps-integration** | 20KB | Add rich UI (structuredContent) to MCP tool results. |
+| 6 | **mcp-apps-official** | 48KB | Official MCP Apps SDK patterns and host integration. |
+| 7 | **mcp-apps-merged** | 39KB | Combined/merged MCP Apps reference. |
+| 8 | **mcp-localbosses-integrator** | 61KB | Wire MCP servers + apps into LocalBosses Next.js app. |
+| 9 | **mcp-qa-tester** | 113KB | Full QA framework — protocol, visual, functional, live API testing. |
+| 10 | **mcp-deployment** | 17KB | Package and deploy — Docker, Railway, GitHub, production. |
+
+## Utility Skills
+
+| Skill | Purpose |
+|-------|---------|
+| **mcp-skill** | Exa MCP integration (web search, deep research) |
+
+---
+
+## How agents use these
+
+1. Agent receives task (e.g., "build a Stripe MCP server")
+2. Agent reads `mcp-api-analyzer/SKILL.md` → produces analysis doc
+3. Agent reads `mcp-server-builder/SKILL.md` → builds the server
+4. Agent reads `mcp-app-designer/SKILL.md` → builds UI apps
+5. Agent reads `mcp-qa-tester/SKILL.md` → runs full test suite
+6. Agent reads `mcp-deployment/SKILL.md` → packages for production
+
+Each skill is self-contained — an agent can pick up any step independently.
+
+---
+
+## Total encoded knowledge: ~550KB of structured agent instructions
diff --git a/skills/mcp-api-analyzer/SKILL.md b/skills/mcp-api-analyzer/SKILL.md
new file mode 100644
index 0000000..7c6434c
--- /dev/null
+++ b/skills/mcp-api-analyzer/SKILL.md
@@ -0,0 +1,869 @@
+# MCP API Analyzer — Phase 1: API Discovery & Analysis
+
+**When to use this skill:** You have API documentation (URLs, OpenAPI specs, user guides) for a service and need to produce a structured analysis document that feeds into the MCP Factory pipeline. This is always the FIRST step before building anything.
+
+**What this covers:** Reading API docs efficiently, cataloging endpoints, designing tool groups, naming tools, identifying app candidates, documenting auth flows and rate limits. Output is a single `{service}-api-analysis.md` file.
+
+**Pipeline position:** Phase 1 of 6 → Output feeds into `mcp-server-builder` (Phase 2) and `mcp-app-designer` (Phase 3)
+
+---
+
+## 1. Inputs
+
+| Input | Required | Description |
+|-------|----------|-------------|
+| API documentation URL(s) | **Yes** | Primary reference docs |
+| OpenAPI/Swagger spec | Preferred | Machine-readable endpoint catalog |
+| User guides / tutorials | Nice-to-have | Helps understand real-world usage |
+| Marketing / pricing page | Nice-to-have | Tier limits, feature gates |
+| Existing SDK examples | Nice-to-have | Reveals common patterns |
+
+## 2. Output
+
+A single file: **`{service}-api-analysis.md`**
+
+Place it in the workspace root or alongside the future server directory:
+```
+~/.clawdbot/workspace/{service}-api-analysis.md
+```
+
+This file is the sole input for Phase 2 (server build) and Phase 3 (app design).
+
+---
+
+## 3. How to Read API Docs Efficiently
+
+### Step 0: API Style Detection
+
+**Identify the API style FIRST.** This determines how you read the docs and how tools are designed.
+
+| Style | Detection Signals | Tool Mapping |
+|-------|-------------------|--------------|
+| **REST** | Multiple URL paths, standard HTTP verbs (GET/POST/PUT/DELETE), resource-oriented URLs | 1 endpoint → 1 tool (standard) |
+| **GraphQL** | Single `/graphql` endpoint, `query`/`mutation` in request body, schema introspection | Queries → read tools, Mutations → write tools, Subscriptions → skip (note for future) |
+| **SOAP/XML** | WSDL file, XML request/response, `Content-Type: text/xml`, `.asmx` endpoints | Each WSDL operation → 1 tool, note XML→JSON transform needed |
+| **gRPC** | `.proto` files, binary protocol, service/method definitions | Each RPC method → 1 tool, note HTTP/gRPC gateway if available |
+| **WebSocket** | `ws://` or `wss://` URLs, persistent connections, event-based messaging | Message types → tools, note connection lifecycle management |
+
+**Adaptation notes for non-REST APIs:**
+
+- **GraphQL:** Download the schema (`{ __schema { types { name fields { name } } } }`). Group by query vs mutation. Each meaningful query/mutation becomes a tool. Combine related queries if they share variables. The server's API client sends POST requests with `{ query, variables }` — document the query string per tool.
+- **SOAP:** Locate the WSDL. Each `<operation>` maps to a tool. Note the SOAPAction header. The server must transform XML responses to JSON — document the response mapping per tool.
+- **gRPC:** Check for an HTTP/JSON gateway (many gRPC services expose one). If available, treat as REST. If not, the server needs a gRPC client — document the `.proto` service and method names.
+- **WebSocket:** These are usually event-driven, not request/response. Map "send message" events to write tools. For incoming events, note them for future resource/subscription support. The server must manage a persistent connection.
+
+### What to READ (priority order):
+
+1. **Authentication page** — Read FIRST, completely. Auth determines everything.
+   - What type? (OAuth2, API key, JWT, session token, basic auth)
+   - Where does the token go? (Authorization header, query param, cookie)
+   - Token refresh flow? (Expiry, refresh tokens, re-auth)
+   - Scopes/permissions model?
+
+2. **Rate limits page** — Read SECOND. This constrains tool design.
+   - Requests per minute/hour/day?
+   - Per-endpoint limits vs global limits?
+   - Burst allowance?
+   - Rate limit headers? (X-RateLimit-Remaining, Retry-After)
+
+3. **API overview / getting started** — Skim for architecture patterns.
+   - REST vs GraphQL vs RPC?
+   - Base URL pattern (versioned? regional?)
+   - Common response envelope (data wrapper, pagination shape)
+   - Error response format
+
+4. **Endpoint reference** — Systematic scan, don't deep-dive yet.
+   - Group endpoints by resource/domain (contacts, deals, invoices, etc.)
+   - Note HTTP methods per endpoint (GET=read, POST=create, PUT=update, DELETE=delete)
+   - Flag endpoints with complex input (nested objects, file uploads, webhooks)
+   - Count total endpoints per group
+
+5. **Pagination docs** — Find the pagination pattern.
+   - Cursor-based vs offset-based vs page-based?
+   - What params? (page, limit, offset, cursor, startAfter)
+   - Max page size?
+   - How to detect "no more pages"?
+
+6. **Webhooks / events** — Note but don't deep-dive.
+   - Available webhook events (for future reference)
+   - Delivery format
+
+7. **Version & deprecation info** — Check for sunset timelines.
+   - Current stable version
+   - Any deprecated endpoints still in use
+   - Version header requirements (e.g., `API-Version: 2024-01-01`)
+   - Breaking changes in recent versions
+
+### What to SKIP (or skim very lightly):
+
+- SDK-specific guides (Python, Ruby, etc.) — we build our own client
+- UI/dashboard tutorials — we only care about the API
+- Community forums / blog posts — too noisy
+- Deprecated endpoints — unless no replacement exists
+- Webhook setup instructions — we consume the API, not webhooks (usually)
+
+### Speed technique for large APIs (50+ endpoints):
+
+1. If OpenAPI spec exists, download it and parse programmatically
+2. Extract all paths + methods into a spreadsheet/list
+3. Group by URL prefix (e.g., `/contacts/*`, `/deals/*`, `/invoices/*`)
+4. Count endpoints per group
+5. Read the 2-3 most important endpoints per group in detail
+6. Note the pattern — most groups follow identical CRUD patterns
+
+### Pagination Pattern Catalog
+
+Different APIs use different pagination strategies. Identify which pattern(s) the API uses and document per the table below.
+
+| Pattern | How It Works | Request Next Page | Detect Last Page | Total Count | Example APIs |
+|---------|-------------|-------------------|------------------|-------------|-------------|
+| **Offset/Limit** | Skip N records, return M | `?offset=25&limit=25` | Results < limit, or offset ≥ total | Usually available | Most REST APIs |
+| **Page Number** | Request page N of size M | `?page=2&pageSize=25` | Empty results, or page ≥ totalPages | Usually available | GHL, HubSpot |
+| **Cursor (opaque)** | Server returns an opaque cursor string | `?cursor=abc123&limit=25` | Cursor is null/absent in response | Rarely available | Slack, Facebook |
+| **Keyset (Stripe-style)** | Use last item's ID as boundary | `?starting_after=obj_xxx&limit=25` | `has_more: false` in response | Rarely available | Stripe, Intercom |
+| **Link Header** | Server returns `Link: <url>; rel="next"` in headers | Follow the `rel="next"` URL directly | No `rel="next"` link in response | Sometimes via `rel="last"` | GitHub, many REST APIs |
+| **Scroll/Search-After** | Server returns a sort-value array to continue from | `?search_after=[timestamp, id]` | Empty results | Via separate count query | Elasticsearch |
+| **Composite Cursor** | Base64-encoded JSON with multiple sort fields | `?cursor=eyJpZCI6MTIzLCJ...}` | Decoded cursor has `done: true`, or results empty | Rarely available | Internal APIs, GraphQL relay |
+| **Token-Based (AWS-style)** | Server returns a `NextToken` / `NextContinuationToken` | Pass `NextToken` in next request body/params | `NextToken` is absent in response | Sometimes via separate field | AWS (S3, DynamoDB, SQS) |
+
+**For each pattern, document:**
+- How to request the next page
+- How to detect the last page (no more data)
+- Whether total count is available
+- Whether backwards pagination is supported
+- Max page size allowed
+
+---
+
+## 4. Analysis Document Template
+
+Use this EXACT template. Every section is required.
+
+````markdown
+# {Service Name} — MCP API Analysis
+
+**Date:** {YYYY-MM-DD}
+**API Version:** {version}
+**Base URL:** `{base_url}`
+**Documentation:** {docs_url}
+**OpenAPI Spec:** {spec_url or "Not available"}
+
+---
+
+## 1. Service Overview
+
+**What it does:** {1-2 sentence description}
+**Target users:** {Who uses this product}
+**Pricing tiers:** {Free / Starter / Pro / Enterprise — note API access level per tier}
+**API access:** {Which tiers include API access, any costs per call}
+
+---
+
+## 2. Authentication
+
+**Method:** {OAuth2 / API Key / JWT / Basic Auth / Custom}
+
+### Auth Flow:
+```
+{Step-by-step auth flow}
+1. {First step}
+2. {Second step}
+3. {How to get/refresh token}
+```
+
+### OAuth2 Details (if applicable):
+- **Grant type:** {authorization_code / client_credentials / PKCE / device_code}
+- **Authorization URL:** `{url}`
+- **Token URL:** `{url}`
+- **Redirect URI requirements:** {localhost allowed? specific paths?}
+- **Scopes required:** {list scopes and what they grant}
+- **PKCE required?** {yes/no — required for public clients}
+
+### Headers:
+```
+Authorization: {Bearer {token} / Basic {base64} / X-API-Key: {key}}
+Content-Type: application/json
+{Any other required headers, e.g., X-Account-ID}
+```
+
+### Environment Variables Needed:
+```bash
+{SERVICE}_API_KEY=
+{SERVICE}_API_SECRET=        # If OAuth2
+{SERVICE}_BASE_URL=          # If configurable/sandbox
+{SERVICE}_ACCOUNT_ID=        # If multi-tenant
+```
+
+### Token Lifecycle:
+- **Token type:** {access token / API key / JWT}
+- **Expiry:** {duration or "never" for API keys}
+- **Refresh mechanism:** {refresh token endpoint / re-auth / N/A}
+- **Refresh token expiry:** {duration or "never"}
+- **Caching strategy:** {Cache token, refresh 5 min before expiry}
+- **Storage for long-running server:** {Token stored in memory, refresh before expiry. For OAuth2 auth code flow: initial token obtained via browser flow, server stores refresh token and auto-refreshes.}
+
+### Key Rotation / Compromise:
+- **Rotation procedure:** {How to generate new keys/secrets}
+- **Revocation endpoint:** {URL to revoke compromised tokens, or "manual via dashboard"}
+- **Grace period:** {Does old key continue working after rotation? For how long?}
+
+---
+
+## 3. API Patterns
+
+**Style:** {REST / GraphQL / SOAP / gRPC / WebSocket}
+**Non-REST adaptation notes:** {If non-REST, note how tools map — see API Style Detection above}
+**Response envelope:**
+```json
+{
+  "data": [...],
+  "meta": { "total": 100, "page": 1, "pageSize": 25 }
+}
+```
+
+**Pagination:**
+- **Type:** {cursor / offset / page-based / keyset / link-header / token-based}
+- **Parameters:** {page, pageSize / limit, offset / cursor, limit / starting_after}
+- **Max page size:** {number}
+- **End detection:** {empty array / hasMore field / next cursor is null / no Link rel="next"}
+- **Total count available:** {yes — in meta.total / no / separate count endpoint}
+- **Backwards pagination:** {supported / not supported}
+
+**Error format:**
+```json
+{
+  "error": { "code": "NOT_FOUND", "message": "Resource not found" }
+}
+```
+
+**Rate limits:**
+- **Global:** {X requests per Y}
+- **Per-endpoint:** {Any specific limits}
+- **Burst allowance:** {Token bucket / leaky bucket / simple counter}
+- **Rate limit scope:** {per-key / per-endpoint / per-user}
+- **Exceeded penalty:** {429 response / temporary ban / throttled response}
+- **Headers:** {X-RateLimit-Remaining, Retry-After}
+- **Strategy:** {Exponential backoff / fixed delay / queue}
+
+**Sandbox / Test Environment:**
+- **Available:** {yes / no}
+- **Sandbox base URL:** `{sandbox_url or "N/A"}`
+- **How to access:** {Separate API key / toggle in dashboard / different subdomain}
+- **Limitations:** {Rate limits differ? Data resets? Feature parity with production?}
+- **QA impact:** {Can QA use sandbox for live API testing? Any endpoints unavailable in sandbox?}
+
+> **Why this matters:** If a sandbox exists, QA testing (Phase 5) can run against it safely without affecting production data. If no sandbox, QA must use mocks or test carefully with real data. Document this early — it directly affects the testing strategy.
+
+---
+
+## 4. Version & Deprecation
+
+- **Current stable version:** {e.g., v2, 2024-01-01}
+- **Version mechanism:** {URL path (/v2/), header (API-Version: 2024-01-01), query param}
+- **Version header requirements:** {Required header name and format, if any}
+- **Deprecation timeline:** {Any endpoints or versions being sunset — with dates}
+- **Breaking changes in recent versions:** {Notable changes that affect tool design}
+- **Changelog URL:** {Link to changelog/migration guide for reference}
+
+---
+
+## 5. Endpoint Catalog
+
+### Group: {Domain Name} ({count} endpoints)
+
+| Method | Path | Description | Notes |
+|--------|------|-------------|-------|
+| GET | `/resource` | List resources | Paginated, filterable |
+| GET | `/resource/{id}` | Get single resource | |
+| POST | `/resource` | Create resource | Required: name, email |
+| PUT | `/resource/{id}` | Update resource | Partial update supported |
+| DELETE | `/resource/{id}` | Delete resource | Soft delete |
+
+{Repeat for each domain group}
+
+### Group: {Next Domain} ({count} endpoints)
+...
+
+**Total endpoints:** {count}
+
+---
+
+## 6. Tool Groups (for Lazy Loading)
+
+Tools are organized into groups that load on-demand. Each group maps to a domain.
+
+| Group Name | Tools | Load Trigger | Description |
+|------------|-------|--------------|-------------|
+| `contacts` | {count} | User asks about contacts | Contact CRUD, search, tags |
+| `deals` | {count} | User asks about deals/pipeline | Deal management, stages |
+| `invoicing` | {count} | User asks about invoices/payments | Invoice CRUD, payments |
+| `calendar` | {count} | User asks about scheduling | Appointments, availability |
+| `analytics` | {count} | User asks for reports/metrics | Dashboards, KPIs |
+| `admin` | {count} | User asks about settings/config | Users, permissions, webhooks |
+
+**Target:** 5-15 groups, 3-15 tools per group. No group should exceed 20 tools.
+
+---
+
+## 7. Tool Inventory
+
+### Group: {group_name}
+
+#### `list_{resources}`
+- **Title:** List {Resources}
+- **Icon:** `{service-cdn-url}/list-icon.svg` *(or omit if no suitable icon — SVG preferred)*
+- **Description:** List {resources} with optional filters and pagination. Returns `{key_field_1, key_field_2, key_field_3, status}` for each {resource}. Use when the user wants to browse, filter, or get an overview of multiple {resources}. Do NOT use when searching by specific keyword (use `search_{resources}` instead) or for getting full details of one {resource} (use `get_{resource}` instead).
+- **HTTP:** GET `/resource`
+- **Annotations:** `readOnlyHint: true`, `destructiveHint: false`, `idempotentHint: true`, `openWorldHint: false`
+- **Parameters:**
+  | Param | Type | Required | Description |
+  |-------|------|----------|-------------|
+  | page | number | No | Page number (default 1) |
+  | pageSize | number | No | Results per page (default 25, max 100) |
+  | query | string | No | Search by name, email, or phone |
+  | status | string | No | Filter: active, inactive, all |
+  | sortBy | string | No | Sort field: created, updated, name |
+- **Output Schema:** `{ data: Resource[], meta: { total: number, page: number, pageSize: number } }`
+- **Content Annotations:** `audience: ["user", "assistant"]`, `priority: 0.7`
+- **Response shape:** `{ data: Resource[], meta: { total, page, pageSize } }`
+
+#### `get_{resource}`
+- **Title:** Get {Resource} Details
+- **Icon:** `{service-cdn-url}/detail-icon.svg` *(optional)*
+- **Description:** Get complete details for a single {resource} by ID. Returns all fields including `{notable_field_1, notable_field_2, notable_field_3}`. Use when the user references a specific {resource} by name/ID or needs detailed information about one {resource}. Do NOT use when the user wants to browse multiple {resources} (use `list_{resources}` instead).
+- **HTTP:** GET `/resource/{id}`
+- **Annotations:** `readOnlyHint: true`, `destructiveHint: false`, `idempotentHint: true`, `openWorldHint: false`
+- **Parameters:**
+  | Param | Type | Required | Description |
+  |-------|------|----------|-------------|
+  | {resource}_id | string | **Yes** | {Resource} ID |
+- **Output Schema:** `Resource` (full object with all fields)
+- **Content Annotations:** `audience: ["user"]`, `priority: 0.8`
+- **Response shape:** `Resource`
+
+#### `create_{resource}`
+- **Title:** Create New {Resource}
+- **Icon:** `{service-cdn-url}/create-icon.svg` *(optional)*
+- **Description:** Create a new {resource}. Returns the created {resource} with its assigned ID. Use when the user wants to add, create, or set up a new {resource}. Do NOT use when updating an existing {resource} (use `update_{resource}` instead). Side effect: creates a permanent record in the system.
+- **HTTP:** POST `/resource`
+- **Annotations:** `readOnlyHint: false`, `destructiveHint: false`, `idempotentHint: false`, `openWorldHint: false`
+- **Parameters:**
+  | Param | Type | Required | Description |
+  |-------|------|----------|-------------|
+  | name | string | **Yes** | {Resource} name |
+  | email | string | No | Email address |
+  | {etc.} | | | |
+- **Output Schema:** `Resource` (created object with ID)
+- **Content Annotations:** `audience: ["user"]`, `priority: 0.9`
+- **Response shape:** `Resource`
+
+#### `update_{resource}`
+- **Title:** Update {Resource}
+- **Icon:** `{service-cdn-url}/edit-icon.svg` *(optional)*
+- **Description:** Update an existing {resource}. Only include fields to change — omitted fields remain unchanged. Returns the updated {resource}. Use when the user wants to modify, change, or edit a {resource}. Do NOT use when creating a new {resource} (use `create_{resource}` instead). Side effect: modifies the existing record.
+- **HTTP:** PUT `/resource/{id}`
+- **Annotations:** `readOnlyHint: false`, `destructiveHint: false`, `idempotentHint: true`, `openWorldHint: false`
+- **Parameters:**
+  | Param | Type | Required | Description |
+  |-------|------|----------|-------------|
+  | {resource}_id | string | **Yes** | {Resource} ID |
+  | {fields...} | | No | Fields to update |
+- **Output Schema:** `Resource` (updated object)
+- **Content Annotations:** `audience: ["user"]`, `priority: 0.9`
+- **Response shape:** `Resource`
+
+#### `delete_{resource}`
+- **Title:** Delete {Resource}
+- **Icon:** `{service-cdn-url}/delete-icon.svg` *(optional)*
+- **Description:** Delete a {resource} permanently. This cannot be undone. Use only when the user explicitly asks to delete or remove a {resource}. Do NOT use for archiving, deactivating, or hiding (use `update_{resource}` with status change instead, if available). Side effect: permanently removes the record.
+- **HTTP:** DELETE `/resource/{id}`
+- **Annotations:** `readOnlyHint: false`, `destructiveHint: true`, `idempotentHint: true`, `openWorldHint: false`
+- **Parameters:**
+  | Param | Type | Required | Description |
+  |-------|------|----------|-------------|
+  | {resource}_id | string | **Yes** | {Resource} ID |
+- **Output Schema:** `{ success: boolean }`
+- **Content Annotations:** `audience: ["user"]`, `priority: 1.0`
+- **Response shape:** `{ success: true }`
+
+{Repeat for each tool in each group}
+
+### Disambiguation Table (per group)
+
+For each tool group, produce a disambiguation matrix to guide tool routing:
+
+| User says... | Correct tool | Why not others |
+|---|---|---|
+| "Show me all {resources}" | `list_{resources}` | Not `search_` (no keyword), not `get_` (not one specific item) |
+| "Find {name}" | `search_{resources}` | Not `list_` (specific name = search), not `get_` (no ID provided) |
+| "What's {name}'s email?" | `get_{resource}` | Not `list_`/`search_` (asking about a specific known {resource}) |
+| "Add a new {resource}" | `create_{resource}` | Not `update_` (new, not existing) |
+| "Change {name}'s phone number" | `update_{resource}` | Not `create_` (modifying existing) |
+| "Remove {name}" | `delete_{resource}` | Not `update_` (user said remove/delete, not deactivate) |
+
+### Common User Intent Clustering
+
+For each disambiguation entry, consider **diverse phrasings** real users would type. Cluster by intent to ensure the tool description handles all variants:
+
+| Intent | Common Phrasings | Target Tool |
+|--------|-----------------|-------------|
+| Browse/overview | "show me", "list", "what are my", "pull up", "let me see", "give me all" | `list_{resources}` |
+| Search/find | "find", "search for", "look up", "where is", "do I have a" | `search_{resources}` |
+| Detail/inspect | "tell me about", "what's the status of", "show me details for", "more info on" | `get_{resource}` |
+| Create/add | "add", "create", "new", "set up", "register", "make a" | `create_{resource}` |
+| Modify/edit | "change", "update", "edit", "modify", "fix", "set X to Y" | `update_{resource}` |
+| Remove/delete | "delete", "remove", "get rid of", "cancel", "drop" | `delete_{resource}` |
+
+> **Tip:** When writing tool descriptions, ensure the "When to use" clause covers the most common phrasings for that intent. The "When NOT to use" clause should address the top misrouting risk (e.g., `list_` vs `search_` is the most common confusion).
+
+---
+
+## 8. App Candidates
+
+### Dashboard Apps
+| App ID | Name | Data Source Tools | Description |
+|--------|------|-------------------|-------------|
+| `{svc}-dashboard` | {Service} Dashboard | `get_analytics`, `list_*` | Overview KPIs, recent activity |
+
+### Data Grid Apps
+| App ID | Name | Data Source Tools | Description |
+|--------|------|-------------------|-------------|
+| `{svc}-contact-grid` | Contacts | `list_contacts`, `search_contacts` | Searchable contact list |
+
+### Detail Card Apps
+| App ID | Name | Data Source Tools | Description |
+|--------|------|-------------------|-------------|
+| `{svc}-contact-card` | Contact Card | `get_contact` | Single contact deep-dive |
+
+### Form/Wizard Apps
+| App ID | Name | Data Source Tools | Description |
+|--------|------|-------------------|-------------|
+| `{svc}-contact-creator` | New Contact | `create_contact` | Contact creation form |
+
+### Specialized Apps
+| App ID | Name | Type | Data Source Tools | Description |
+|--------|------|------|-------------------|-------------|
+| `{svc}-calendar` | Calendar | calendar | `list_appointments` | Appointment calendar |
+| `{svc}-pipeline` | Pipeline | funnel | `list_deals` | Deal pipeline kanban |
+| `{svc}-timeline` | Activity | timeline | `get_activity` | Activity feed |
+
+---
+
+## 9. Elicitation Candidates
+
+Identify flows where the MCP server should request user input mid-operation using the MCP Elicitation capability (`elicitation/create`). These are interactions where the server needs information or confirmation from the user before proceeding.
+
+### When to flag a flow for elicitation:
+
+- **OAuth account selection** — API supports multiple connected accounts; server needs user to choose which one
+- **Destructive operation confirmation** — DELETE or irreversible actions should confirm before executing
+- **Ambiguous input resolution** — User says "delete the contact" but there are 3 matches; server asks which one
+- **Multi-step wizards** — Creating a complex resource that requires sequential input (e.g., create event → pick calendar → set time → invite attendees)
+- **Scope/permission escalation** — Action requires additional OAuth scopes the user hasn't granted
+- **Payment/billing actions** — Any action that costs money should confirm amount and target
+
+### Elicitation Candidate Template:
+
+| Flow | Trigger | Elicitation Type | User Input Needed | Fallback (if elicitation unsupported) |
+|------|---------|-----------------|--------------------|-----------------------------------------|
+| Delete {resource} | `delete_{resource}` called | Confirmation | "Confirm delete {name}? (yes/no)" | Return warning text, require second call |
+| Connect account | First API call with OAuth | Selection | "Which account? (list options)" | Use default/first account |
+| Bulk action | `bulk_update` with >10 items | Confirmation | "Update {N} records? (yes/no)" | Cap at 10, warn about limit |
+| {Describe flow} | {What triggers it} | {Confirmation / Selection / Form} | {What the user sees} | {What happens if client doesn't support elicitation} |
+
+**Important:** Always plan a fallback for clients that don't support elicitation. The server should still function — it just may require the user to provide the information in their original message or via a follow-up tool call.
+
+---
+
+## 10. Task Candidates (Async Operations)
+
+Identify tools where the operation may take >10 seconds and should be executed asynchronously using MCP Tasks (spec 2025-11-25, experimental SEP-1686).
+
+### When to flag a tool for async/task support:
+- **Report generation** — compiling analytics, PDFs, exports
+- **Bulk operations** — updating 100+ records, mass imports
+- **External processing** — waiting on third-party webhooks, payment processing
+- **Data migration** — moving large datasets between systems
+- **File generation** — creating CSVs, spreadsheets, archives
+
+### Task Candidate Template:
+
+| Tool | Typical Duration | Task Support | Recommended Polling Interval |
+|------|-----------------|-------------|------------------------------|
+| `export_report` | 30-120s | required | 5000ms |
+| `bulk_update` | 10-60s | optional | 3000ms |
+| `generate_invoice_pdf` | 5-15s | optional | 2000ms |
+| `{tool_name}` | {duration} | {required/optional/forbidden} | {interval} |
+
+> **Note:** Most tools should be `forbidden` for task support — only flag tools that genuinely need async execution. If the operation completes in <5 seconds, don't use tasks.
+
+---
+
+## 11. Data Shape Contracts
+
+For each app candidate, define the exact mapping from tool `outputSchema` to what the app's `render()` function expects. This contract prevents silent data shape mismatches.
+
+### Contract Template:
+
+| App | Source Tool | Tool outputSchema Key Fields | App Expected Fields | Transform Notes |
+|-----|------------|------------------------------|---------------------|-----------------|
+| `{svc}-contact-grid` | `list_contacts` | `data[].{name,email,status}`, `meta.{total,page,pageSize}` | `data[].{name,email,status}`, `meta.{total,page,pageSize}` | Direct pass-through |
+| `{svc}-dashboard` | `get_analytics` | `{revenue,contacts,deals}` | `metrics.{revenue,contacts,deals}`, `recent[]` | LLM restructures into metrics + recent |
+| `{svc}-{type}` | `{tool}` | `{fields}` | `{fields}` | `{notes}` |
+
+### Contract Rules:
+1. **Direct pass-through** — When tool output matches app input exactly. Preferred.
+2. **LLM transform** — When the LLM must restructure data (via APP_DATA). Document the mapping explicitly so system prompts can reference it.
+3. **Aggregation** — When an app needs data from multiple tools. List all source tools and how their outputs combine.
+
+### Validation:
+- The builder should set `outputSchema` to match the contract
+- The designer should set `validateData()` to check for the contracted fields
+- The integrator's `systemPromptAddon` should reference these contracts for APP_DATA generation
+
+---
+
+## 12. Naming Conventions
+
+### Tool names: `{verb}_{noun}`
+- `list_contacts`, `get_contact`, `create_contact`, `update_contact`, `delete_contact`
+- `search_contacts` (if separate from list)
+- `send_message`, `schedule_appointment`, `export_report`
+
+### Semantic Clustering — Verb Prefix Conventions
+
+Use consistent verb prefixes to signal intent. This helps the LLM distinguish between tools with related names and reduces misrouting.
+
+| Prefix | Intent | Maps to HTTP | Examples |
+|--------|--------|-------------|----------|
+| `browse_` or `list_` | List/overview of multiple items | GET (collection) | `list_contacts`, `browse_invoices` |
+| `inspect_` or `get_` | Deep-dive into a single item | GET (single) | `get_contact`, `inspect_deal` |
+| `modify_` or `create_` / `update_` | Create or change a resource | POST / PUT | `create_contact`, `update_deal` |
+| `remove_` or `delete_` | Delete a resource | DELETE | `delete_contact`, `remove_tag` |
+| `search_` | Full-text or keyword search | GET (with query) | `search_contacts` |
+| `send_` | Dispatch a message/notification | POST (side effect) | `send_email`, `send_sms` |
+| `export_` | Generate a report/file | GET or POST | `export_report` |
+
+**Guidelines:**
+- Pick ONE prefix style per server and be consistent (either `list_`/`get_` or `browse_`/`inspect_`, not both)
+- The standard `list_`/`get_`/`create_`/`update_`/`delete_` is recommended for most APIs
+- Use `browse_`/`inspect_`/`modify_`/`remove_` only if you need to avoid ambiguity with existing tool names or if the API's language uses these verbs naturally
+- For mutually exclusive tools, add "INSTEAD OF" notes in descriptions (e.g., "Use `search_contacts` INSTEAD OF `list_contacts` when the user provides a keyword")
+
+### App IDs: `{service}-{type}-{optional-qualifier}`
+- `{svc}-dashboard`, `{svc}-contact-grid`, `{svc}-contact-card`
+- `{svc}-pipeline-kanban`, `{svc}-calendar-view`, `{svc}-activity-timeline`
+
+### Tool group names: lowercase, domain-based
+- `contacts`, `deals`, `invoicing`, `calendar`, `analytics`, `admin`
+
+---
+
+## 13. Quirks & Gotchas
+
+{List any API-specific issues discovered during analysis}
+
+- {e.g., "Delete endpoint returns 200 with empty body, not 204"}
+- {e.g., "Pagination starts at 0, not 1"}
+- {e.g., "Date fields use Unix timestamps, not ISO 8601"}
+- {e.g., "Rate limit resets at midnight UTC, not rolling window"}
+- {e.g., "Sandbox environment has different base URL"}
+
+---
+
+## 14. Implementation Priority
+
+### Phase 1 (Core — build first):
+1. {most-used-group} — {why}
+2. {second-group} — {why}
+
+### Phase 2 (Important — build second):
+3. {third-group} — {why}
+4. {fourth-group} — {why}
+
+### Phase 3 (Nice-to-have — build if time):
+5. {remaining-groups}
+
+### App Priority:
+1. {svc}-dashboard — Always build the dashboard first
+2. {svc}-{most-used-grid} — Most common data view
+3. {svc}-{most-used-detail} — Detail for most common entity
+
+---
+
+## 5. Tool Description Best Practices
+
+Tool descriptions are the #1 factor in whether an LLM correctly routes to the right tool. Follow these rules:
+
+### The Description Formula (6-part):
+
+```
+{What it does}. {What it returns — include 2-3 key field names}. 
+{When to use it — specific user intents}. {When NOT to use it — disambiguation}.
+{Side effects — if any}.
+```
+
+Every tool description MUST include the "When NOT to use" clause. Research shows this single addition reduces tool misrouting by ~30%.
+
+### Before/After Example:
+
+**❌ BEFORE (too vague, no disambiguation):**
+```
+"List contacts with optional filters. Returns paginated results including name, email, phone, 
+and status. Use when the user wants to see, search, or browse their contact list."
+```
+
+**✅ AFTER (specific, disambiguated, actionable):**
+```
+"List contacts with optional filters and pagination. Returns {name, email, phone, status, 
+created_date} for each contact, plus {total, page, pageSize} metadata. Use when the user 
+wants to browse, filter, or get an overview of multiple contacts. Do NOT use when searching 
+by specific keyword (use search_contacts instead) or for getting full details of one contact 
+(use get_contact instead). Read-only, no side effects."
+```
+
+### For similar tools, differentiate clearly:
+```
+list_contacts: "...browse, filter, or get an overview of multiple contacts. 
+    Do NOT use when searching by keyword (use search_contacts) or looking up one contact (use get_contact)."
+search_contacts: "...full-text search across all contact fields by keyword. 
+    Do NOT use when browsing without a search term (use list_contacts) or when the user has a specific ID (use get_contact)."
+get_contact: "...get complete details for one contact by ID. 
+    Do NOT use when the user wants multiple contacts (use list_contacts) or is searching by name (use search_contacts)."
+```
+
+### Token Budget Awareness
+
+Tool descriptions consume context window tokens. Every tool definition averages 50-200 tokens depending on schema complexity. With 50+ tools, this is 10,000+ tokens before any work begins.
+
+**Targets:**
+- **Total tool definition tokens per server:** Under 5,000 tokens
+- **Per-tool target:** ~200 tokens (description + schema combined)
+- **Active tools per interaction:** Cap at 15-20 via lazy loading
+
+**Optimization techniques:**
+- Be concise — every word must earn its place
+- Eliminate redundant descriptions between the tool description and parameter descriptions
+- Use field name lists (`{name, email, phone}`) instead of prose descriptions of return values
+- Combine overlapping tools when the distinction is minor (e.g., `list_contacts` with optional `query` param instead of separate `list_contacts` + `search_contacts`)
+
+### Tool Count Optimization
+
+If a tool group exceeds 15 tools, consider combining:
+
+| Instead of... | Combine into... | How |
+|---------------|-----------------|-----|
+| `list_contacts` + `search_contacts` | `list_contacts` with optional `query` param | Add `query` as optional filter |
+| `get_contact_email` + `get_contact_phone` + `get_contact_address` | `get_contact` (returns all fields) | Single tool, all fields returned |
+| `create_contact` + `create_lead` + `create_prospect` | `create_contact` with `type` param | Use enum parameter for type |
+| `get_report_daily` + `get_report_weekly` + `get_report_monthly` | `get_report` with `period` param | Use enum parameter for period |
+
+**Rule of thumb:** If two tools share >80% of their parameters and the same endpoint pattern, they should be one tool with a distinguishing parameter.
+
+---
+
+## 6. MCP Annotation Rules
+
+Every tool MUST have annotations. Use this decision tree:
+
+```
+Is it a GET/read operation?
+  → readOnlyHint: true, destructiveHint: false
+
+Is it a DELETE operation?
+  → readOnlyHint: false, destructiveHint: true
+
+Is it a POST/create operation?
+  → readOnlyHint: false, destructiveHint: false, idempotentHint: false
+
+Is it a PUT/upsert operation?
+  → readOnlyHint: false, destructiveHint: false, idempotentHint: true
+
+Does it affect external systems outside this API?
+  → openWorldHint: true (rare — most API tools are openWorldHint: false)
+```
+
+---
+
+## 7. Content Annotations Planning
+
+MCP content blocks can carry `audience` and `priority` annotations that control how tool outputs are routed. Plan these during analysis — they feed directly into the server builder.
+
+### Audience Annotation:
+- `["user"]` — Output is for the end user (show in UI/app, don't feed back to LLM for reasoning)
+- `["assistant"]` — Output is for the LLM (feed into context for multi-step reasoning, don't show to user)
+- `["user", "assistant"]` — Both (show to user AND available for LLM reasoning — the default)
+
+### Priority Annotation (0.0 to 1.0):
+- `1.0` — Critical, always show prominently (destructive operation results, errors, confirmations)
+- `0.7-0.9` — Important, show normally (most tool results)
+- `0.3-0.6` — Supplementary, can be collapsed/summarized (metadata, pagination info)
+- `0.0-0.2` — Low priority, assistant-only (debug info, internal state)
+
+### Planning Guidelines:
+
+| Tool Type | Audience | Priority | Rationale |
+|-----------|----------|----------|-----------|
+| `list_*` | `["user", "assistant"]` | 0.7 | User sees data, LLM may use for follow-up |
+| `get_*` | `["user"]` | 0.8 | Primarily for user display |
+| `create_*` / `update_*` | `["user"]` | 0.9 | User needs confirmation of changes |
+| `delete_*` | `["user"]` | 1.0 | Critical — user must see result |
+| `search_*` | `["user", "assistant"]` | 0.7 | User sees results, LLM may refine |
+| Analytics/aggregation | `["user"]` | 0.8 | Dashboard-type data, primarily visual |
+| Internal/helper tools | `["assistant"]` | 0.3 | LLM uses for reasoning, user doesn't need to see |
+
+---
+
+## 8. App Candidate Selection Criteria
+
+Not every endpoint deserves an app. Use this checklist:
+
+### BUILD an app when:
+- ✅ The data is a **list** that benefits from search/filter UI (data grid)
+- ✅ The data is **complex** with many fields (detail card)
+- ✅ There are **aggregate metrics** or KPIs (dashboard)
+- ✅ The data is **date-based** and benefits from calendar layout (calendar)
+- ✅ The data has **stages/phases** (funnel/kanban)
+- ✅ The data is **chronological events** (timeline)
+- ✅ There's a **multi-step creation flow** (form/wizard)
+
+### SKIP an app when:
+- ❌ It's a simple CRUD with 2-3 fields (just use the tool directly)
+- ❌ The response is a simple success/fail (no visual benefit)
+- ❌ It's a settings/config endpoint (rarely needed in UI)
+- ❌ It's a batch/background operation (status check is enough)
+
+### App count targets:
+- **Small API (10-20 endpoints):** 3-5 apps
+- **Medium API (20-50 endpoints):** 5-10 apps
+- **Large API (50+ endpoints):** 10-20 apps
+- **Never exceed 25 apps** for a single service — diminishing returns
+
+---
+
+## 9. Quality Gate Checklist
+
+Before passing the analysis doc to Phase 2, verify:
+
+### Core Completeness:
+- [ ] **API style identified** — REST/GraphQL/SOAP/gRPC/WebSocket documented with adaptation notes if non-REST
+- [ ] **Every endpoint is cataloged** — no missing endpoints from the API reference
+- [ ] **Tool groups are balanced** — no group with 50+ tools, aim for 3-15 per group
+- [ ] **Active tool count is manageable** — total tools ≤ 60, each lazy-loaded group ≤ 20, active per interaction ≤ 15-20
+
+### Tool Quality:
+- [ ] **Tool descriptions follow 6-part formula** — What / Returns (field names) / When to use / When NOT to use / Side effects
+- [ ] **Every tool has a `title` field** — Human-readable display name separate from machine name
+- [ ] **Every tool has an `outputSchema` planned** — Expected response structure documented
+- [ ] **Every tool has annotations planned** — readOnlyHint, destructiveHint, idempotentHint, openWorldHint
+- [ ] **Content annotations planned** — audience and priority assigned per tool type
+- [ ] **Disambiguation tables exist** — For each tool group with similar tools, "User says X → Correct tool → Why not others"
+- [ ] **Semantic verb prefixes are consistent** — list_/get_/create_/update_/delete_ (or chosen alternative) used uniformly
+
+### Auth & Infrastructure:
+- [ ] **Auth flow is complete** — Step-by-step, env vars listed, refresh strategy documented
+- [ ] **OAuth2 subtype identified** — If OAuth2: grant type, PKCE, scopes, token lifetime documented
+- [ ] **Token lifecycle documented** — Expiry, refresh, storage strategy for long-running server, key rotation procedure
+- [ ] **Pagination pattern identified** — Type, params, max size, end detection, total count availability
+- [ ] **Rate limits are documented** — Global + per-endpoint, burst behavior, scope, penalty
+
+### Planning:
+- [ ] **Version & deprecation documented** — Current version, sunset timelines, version header requirements
+- [ ] **App candidates have clear data sources** — Each app maps to specific tool(s)
+- [ ] **Data shape contracts defined** — Tool outputSchema → app expected input mapped per app candidate
+- [ ] **Elicitation candidates identified** — Destructive operations, ambiguous inputs, multi-step flows, account selection
+- [ ] **Task candidates identified** — Long-running operations flagged with polling intervals
+- [ ] **Icon planning noted per tool** — SVG preferred, at least noted even if deferred
+- [ ] **Sandbox/test environment documented** — Availability, URL, QA impact
+- [ ] **Error format is documented** — Response shape, common error codes
+- [ ] **Naming follows conventions** — verb_noun tools, service-type app IDs, consistent verb prefixes
+- [ ] **User intent clustering done** — Diverse phrasings per disambiguation entry
+- [ ] **Quirks & gotchas captured** — API-specific oddities that affect implementation
+
+---
+
+## 10. Example: Completed Analysis (abbreviated)
+
+```markdown
+# Calendly — MCP API Analysis
+
+**Date:** 2026-02-04
+**API Version:** v2
+**Base URL:** `https://api.calendly.com`
+**Documentation:** https://developer.calendly.com/api-docs
+
+## 1. Service Overview
+**What it does:** Scheduling automation platform
+**API Style:** REST
+
+## 2. Authentication
+**Method:** OAuth2 (Personal Access Token also available)
+**OAuth2 Grant Type:** authorization_code (PKCE recommended for public clients)
+**Token Expiry:** 2 hours (refresh token: 30 days)
+Headers: `Authorization: Bearer {token}`
+
+## 4. Version & Deprecation
+**Current Version:** v2 (v1 sunset: 2024-06-01)
+**Version Mechanism:** URL path (/api/v2/)
+
+## 6. Tool Groups
+| Group | Tools | Description |
+|-------|-------|-------------|
+| `scheduling` | 8 | Event types, scheduling links |
+| `events` | 6 | Scheduled events, invitees |
+| `users` | 4 | User profiles, org membership |
+| `webhooks` | 3 | Webhook subscriptions |
+
+## 7. Tool Inventory (example tool)
+### `list_events`
+- **Title:** List Scheduled Events
+- **Description:** List scheduled events with date range and status filters. Returns {name, start_time, end_time, status, invitee_count} per event. Use when user wants to see upcoming or past events. Do NOT use for event type management (use list_event_types) or single event details (use get_event). Read-only.
+- **Output Schema:** `{ collection: Event[], pagination: { count, next_page_token } }`
+- **Content Annotations:** `audience: ["user", "assistant"]`, `priority: 0.7`
+
+## 8. App Candidates
+- calendly-dashboard (Dashboard) — event counts, upcoming schedule
+- calendly-event-grid (Data Grid) — list scheduled events
+- calendly-event-detail (Detail Card) — single event with invitee info
+- calendly-calendar (Calendar) — visual calendar of events
+- calendly-availability (Form) — set availability preferences
+
+## 9. Elicitation Candidates
+| Flow | Trigger | Type | User Input | Fallback |
+|------|---------|------|------------|----------|
+| Cancel event | `cancel_event` | Confirmation | "Cancel event with {invitee}?" | Require explicit confirmation in message |
+| Connect calendar | Initial setup | Selection | "Which calendar provider?" | Default to primary calendar |
+```
+
+---
+
+## 11. Execution Workflow
+
+```
+1. Receive API docs URL(s) from user
+2. Identify API style (REST/GraphQL/SOAP/gRPC/WebSocket)
+3. Read auth page → Document auth flow (including OAuth2 subtype, token lifecycle, key rotation)
+4. Read rate limits → Document constraints (including burst, scope, penalty)
+5. Check sandbox/test environment → Document availability, URL, and QA impact
+6. Check version/deprecation → Document current version and sunset timelines
+7. Scan all endpoints → Build endpoint catalog
+8. Group endpoints by domain → Define tool groups (cap at 15-20 active per interaction)
+9. Name each tool → Write 6-part descriptions with annotations, title, outputSchema, content annotations, icon
+10. Build disambiguation tables with user intent clustering for each tool group
+11. Identify elicitation candidates (destructive ops, ambiguous inputs, multi-step flows)
+12. Identify task candidates (long-running operations >10s)
+13. Identify app candidates → Map to data source tools
+14. Define data shape contracts (tool outputSchema → app expected input)
+15. Document quirks/gotchas
+16. Set implementation priority
+17. Run quality gate checklist
+18. Output: {service}-api-analysis.md
+```
+
+**Estimated time:** 30-60 minutes for small APIs, 1-2 hours for large APIs (50+ endpoints)
+
+**Agent model recommendation:** Opus — requires deep reading comprehension and strategic judgment for tool grouping and app candidate selection.
+
+---
+
+*This skill is Phase 1 of the MCP Factory pipeline. The analysis document it produces is the single source of truth for all subsequent phases.*
diff --git a/skills/mcp-app-designer/SKILL.md b/skills/mcp-app-designer/SKILL.md
new file mode 100644
index 0000000..b35fd05
--- /dev/null
+++ b/skills/mcp-app-designer/SKILL.md
@@ -0,0 +1,2170 @@
+# MCP App Designer — Phase 3: Design & Build HTML Apps
+
+**When to use this skill:** You have a `{service}-api-analysis.md` (specifically the App Candidates section) and optionally a built MCP server, and need to create the visual HTML apps that render in LocalBosses. Each app is a single self-contained HTML file.
+
+**What this covers:** Dark theme design specs, 9 app type patterns (including Interactive Data Grid), data visualization primitives, accessibility fundamentals, micro-interactions, bidirectional communication, the exact HTML template with data reception, responsive design, three-state rendering (loading/empty/data), and data flow architecture.
+
+**Pipeline position:** Phase 3 of 6 → Input from `mcp-api-analyzer` (Phase 1), can run parallel with `mcp-server-builder` (Phase 2). Output feeds `mcp-localbosses-integrator` (Phase 4).
+
+---
+
+## 1. Inputs & Outputs
+
+**Inputs:**
+- `{service}-api-analysis.md` — App Candidates section (which apps to build, data sources)
+- Tool definitions (from Phase 2 server or analysis doc) — what data shapes to expect
+
+**Output:** HTML app files in `{service}-mcp/app-ui/`:
+```
+{service}-mcp/
+└── app-ui/
+    ├── dashboard.html
+    ├── contact-grid.html
+    ├── contact-card.html
+    ├── contact-creator.html
+    ├── calendar-view.html
+    ├── pipeline-kanban.html
+    ├── activity-timeline.html
+    ├── data-explorer.html      ← Interactive Data Grid (new)
+    └── ...
+```
+
+Each file is a **single, self-contained HTML file** with all CSS and JS inline. Zero external dependencies.
+
+---
+
+## 2. Design System — LocalBosses Dark Theme
+
+### Color Palette
+
+> **WCAG AA Compliance Note:** All text colors must maintain a minimum contrast ratio of **4.5:1** against their background for normal text (under 18px/14px bold), and **3:1** for large text. The secondary text color `#b0b2b8` achieves **5.0:1** on `#1a1d23` and **4.3:1** on `#2b2d31`, meeting AA for normal text. The previous value `#96989d` (3.7:1) failed this requirement and must not be used.
+
+| Token | Hex | Usage |
+|-------|-----|-------|
+| `--bg-primary` | `#1a1d23` | Page/body background |
+| `--bg-secondary` | `#2b2d31` | Cards, panels, containers |
+| `--bg-tertiary` | `#232529` | Nested elements, table rows alt |
+| `--bg-hover` | `#35373c` | Hover states on interactive elements |
+| `--bg-input` | `#1e2024` | Form inputs, text areas |
+| `--accent` | `#ff6d5a` | Primary accent, buttons, active states |
+| `--accent-hover` | `#ff8574` | Accent hover state |
+| `--accent-subtle` | `rgba(255, 109, 90, 0.15)` | Accent backgrounds, badges |
+| `--text-primary` | `#dcddde` | Primary text |
+| `--text-secondary` | `#b0b2b8` | Muted/secondary text, labels (WCAG AA 5.0:1 on #1a1d23) |
+| `--text-heading` | `#ffffff` | Headings, emphasis |
+| `--border` | `#3a3c41` | Borders, dividers |
+| `--success` | `#43b581` | Success states, positive metrics |
+| `--warning` | `#faa61a` | Warning states, caution |
+| `--danger` | `#f04747` | Error states, destructive actions |
+| `--info` | `#5865f2` | Info states, links |
+
+### Typography
+
+```css
+font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, sans-serif;
+```
+
+| Element | Size | Weight | Color |
+|---------|------|--------|-------|
+| Page title | 18px | 700 | #ffffff |
+| Section heading | 14px | 600 | #ffffff |
+| Body text | 13px | 400 | #dcddde |
+| Small/muted | 12px | 400 | #b0b2b8 |
+| Metric value | 24px | 700 | #ff6d5a |
+| Table header | 11px | 600 | #b0b2b8 (uppercase, letter-spacing: 0.5px) |
+
+### Spacing & Layout
+
+| Token | Value | Usage |
+|-------|-------|-------|
+| `--gap-xs` | 4px | Tight spacing (icon + label) |
+| `--gap-sm` | 8px | Compact spacing |
+| `--gap-md` | 12px | Standard spacing |
+| `--gap-lg` | 16px | Section spacing |
+| `--gap-xl` | 24px | Major section breaks |
+| `--radius-sm` | 4px | Small elements (badges, chips) |
+| `--radius-md` | 8px | Cards, panels |
+| `--radius-lg` | 12px | Large containers, modals |
+
+### Components
+
+#### Cards
+```css
+.card {
+  background: #2b2d31;
+  border-radius: 8px;
+  padding: 16px;
+  border: 1px solid #3a3c41;
+}
+```
+
+#### Buttons
+```css
+.btn-primary {
+  background: #ff6d5a;
+  color: #ffffff;
+  border: none;
+  padding: 8px 16px;
+  border-radius: 6px;
+  font-size: 13px;
+  font-weight: 600;
+  cursor: pointer;
+  transition: background 0.15s;
+}
+.btn-primary:hover { background: #ff8574; }
+.btn-primary:focus-visible { outline: 2px solid #ff6d5a; outline-offset: 2px; }
+
+.btn-secondary {
+  background: transparent;
+  color: #dcddde;
+  border: 1px solid #3a3c41;
+  padding: 8px 16px;
+  border-radius: 6px;
+  font-size: 13px;
+  cursor: pointer;
+  transition: all 0.15s;
+}
+.btn-secondary:hover { background: #35373c; border-color: #4a4c51; }
+.btn-secondary:focus-visible { outline: 2px solid #ff6d5a; outline-offset: 2px; }
+```
+
+#### Status badges
+```css
+.badge { padding: 2px 8px; border-radius: 10px; font-size: 11px; font-weight: 600; }
+.badge-success { background: rgba(67, 181, 129, 0.15); color: #43b581; }
+.badge-warning { background: rgba(250, 166, 26, 0.15); color: #faa61a; }
+.badge-danger { background: rgba(240, 71, 71, 0.15); color: #f04747; }
+.badge-info { background: rgba(88, 101, 242, 0.15); color: #5865f2; }
+.badge-accent { background: rgba(255, 109, 90, 0.15); color: #ff6d5a; }
+.badge-neutral { background: rgba(176, 178, 184, 0.15); color: #b0b2b8; }
+```
+
+---
+
+## 3. Data Visualization Primitives
+
+All visualizations use pure CSS/SVG — zero external dependencies. Copy these snippets into any app template.
+
+### 3.1 Line / Area Chart (SVG Polyline)
+
+```html
+<!-- Line Chart: pass an array of {x, y} normalized to viewBox -->
+<svg viewBox="0 0 300 100" style="width:100%;height:160px" role="img" aria-label="Line chart showing trend data">
+  <!-- Grid lines -->
+  <line x1="0" y1="25" x2="300" y2="25" stroke="#3a3c41" stroke-width="0.5" stroke-dasharray="4"/>
+  <line x1="0" y1="50" x2="300" y2="50" stroke="#3a3c41" stroke-width="0.5" stroke-dasharray="4"/>
+  <line x1="0" y1="75" x2="300" y2="75" stroke="#3a3c41" stroke-width="0.5" stroke-dasharray="4"/>
+  <!-- Area fill -->
+  <polygon fill="rgba(255,109,90,0.1)" points="0,100 0,70 50,55 100,60 150,30 200,40 250,20 300,15 300,100"/>
+  <!-- Line -->
+  <polyline fill="none" stroke="#ff6d5a" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"
+    points="0,70 50,55 100,60 150,30 200,40 250,20 300,15"/>
+  <!-- Data points -->
+  <circle cx="0" cy="70" r="3" fill="#ff6d5a"/>
+  <circle cx="150" cy="30" r="3" fill="#ff6d5a"/>
+  <circle cx="300" cy="15" r="3" fill="#ff6d5a"/>
+</svg>
+```
+
+**JS helper to generate points from data:**
+```javascript
+function makeLinePoints(data, width, height) {
+  const max = Math.max(...data.map(d => d.value), 1);
+  const step = width / Math.max(data.length - 1, 1);
+  return data.map((d, i) => `${i * step},${height - (d.value / max) * (height - 10)}`).join(' ');
+}
+// Usage: <polyline points="${makeLinePoints(data, 300, 100)}"/>
+```
+
+### 3.2 Donut / Pie Chart (SVG Circle)
+
+```html
+<!-- Donut chart using stroke-dasharray trick -->
+<svg viewBox="0 0 36 36" style="width:120px;height:120px" role="img" aria-label="Donut chart: 72% complete">
+  <!-- Background ring -->
+  <circle cx="18" cy="18" r="15.9" fill="none" stroke="#2b2d31" stroke-width="3"/>
+  <!-- Segment 1: 72% (accent) -->
+  <circle cx="18" cy="18" r="15.9" fill="none" stroke="#ff6d5a" stroke-width="3"
+    stroke-dasharray="72 28" stroke-dashoffset="25" stroke-linecap="round"/>
+  <!-- Segment 2: 28% (muted) -->
+  <circle cx="18" cy="18" r="15.9" fill="none" stroke="#3a3c41" stroke-width="3"
+    stroke-dasharray="28 72" stroke-dashoffset="53"/>
+  <!-- Center label -->
+  <text x="18" y="18" text-anchor="middle" dy="0.35em" fill="#ffffff" font-size="8" font-weight="700">72%</text>
+</svg>
+```
+
+**JS helper for multi-segment donut:**
+```javascript
+function makeDonutSegments(segments, radius) {
+  const circumference = 2 * Math.PI * radius;
+  let offset = 25; // Start from top (25% offset = 12 o'clock)
+  return segments.map(seg => {
+    const dashArray = `${seg.percent} ${100 - seg.percent}`;
+    const html = `<circle cx="18" cy="18" r="${radius}" fill="none" stroke="${seg.color}" stroke-width="3" stroke-dasharray="${dashArray}" stroke-dashoffset="${offset}"/>`;
+    offset -= seg.percent;
+    return html;
+  }).join('');
+}
+```
+
+### 3.3 Sparklines (Inline SVG)
+
+```html
+<!-- Tiny inline sparkline — 80x24px, no axes -->
+<svg viewBox="0 0 100 30" style="width:80px;height:24px;vertical-align:middle" role="img" aria-label="Trend: increasing">
+  <polyline fill="none" stroke="#ff6d5a" stroke-width="2" stroke-linecap="round"
+    points="0,25 15,20 30,22 45,10 60,15 75,8 90,12 100,5"/>
+</svg>
+
+<!-- Green sparkline for positive trends -->
+<svg viewBox="0 0 100 30" style="width:80px;height:24px;vertical-align:middle" role="img" aria-label="Trend: stable">
+  <polyline fill="none" stroke="#43b581" stroke-width="2" stroke-linecap="round"
+    points="0,20 15,18 30,22 45,16 60,18 75,14 90,16 100,12"/>
+</svg>
+```
+
+### 3.4 Progress Bars (CSS-Only)
+
+```html
+<!-- Basic progress bar -->
+<div style="background:#232529;border-radius:4px;height:8px;overflow:hidden" role="progressbar" aria-valuenow="72" aria-valuemin="0" aria-valuemax="100" aria-label="Progress: 72%">
+  <div style="background:#ff6d5a;height:100%;width:72%;border-radius:4px;transition:width 0.6s ease"></div>
+</div>
+
+<!-- Labeled progress bar -->
+<div style="display:flex;justify-content:space-between;align-items:center;gap:12px;margin-bottom:8px">
+  <span style="font-size:12px;color:#b0b2b8;min-width:80px">Conversion</span>
+  <div style="flex:1;background:#232529;border-radius:4px;height:8px;overflow:hidden" role="progressbar" aria-valuenow="45" aria-valuemin="0" aria-valuemax="100">
+    <div style="background:#43b581;height:100%;width:45%;border-radius:4px;transition:width 0.6s ease"></div>
+  </div>
+  <span style="font-size:12px;color:#b0b2b8;min-width:35px;text-align:right">45%</span>
+</div>
+```
+
+### 3.5 Horizontal Bar Charts (CSS Flexbox)
+
+```html
+<!-- Horizontal bar chart — great for rankings/comparisons -->
+<div style="display:flex;flex-direction:column;gap:8px">
+  <div style="display:flex;align-items:center;gap:8px">
+    <span style="font-size:12px;color:#b0b2b8;min-width:80px;text-align:right">Email</span>
+    <div style="flex:1;background:#232529;border-radius:4px;height:20px;overflow:hidden">
+      <div style="background:#ff6d5a;height:100%;width:82%;border-radius:4px;display:flex;align-items:center;padding-left:8px">
+        <span style="font-size:11px;color:#fff;font-weight:600">82%</span>
+      </div>
+    </div>
+  </div>
+  <div style="display:flex;align-items:center;gap:8px">
+    <span style="font-size:12px;color:#b0b2b8;min-width:80px;text-align:right">Social</span>
+    <div style="flex:1;background:#232529;border-radius:4px;height:20px;overflow:hidden">
+      <div style="background:#5865f2;height:100%;width:54%;border-radius:4px;display:flex;align-items:center;padding-left:8px">
+        <span style="font-size:11px;color:#fff;font-weight:600">54%</span>
+      </div>
+    </div>
+  </div>
+  <div style="display:flex;align-items:center;gap:8px">
+    <span style="font-size:12px;color:#b0b2b8;min-width:80px;text-align:right">Direct</span>
+    <div style="flex:1;background:#232529;border-radius:4px;height:20px;overflow:hidden">
+      <div style="background:#43b581;height:100%;width:31%;border-radius:4px;display:flex;align-items:center;padding-left:8px">
+        <span style="font-size:11px;color:#fff;font-weight:600">31%</span>
+      </div>
+    </div>
+  </div>
+</div>
+```
+
+**JS helper for horizontal bars from data:**
+```javascript
+function renderHorizontalBars(items, colorFn) {
+  const max = Math.max(...items.map(d => d.value), 1);
+  return items.map(d => {
+    const pct = Math.round((d.value / max) * 100);
+    const color = colorFn ? colorFn(d) : '#ff6d5a';
+    return `
+      <div style="display:flex;align-items:center;gap:8px">
+        <span style="font-size:12px;color:#b0b2b8;min-width:80px;text-align:right;overflow:hidden;text-overflow:ellipsis;white-space:nowrap">${escapeHtml(d.label)}</span>
+        <div style="flex:1;background:#232529;border-radius:4px;height:20px;overflow:hidden">
+          <div style="background:${color};height:100%;width:${pct}%;border-radius:4px;display:flex;align-items:center;padding-left:8px;min-width:30px">
+            <span style="font-size:11px;color:#fff;font-weight:600">${formatNumber(d.value)}</span>
+          </div>
+        </div>
+      </div>`;
+  }).join('');
+}
+```
+
+---
+
+## 4. Data Flow: How Data Gets to the App
+
+### Architecture
+
+```
+User sends message in thread
+       │
+       ▼
+AI calls MCP tool → tool returns result
+       │
+       ├─── structuredContent (MCP protocol)  ← typed JSON data from tool
+       └─── content (text fallback)           ← human-readable text
+       │
+       ▼
+AI generates response + APP_DATA block
+       │
+       ▼
+<!--APP_DATA:{"contacts":[...]}:END_APP_DATA-->
+       │
+       ▼
+LocalBosses chat/route.ts parses APP_DATA
+       │
+       ▼
+Stores in app-data endpoint & sends via postMessage
+       │
+       ▼
+iframe receives data → app renders
+```
+
+### MCP `structuredContent` Context
+
+> **Important distinction:** The `APP_DATA` block format (`<!--APP_DATA:{...}:END_APP_DATA-->`) is a **LocalBosses-specific** pattern for passing structured data from the AI's text response to the app iframe. It is NOT part of the MCP protocol.
+>
+> In the MCP protocol (spec 2025-06-18+), tools return typed data via `structuredContent` alongside a text fallback in `content`. The flow is:
+>
+> 1. **MCP tool** returns `{ content: [...], structuredContent: { data: [...], meta: {...} } }`
+> 2. **LocalBosses** receives the tool result — the `structuredContent` is the typed data
+> 3. **AI** uses `structuredContent` to generate the `APP_DATA` block in its response text
+> 4. **LocalBosses route.ts** parses `APP_DATA` from the AI's response and sends it to the iframe
+>
+> The app itself doesn't interact with MCP directly — it receives data via `postMessage` or polling, regardless of whether the data originally came from `structuredContent` or was generated by the AI. The apps are a pure rendering layer.
+
+### Two data reception methods (apps MUST support both):
+
+1. **postMessage** — Primary. Host sends data to iframe.
+2. **Polling** — Fallback. App fetches from `/api/app-data` with exponential backoff.
+
+---
+
+## 5. The HTML App Template
+
+This is the EXACT base template for every app. Copy and customize.
+
+```html
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8">
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <meta http-equiv="Content-Security-Policy" content="default-src 'none'; script-src 'unsafe-inline'; style-src 'unsafe-inline'; img-src data: blob:; connect-src 'self'; frame-ancestors 'self';">
+  <title>{App Name}</title>
+  <style>
+    /* ═══ RESET ═══ */
+    *, *::before, *::after { margin: 0; padding: 0; box-sizing: border-box; }
+
+    /* ═══ BASE ═══ */
+    body {
+      font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, sans-serif;
+      background: #1a1d23;
+      color: #dcddde;
+      padding: 16px;
+      font-size: 13px;
+      line-height: 1.5;
+      overflow-x: hidden;
+    }
+
+    /* ═══ ACCESSIBILITY ═══ */
+    /* Screen reader only — visually hidden but available to assistive technology */
+    .sr-only {
+      position: absolute;
+      width: 1px;
+      height: 1px;
+      padding: 0;
+      margin: -1px;
+      overflow: hidden;
+      clip: rect(0, 0, 0, 0);
+      white-space: nowrap;
+      border: 0;
+    }
+    /* Focus visible for keyboard users */
+    :focus-visible {
+      outline: 2px solid #ff6d5a;
+      outline-offset: 2px;
+    }
+
+    /* ═══ LOADING SKELETON ═══ */
+    .skeleton {
+      background: linear-gradient(90deg, #2b2d31 25%, #35373c 50%, #2b2d31 75%);
+      background-size: 200% 100%;
+      animation: shimmer 1.5s infinite;
+      border-radius: 4px;
+    }
+    @keyframes shimmer {
+      0% { background-position: 200% 0; }
+      100% { background-position: -200% 0; }
+    }
+    .skeleton-line { height: 14px; margin-bottom: 8px; }
+    .skeleton-line:last-child { width: 60%; }
+    .skeleton-card { height: 80px; margin-bottom: 12px; border-radius: 8px; }
+
+    /* Respect reduced motion preference */
+    @media (prefers-reduced-motion: reduce) {
+      .skeleton { animation: none; background: #2b2d31; }
+      .row-enter { animation: none !important; opacity: 1 !important; }
+      .metric-count { transition: none !important; }
+      .cross-fade { transition: none !important; }
+    }
+
+    /* ═══ EMPTY STATE ═══ */
+    .empty-state {
+      text-align: center;
+      padding: 48px 24px;
+      color: #b0b2b8;
+    }
+    .empty-state-icon { font-size: 48px; margin-bottom: 16px; opacity: 0.5; }
+    .empty-state-title { font-size: 16px; font-weight: 600; color: #dcddde; margin-bottom: 8px; }
+    .empty-state-text { font-size: 13px; max-width: 300px; margin: 0 auto; }
+
+    /* ═══ HEADER ═══ */
+    .app-header {
+      display: flex;
+      justify-content: space-between;
+      align-items: center;
+      margin-bottom: 16px;
+      padding-bottom: 12px;
+      border-bottom: 1px solid #3a3c41;
+    }
+    .app-title { font-size: 18px; font-weight: 700; color: #ffffff; }
+    .app-subtitle { font-size: 12px; color: #b0b2b8; margin-top: 2px; }
+
+    /* ═══ CARDS ═══ */
+    .card {
+      background: #2b2d31;
+      border-radius: 8px;
+      padding: 16px;
+      border: 1px solid #3a3c41;
+      transition: border-color 0.15s;
+    }
+    .card:hover { border-color: #4a4c51; }
+
+    /* ═══ METRICS ROW ═══ */
+    .metrics-row {
+      display: grid;
+      grid-template-columns: repeat(auto-fit, minmax(120px, 1fr));
+      gap: 12px;
+      margin-bottom: 16px;
+    }
+    .metric-card {
+      background: #2b2d31;
+      border-radius: 8px;
+      padding: 12px;
+      border: 1px solid #3a3c41;
+    }
+    .metric-label { font-size: 11px; color: #b0b2b8; text-transform: uppercase; letter-spacing: 0.5px; }
+    .metric-value { font-size: 24px; font-weight: 700; color: #ff6d5a; margin-top: 4px; }
+    .metric-change { font-size: 11px; margin-top: 2px; }
+    .metric-change.up { color: #43b581; }
+    .metric-change.down { color: #f04747; }
+
+    /* ═══ TABLE ═══ */
+    .data-table { width: 100%; border-collapse: collapse; }
+    .data-table th {
+      text-align: left;
+      padding: 8px 12px;
+      font-size: 11px;
+      font-weight: 600;
+      color: #b0b2b8;
+      text-transform: uppercase;
+      letter-spacing: 0.5px;
+      border-bottom: 1px solid #3a3c41;
+    }
+    .data-table td {
+      padding: 10px 12px;
+      border-bottom: 1px solid rgba(58, 60, 65, 0.5);
+      font-size: 13px;
+    }
+    .data-table tr:hover td { background: #35373c; }
+
+    /* ═══ BADGES ═══ */
+    .badge { display: inline-block; padding: 2px 8px; border-radius: 10px; font-size: 11px; font-weight: 600; }
+    .badge-success { background: rgba(67, 181, 129, 0.15); color: #43b581; }
+    .badge-warning { background: rgba(250, 166, 26, 0.15); color: #faa61a; }
+    .badge-danger { background: rgba(240, 71, 71, 0.15); color: #f04747; }
+    .badge-info { background: rgba(88, 101, 242, 0.15); color: #5865f2; }
+    .badge-accent { background: rgba(255, 109, 90, 0.15); color: #ff6d5a; }
+    .badge-neutral { background: rgba(176, 178, 184, 0.15); color: #b0b2b8; }
+
+    /* ═══ MICRO-INTERACTIONS ═══ */
+    /* Staggered row entrance — apply via JS: el.style.animationDelay = `${i * 50}ms` */
+    .row-enter {
+      animation: fadeSlideIn 0.25s ease-out forwards;
+      opacity: 0;
+    }
+    @keyframes fadeSlideIn {
+      from { opacity: 0; transform: translateY(4px); }
+      to { opacity: 1; transform: translateY(0); }
+    }
+    /* Cross-fade for data updates */
+    .cross-fade {
+      transition: opacity 0.2s ease;
+    }
+
+    /* ═══ UPDATING OVERLAY ═══ */
+    /* 4th state: shown over existing data while new data loads */
+    .updating-overlay {
+      position: absolute;
+      inset: 0;
+      background: rgba(26, 29, 35, 0.6);
+      display: flex;
+      align-items: center;
+      justify-content: center;
+      border-radius: 8px;
+      z-index: 10;
+    }
+    .updating-overlay .updating-text {
+      font-size: 13px;
+      color: #b0b2b8;
+      display: flex;
+      align-items: center;
+      gap: 8px;
+    }
+    .updating-spinner {
+      width: 16px;
+      height: 16px;
+      border: 2px solid #3a3c41;
+      border-top-color: #ff6d5a;
+      border-radius: 50%;
+      animation: spin 0.8s linear infinite;
+    }
+    @keyframes spin { to { transform: rotate(360deg); } }
+
+    /* ═══ RESPONSIVE ═══ */
+    @media (max-width: 400px) {
+      body { padding: 12px; }
+      .metrics-row { grid-template-columns: repeat(2, 1fr); gap: 8px; }
+      .app-title { font-size: 16px; }
+      .data-table { font-size: 12px; }
+      .data-table th, .data-table td { padding: 8px 8px; }
+    }
+    @media (max-width: 300px) {
+      .metrics-row { grid-template-columns: 1fr; }
+      body { padding: 8px; }
+    }
+  </style>
+</head>
+<body>
+  <div id="app">
+    <!-- LOADING STATE (shown by default) -->
+    <div id="loading" role="status" aria-label="Loading content">
+      <span class="sr-only">Loading content, please wait…</span>
+      <div class="app-header">
+        <div>
+          <div class="skeleton skeleton-line" style="width:140px;height:20px"></div>
+          <div class="skeleton skeleton-line" style="width:200px;height:12px;margin-top:6px"></div>
+        </div>
+      </div>
+      <div class="metrics-row">
+        <div class="skeleton skeleton-card" style="height:70px"></div>
+        <div class="skeleton skeleton-card" style="height:70px"></div>
+        <div class="skeleton skeleton-card" style="height:70px"></div>
+      </div>
+      <div class="skeleton skeleton-card"></div>
+      <div class="skeleton skeleton-card"></div>
+      <div class="skeleton skeleton-card"></div>
+    </div>
+
+    <!-- EMPTY STATE (hidden by default) — customize per app type -->
+    <div id="empty" style="display:none">
+      <div class="empty-state">
+        <div class="empty-state-icon">📋</div>
+        <div class="empty-state-title">No data yet</div>
+        <div class="empty-state-text">Ask me a question in the chat to populate this view with data.</div>
+      </div>
+    </div>
+
+    <!-- DATA STATE (hidden by default) -->
+    <div id="content" style="display:none;position:relative" aria-live="polite">
+      <!-- Populated by render() -->
+      <!-- UPDATING OVERLAY — subtle indicator on existing data while new data loads -->
+      <div id="updating-overlay" class="updating-overlay" style="display:none" role="status">
+        <div class="updating-text">
+          <div class="updating-spinner"></div>
+          <span>Updating…</span>
+        </div>
+      </div>
+    </div>
+  </div>
+
+  <script>
+    // ═══════════════════════════════════════
+    // ERROR BOUNDARY — catch render failures
+    // ═══════════════════════════════════════
+
+    window.onerror = function(msg, url, line, col, error) {
+      console.error('App error:', msg, 'at line', line);
+      try {
+        document.getElementById('content').innerHTML = `
+          <div class="empty-state">
+            <div class="empty-state-icon">⚠️</div>
+            <div class="empty-state-title">Display Error</div>
+            <div class="empty-state-text">The app encountered an issue rendering the data. Try sending a new message.</div>
+          </div>`;
+        showState('data');
+      } catch (e) {
+        // Last resort — at least show something
+        document.body.innerHTML = '<div style="text-align:center;padding:48px;color:#b0b2b8">⚠️ Display error. Try sending a new message.</div>';
+      }
+      return true; // Prevent default error handling
+    };
+
+    window.addEventListener('unhandledrejection', function(event) {
+      console.error('Unhandled promise rejection:', event.reason);
+    });
+
+    // ═══════════════════════════════════════
+    // DATA RECEPTION — postMessage + polling
+    // ═══════════════════════════════════════
+
+    let currentData = null;
+
+    // Trusted origins for postMessage validation
+    // Configure for your environment: same-origin + localhost + any custom trusted origins
+    const TRUSTED_ORIGINS = [window.location.origin, 'http://localhost:3000', 'http://localhost:3001'];
+
+    // Method 1: postMessage from host
+    window.addEventListener('message', (event) => {
+      // Validate origin — allow same-origin, localhost, and configured trusted origins
+      if (event.origin && event.origin !== window.location.origin && !TRUSTED_ORIGINS.includes(event.origin)) {
+        console.warn('[App] Rejected postMessage from untrusted origin:', event.origin);
+        return;
+      }
+      try {
+        const msg = event.data;
+        // Handle "updating" state — triggered when user sends a new message
+        if (msg.type === 'user_message_sent') {
+          if (currentData) showState('updating'); // Show overlay on existing data
+          return;
+        }
+        // Handle multiple message formats
+        if (msg.type === 'mcp_app_data' && msg.data) {
+          handleData(msg.data);
+        } else if (msg.type === 'app_data' && msg.data) {
+          handleData(msg.data);
+        } else if (msg.type === 'mcp-app-init' && msg.data) {
+          handleData(msg.data);
+        } else if (typeof msg === 'object' && !msg.type) {
+          // Raw data object
+          handleData(msg);
+        }
+      } catch (e) {
+        console.error('postMessage handler error:', e);
+      }
+    });
+
+    // Method 2: Polling fallback with exponential backoff
+    const APP_ID = '{app-id}'; // Replace with actual app ID
+    let pollTimer = null;
+    let pollCount = 0;
+    const POLL_INTERVALS = [3000, 5000, 10000, 30000]; // Exponential backoff
+    const MAX_POLLS = 20;
+
+    async function pollForData() {
+      // Don't poll if tab is hidden or max attempts reached
+      if (document.hidden) return schedulePoll();
+      if (pollCount >= MAX_POLLS) {
+        showState('empty');
+        document.querySelector('#empty .empty-state-title').textContent = 'Timed Out';
+        document.querySelector('#empty .empty-state-text').textContent = 'Data took too long to load. Try sending a new message.';
+        return;
+      }
+
+      pollCount++;
+      try {
+        const res = await fetch(`/api/app-data?app=${APP_ID}&t=${Date.now()}`);
+        if (res.ok) {
+          const data = await res.json();
+          if (data && Object.keys(data).length > 0) {
+            handleData(data);
+            return; // Stop polling — data received
+          }
+        }
+      } catch (e) {
+        // Silently fail — polling is a fallback
+      }
+      schedulePoll();
+    }
+
+    function schedulePoll() {
+      if (currentData) return; // Already have data, stop
+      const intervalIndex = Math.min(pollCount, POLL_INTERVALS.length - 1);
+      pollTimer = setTimeout(pollForData, POLL_INTERVALS[intervalIndex]);
+    }
+
+    // Pause/resume polling on visibility change
+    document.addEventListener('visibilitychange', () => {
+      if (!document.hidden && !currentData && pollCount < MAX_POLLS) {
+        pollForData();
+      }
+    });
+
+    // Start polling after short delay (give postMessage a chance first)
+    setTimeout(pollForData, 500);
+
+    // ═══════════════════════════════════════
+    // DATA HANDLING
+    // ═══════════════════════════════════════
+
+    function handleData(data) {
+      // Deduplicate — don't re-render identical data
+      const dataStr = JSON.stringify(data);
+      if (dataStr === JSON.stringify(currentData)) return;
+      currentData = data;
+
+      // Stop polling once we have data
+      if (pollTimer) { clearTimeout(pollTimer); pollTimer = null; }
+
+      // Route to render
+      if (!data || (typeof data === 'object' && Object.keys(data).length === 0)) {
+        showState('empty');
+      } else {
+        try {
+          render(data);
+        } catch (e) {
+          console.error('Render error:', e);
+          document.getElementById('content').innerHTML = `
+            <div class="empty-state">
+              <div class="empty-state-icon">⚠️</div>
+              <div class="empty-state-title">Display Error</div>
+              <div class="empty-state-text">Could not render the data. Try a different query.</div>
+            </div>`;
+          showState('data');
+        }
+      }
+    }
+
+    // ═══════════════════════════════════════
+    // STATE MANAGEMENT
+    // ═══════════════════════════════════════
+
+    function showState(state) {
+      document.getElementById('loading').style.display = state === 'loading' ? 'block' : 'none';
+      document.getElementById('empty').style.display = state === 'empty' ? 'block' : 'none';
+      const content = document.getElementById('content');
+      content.style.display = (state === 'data' || state === 'updating') ? 'block' : 'none';
+
+      // Updating overlay — subtle indicator on existing data while new data loads
+      const overlay = document.getElementById('updating-overlay');
+      if (overlay) overlay.style.display = state === 'updating' ? 'flex' : 'none';
+
+      // Focus management: move focus to content when data loads
+      if (state === 'data') {
+        content.setAttribute('tabindex', '-1');
+        content.focus({ preventScroll: true });
+      }
+    }
+
+    // ═══════════════════════════════════════
+    // DATA VALIDATION
+    // ═══════════════════════════════════════
+
+    /**
+     * Validate that data contains expected fields.
+     * Logs warnings for missing fields instead of crashing.
+     * @param {object} data - The data object to validate
+     * @param {string[]} requiredFields - Array of field names/paths expected
+     * @returns {boolean} - true if all fields present, false if any missing
+     */
+    function validateData(data, requiredFields) {
+      if (!data || typeof data !== 'object') {
+        console.warn('[App] validateData: data is not an object', data);
+        return false;
+      }
+      let valid = true;
+      requiredFields.forEach(field => {
+        const parts = field.split('.');
+        let val = data;
+        for (const part of parts) {
+          val = val?.[part];
+        }
+        if (val === undefined || val === null) {
+          console.warn(`[App] Missing expected field: "${field}"`, data);
+          valid = false;
+        }
+      });
+      return valid;
+    }
+
+    // ═══════════════════════════════════════
+    // BIDIRECTIONAL COMMUNICATION
+    // ═══════════════════════════════════════
+
+    /**
+     * Send an action from the app back to the host.
+     * @param {'refresh'|'navigate'|'tool_call'} action - The action type
+     * @param {object} payload - Action-specific data
+     *
+     * Usage examples:
+     *   sendToHost('refresh', {});
+     *   sendToHost('navigate', { app: 'contact-card', params: { id: '123' } });
+     *   sendToHost('tool_call', { tool: 'delete_contact', args: { id: '123' } });
+     */
+    function sendToHost(action, payload) {
+      window.parent.postMessage({
+        type: 'mcp_app_action',
+        action: action,
+        payload: payload,
+        appId: APP_ID
+      }, '*');
+    }
+
+    // ═══════════════════════════════════════
+    // RENDER — Customize per app type
+    // ═══════════════════════════════════════
+
+    function render(data) {
+      showState('data');
+      const el = document.getElementById('content');
+
+      // === YOUR APP-SPECIFIC RENDERING HERE ===
+      el.innerHTML = `
+        <div class="app-header">
+          <div>
+            <div class="app-title">{App Title}</div>
+            <div class="app-subtitle">${escapeHtml(data.subtitle || '')}</div>
+          </div>
+        </div>
+        <!-- Render your data here -->
+      `;
+    }
+
+    // ═══════════════════════════════════════
+    // MICRO-INTERACTIONS
+    // ═══════════════════════════════════════
+
+    /**
+     * Apply staggered entrance animation to rows.
+     * Call after inserting rows into the DOM.
+     * @param {string} selector - CSS selector for the rows
+     * @param {number} delayMs - Delay between each row (default 50ms)
+     */
+    function staggerRows(selector, delayMs = 50) {
+      document.querySelectorAll(selector).forEach((row, i) => {
+        row.classList.add('row-enter');
+        row.style.animationDelay = `${i * delayMs}ms`;
+      });
+    }
+
+    /**
+     * Animate a number counting up from 0 to its target value.
+     * @param {HTMLElement} el - The element containing the number
+     * @param {number} target - The target number
+     * @param {number} duration - Animation duration in ms (default 600)
+     * @param {function} formatter - Formatting function (default formatNumber)
+     */
+    function animateCount(el, target, duration = 600, formatter = formatNumber) {
+      // Respect reduced motion
+      if (window.matchMedia('(prefers-reduced-motion: reduce)').matches) {
+        el.textContent = formatter(target);
+        return;
+      }
+      const start = performance.now();
+      function step(now) {
+        const elapsed = now - start;
+        const progress = Math.min(elapsed / duration, 1);
+        // Ease-out cubic
+        const eased = 1 - Math.pow(1 - progress, 3);
+        el.textContent = formatter(Math.round(target * eased));
+        if (progress < 1) requestAnimationFrame(step);
+      }
+      requestAnimationFrame(step);
+    }
+
+    /**
+     * Smooth cross-fade when updating content.
+     * @param {HTMLElement} container - The container to update
+     * @param {string} newHtml - The new HTML content
+     */
+    function crossFadeUpdate(container, newHtml) {
+      if (window.matchMedia('(prefers-reduced-motion: reduce)').matches) {
+        container.innerHTML = newHtml;
+        return;
+      }
+      container.style.opacity = '0';
+      setTimeout(() => {
+        container.innerHTML = newHtml;
+        container.style.opacity = '1';
+      }, 200);
+    }
+
+    // ═══════════════════════════════════════
+    // UTILITIES
+    // ═══════════════════════════════════════
+
+    function escapeHtml(text) {
+      if (!text) return '';
+      return String(text)
+        .replace(/&/g, '&amp;')
+        .replace(/</g, '&lt;')
+        .replace(/>/g, '&gt;')
+        .replace(/"/g, '&quot;')
+        .replace(/'/g, '&#39;');
+    }
+
+    function formatNumber(num) {
+      if (num == null) return '—';
+      if (typeof num !== 'number') num = parseFloat(num);
+      if (isNaN(num)) return '—';
+      if (num >= 1000000) return (num / 1000000).toFixed(1) + 'M';
+      if (num >= 1000) return (num / 1000).toFixed(1) + 'K';
+      return num.toLocaleString();
+    }
+
+    function formatCurrency(num) {
+      if (num == null) return '—';
+      return '$' + Number(num).toLocaleString(undefined, { minimumFractionDigits: 0, maximumFractionDigits: 0 });
+    }
+
+    function formatDate(dateStr) {
+      if (!dateStr) return '—';
+      try {
+        const d = new Date(dateStr);
+        return d.toLocaleDateString('en-US', { month: 'short', day: 'numeric', year: 'numeric' });
+      } catch { return dateStr; }
+    }
+
+    function formatDateTime(dateStr) {
+      if (!dateStr) return '—';
+      try {
+        const d = new Date(dateStr);
+        return d.toLocaleDateString('en-US', { month: 'short', day: 'numeric' }) + ' ' +
+               d.toLocaleTimeString('en-US', { hour: 'numeric', minute: '2-digit' });
+      } catch { return dateStr; }
+    }
+
+    function getBadgeClass(status) {
+      const s = String(status).toLowerCase();
+      if (['active', 'open', 'won', 'completed', 'paid', 'success', 'live'].includes(s)) return 'badge-success';
+      if (['pending', 'in progress', 'processing', 'draft'].includes(s)) return 'badge-warning';
+      if (['closed', 'lost', 'failed', 'overdue', 'cancelled', 'error'].includes(s)) return 'badge-danger';
+      if (['new', 'scheduled', 'upcoming'].includes(s)) return 'badge-info';
+      return 'badge-neutral';
+    }
+
+    /**
+     * Copy text to clipboard and show brief visual feedback.
+     * @param {string} text - Text to copy
+     * @param {HTMLElement} [feedbackEl] - Optional element to flash "Copied!"
+     */
+    function copyToClipboard(text, feedbackEl) {
+      navigator.clipboard.writeText(text).then(() => {
+        if (feedbackEl) {
+          const orig = feedbackEl.textContent;
+          feedbackEl.textContent = 'Copied!';
+          feedbackEl.style.color = '#43b581';
+          setTimeout(() => {
+            feedbackEl.textContent = orig;
+            feedbackEl.style.color = '';
+          }, 1500);
+        }
+      }).catch(() => {
+        // Fallback for older browsers
+        const ta = document.createElement('textarea');
+        ta.value = text;
+        ta.style.position = 'fixed';
+        ta.style.opacity = '0';
+        document.body.appendChild(ta);
+        ta.select();
+        document.execCommand('copy');
+        document.body.removeChild(ta);
+      });
+    }
+  </script>
+</body>
+</html>
+```
+
+---
+
+## 6. App Type Templates
+
+### 6.1 Dashboard
+
+**Use when:** Aggregate KPIs, overview metrics, recent activity summary.
+
+**Expected data shape:** `{ title?, timeFrame?, metrics: { [key]: number }, recent?: { title, description?, date }[] }`
+
+**Empty state:** "Ask me for a performance overview, KPIs, or a metrics summary."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  // Validate expected shape
+  validateData(data, ['metrics']);
+
+  const metrics = data.metrics || {};
+  const recentItems = Array.isArray(data.recent) ? data.recent : [];
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || '{Service} Dashboard')}</div>
+        <div class="app-subtitle">${escapeHtml(data.timeFrame || 'Last 30 days')}</div>
+      </div>
+    </div>
+
+    <div class="metrics-row" role="list" aria-label="Key metrics">
+      ${Object.entries(metrics).map(([key, val]) => `
+        <div class="metric-card" role="listitem">
+          <div class="metric-label">${escapeHtml(key.replace(/_/g, ' '))}</div>
+          <div class="metric-value" data-count="${typeof val === 'number' ? val : ''}">${typeof val === 'number' && key.includes('revenue') ? formatCurrency(val) : formatNumber(val)}</div>
+        </div>
+      `).join('')}
+    </div>
+
+    ${recentItems.length > 0 ? `
+      <div class="card">
+        <div style="font-size:14px;font-weight:600;color:#fff;margin-bottom:12px">Recent Activity</div>
+        ${recentItems.slice(0, 10).map((item, i) => `
+          <div class="row-enter" style="display:flex;justify-content:space-between;align-items:center;padding:8px 0;border-bottom:1px solid rgba(58,60,65,0.5);animation-delay:${i * 50}ms">
+            <div>
+              <div style="font-weight:500">${escapeHtml(item.title || item.name || '—')}</div>
+              <div style="font-size:12px;color:#b0b2b8">${escapeHtml(item.description || item.type || '')}</div>
+            </div>
+            <div style="font-size:12px;color:#b0b2b8">${formatDateTime(item.date || item.createdAt)}</div>
+          </div>
+        `).join('')}
+      </div>
+    ` : ''}
+  `;
+
+  // Animate metric numbers
+  el.querySelectorAll('.metric-value[data-count]').forEach(el => {
+    const target = parseFloat(el.dataset.count);
+    if (!isNaN(target)) {
+      const isCurrency = el.textContent.startsWith('$');
+      animateCount(el, target, 600, isCurrency ? formatCurrency : formatNumber);
+    }
+  });
+}
+```
+
+**Dashboard empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">📊</div>
+    <div class="empty-state-title">Dashboard</div>
+    <div class="empty-state-text">Ask me for a performance overview, revenue metrics, or a summary of recent activity.</div>
+  </div>
+</div>
+```
+
+### 6.2 Data Grid
+
+**Use when:** Searchable/filterable lists, table views.
+
+**Expected data shape:** `{ title?, data|items|contacts|results: object[], meta?: { total, page, pageSize } }`
+
+**Empty state:** "Try 'show me all active contacts' or 'list recent invoices.'"
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  const items = Array.isArray(data) ? data : (data.data || data.items || data.contacts || data.results || []);
+  const total = data.meta?.total || data.total || items.length;
+
+  // Validate
+  if (!Array.isArray(items)) {
+    console.warn('[DataGrid] Expected array for items, got:', typeof items);
+  }
+
+  // Auto-detect columns from first item
+  const columns = items.length > 0
+    ? Object.keys(items[0]).filter(k => !['id', '_id', '__v'].includes(k)).slice(0, 6)
+    : [];
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Results')}</div>
+        <div class="app-subtitle">${total} record${total !== 1 ? 's' : ''}</div>
+      </div>
+    </div>
+
+    <div class="card" style="overflow-x:auto">
+      <table class="data-table" role="table" aria-label="${escapeHtml(data.title || 'Data grid')}">
+        <thead>
+          <tr>${columns.map(col => `<th scope="col">${escapeHtml(col.replace(/_/g, ' '))}</th>`).join('')}</tr>
+        </thead>
+        <tbody>
+          ${items.map((item, i) => `
+            <tr class="row-enter" style="animation-delay:${i * 50}ms">
+              ${columns.map(col => {
+                const val = item[col];
+                if (col === 'status' || col === 'state') {
+                  return `<td><span class="badge ${getBadgeClass(val)}"><span class="sr-only">Status: </span>${escapeHtml(String(val || '—'))}</span></td>`;
+                }
+                if (typeof val === 'number' && (col.includes('amount') || col.includes('revenue') || col.includes('price'))) {
+                  return `<td>${formatCurrency(val)}</td>`;
+                }
+                if (typeof val === 'string' && val.match(/^\d{4}-\d{2}-\d{2}/)) {
+                  return `<td>${formatDate(val)}</td>`;
+                }
+                return `<td>${escapeHtml(String(val ?? '—'))}</td>`;
+              }).join('')}
+            </tr>
+          `).join('')}
+        </tbody>
+      </table>
+    </div>
+  `;
+}
+```
+
+**Data Grid empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">📋</div>
+    <div class="empty-state-title">No records yet</div>
+    <div class="empty-state-text">Try "show me all active contacts" or "list recent invoices."</div>
+  </div>
+</div>
+```
+
+### 6.3 Detail Card
+
+**Use when:** Single entity deep-dive (contact, invoice, appointment).
+
+**Expected data shape:** `{ data|contact|item: { name?, title?, email?, status?, ...fields } }`
+
+**Empty state:** "Ask about a specific record by name or ID to see its details."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  // Flatten data — support nested formats
+  const item = data.data || data.contact || data.item || data;
+  const fields = Object.entries(item).filter(([k]) => !['id', '_id', '__v'].includes(k));
+
+  // Validate
+  validateData(item, ['name']);
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(item.name || item.title || 'Details')}</div>
+        <div class="app-subtitle">${escapeHtml(item.email || item.type || item.status || '')}</div>
+      </div>
+      ${item.status ? `<span class="badge ${getBadgeClass(item.status)}"><span class="sr-only">Status: </span>${escapeHtml(item.status)}</span>` : ''}
+    </div>
+
+    <div class="card" role="list" aria-label="Record details">
+      ${fields.map(([key, val], i) => {
+        if (val == null || val === '') return '';
+        if (typeof val === 'object') val = JSON.stringify(val);
+        return `
+          <div role="listitem" class="row-enter" style="display:flex;justify-content:space-between;padding:8px 0;border-bottom:1px solid rgba(58,60,65,0.3);animation-delay:${i * 50}ms">
+            <span style="color:#b0b2b8;font-size:12px;text-transform:capitalize">${escapeHtml(key.replace(/_/g, ' '))}</span>
+            <span style="font-weight:500;max-width:60%;text-align:right;word-break:break-word">${escapeHtml(String(val))}</span>
+          </div>
+        `;
+      }).join('')}
+    </div>
+  `;
+}
+```
+
+**Detail Card empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">🔍</div>
+    <div class="empty-state-title">No details to show</div>
+    <div class="empty-state-text">Ask about a specific record by name or ID to see its full details here.</div>
+  </div>
+</div>
+```
+
+### 6.4 Form / Wizard
+
+**Use when:** Multi-step creation or edit flows.
+
+**Expected data shape:** `{ title?, description?, fields: { name, label?, type?, required?, placeholder?, options?: {value, label}[] }[] }`
+
+**Empty state:** "Tell me what you'd like to create and I'll set up the form."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  // Validate
+  validateData(data, ['fields']);
+
+  const fields = data.fields || [];
+  const title = data.title || 'Create New';
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(title)}</div>
+        <div class="app-subtitle">${escapeHtml(data.description || 'Fill in the details below')}</div>
+      </div>
+    </div>
+
+    <div class="card">
+      <form id="appForm" onsubmit="return false" aria-label="${escapeHtml(title)}">
+        ${fields.map((field, i) => `
+          <div style="margin-bottom:16px" class="row-enter" style="animation-delay:${i * 50}ms">
+            <label for="field-${escapeHtml(field.name)}" style="display:block;font-size:12px;color:#b0b2b8;margin-bottom:4px;text-transform:capitalize">
+              ${escapeHtml(field.label || field.name)}${field.required ? ' *' : ''}
+            </label>
+            ${field.type === 'select' ? `
+              <select id="field-${escapeHtml(field.name)}" name="${escapeHtml(field.name)}" style="width:100%;padding:8px 12px;background:#1e2024;border:1px solid #3a3c41;border-radius:6px;color:#dcddde;font-size:13px" ${field.required ? 'required' : ''} aria-label="${escapeHtml(field.label || field.name)}">
+                <option value="">Select...</option>
+                ${(field.options || []).map(opt => `<option value="${escapeHtml(opt.value || opt)}">${escapeHtml(opt.label || opt)}</option>`).join('')}
+              </select>
+            ` : field.type === 'textarea' ? `
+              <textarea id="field-${escapeHtml(field.name)}" name="${escapeHtml(field.name)}" rows="3" style="width:100%;padding:8px 12px;background:#1e2024;border:1px solid #3a3c41;border-radius:6px;color:#dcddde;font-size:13px;resize:vertical" ${field.required ? 'required' : ''} placeholder="${escapeHtml(field.placeholder || '')}" aria-label="${escapeHtml(field.label || field.name)}"></textarea>
+            ` : `
+              <input id="field-${escapeHtml(field.name)}" type="${field.type || 'text'}" name="${escapeHtml(field.name)}" style="width:100%;padding:8px 12px;background:#1e2024;border:1px solid #3a3c41;border-radius:6px;color:#dcddde;font-size:13px" ${field.required ? 'required' : ''} placeholder="${escapeHtml(field.placeholder || '')}" value="${escapeHtml(field.value || '')}" aria-label="${escapeHtml(field.label || field.name)}">
+            `}
+          </div>
+        `).join('')}
+        <button class="btn-primary" type="button" onclick="submitForm()" style="width:100%;margin-top:16px;padding:10px 16px">
+          ${escapeHtml(data.submitLabel || 'Submit')}
+        </button>
+      </form>
+    </div>
+  `;
+}
+
+// Form submit handler — collects values, validates required fields, sends to host
+function submitForm() {
+  const form = document.getElementById('appForm');
+  if (!form) return;
+  const formData = {};
+  const fields = form.querySelectorAll('input, select, textarea');
+
+  // Reset field borders
+  fields.forEach(f => { f.style.borderColor = '#3a3c41'; });
+
+  // Collect values
+  fields.forEach(field => {
+    if (field.name) formData[field.name] = field.value;
+  });
+
+  // Validate required fields
+  const missing = [...fields].filter(f => f.required && !f.value);
+  if (missing.length > 0) {
+    missing.forEach(f => { f.style.borderColor = '#f04747'; });
+    missing[0].focus();
+    return;
+  }
+
+  // Send to host for tool execution
+  sendToHost('tool_call', {
+    tool: 'create_' + APP_ID.split('-').pop(),
+    args: formData
+  });
+
+  // Show confirmation state
+  showState('empty');
+  document.querySelector('#empty .empty-state-icon').textContent = '✅';
+  document.querySelector('#empty .empty-state-title').textContent = 'Submitted!';
+  document.querySelector('#empty .empty-state-text').textContent = 'Your request has been sent. Check the chat for confirmation.';
+}
+```
+
+**Form empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">✏️</div>
+    <div class="empty-state-title">Ready to create</div>
+    <div class="empty-state-text">Tell me what you'd like to create and I'll set up the form for you.</div>
+  </div>
+</div>
+```
+
+### 6.5 Timeline
+
+**Use when:** Chronological events, activity feeds, audit logs.
+
+**Expected data shape:** `{ title?, events|activities|timeline: { title, description?, date|timestamp, user|actor? }[] }`
+
+**Empty state:** "Ask to see recent activity, event history, or an audit log."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  const events = Array.isArray(data) ? data : (data.events || data.activities || data.timeline || []);
+
+  // Validate
+  if (events.length > 0) validateData(events[0], ['title']);
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Activity Timeline')}</div>
+        <div class="app-subtitle">${events.length} event${events.length !== 1 ? 's' : ''}</div>
+      </div>
+    </div>
+
+    <div style="position:relative;padding-left:24px" role="list" aria-label="Timeline events">
+      <div style="position:absolute;left:8px;top:0;bottom:0;width:2px;background:#3a3c41" aria-hidden="true"></div>
+      ${events.map((event, i) => `
+        <div style="position:relative;padding-bottom:${i < events.length - 1 ? '20px' : '0'}" role="listitem" class="row-enter" style="animation-delay:${i * 50}ms">
+          <div style="position:absolute;left:-20px;top:4px;width:12px;height:12px;border-radius:50%;background:${i === 0 ? '#ff6d5a' : '#3a3c41'};border:2px solid #1a1d23" aria-hidden="true"></div>
+          <div class="card" style="margin-left:8px">
+            <div style="display:flex;justify-content:space-between;align-items:start">
+              <div>
+                <div style="font-weight:600;color:#fff">${escapeHtml(event.title || event.type || event.action || '—')}</div>
+                <div style="font-size:12px;color:#b0b2b8;margin-top:2px">${escapeHtml(event.description || event.details || '')}</div>
+              </div>
+              <div style="font-size:11px;color:#b0b2b8;white-space:nowrap;margin-left:12px">${formatDateTime(event.date || event.timestamp || event.createdAt)}</div>
+            </div>
+            ${event.user || event.actor ? `<div style="font-size:12px;color:#b0b2b8;margin-top:6px">by ${escapeHtml(event.user || event.actor)}</div>` : ''}
+          </div>
+        </div>
+      `).join('')}
+    </div>
+  `;
+}
+```
+
+**Timeline empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">🕐</div>
+    <div class="empty-state-title">No activity yet</div>
+    <div class="empty-state-text">Ask to see recent activity, event history, or an audit trail.</div>
+  </div>
+</div>
+```
+
+### 6.6 Funnel / Pipeline
+
+**Use when:** Stage-based progression (sales pipeline, deal stages).
+
+**Expected data shape:** `{ title?, stages|pipeline: { name|title, items|deals: { name|title, value|amount?, contact|company? }[] }[] }`
+
+**Empty state:** "Ask to see your sales pipeline or a specific deal stage."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  const stages = Array.isArray(data) ? data : (data.stages || data.pipeline || []);
+
+  // Validate
+  if (stages.length > 0) validateData(stages[0], ['name']);
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Pipeline')}</div>
+        <div class="app-subtitle">${escapeHtml(data.subtitle || '')}</div>
+      </div>
+    </div>
+
+    <div style="display:flex;gap:12px;overflow-x:auto;padding-bottom:8px" role="list" aria-label="Pipeline stages">
+      ${stages.map((stage, i) => {
+        const items = stage.items || stage.deals || stage.opportunities || [];
+        return `
+          <div style="min-width:220px;flex:1" role="listitem" aria-label="${escapeHtml(stage.name || stage.title)} stage, ${items.length} items">
+            <div style="display:flex;justify-content:space-between;align-items:center;margin-bottom:8px;padding:8px 12px;background:#2b2d31;border-radius:8px 8px 0 0;border:1px solid #3a3c41;border-bottom:2px solid #ff6d5a">
+              <span style="font-weight:600;font-size:13px;color:#fff">${escapeHtml(stage.name || stage.title)}</span>
+              <span style="font-size:12px;color:#b0b2b8">${items.length}</span>
+            </div>
+            <div style="display:flex;flex-direction:column;gap:8px">
+              ${items.map((item, j) => `
+                <div class="card row-enter" style="padding:12px;animation-delay:${(i * 3 + j) * 50}ms">
+                  <div style="font-weight:500;font-size:13px;margin-bottom:4px">${escapeHtml(item.name || item.title)}</div>
+                  ${item.value || item.amount ? `<div style="font-size:14px;font-weight:600;color:#ff6d5a">${formatCurrency(item.value || item.amount)}</div>` : ''}
+                  ${item.contact || item.company ? `<div style="font-size:12px;color:#b0b2b8;margin-top:4px">${escapeHtml(item.contact || item.company)}</div>` : ''}
+                </div>
+              `).join('')}
+              ${items.length === 0 ? '<div style="text-align:center;padding:16px;color:#b0b2b8;font-size:12px">No items</div>' : ''}
+            </div>
+          </div>
+        `;
+      }).join('')}
+    </div>
+  `;
+}
+```
+
+**Pipeline empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">🔄</div>
+    <div class="empty-state-title">Pipeline empty</div>
+    <div class="empty-state-text">Ask to see your sales pipeline, deal stages, or project workflow.</div>
+  </div>
+</div>
+```
+
+### 6.7 Calendar
+
+**Use when:** Date-based data (appointments, events, schedules).
+
+**Expected data shape:** `{ title?, events|appointments: { title|name, date|start|startTime, description?, location?, attendee|contact?, status? }[] }`
+
+**Empty state:** "Ask to see upcoming appointments, scheduled events, or your calendar."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  const events = Array.isArray(data) ? data : (data.events || data.appointments || []);
+  const today = new Date();
+
+  // Validate
+  if (events.length > 0) validateData(events[0], ['title']);
+
+  // Group events by date
+  const byDate = {};
+  events.forEach(evt => {
+    const dateStr = new Date(evt.date || evt.start || evt.startTime).toISOString().split('T')[0];
+    if (!byDate[dateStr]) byDate[dateStr] = [];
+    byDate[dateStr].push(evt);
+  });
+
+  const sortedDates = Object.keys(byDate).sort();
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Calendar')}</div>
+        <div class="app-subtitle">${events.length} event${events.length !== 1 ? 's' : ''}</div>
+      </div>
+    </div>
+
+    <div role="list" aria-label="Calendar events grouped by date">
+      ${sortedDates.map(dateStr => {
+        const d = new Date(dateStr + 'T12:00:00');
+        const isToday = dateStr === today.toISOString().split('T')[0];
+        return `
+          <div style="margin-bottom:16px" role="listitem">
+            <div style="font-size:13px;font-weight:600;color:${isToday ? '#ff6d5a' : '#fff'};margin-bottom:8px;padding:4px 0;border-bottom:1px solid #3a3c41">
+              ${isToday ? '📍 Today — ' : ''}${d.toLocaleDateString('en-US', { weekday: 'long', month: 'long', day: 'numeric' })}
+            </div>
+            ${byDate[dateStr].map((evt, i) => `
+              <div class="card row-enter" style="margin-bottom:8px;padding:12px;display:flex;gap:12px;align-items:start;animation-delay:${i * 50}ms">
+                <div style="font-size:12px;color:#ff6d5a;font-weight:600;white-space:nowrap;min-width:55px">
+                  ${formatTime(evt.start || evt.startTime || evt.date)}
+                </div>
+                <div style="flex:1">
+                  <div style="font-weight:500">${escapeHtml(evt.title || evt.name || '—')}</div>
+                  ${evt.description || evt.location ? `<div style="font-size:12px;color:#b0b2b8;margin-top:2px">${escapeHtml(evt.description || evt.location || '')}</div>` : ''}
+                  ${evt.attendee || evt.contact ? `<div style="font-size:12px;color:#b0b2b8;margin-top:2px">👤 ${escapeHtml(evt.attendee || evt.contact)}</div>` : ''}
+                </div>
+                ${evt.status ? `<span class="badge ${getBadgeClass(evt.status)}"><span class="sr-only">Status: </span>${escapeHtml(evt.status)}</span>` : ''}
+              </div>
+            `).join('')}
+          </div>
+        `;
+      }).join('')}
+    </div>
+  `;
+}
+
+function formatTime(dateStr) {
+  if (!dateStr) return '';
+  try {
+    return new Date(dateStr).toLocaleTimeString('en-US', { hour: 'numeric', minute: '2-digit' });
+  } catch { return ''; }
+}
+```
+
+**Calendar empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">📅</div>
+    <div class="empty-state-title">No events scheduled</div>
+    <div class="empty-state-text">Ask to see upcoming appointments, scheduled events, or your calendar for a specific date range.</div>
+  </div>
+</div>
+```
+
+### 6.8 Analytics / Chart
+
+**Use when:** Data visualization, trends, comparisons. Pure CSS charts (no external libs).
+
+**Expected data shape:** `{ title?, subtitle|timeFrame?, metrics?: { [key]: number }, chart|series: { label|name, value|count }[], chartTitle? }`
+
+**Empty state:** "Ask for analytics, performance trends, or a breakdown of your data."
+
+```javascript
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  // Validate
+  validateData(data, ['chart']);
+
+  const chartData = data.chart || data.series || [];
+  const maxVal = Math.max(...chartData.map(d => d.value || d.count || 0), 1);
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Analytics')}</div>
+        <div class="app-subtitle">${escapeHtml(data.subtitle || data.timeFrame || '')}</div>
+      </div>
+    </div>
+
+    ${data.metrics ? `
+      <div class="metrics-row" role="list" aria-label="Key metrics">
+        ${Object.entries(data.metrics).map(([key, val]) => `
+          <div class="metric-card" role="listitem">
+            <div class="metric-label">${escapeHtml(key.replace(/_/g, ' '))}</div>
+            <div class="metric-value" data-count="${typeof val === 'number' ? val : ''}">${formatNumber(val)}</div>
+          </div>
+        `).join('')}
+      </div>
+    ` : ''}
+
+    <div class="card">
+      <div style="font-size:14px;font-weight:600;color:#fff;margin-bottom:16px">${escapeHtml(data.chartTitle || 'Overview')}</div>
+      <div style="display:flex;align-items:flex-end;gap:4px;height:160px;padding:0 4px" role="img" aria-label="Bar chart showing ${escapeHtml(data.chartTitle || 'data')}">
+        ${chartData.map((d, i) => {
+          const pct = ((d.value || d.count || 0) / maxVal) * 100;
+          return `
+            <div style="flex:1;display:flex;flex-direction:column;align-items:center;gap:4px" class="row-enter" style="animation-delay:${i * 50}ms">
+              <div style="font-size:10px;color:#b0b2b8">${formatNumber(d.value || d.count)}</div>
+              <div style="width:100%;background:#ff6d5a;border-radius:4px 4px 0 0;height:${Math.max(pct, 2)}%;min-height:4px;transition:height 0.3s"></div>
+              <div style="font-size:10px;color:#b0b2b8;white-space:nowrap;overflow:hidden;text-overflow:ellipsis;max-width:100%;text-align:center">${escapeHtml(d.label || d.name || '')}</div>
+            </div>
+          `;
+        }).join('')}
+      </div>
+    </div>
+  `;
+
+  // Animate metric numbers
+  el.querySelectorAll('.metric-value[data-count]').forEach(el => {
+    const target = parseFloat(el.dataset.count);
+    if (!isNaN(target)) animateCount(el, target);
+  });
+}
+```
+
+**Analytics empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">📈</div>
+    <div class="empty-state-title">No analytics data</div>
+    <div class="empty-state-text">Ask for performance trends, a revenue breakdown, or a comparison report.</div>
+  </div>
+</div>
+```
+
+### 6.9 Interactive Data Grid
+
+**Use when:** Data tables that need client-side sorting, filtering, searching, copy-to-clipboard, expand/collapse, or bulk selection. Use this instead of the basic Data Grid (6.2) when users need to interact with the data beyond reading it.
+
+**Expected data shape:** `{ title?, data|items: object[], columns?: { key, label, sortable?, copyable? }[], meta?: { total } }`
+
+**Empty state:** "Try 'show me all contacts' or 'list invoices from this month.'"
+
+This template includes all 5 interactive patterns. Include only the patterns your app needs.
+
+```html
+<!-- Additional CSS for Interactive Data Grid (add to <style>) -->
+<style>
+  /* ═══ INTERACTIVE DATA GRID ═══ */
+  .grid-toolbar {
+    display: flex;
+    gap: 8px;
+    margin-bottom: 12px;
+    align-items: center;
+    flex-wrap: wrap;
+  }
+  .grid-search {
+    flex: 1;
+    min-width: 160px;
+    padding: 6px 12px;
+    background: #1e2024;
+    border: 1px solid #3a3c41;
+    border-radius: 6px;
+    color: #dcddde;
+    font-size: 13px;
+  }
+  .grid-search:focus { border-color: #ff6d5a; outline: none; }
+  .grid-search::placeholder { color: #b0b2b8; }
+
+  /* Sortable column headers */
+  .sortable {
+    cursor: pointer;
+    user-select: none;
+    position: relative;
+    padding-right: 20px !important;
+  }
+  .sortable:hover { color: #dcddde; }
+  .sortable::after {
+    content: '⇅';
+    position: absolute;
+    right: 4px;
+    opacity: 0.4;
+    font-size: 10px;
+  }
+  .sortable.asc::after { content: '↑'; opacity: 1; color: #ff6d5a; }
+  .sortable.desc::after { content: '↓'; opacity: 1; color: #ff6d5a; }
+
+  /* Bulk selection */
+  .bulk-bar {
+    display: flex;
+    align-items: center;
+    justify-content: space-between;
+    padding: 8px 12px;
+    background: rgba(255, 109, 90, 0.1);
+    border: 1px solid rgba(255, 109, 90, 0.3);
+    border-radius: 6px;
+    margin-bottom: 8px;
+    font-size: 13px;
+    color: #ff6d5a;
+  }
+  .bulk-bar button {
+    background: #ff6d5a;
+    color: #fff;
+    border: none;
+    padding: 4px 12px;
+    border-radius: 4px;
+    font-size: 12px;
+    cursor: pointer;
+    font-weight: 600;
+  }
+  .bulk-bar button:hover { background: #ff8574; }
+
+  /* Copyable cells */
+  .copyable {
+    cursor: pointer;
+    border-bottom: 1px dashed #3a3c41;
+    transition: color 0.15s;
+  }
+  .copyable:hover { color: #ff6d5a; }
+
+  /* Accordion / expand-collapse */
+  .expandable-row { cursor: pointer; }
+  .expandable-row:hover td { background: #35373c; }
+  .expand-icon { display: inline-block; transition: transform 0.15s; margin-right: 4px; font-size: 10px; }
+  .expand-icon.open { transform: rotate(90deg); }
+  .detail-row { display: none; }
+  .detail-row.open { display: table-row; }
+  .detail-row td {
+    background: #232529;
+    padding: 12px 16px !important;
+    border-bottom: 1px solid #3a3c41;
+  }
+
+  /* Grid checkbox */
+  .grid-check {
+    appearance: none;
+    width: 16px;
+    height: 16px;
+    border: 2px solid #3a3c41;
+    border-radius: 3px;
+    background: #1e2024;
+    cursor: pointer;
+    vertical-align: middle;
+  }
+  .grid-check:checked {
+    background: #ff6d5a;
+    border-color: #ff6d5a;
+    background-image: url("data:image/svg+xml,%3Csvg viewBox='0 0 16 16' fill='white' xmlns='http://www.w3.org/2000/svg'%3E%3Cpath d='M12.207 4.793a1 1 0 010 1.414l-5 5a1 1 0 01-1.414 0l-2-2a1 1 0 011.414-1.414L6.5 9.086l4.293-4.293a1 1 0 011.414 0z'/%3E%3C/svg%3E");
+  }
+  .grid-check:focus-visible { outline: 2px solid #ff6d5a; outline-offset: 2px; }
+</style>
+```
+
+```javascript
+// ═══ Interactive Data Grid — Full Implementation ═══
+
+let gridState = {
+  items: [],
+  filteredItems: [],
+  sortCol: null,
+  sortDir: 'asc',
+  searchQuery: '',
+  selectedIds: new Set(),
+  expandedIds: new Set()
+};
+
+function render(data) {
+  showState('data');
+  const el = document.getElementById('content');
+
+  // Parse items from various data shapes
+  const rawItems = Array.isArray(data) ? data : (data.data || data.items || data.contacts || data.results || []);
+  gridState.items = rawItems.map((item, i) => ({ ...item, _idx: i, _id: item.id || item._id || `row-${i}` }));
+  gridState.filteredItems = [...gridState.items];
+
+  // Auto-detect columns (or use provided columns config)
+  const columnConfig = data.columns || (rawItems.length > 0
+    ? Object.keys(rawItems[0])
+        .filter(k => !['id', '_id', '__v', '_idx'].includes(k))
+        .slice(0, 6)
+        .map(k => ({ key: k, label: k.replace(/_/g, ' '), sortable: true, copyable: k === 'email' || k === 'id' }))
+    : []);
+
+  const total = data.meta?.total || data.total || rawItems.length;
+
+  el.innerHTML = `
+    <div class="app-header">
+      <div>
+        <div class="app-title">${escapeHtml(data.title || 'Data Explorer')}</div>
+        <div class="app-subtitle"><span id="grid-count">${total}</span> record${total !== 1 ? 's' : ''}</div>
+      </div>
+      <button class="btn-secondary" onclick="sendToHost('refresh', {})" aria-label="Refresh data" tabindex="0">↻ Refresh</button>
+    </div>
+
+    <!-- Toolbar: Search -->
+    <div class="grid-toolbar">
+      <input type="text" class="grid-search" placeholder="Search records…" id="grid-search"
+        oninput="handleSearch(this.value)" aria-label="Search records" tabindex="0">
+    </div>
+
+    <!-- Bulk action bar (hidden until selection) -->
+    <div id="bulk-bar" class="bulk-bar" style="display:none" role="status">
+      <span><span id="bulk-count">0</span> selected</span>
+      <div style="display:flex;gap:8px">
+        <button onclick="handleBulkAction('export')" tabindex="0">Export</button>
+        <button onclick="clearSelection()" style="background:transparent;color:#b0b2b8;border:1px solid #3a3c41" tabindex="0">Clear</button>
+      </div>
+    </div>
+
+    <!-- Data table -->
+    <div class="card" style="overflow-x:auto">
+      <table class="data-table" role="table" aria-label="${escapeHtml(data.title || 'Interactive data grid')}">
+        <thead>
+          <tr>
+            <th style="width:32px"><input type="checkbox" class="grid-check" id="select-all" onchange="toggleSelectAll(this.checked)" aria-label="Select all rows" tabindex="0"></th>
+            ${columnConfig.map(col => `
+              <th scope="col" class="${col.sortable !== false ? 'sortable' : ''}"
+                ${col.sortable !== false ? `onclick="handleSort('${col.key}')" tabindex="0" role="button" aria-label="Sort by ${escapeHtml(col.label)}"` : ''}
+                id="col-${col.key}">
+                ${escapeHtml(col.label)}
+              </th>
+            `).join('')}
+            <th style="width:32px" scope="col"><span class="sr-only">Expand</span></th>
+          </tr>
+        </thead>
+        <tbody id="grid-body">
+        </tbody>
+      </table>
+    </div>
+  `;
+
+  // Store column config for re-renders
+  gridState.columns = columnConfig;
+  renderRows();
+}
+
+function renderRows() {
+  const tbody = document.getElementById('grid-body');
+  if (!tbody) return;
+
+  const items = gridState.filteredItems;
+  const cols = gridState.columns;
+
+  tbody.innerHTML = items.map((item, i) => {
+    const isSelected = gridState.selectedIds.has(item._id);
+    const isExpanded = gridState.expandedIds.has(item._id);
+
+    return `
+      <tr class="expandable-row row-enter" style="animation-delay:${i * 30}ms" data-id="${escapeHtml(String(item._id))}">
+        <td><input type="checkbox" class="grid-check" ${isSelected ? 'checked' : ''} onchange="toggleSelect('${escapeHtml(String(item._id))}', this.checked)" aria-label="Select row ${i + 1}" tabindex="0"></td>
+        ${cols.map(col => {
+          const val = item[col.key];
+          let cellContent;
+
+          if (col.key === 'status' || col.key === 'state') {
+            cellContent = `<span class="badge ${getBadgeClass(val)}"><span class="sr-only">Status: </span>${escapeHtml(String(val || '—'))}</span>`;
+          } else if (col.copyable) {
+            cellContent = `<span class="copyable" onclick="event.stopPropagation();copyToClipboard('${escapeHtml(String(val || ''))}', this)" title="Click to copy" tabindex="0" role="button" aria-label="Copy ${escapeHtml(col.label)}: ${escapeHtml(String(val || ''))}">${escapeHtml(String(val ?? '—'))}</span>`;
+          } else if (typeof val === 'number' && (col.key.includes('amount') || col.key.includes('revenue') || col.key.includes('price'))) {
+            cellContent = formatCurrency(val);
+          } else if (typeof val === 'string' && val.match(/^\d{4}-\d{2}-\d{2}/)) {
+            cellContent = formatDate(val);
+          } else {
+            cellContent = escapeHtml(String(val ?? '—'));
+          }
+
+          return `<td>${cellContent}</td>`;
+        }).join('')}
+        <td>
+          <span class="expand-icon ${isExpanded ? 'open' : ''}" onclick="toggleExpand('${escapeHtml(String(item._id))}')" tabindex="0" role="button" aria-label="${isExpanded ? 'Collapse' : 'Expand'} row details" aria-expanded="${isExpanded}">▶</span>
+        </td>
+      </tr>
+      <tr class="detail-row ${isExpanded ? 'open' : ''}" id="detail-${escapeHtml(String(item._id))}">
+        <td colspan="${cols.length + 2}">
+          <div style="display:grid;grid-template-columns:repeat(auto-fill,minmax(200px,1fr));gap:8px">
+            ${Object.entries(item).filter(([k]) => !k.startsWith('_')).map(([k, v]) => `
+              <div>
+                <span style="color:#b0b2b8;font-size:11px;text-transform:capitalize">${escapeHtml(k.replace(/_/g, ' '))}</span><br>
+                <span style="font-size:13px">${escapeHtml(String(v ?? '—'))}</span>
+              </div>
+            `).join('')}
+          </div>
+        </td>
+      </tr>
+    `;
+  }).join('');
+
+  // Update count
+  const countEl = document.getElementById('grid-count');
+  if (countEl) countEl.textContent = items.length;
+}
+
+// ── Apply Sort (without toggling direction) ──
+// Extracted so handleSearch can re-apply the current sort without side effects
+function applySort() {
+  const colKey = gridState.sortCol;
+  if (!colKey) return;
+  gridState.filteredItems.sort((a, b) => {
+    let aVal = a[colKey], bVal = b[colKey];
+    if (aVal == null) return 1;
+    if (bVal == null) return -1;
+    if (typeof aVal === 'number' && typeof bVal === 'number') {
+      return gridState.sortDir === 'asc' ? aVal - bVal : bVal - aVal;
+    }
+    aVal = String(aVal).toLowerCase();
+    bVal = String(bVal).toLowerCase();
+    const cmp = aVal.localeCompare(bVal);
+    return gridState.sortDir === 'asc' ? cmp : -cmp;
+  });
+}
+
+// ── Sorting (user clicks column header) ──
+function handleSort(colKey) {
+  if (gridState.sortCol === colKey) {
+    gridState.sortDir = gridState.sortDir === 'asc' ? 'desc' : 'asc';
+  } else {
+    gridState.sortCol = colKey;
+    gridState.sortDir = 'asc';
+  }
+
+  // Update header classes
+  document.querySelectorAll('.sortable').forEach(th => th.classList.remove('asc', 'desc'));
+  const activeHeader = document.getElementById(`col-${colKey}`);
+  if (activeHeader) activeHeader.classList.add(gridState.sortDir);
+
+  applySort();
+  renderRows();
+}
+
+// ── Filtering / Search ──
+function handleSearch(query) {
+  gridState.searchQuery = query.toLowerCase().trim();
+  if (!gridState.searchQuery) {
+    gridState.filteredItems = [...gridState.items];
+  } else {
+    gridState.filteredItems = gridState.items.filter(item =>
+      Object.values(item).some(v =>
+        v != null && String(v).toLowerCase().includes(gridState.searchQuery)
+      )
+    );
+  }
+  // Re-apply current sort without toggling direction
+  if (gridState.sortCol) {
+    applySort();
+  }
+  renderRows();
+}
+
+// ── Bulk Selection ──
+function toggleSelect(id, checked) {
+  if (checked) {
+    gridState.selectedIds.add(id);
+  } else {
+    gridState.selectedIds.delete(id);
+  }
+  updateBulkBar();
+}
+
+function toggleSelectAll(checked) {
+  if (checked) {
+    gridState.filteredItems.forEach(item => gridState.selectedIds.add(item._id));
+  } else {
+    gridState.selectedIds.clear();
+  }
+  // Update all checkboxes
+  document.querySelectorAll('#grid-body .grid-check').forEach(cb => cb.checked = checked);
+  updateBulkBar();
+}
+
+function clearSelection() {
+  gridState.selectedIds.clear();
+  document.querySelectorAll('.grid-check').forEach(cb => cb.checked = false);
+  updateBulkBar();
+}
+
+function updateBulkBar() {
+  const bar = document.getElementById('bulk-bar');
+  const count = gridState.selectedIds.size;
+  if (bar) {
+    bar.style.display = count > 0 ? 'flex' : 'none';
+    document.getElementById('bulk-count').textContent = count;
+  }
+}
+
+function handleBulkAction(action) {
+  const selectedItems = gridState.items.filter(item => gridState.selectedIds.has(item._id));
+  sendToHost('tool_call', { action, items: selectedItems.map(i => ({ ...i, _idx: undefined, _id: undefined })) });
+}
+
+// ── Expand/Collapse ──
+function toggleExpand(id) {
+  if (gridState.expandedIds.has(id)) {
+    gridState.expandedIds.delete(id);
+  } else {
+    gridState.expandedIds.add(id);
+  }
+  const detailRow = document.getElementById(`detail-${id}`);
+  const icon = document.querySelector(`tr[data-id="${id}"] .expand-icon`);
+  if (detailRow) detailRow.classList.toggle('open');
+  if (icon) {
+    icon.classList.toggle('open');
+    icon.setAttribute('aria-expanded', gridState.expandedIds.has(id));
+  }
+}
+```
+
+> **Performance Note (100+ rows):** For datasets over 100 rows, the full DOM render becomes slow. Two mitigation strategies:
+> 1. **Client-side pagination:** Render 50 rows at a time with prev/next controls. All data is already loaded — just slice the array.
+> 2. **Virtual scrolling:** Only render visible rows + a buffer zone (±10 rows). Recalculate on scroll. More complex but handles 10K+ rows.
+>
+> For most MCP apps, client-side pagination is sufficient. The tool's `meta.pageSize` already limits server-side results to 25-50 rows.
+
+**Interactive Data Grid empty state customization:**
+```html
+<div id="empty" style="display:none">
+  <div class="empty-state">
+    <div class="empty-state-icon">🔎</div>
+    <div class="empty-state-title">Ready to explore</div>
+    <div class="empty-state-text">Try "show me all contacts" or "list invoices from this month" to load data you can sort, filter, and explore.</div>
+  </div>
+</div>
+```
+
+---
+
+## 7. Bidirectional Communication Patterns
+
+Apps can send actions back to the LocalBosses host using `sendToHost()`. The host listens for `mcp_app_action` messages on the iframe's parent window.
+
+### Pattern 1: Request Data Refresh
+
+```javascript
+// User clicks a "Refresh" button in the app
+document.getElementById('refreshBtn').addEventListener('click', () => {
+  sendToHost('refresh', {});
+  showState('loading'); // Show loading while refresh happens
+});
+```
+
+### Pattern 2: Navigate to Another App (Drill-Down)
+
+```javascript
+// User clicks a contact name → open their detail card
+function openContact(contactId, contactName) {
+  sendToHost('navigate', {
+    app: 'contact-card',
+    params: { id: contactId, name: contactName }
+  });
+}
+
+// In a table row:
+// <td><a href="#" onclick="openContact('${item.id}', '${escapeHtml(item.name)}')" tabindex="0">${escapeHtml(item.name)}</a></td>
+```
+
+> **App-to-App Navigation (Drill-Down):** The `sendToHost('navigate', ...)` pattern enables interconnected apps. Example flows:
+> - **Data Grid → Detail Card:** Click a contact name in the grid → host opens the contact-card app with that contact's data
+> - **Dashboard → Data Grid:** Click a metric card → host opens the grid filtered to that metric
+> - **Detail Card → Form:** Click "Edit" → host opens the form pre-filled with the entity's data
+>
+> The host must listen for `mcp_app_action` messages with `action: 'navigate'` and handle the app switch (see `mcp-localbosses-integrator` Phase 4 for host-side wiring).
+
+### Pattern 3: Trigger a Tool Call
+
+```javascript
+// User clicks "Delete" on a row
+function deleteItem(itemId) {
+  if (confirm('Are you sure you want to delete this item?')) {
+    sendToHost('tool_call', {
+      tool: 'delete_contact',
+      args: { id: itemId }
+    });
+  }
+}
+```
+
+---
+
+## 8. Responsive Design Requirements
+
+Apps must work from **280px to 800px width**.
+
+### Breakpoints:
+
+| Width | Behavior |
+|-------|----------|
+| 280-399px | Single column. Compact padding. Smaller fonts. Horizontal scroll for tables. |
+| 400-599px | Two columns for metrics. Standard padding. |
+| 600-800px | Full layout. Three+ metric columns. Tables without scroll. |
+
+### Required CSS:
+```css
+@media (max-width: 400px) {
+  body { padding: 12px; }
+  .metrics-row { grid-template-columns: repeat(2, 1fr); gap: 8px; }
+  .app-title { font-size: 16px; }
+  .data-table { font-size: 12px; }
+}
+@media (max-width: 300px) {
+  .metrics-row { grid-template-columns: 1fr; }
+  body { padding: 8px; }
+}
+```
+
+### Key rules:
+- Use `grid-template-columns: repeat(auto-fit, minmax(Xpx, 1fr))` for adaptive grids
+- Tables get `overflow-x: auto` on the container
+- Pipeline columns scroll horizontally on narrow screens
+- All text uses `word-break: break-word` or `text-overflow: ellipsis`
+
+---
+
+## 9. Three Required States
+
+Every app MUST implement all three:
+
+### 1. Loading State (visible on page load)
+- Use CSS skeleton animations (shimmer effect)
+- Match the layout of the data state (skeletons should look like the content)
+- Default state — visible when page first loads
+- Must include `role="status"` and `aria-label="Loading content"` for screen readers
+- Must include `<span class="sr-only">Loading content, please wait…</span>`
+- Skeleton animation respects `prefers-reduced-motion` (degrades to static background)
+
+### 2. Empty State (when data is null or empty)
+- Center-aligned with large icon, title, and description
+- **Context-specific prompt per app type** (NOT generic "Ask me a question"):
+  - Dashboard: "Ask me for a performance overview, KPIs, or a metrics summary."
+  - Data Grid: "Try 'show me all active contacts' or 'list recent invoices.'"
+  - Detail Card: "Ask about a specific record by name or ID to see its details."
+  - Form: "Tell me what you'd like to create and I'll set up the form."
+  - Timeline: "Ask to see recent activity, event history, or an audit trail."
+  - Pipeline: "Ask to see your sales pipeline or a specific deal stage."
+  - Calendar: "Ask to see upcoming appointments or your calendar for a date range."
+  - Analytics: "Ask for analytics, performance trends, or a data breakdown."
+  - Interactive Grid: "Try 'show me all contacts' to load data you can sort and explore."
+- Friendly, not error-like
+
+### 3. Data State (when data is received)
+- Full app rendering with `aria-live="polite"` on the content container
+- Handle missing/null fields gracefully (show "—" not "undefined")
+- Handle unexpected data shapes (arrays where objects expected, etc.)
+- Validate data shape with `validateData()` before rendering
+- Apply staggered row entrance animations where appropriate
+- Focus moves to content container when data loads
+
+---
+
+## 10. Rules & Constraints
+
+### MUST:
+- [x] Single HTML file — all CSS/JS inline
+- [x] Zero external dependencies — no CDN links, no fetch to external URLs
+- [x] Dark theme matching LocalBosses palette
+- [x] All three states (loading, empty, data)
+- [x] Both data reception methods (postMessage + polling with exponential backoff)
+- [x] HTML escaping on all user data (`escapeHtml()`)
+- [x] Responsive from 280px to 800px
+- [x] Graceful with missing fields (never show "undefined")
+- [x] Error boundary — `window.onerror` handler, try/catch in render
+- [x] WCAG AA contrast — secondary text `#b0b2b8` (5.0:1), never `#96989d`
+- [x] Accessibility — ARIA attributes, keyboard navigation, focus management
+- [x] Data validation — `validateData()` before rendering
+- [x] Context-specific empty state prompts per app type
+- [x] `prefers-reduced-motion` respected for all animations
+- [x] File size under 50KB per app (ideally under 30KB) — budget enforced during QA
+
+### MUST NOT:
+- [ ] No external CSS/JS files
+- [ ] No CDN links (Chart.js, D3, etc.)
+- [ ] No `<iframe>` inception
+- [ ] No localStorage/sessionStorage (data comes from host)
+- [ ] No hardcoded API calls (data comes via postMessage/polling)
+- [ ] No light theme elements
+- [ ] No use of `#96989d` for text (fails WCAG AA)
+
+---
+
+## 11. Quality Gate Checklist
+
+Before passing apps to Phase 4, verify:
+
+- [ ] **Every app renders with sample data** — no blank screens
+- [ ] **Every app has loading skeleton** — visible on first load, with `role="status"` and sr-only text
+- [ ] **Every app has empty state** — context-specific prompt matching its app type
+- [ ] **Dark theme is consistent** — #1a1d23 bg, #2b2d31 cards, #ff6d5a accent
+- [ ] **WCAG AA contrast** — all secondary text uses `#b0b2b8`, NOT `#96989d`
+- [ ] **Works at 280px width** — no broken layouts, all content accessible
+- [ ] **Works at 800px width** — no excessive whitespace, uses available space
+- [ ] **No external dependencies** — zero CDN links, zero fetch to external URLs
+- [ ] **HTML is escaped** — no XSS from user data
+- [ ] **Handles missing fields** — shows "—" not "undefined" or "null"
+- [ ] **Error boundary present** — `window.onerror` handler catches render failures
+- [ ] **Accessibility basics** — ARIA roles/labels on tables, lists, interactive elements
+- [ ] **Keyboard navigable** — all interactive elements focusable with visible focus indicator
+- [ ] **Reduced motion respected** — `prefers-reduced-motion` disables animations
+- [ ] **Polling uses exponential backoff** — 3s → 5s → 10s → 30s, max 20 attempts
+- [ ] **Data validation** — `validateData()` called before rendering
+- [ ] **File size is reasonable** — single HTML under 50KB (ideally under 30KB)
+
+---
+
+## 12. Execution Workflow
+
+```
+1. Read {service}-api-analysis.md — App Candidates section
+2. For each app candidate:
+   a. Choose app type (dashboard/grid/card/form/timeline/funnel/calendar/analytics/interactive-grid)
+   b. Copy the base HTML template
+   c. Customize the render() function using the type-specific template
+   d. Set correct APP_ID for polling
+   e. Customize loading skeleton to match content layout
+   f. Customize empty state with context-specific icon and message for this app type
+   g. Add ARIA attributes (role, aria-label) to dynamic content regions
+   h. Verify error boundary is present (window.onerror)
+   i. Verify polling uses exponential backoff pattern
+   j. Add data validation with validateData() for expected fields
+   k. Test with sample data mentally (does the render handle edge cases?)
+3. Save all files to {service}-mcp/app-ui/
+4. Verify all apps against quality gate
+```
+
+**Estimated time:** 15-30 minutes per app, 1-3 hours for a full set.
+
+**Agent model recommendation:** Sonnet — well-defined templates, HTML/CSS generation.
+
+---
+
+*This skill is Phase 3 of the MCP Factory pipeline. It produces the visual HTML apps that render inside LocalBosses threads.*
diff --git a/skills/mcp-apps-integration/SKILL.md b/skills/mcp-apps-integration/SKILL.md
new file mode 100644
index 0000000..f35c45f
--- /dev/null
+++ b/skills/mcp-apps-integration/SKILL.md
@@ -0,0 +1,772 @@
+# MCP Apps Integration — Building Servers with Rich UI
+
+**When to use this skill:** Adding rich UI components (structuredContent) to MCP servers. Use when tool results benefit from visual presentation beyond plain text/JSON.
+
+**What this covers:** Integrating MCP Apps with server tools, based on 11 production GHL apps (Contact Grid, Pipeline Board, Calendar View, Invoice Preview, etc.).
+
+---
+
+## 1. What Are MCP Apps?
+
+**MCP Apps = Tools that return `structuredContent`** (HTML-based UI components that render in Claude Desktop)
+
+**Use cases:**
+- **Data grids:** Contact lists, search results
+- **Dashboards:** Stats, metrics, KPIs
+- **Cards:** Opportunity cards, invoice previews
+- **Timelines:** Activity feeds, history
+- **Forms:** Quick actions embedded in UI
+- **Visualizations:** Charts, graphs, calendars
+
+**When to use apps vs regular tools:**
+- ✅ Use apps: Visual data (grids, cards, timelines)
+- ❌ Skip apps: Simple CRUD operations, plain JSON responses
+
+---
+
+## 2. Architecture Pattern
+
+### Server + Apps Integration
+```
+mcp-server-myservice/
+├── src/
+│   ├── index.ts              # Main server (or server.ts)
+│   ├── clients/
+│   │   └── api-client.ts     # API client
+│   ├── apps/
+│   │   └── index.ts          # Apps manager + tool definitions
+│   ├── ui/
+│   │   ├── contact-grid.html
+│   │   ├── dashboard.html
+│   │   └── ...
+│   └── types/
+│       └── index.ts          # Shared TypeScript types
+├── dist/
+│   ├── index.js              # Compiled server
+│   ├── apps/
+│   ├── app-ui/               # Compiled HTML files (copied during build)
+│   └── ...
+├── package.json
+├── tsconfig.json
+└── README.md
+```
+
+**Key points:**
+- Apps manager lives in `src/apps/index.ts`
+- HTML UI files live in `src/ui/` or `app-ui/`
+- Compiled UI files must be accessible at runtime (copy during build)
+
+---
+
+## 3. Apps Manager Pattern
+
+### Basic MCPAppsManager Class
+
+```typescript
+import { Tool } from '@modelcontextprotocol/sdk/types.js';
+import { MyAPIClient } from '../clients/api-client.js';
+import * as fs from 'fs';
+import * as path from 'path';
+import { fileURLToPath } from 'url';
+
+export interface AppToolResult {
+  content: Array<{ type: 'text'; text: string }>;
+  structuredContent?: Record<string, unknown>;
+}
+
+export interface AppResourceHandler {
+  uri: string;
+  mimeType: string;
+  getContent: () => string;
+}
+
+// ESM __dirname equivalent
+const __filename = fileURLToPath(import.meta.url);
+const __dirname = path.dirname(__filename);
+
+function getUIBuildPath(): string {
+  // When compiled, this file is at dist/apps/index.js
+  // UI files are at dist/app-ui/
+  const fromDist = path.resolve(__dirname, '..', 'app-ui');
+  if (fs.existsSync(fromDist)) {
+    return fromDist;
+  }
+  // Fallback
+  return fromDist;
+}
+
+export class MCPAppsManager {
+  private apiClient: MyAPIClient;
+  private resourceHandlers: Map<string, AppResourceHandler> = new Map();
+  private uiBuildPath: string;
+
+  constructor(apiClient: MyAPIClient) {
+    this.apiClient = apiClient;
+    this.uiBuildPath = getUIBuildPath();
+    this.registerResourceHandlers();
+  }
+
+  /**
+   * Register all UI resource handlers
+   */
+  private registerResourceHandlers(): void {
+    const resources: Array<{ uri: string; file: string }> = [
+      { uri: 'ui://myservice/contact-grid', file: 'contact-grid.html' },
+      { uri: 'ui://myservice/dashboard', file: 'dashboard.html' },
+    ];
+
+    for (const resource of resources) {
+      this.resourceHandlers.set(resource.uri, {
+        uri: resource.uri,
+        mimeType: 'text/html;profile=mcp-app',
+        getContent: () => this.loadUIResource(resource.file),
+      });
+    }
+  }
+
+  /**
+   * Load UI resource from build directory
+   */
+  private loadUIResource(filename: string): string {
+    const filePath = path.join(this.uiBuildPath, filename);
+    try {
+      return fs.readFileSync(filePath, 'utf-8');
+    } catch (error) {
+      console.error(`UI resource not found: ${filePath}`);
+      return this.getFallbackHTML(filename);
+    }
+  }
+
+  /**
+   * Generate fallback HTML when UI resource is not built
+   */
+  private getFallbackHTML(filename: string): string {
+    const componentName = filename.replace('.html', '');
+    return `
+<!DOCTYPE html>
+<html>
+<head>
+  <meta charset="UTF-8">
+  <title>${componentName}</title>
+</head>
+<body>
+  <div style="text-align: center; padding: 20px; color: #666;">
+    <p>UI component "${componentName}" is loading...</p>
+  </div>
+</body>
+</html>
+    `.trim();
+  }
+
+  /**
+   * Get tool definitions for all app tools
+   */
+  getToolDefinitions(): Tool[] {
+    return [
+      {
+        name: 'view_contact_grid',
+        description: 'Display contact search results in a data grid. Returns a visual UI component.',
+        inputSchema: {
+          type: 'object',
+          properties: {
+            query: { type: 'string', description: 'Search query' },
+            limit: { type: 'number', description: 'Max results (default: 25)' },
+          },
+        },
+      },
+      // ... more app tools
+    ];
+  }
+
+  /**
+   * Get resource handlers (for server registration)
+   */
+  getResourceHandlers(): Map<string, AppResourceHandler> {
+    return this.resourceHandlers;
+  }
+
+  /**
+   * Handle app tool calls
+   */
+  async handleAppTool(name: string, args: Record<string, unknown>): Promise<AppToolResult> {
+    switch (name) {
+      case 'view_contact_grid':
+        return this.viewContactGrid(args);
+      
+      default:
+        throw new Error(`Unknown app tool: ${name}`);
+    }
+  }
+
+  /**
+   * Example: Contact Grid App
+   */
+  private async viewContactGrid(args: Record<string, unknown>): Promise<AppToolResult> {
+    const { query = '', limit = 25 } = args;
+
+    // Call API to get data
+    const contacts = await this.apiClient.searchContacts({ query, limit: Number(limit) });
+
+    // Return structuredContent pointing to UI resource
+    return {
+      content: [{ type: 'text', text: `Found ${contacts.length} contacts` }],
+      structuredContent: {
+        type: 'ui',
+        uri: 'ui://myservice/contact-grid',
+        data: {
+          contacts,
+          query,
+          timestamp: new Date().toISOString(),
+        },
+      },
+    };
+  }
+}
+```
+
+---
+
+## 4. Server Integration
+
+### In `src/index.ts` or `src/server.ts`
+
+```typescript
+import { Server } from "@modelcontextprotocol/sdk/server/index.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import {
+  CallToolRequestSchema,
+  ListToolsRequestSchema,
+  ListResourcesRequestSchema,
+  ReadResourceRequestSchema,
+} from "@modelcontextprotocol/sdk/types.js";
+import { MyAPIClient } from './clients/api-client.js';
+import { MCPAppsManager } from './apps/index.js';
+
+async function main() {
+  // Initialize API client
+  const apiClient = new MyAPIClient(process.env.API_KEY!);
+
+  // Initialize apps manager
+  const appsManager = new MCPAppsManager(apiClient);
+
+  // Create MCP server
+  const server = new Server(
+    { name: 'myservice-mcp', version: '1.0.0' },
+    { capabilities: { tools: {}, resources: {} } } // ✅ Enable resources
+  );
+
+  // List tools (regular tools + app tools)
+  server.setRequestHandler(ListToolsRequestSchema, async () => {
+    const regularTools = [
+      // ... your regular tools
+    ];
+    const appTools = appsManager.getToolDefinitions();
+    
+    return {
+      tools: [...regularTools, ...appTools],
+    };
+  });
+
+  // Handle tool calls
+  server.setRequestHandler(CallToolRequestSchema, async (request) => {
+    const { name, arguments: args } = request.params;
+
+    try {
+      // Check if it's an app tool
+      const appTools = appsManager.getToolDefinitions().map(t => t.name);
+      if (appTools.includes(name)) {
+        return await appsManager.handleAppTool(name, args || {});
+      }
+
+      // Handle regular tools
+      const result = await handleRegularTool(apiClient, name, args || {});
+      return {
+        content: [{ type: 'text', text: JSON.stringify(result, null, 2) }],
+      };
+    } catch (error) {
+      const message = error instanceof Error ? error.message : String(error);
+      return {
+        content: [{ type: 'text', text: `Error: ${message}` }],
+        isError: true,
+      };
+    }
+  });
+
+  // List resources (UI files)
+  server.setRequestHandler(ListResourcesRequestSchema, async () => {
+    const handlers = appsManager.getResourceHandlers();
+    const resources = Array.from(handlers.values()).map(h => ({
+      uri: h.uri,
+      mimeType: h.mimeType,
+      name: h.uri.split('/').pop() || h.uri,
+    }));
+    return { resources };
+  });
+
+  // Read resources (serve UI HTML)
+  server.setRequestHandler(ReadResourceRequestSchema, async (request) => {
+    const { uri } = request.params;
+    const handler = appsManager.getResourceHandlers().get(uri);
+
+    if (!handler) {
+      throw new Error(`Resource not found: ${uri}`);
+    }
+
+    return {
+      contents: [{
+        uri,
+        mimeType: handler.mimeType,
+        text: handler.getContent(),
+      }],
+    };
+  });
+
+  // Start server
+  const transport = new StdioServerTransport();
+  await server.connect(transport);
+  console.error('MyService MCP server with apps running on stdio');
+}
+
+main().catch(console.error);
+```
+
+**Key additions for apps:**
+1. `capabilities: { tools: {}, resources: {} }` — Enable resources
+2. `ListResourcesRequestSchema` handler — List UI files
+3. `ReadResourceRequestSchema` handler — Serve UI HTML
+4. Check if tool is an app tool before routing
+
+---
+
+## 5. HTML UI Component Template
+
+### Example: Contact Grid (`src/ui/contact-grid.html`)
+
+```html
+<!DOCTYPE html>
+<html>
+<head>
+  <meta charset="UTF-8">
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <title>Contact Grid</title>
+  <style>
+    * {
+      margin: 0;
+      padding: 0;
+      box-sizing: border-box;
+    }
+    body {
+      font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif;
+      padding: 20px;
+      background: #f5f5f5;
+    }
+    .grid-container {
+      background: white;
+      border-radius: 8px;
+      padding: 20px;
+      box-shadow: 0 2px 4px rgba(0,0,0,0.1);
+    }
+    .grid-header {
+      display: flex;
+      justify-content: space-between;
+      align-items: center;
+      margin-bottom: 20px;
+      padding-bottom: 15px;
+      border-bottom: 2px solid #e0e0e0;
+    }
+    .grid-title {
+      font-size: 20px;
+      font-weight: 600;
+      color: #333;
+    }
+    .grid-count {
+      font-size: 14px;
+      color: #666;
+    }
+    .contacts-table {
+      width: 100%;
+      border-collapse: collapse;
+    }
+    .contacts-table th {
+      text-align: left;
+      padding: 12px;
+      background: #f8f9fa;
+      color: #555;
+      font-weight: 600;
+      font-size: 13px;
+      text-transform: uppercase;
+      letter-spacing: 0.5px;
+    }
+    .contacts-table td {
+      padding: 12px;
+      border-bottom: 1px solid #e0e0e0;
+    }
+    .contacts-table tr:hover {
+      background: #f8f9fa;
+    }
+    .contact-name {
+      font-weight: 600;
+      color: #2563eb;
+    }
+    .contact-email {
+      color: #666;
+      font-size: 14px;
+    }
+    .contact-status {
+      display: inline-block;
+      padding: 4px 12px;
+      border-radius: 12px;
+      font-size: 12px;
+      font-weight: 600;
+    }
+    .status-active {
+      background: #d1fae5;
+      color: #065f46;
+    }
+    .status-inactive {
+      background: #fee2e2;
+      color: #991b1b;
+    }
+  </style>
+</head>
+<body>
+  <div class="grid-container">
+    <div class="grid-header">
+      <div class="grid-title">Contacts</div>
+      <div class="grid-count" id="count"></div>
+    </div>
+    <table class="contacts-table">
+      <thead>
+        <tr>
+          <th>Name</th>
+          <th>Email</th>
+          <th>Phone</th>
+          <th>Status</th>
+        </tr>
+      </thead>
+      <tbody id="contacts-tbody">
+        <!-- Populated by JavaScript -->
+      </tbody>
+    </table>
+  </div>
+
+  <script>
+    // Listen for data from MCP
+    window.addEventListener('message', (event) => {
+      if (event.data?.type === 'mcp-app-init') {
+        const data = event.data.data;
+        renderContacts(data);
+      }
+    });
+
+    function renderContacts(data) {
+      const { contacts, query } = data;
+      
+      // Update count
+      document.getElementById('count').textContent = 
+        `${contacts.length} result${contacts.length !== 1 ? 's' : ''}`;
+
+      // Render table rows
+      const tbody = document.getElementById('contacts-tbody');
+      tbody.innerHTML = contacts.map(contact => `
+        <tr>
+          <td class="contact-name">${escapeHtml(contact.name)}</td>
+          <td class="contact-email">${escapeHtml(contact.email || 'N/A')}</td>
+          <td>${escapeHtml(contact.phone || 'N/A')}</td>
+          <td>
+            <span class="contact-status status-${contact.status || 'active'}">
+              ${escapeHtml(contact.status || 'Active')}
+            </span>
+          </td>
+        </tr>
+      `).join('');
+    }
+
+    function escapeHtml(text) {
+      const div = document.createElement('div');
+      div.textContent = text;
+      return div.innerHTML;
+    }
+  </script>
+</body>
+</html>
+```
+
+**Key patterns:**
+- Self-contained (all CSS/JS inline)
+- `window.addEventListener('message', ...)` to receive data
+- `event.data.type === 'mcp-app-init'` to detect init
+- `event.data.data` contains the structuredContent.data object
+- Escape HTML to prevent XSS
+- Clean, modern styling
+
+---
+
+## 6. Common UI Patterns
+
+### 1. Data Grid (List View)
+**Use for:** Contact lists, search results, transaction history
+**Components:** Table, sorting, pagination indicators
+**Example apps:** Contact Grid, Pipeline Board
+
+### 2. Card View (Detail View)
+**Use for:** Single item details, opportunity cards, invoices
+**Components:** Card container, labeled fields, actions
+**Example apps:** Opportunity Card, Invoice Preview
+
+### 3. Dashboard (Stats/Metrics)
+**Use for:** Analytics, KPIs, performance metrics
+**Components:** Stat cards, charts (use Chart.js), progress bars
+**Example apps:** Campaign Stats, Agent Stats
+
+### 4. Timeline (Activity Feed)
+**Use for:** History, activity logs, event streams
+**Components:** Timeline with timestamps, event types, icons
+**Example apps:** Contact Timeline, Workflow Status
+
+### 5. Calendar View
+**Use for:** Appointments, events, schedules
+**Components:** Calendar grid, event markers, time slots
+**Example apps:** Calendar View
+
+---
+
+## 7. Build Configuration
+
+### package.json Scripts
+
+```json
+{
+  "scripts": {
+    "build": "npm run build:ts && npm run build:ui",
+    "build:ts": "tsc",
+    "build:ui": "node scripts/copy-ui.js",
+    "dev": "tsx src/index.ts",
+    "start": "node dist/index.js"
+  }
+}
+```
+
+### scripts/copy-ui.js
+
+```javascript
+import fs from 'fs-extra';
+import path from 'path';
+
+const uiSource = path.join(process.cwd(), 'src', 'ui');
+const uiDest = path.join(process.cwd(), 'dist', 'app-ui');
+
+console.log('Copying UI files...');
+console.log(`From: ${uiSource}`);
+console.log(`To: ${uiDest}`);
+
+// Ensure dist/app-ui exists
+fs.ensureDirSync(uiDest);
+
+// Copy all HTML files from src/ui to dist/app-ui
+fs.copySync(uiSource, uiDest, { overwrite: true });
+
+console.log('✅ UI files copied successfully');
+```
+
+**Install fs-extra:**
+```bash
+npm install --save-dev fs-extra @types/fs-extra
+```
+
+### tsconfig.json
+
+```json
+{
+  "compilerOptions": {
+    "target": "ES2022",
+    "module": "ES2022",
+    "moduleResolution": "node",
+    "outDir": "./dist",
+    "rootDir": "./src",
+    "strict": true,
+    "esModuleInterop": true,
+    "skipLibCheck": true,
+    "forceConsistentCasingInFileNames": true
+  },
+  "include": ["src/**/*"],
+  "exclude": ["node_modules", "dist", "src/ui"]
+}
+```
+
+**Note:** Exclude `src/ui` from TypeScript compilation (HTML files don't need compiling)
+
+---
+
+## 8. Testing Apps
+
+### 1. Build the server
+```bash
+npm run build
+```
+
+### 2. Add to Claude Desktop config
+```json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "node",
+      "args": ["/absolute/path/to/dist/index.js"],
+      "env": {
+        "API_KEY": "your_key_here"
+      }
+    }
+  }
+}
+```
+
+### 3. Restart Claude Desktop
+
+### 4. Call an app tool
+```
+Can you show me the contact grid for "john"?
+```
+
+Claude will call `view_contact_grid` → Server returns `structuredContent` → UI renders in Claude Desktop
+
+---
+
+## 9. When to Use Apps vs Regular Tools
+
+| Scenario | Use App | Use Regular Tool |
+|----------|---------|------------------|
+| Display contact list | ✅ Grid UI | ❌ JSON dump |
+| Show dashboard stats | ✅ Dashboard UI | ❌ Plain numbers |
+| Get single contact by ID | ❌ Overkill | ✅ JSON response |
+| Create a new record | ❌ No UI needed | ✅ POST + return result |
+| Search + display results | ✅ Grid UI | Maybe (depends on result size) |
+| Calendar of appointments | ✅ Calendar UI | ❌ JSON dates hard to parse |
+| Invoice details | ✅ Card UI | Maybe |
+
+**Rule of thumb:** If the result benefits from visual formatting, use an app. If it's pure data/CRUD, use a regular tool.
+
+---
+
+## 10. Common Pitfalls
+
+### ❌ UI files not copied to dist/
+**Solution:** Add `build:ui` script that copies HTML from `src/ui/` to `dist/app-ui/`
+
+### ❌ UI path resolution fails
+**Solution:** Use `fileURLToPath` for ESM `__dirname` equivalent + check `fs.existsSync()`
+
+### ❌ Data not showing in UI
+**Solution:** Check `event.data.type === 'mcp-app-init'` and log `event.data.data` to console
+
+### ❌ Resources not registered
+**Solution:** Add `capabilities: { resources: {} }` and implement `ListResourcesRequestSchema` + `ReadResourceRequestSchema`
+
+### ❌ HTML escaping issues
+**Solution:** Always escape user data with `escapeHtml()` function
+
+---
+
+## 11. App Tool Naming Convention
+
+**Pattern:** `view_` or `show_` prefix for app tools
+
+- `view_contact_grid` → Display contact grid
+- `show_dashboard` → Display dashboard
+- `view_opportunity_card` → Display opportunity card
+- `show_calendar` → Display calendar
+
+**Why:**
+- Differentiates app tools from regular tools
+- Signals to Claude that result is visual
+- Clear intent (viewing vs fetching)
+
+---
+
+## 12. Example: Complete App Tool
+
+```typescript
+{
+  name: 'view_pipeline_board',
+  description: 'Display sales pipeline board with opportunities grouped by stage. Returns an interactive visual component.',
+  inputSchema: {
+    type: 'object',
+    properties: {
+      pipelineId: { 
+        type: 'string', 
+        description: 'Pipeline ID (optional, defaults to main pipeline)' 
+      },
+      includeWon: { 
+        type: 'boolean', 
+        description: 'Include won deals (default: false)' 
+      },
+    },
+  },
+}
+```
+
+```typescript
+private async viewPipelineBoard(args: Record<string, unknown>): Promise<AppToolResult> {
+  const { pipelineId, includeWon = false } = args;
+
+  // Fetch pipeline data from API
+  const pipeline = await this.apiClient.getPipeline(pipelineId);
+  const opportunities = await this.apiClient.getOpportunities({ 
+    pipelineId,
+    status: includeWon ? 'all' : 'active',
+  });
+
+  // Group by stage
+  const groupedByStage = opportunities.reduce((acc, opp) => {
+    if (!acc[opp.stageId]) acc[opp.stageId] = [];
+    acc[opp.stageId].push(opp);
+    return acc;
+  }, {} as Record<string, any[]>);
+
+  return {
+    content: [{
+      type: 'text',
+      text: `Pipeline Board: ${pipeline.name} (${opportunities.length} opportunities)`,
+    }],
+    structuredContent: {
+      type: 'ui',
+      uri: 'ui://myservice/pipeline-board',
+      data: {
+        pipeline,
+        opportunities,
+        groupedByStage,
+        includeWon,
+        timestamp: new Date().toISOString(),
+      },
+    },
+  };
+}
+```
+
+---
+
+## 13. Resources
+
+- **MCP Apps Docs:** https://modelcontextprotocol.io/docs/apps
+- **Example Apps:** `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/ghl-mcp-apps-only/`
+- **GHL MCP Server:** `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/GoHighLevel-MCP/`
+
+---
+
+## Summary
+
+**To add apps to an MCP server:**
+1. Create `MCPAppsManager` class in `src/apps/index.ts`
+2. Build HTML UI components in `src/ui/`
+3. Register resource handlers in apps manager
+4. Add `capabilities: { resources: {} }` to server
+5. Implement `ListResourcesRequestSchema` and `ReadResourceRequestSchema`
+6. Return `structuredContent` from app tool handlers
+7. Copy UI files to `dist/app-ui/` during build
+
+**Benefits:**
+- Rich visual presentation of data
+- Better UX in Claude Desktop
+- Interactive components (grids, cards, dashboards)
+- Clear separation of regular tools vs visual tools
+
+Follow this pattern and your apps will integrate seamlessly with your MCP server.
diff --git a/skills/mcp-apps-merged/CHANGELOG.md b/skills/mcp-apps-merged/CHANGELOG.md
new file mode 100644
index 0000000..dcd920a
--- /dev/null
+++ b/skills/mcp-apps-merged/CHANGELOG.md
@@ -0,0 +1,86 @@
+# MCP Apps Merged Skill — Changelog
+
+## Merge Date: 2026-02-03
+
+### Added from Official ext-apps Skill
+
+These sections/content were **new** from the official skill and did not exist in the custom skill:
+
+| Addition | Location in Merged Doc |
+|----------|----------------------|
+| YAML frontmatter with trigger phrases | Top of file |
+| Framework selection table (Vue/Svelte/Preact/Solid) | Quick Start Decision Tree |
+| Project context section (adding to existing vs new) | Quick Start Decision Tree |
+| Full framework templates table (6 frameworks) | Get Reference Code → Framework Templates |
+| API Reference source files table | Get Reference Code → API Reference |
+| Additional advanced examples (video-resource, sheet-music, threejs, transcript, basic-host) | Get Reference Code → Advanced Examples |
+| `npm install` vs manual versions warning | Required Packages + MUST DO table |
+| `tsx` vs `bun` compatibility note | Required Packages |
+| `ontoolinputpartial` streaming section with use cases table and code patterns | New section: Streaming Partial Input |
+| Visibility-based resource management (IntersectionObserver) | New section: Visibility-Based Resource Management |
+| Fullscreen mode (`requestDisplayMode()`) with CSS patterns | New section: Fullscreen Mode |
+| Safe area insets handling (`safeAreaInsets`) | New section: Safe Area Insets |
+| `sendLog` for debug logging to host | Client-Side + Debugging Checklist |
+| Testing with basic-host example | Testing section |
+| Host styling split into Applying / Using subsections | Host CSS Variables |
+| `line-height` CSS variable example | Host CSS Variables |
+
+### Kept from Custom Battle-Tested Skill
+
+Everything from the custom skill was preserved as the structural base:
+
+| Content | Status |
+|---------|--------|
+| "What MCP Apps Actually Are" explanation | Kept verbatim |
+| "How The Data Flows" numbered diagram | Kept verbatim |
+| Quick Reference pattern | Kept verbatim |
+| Required Packages table | Kept, enhanced with official notes |
+| Project Structure tree | Kept verbatim |
+| Complete Server Template (annotated) | Kept verbatim |
+| Tool Registration quick reference | Kept verbatim |
+| Resource Registration snippet | Kept verbatim |
+| Tool Visibility Options | Kept (merged — both had same content) |
+| Server-Side Action Tools (view vs action) | Kept verbatim |
+| Complete HTML Template | Kept verbatim |
+| Complete UI Logic template | Kept verbatim |
+| Vanilla JS Full Lifecycle pattern | Kept, enhanced with `ontoolinputpartial` + safe area |
+| React Pattern | Kept verbatim |
+| Calling Server Tools snippet | Kept verbatim |
+| Updating Model Context snippet | Kept verbatim |
+| App Lifecycle Handlers table | Kept, added `ontoolinputpartial` row |
+| Host CSS Variables section | Kept, merged with official's richer version |
+| Build Configuration (vite, tsconfig, package.json) | Kept verbatim |
+| **Module Scope Issue** (critical gotcha) | Kept verbatim |
+| **Drag-and-Drop Pattern** (complete) | Kept verbatim |
+| **Edit Modal Pattern** (complete) | Kept verbatim |
+| CSS for Drag-and-Drop & Modals | Kept verbatim |
+| Key Lessons Learned (6 items) | Kept verbatim |
+| Data Handling Gotchas (array validation, escaping) | Kept verbatim |
+| MUST DO table | Kept, added 4 rows from official |
+| MUST NOT table | Kept, added 3 rows from official |
+| Host-Specific Notes (Goose, Claude Desktop) | Kept verbatim |
+| JSONRPC pipe testing commands | Kept verbatim |
+| Debugging Checklist | Kept, added sendLog step |
+| Common Errors table (10+ entries) | Kept, added 2 rows from official |
+| Summary: The Golden Path | Kept, expanded with new steps |
+| `git clone` reference code command | Kept verbatim |
+
+### Reorganization
+
+| Change | Rationale |
+|--------|-----------|
+| Moved "Get Reference Code" earlier (after Project Structure) | Better flow — see examples before diving into implementation |
+| Merged framework templates + advanced examples into one "Get Reference Code" section | Eliminates duplicate reference code sections |
+| Added dedicated sections for Streaming, Visibility, Fullscreen, Safe Areas | These were inline in the official skill; standalone sections are more scannable |
+| Split Host CSS Variables into "Applying" and "Using" subsections | Clearer separation of setup vs usage |
+| Enhanced MUST DO/MUST NOT tables with official skill's requirements | Single authoritative rules table |
+| Added `sendLog` to both client-side section and debugging checklist | Available in two contexts where you'd need it |
+| Expanded Golden Path summary from 6 to 10 steps | Covers all new features (streaming, fullscreen, visibility, styling) |
+
+### Deduplication
+
+Only literally identical content was deduplicated:
+- Tool visibility options (identical in both) → kept once
+- `git clone` command (identical) → kept once
+- CSS variable examples (nearly identical) → merged into richer version with `line-height`
+- Handler registration order warning (same in both) → kept custom's more detailed version
diff --git a/skills/mcp-apps-merged/SKILL.md b/skills/mcp-apps-merged/SKILL.md
new file mode 100644
index 0000000..2777010
--- /dev/null
+++ b/skills/mcp-apps-merged/SKILL.md
@@ -0,0 +1,1294 @@
+---
+name: Create MCP App
+description: This skill should be used when the user asks to "create an MCP App", "add a UI to an MCP tool", "build an interactive MCP View", "scaffold an MCP App", or needs guidance on MCP Apps SDK patterns, UI-resource registration, MCP App lifecycle, host integration, streaming input, fullscreen mode, or interactive UI patterns. The definitive guide for building MCP Apps with interactive UIs.
+---
+
+# MCP Apps Complete Guide
+
+> **Purpose**: The definitive guide to building MCP Apps correctly. Invoke with `/mcp-apps` when working on MCP Apps. This merges official SDK documentation with battle-tested patterns from real-world debugging.
+
+---
+
+## What MCP Apps Actually Are
+
+### The Core Concept
+MCP Apps is an **official extension** to the Model Context Protocol (SEP-1865) that allows MCP servers to deliver **interactive HTML user interfaces** that render inside AI chat windows (Claude Desktop, Goose, etc.).
+
+**Key Distinction:**
+- **MCP Apps** = Official standard from `@modelcontextprotocol/ext-apps` (USE THIS)
+- **MCP-UI** = Older community library from `@mcp-ui/server` (DEPRECATED - avoid)
+
+### The Fundamental Pattern
+```
+MCP App = Tool Definition + UI Resource + Proper Response
+```
+
+Three things MUST be linked:
+1. **Tool Definition** with `_meta.ui.resourceUri`
+2. **UI Resource** registered with `text/html;profile=mcp-app` MIME type
+3. **Tool Response** that triggers the host to fetch and render the UI
+
+### How The Data Flows
+```
+1. Server starts, registers tools with _meta.ui.resourceUri
+2. Server registers UI resources via resources/list
+3. Host (Goose/Claude) calls tools/list, sees _meta.ui.resourceUri
+4. User triggers tool call
+5. Server executes tool, returns content + structuredContent
+6. Host sees tool has UI, fetches resource via resources/read
+7. Host renders HTML in sandboxed iframe
+8. UI calls app.connect(), receives tool result via ontoolresult
+9. UI renders the data
+```
+
+---
+
+## Quick Reference: The Pattern
+
+```
+MCP App = Tool + UI Resource + Link
+```
+
+1. **Tool** - Called by LLM/host, returns data with `structuredContent`
+2. **Resource** - Serves bundled HTML (single file via vite-plugin-singlefile)
+3. **Link** - Tool's `_meta.ui.resourceUri` points to the resource URI
+
+```
+Host calls tool → Server returns result → Host fetches resource → UI receives result via ontoolresult
+```
+
+---
+
+## Quick Start Decision Tree
+
+### Framework Selection
+
+| Framework | SDK Support | Best For |
+|-----------|-------------|----------|
+| React | `useApp` hook provided | Teams familiar with React |
+| Vanilla JS | Manual lifecycle | Simple apps, no build complexity |
+| Vue | Manual lifecycle | Vue teams (template available) |
+| Svelte | Manual lifecycle | Svelte teams (template available) |
+| Preact | Manual lifecycle | Lightweight React alternative |
+| Solid | Manual lifecycle | Solid teams (template available) |
+
+### Project Context
+
+**Adding to existing MCP server:**
+- Import `registerAppTool`, `registerAppResource` from SDK
+- Add tool registration with `_meta.ui.resourceUri`
+- Add resource registration serving bundled HTML
+
+**Creating new MCP server:**
+- Set up server with transport (stdio or HTTP)
+- Register tools and resources
+- Configure build system with `vite-plugin-singlefile`
+
+---
+
+## Required Packages
+
+```bash
+npm install @modelcontextprotocol/ext-apps @modelcontextprotocol/sdk zod
+npm install -D typescript vite vite-plugin-singlefile tsx
+```
+
+> **CRITICAL**: Use `npm install` to add dependencies rather than manually writing version numbers. This lets npm resolve the latest compatible versions. Never specify version numbers from memory.
+
+| Package | Purpose | REQUIRED |
+|---------|---------|----------|
+| `@modelcontextprotocol/ext-apps` | Official MCP Apps SDK (server + client) | YES |
+| `@modelcontextprotocol/sdk` | Base MCP SDK with McpServer class | YES |
+| `zod` | Schema validation for tool parameters | YES |
+| `vite` + `vite-plugin-singlefile` | Bundle UI to single HTML file | YES |
+| `tsx` | Run TypeScript server files | YES |
+
+**Do NOT use**: `@mcp-ui/server` (deprecated, different API)
+
+> **Note**: The SDK examples use `bun` but generated projects should use `tsx` for broader compatibility.
+
+---
+
+## Project Structure
+
+```
+my-mcp-app/
+├── package.json
+├── tsconfig.json
+├── src/
+│   ├── server.ts              # MCP server with tools + resources
+│   └── app-ui/
+│       ├── mcp-app.html       # UI HTML template
+│       ├── src/
+│       │   └── mcp-app.ts     # UI logic
+│       └── vite.config.ts     # Bundles to single file
+└── dist/
+    ├── server.js              # Compiled server
+    └── app-ui/
+        └── mcp-app.html       # Bundled single-file UI
+```
+
+---
+
+## Get Reference Code
+
+Clone the SDK repository for working examples and API documentation:
+
+```bash
+git clone --branch "v$(npm view @modelcontextprotocol/ext-apps version)" --depth 1 \
+  https://github.com/modelcontextprotocol/ext-apps.git /tmp/mcp-ext-apps
+```
+
+### Framework Templates
+
+Learn and adapt from `/tmp/mcp-ext-apps/examples/basic-server-{framework}/`:
+
+| Template | Key Files |
+|----------|-----------|
+| `basic-server-vanillajs/` | `server.ts`, `src/mcp-app.ts`, `mcp-app.html` |
+| `basic-server-react/` | `server.ts`, `src/mcp-app.tsx` (uses `useApp` hook) |
+| `basic-server-vue/` | `server.ts`, `src/App.vue` |
+| `basic-server-svelte/` | `server.ts`, `src/App.svelte` |
+| `basic-server-preact/` | `server.ts`, `src/mcp-app.tsx` |
+| `basic-server-solid/` | `server.ts`, `src/mcp-app.tsx` |
+
+Each template includes:
+- Complete `server.ts` with `registerAppTool` and `registerAppResource`
+- Client-side app with all lifecycle handlers
+- `vite.config.ts` with `vite-plugin-singlefile`
+- `package.json` with all required dependencies
+- `.gitignore` excluding `node_modules/` and `dist/`
+
+### API Reference (Source Files)
+
+Read JSDoc documentation directly from `/tmp/mcp-ext-apps/src/`:
+
+| File | Contents |
+|------|----------|
+| `src/app.ts` | `App` class, handlers (`ontoolinput`, `ontoolresult`, `onhostcontextchanged`, `onteardown`), lifecycle |
+| `src/server/index.ts` | `registerAppTool`, `registerAppResource`, tool visibility options |
+| `src/spec.types.ts` | All type definitions: `McpUiHostContext`, CSS variable keys, display modes |
+| `src/styles.ts` | `applyDocumentTheme`, `applyHostStyleVariables`, `applyHostFonts` |
+| `src/react/useApp.tsx` | `useApp` hook for React apps |
+| `src/react/useHostStyles.ts` | `useHostStyles`, `useHostStyleVariables`, `useHostFonts` hooks |
+
+### Advanced Examples
+
+| Example | Pattern Demonstrated |
+|---------|---------------------|
+| `examples/shadertoy-server/` | **Streaming partial input** + visibility-based pause/play (best practice for large inputs) |
+| `examples/wiki-explorer-server/` | `callServerTool` for interactive data fetching |
+| `examples/system-monitor-server/` | Polling pattern with interval management |
+| `examples/video-resource-server/` | Binary/blob resources |
+| `examples/sheet-music-server/` | `ontoolinput` - processing tool args before execution completes |
+| `examples/threejs-server/` | `ontoolinputpartial` - streaming/progressive rendering |
+| `examples/map-server/` | `updateModelContext` - keeping model informed of UI state |
+| `examples/transcript-server/` | `updateModelContext` + `sendMessage` - background context updates + user-initiated messages |
+| `examples/basic-host/` | Reference host implementation using `AppBridge` |
+
+---
+
+## Server-Side Implementation
+
+### Critical Imports
+```typescript
+import {
+  registerAppTool,
+  registerAppResource,
+  RESOURCE_MIME_TYPE,  // = "text/html;profile=mcp-app"
+} from "@modelcontextprotocol/ext-apps/server";
+import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import { z } from "zod";
+```
+
+### Complete Server Template
+```typescript
+import {
+  registerAppTool,
+  registerAppResource,
+  RESOURCE_MIME_TYPE,
+} from "@modelcontextprotocol/ext-apps/server";
+import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import { z } from "zod";
+import * as fs from "node:fs/promises";
+import * as path from "node:path";
+
+// Path to bundled UI (after vite build)
+const DIST_DIR = path.join(__dirname, "app-ui");
+
+export function createServer(): McpServer {
+  const server = new McpServer({
+    name: "my-mcp-app",
+    version: "1.0.0",
+  });
+
+  // CRITICAL: Define the resource URI (must use ui:// scheme)
+  const widgetResourceUri = "ui://my-app/widget.html";
+
+  // ============================================
+  // STEP 1: Register Tool with UI Metadata
+  // ============================================
+  registerAppTool(
+    server,
+    "my_tool",  // Tool name (snake_case recommended)
+    {
+      title: "My Tool",  // Human-readable title
+      description: "Description for the LLM to understand when to use this",
+      inputSchema: {
+        // Use Zod schemas for parameters
+        param1: z.string().describe("Parameter description"),
+        param2: z.number().optional().describe("Optional number"),
+      },
+      // CRITICAL: This links tool to UI resource
+      _meta: {
+        ui: { resourceUri: widgetResourceUri },
+      },
+    },
+    async (args: { param1: string; param2?: number }) => {
+      // Your tool logic here
+      const result = {
+        data: args.param1,
+        timestamp: new Date().toISOString(),
+      };
+
+      // CRITICAL: Return format
+      return {
+        // Text fallback for non-UI hosts
+        content: [
+          {
+            type: "text" as const,
+            text: `Result: ${JSON.stringify(result)}`,
+          },
+        ],
+        // Structured data passed to UI via ontoolresult
+        structuredContent: result,
+      };
+    }
+  );
+
+  // ============================================
+  // STEP 2: Register UI Resource
+  // ============================================
+  registerAppResource(
+    server as any,  // Type cast needed due to SDK types
+    widgetResourceUri,  // Resource name
+    widgetResourceUri,  // Resource URI (same as name for simplicity)
+    { mimeType: RESOURCE_MIME_TYPE },  // CRITICAL: Use this constant
+    async () => {
+      console.error(`[MCP App] Reading UI resource`);
+      try {
+        const html = await fs.readFile(
+          path.join(DIST_DIR, "mcp-app.html"),
+          "utf-8"
+        );
+        console.error(`[MCP App] UI loaded (${html.length} bytes)`);
+        return {
+          contents: [
+            {
+              uri: widgetResourceUri,
+              mimeType: RESOURCE_MIME_TYPE,
+              text: html,
+            },
+          ],
+        };
+      } catch (error) {
+        console.error(`[MCP App] Failed to read UI:`, error);
+        return {
+          contents: [
+            {
+              uri: widgetResourceUri,
+              mimeType: RESOURCE_MIME_TYPE,
+              text: "<html><body>UI not built</body></html>",
+            },
+          ],
+        };
+      }
+    }
+  );
+
+  // Add more tools (with or without UI)
+  server.tool(
+    "helper_tool",
+    "A helper tool without UI",
+    {
+      input: z.string().describe("Input value"),
+    },
+    async (args) => {
+      return {
+        content: [{ type: "text" as const, text: `Got: ${args.input}` }],
+      };
+    }
+  );
+
+  return server;
+}
+
+// Entry point
+async function main() {
+  const server = createServer();
+  const transport = new StdioServerTransport();
+  await server.connect(transport);
+  console.error("MCP App Server started");
+}
+
+main().catch((e) => {
+  console.error(`Error: ${e.message}`);
+  process.exit(1);
+});
+```
+
+### Tool Registration (Quick Reference)
+```typescript
+const resourceUri = "ui://my-app/widget.html";
+
+registerAppTool(
+  server,
+  "tool_name",
+  {
+    title: "Tool Title",
+    description: "Description for LLM",
+    inputSchema: {
+      param: z.string().describe("Parameter description"),
+    },
+    _meta: {
+      ui: { resourceUri },
+    },
+  },
+  async (args) => {
+    const result = { data: args.param };
+    return {
+      content: [{ type: "text" as const, text: JSON.stringify(result) }],
+      structuredContent: result,  // This goes to UI
+    };
+  }
+);
+```
+
+### Resource Registration
+```typescript
+registerAppResource(
+  server as any,
+  resourceUri,
+  resourceUri,
+  { mimeType: RESOURCE_MIME_TYPE },
+  async () => ({
+    contents: [{
+      uri: resourceUri,
+      mimeType: RESOURCE_MIME_TYPE,
+      text: await fs.readFile(path.join(__dirname, "app-ui/mcp-app.html"), "utf-8"),
+    }],
+  })
+);
+```
+
+### Tool Visibility Options
+```typescript
+// Default: visible to both model and app
+_meta: { ui: { resourceUri, visibility: ["model", "app"] } }
+
+// UI-only (hidden from model) - for refresh buttons, form submissions, internal actions
+_meta: { ui: { resourceUri, visibility: ["app"] } }
+
+// Model-only (app cannot call)
+_meta: { ui: { resourceUri, visibility: ["model"] } }
+```
+
+### Server-Side Action Tools
+
+For interactive UIs, add **action tools** that the UI calls via `callServerTool`:
+
+```typescript
+// View tool (returns UI) - called by model
+registerAppTool(server, "view_board", { /* ... */ }, async (args) => {
+  const data = await getData();
+  return {
+    content: [{ type: "text", text: `Board loaded` }],
+    structuredContent: data
+  };
+});
+
+// Action tool (no UI) - called by UI via callServerTool
+{
+  name: 'update_item',
+  description: 'Update an item (used by UI for drag-drop/edit)',
+  inputSchema: {
+    type: 'object',
+    properties: {
+      itemId: { type: 'string' },
+      columnId: { type: 'string' },
+      name: { type: 'string' },
+      value: { type: 'number' }
+    },
+    required: ['itemId']
+  }
+  // No _meta.ui - this is not a view tool
+}
+
+// Handler in executeTool:
+case 'update_item':
+  const result = await apiClient.updateItem(args.itemId, args);
+  return {
+    content: [{ type: 'text', text: `Updated ${result.name}` }],
+    structuredContent: { success: true, item: result }
+  };
+```
+
+---
+
+## Client-Side Implementation
+
+### Complete HTML Template (src/app-ui/mcp-app.html)
+```html
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8" />
+  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+  <title>My MCP App</title>
+  <style>
+    /* All styles inline - will be bundled */
+    * { margin: 0; padding: 0; box-sizing: border-box; }
+    body {
+      font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif;
+      padding: 16px;
+      background: #f5f5f5;
+    }
+    .loading { text-align: center; color: #666; padding: 20px; }
+    .error { color: #dc2626; background: #fef2f2; padding: 16px; border-radius: 8px; }
+    .content { background: white; padding: 16px; border-radius: 8px; }
+  </style>
+</head>
+<body>
+  <div id="app">
+    <div class="loading">Loading...</div>
+  </div>
+  <script type="module" src="/src/mcp-app.ts"></script>
+</body>
+</html>
+```
+
+### Complete UI Logic (src/app-ui/src/mcp-app.ts)
+```typescript
+import { App } from "@modelcontextprotocol/ext-apps";
+
+// Define your data types
+interface MyData {
+  data: string;
+  timestamp: string;
+}
+
+const appEl = document.getElementById("app")!;
+
+// Create app instance
+const app = new App({
+  name: "My MCP App",
+  version: "1.0.0"
+});
+
+// ============================================
+// CRITICAL: Set handlers BEFORE connect()
+// ============================================
+app.ontoolresult = (result) => {
+  console.log("Tool result received:", result);
+
+  try {
+    // Get data from structuredContent (preferred) or parse text
+    let data: MyData;
+
+    if (result.structuredContent) {
+      data = result.structuredContent as MyData;
+    } else {
+      const textContent = result.content?.find((c) => c.type === "text")?.text;
+      if (textContent) {
+        data = JSON.parse(textContent);
+      } else {
+        throw new Error("No data in result");
+      }
+    }
+
+    renderContent(data);
+  } catch (error) {
+    console.error("Failed to parse result:", error);
+    appEl.innerHTML = `<div class="error">Failed to load: ${error}</div>`;
+  }
+};
+
+app.onerror = (error) => {
+  console.error("App error:", error);
+  appEl.innerHTML = `<div class="error">Error: ${error.message}</div>`;
+};
+
+// ============================================
+// CRITICAL: Connect to host (must be called)
+// ============================================
+app.connect();
+
+// Render function
+function renderContent(data: MyData) {
+  appEl.innerHTML = `
+    <div class="content">
+      <h2>Result</h2>
+      <p><strong>Data:</strong> ${escapeHtml(data.data)}</p>
+      <p><strong>Time:</strong> ${data.timestamp}</p>
+      <button id="refresh">Refresh</button>
+    </div>
+  `;
+
+  // Add interactivity
+  document.getElementById("refresh")?.addEventListener("click", async () => {
+    try {
+      const result = await app.callServerTool({
+        name: "my_tool",
+        arguments: { param1: "refreshed" }
+      });
+      const newData = result.structuredContent as MyData;
+      renderContent(newData);
+    } catch (error) {
+      console.error("Refresh failed:", error);
+    }
+  });
+}
+
+// Helper to prevent XSS
+function escapeHtml(text: string): string {
+  const div = document.createElement('div');
+  div.textContent = text;
+  return div.innerHTML;
+}
+```
+
+### Vanilla JS Pattern (Full Lifecycle)
+```typescript
+import { App } from "@modelcontextprotocol/ext-apps";
+import { applyDocumentTheme, applyHostStyleVariables, applyHostFonts } from "@modelcontextprotocol/ext-apps";
+
+const app = new App({ name: "My App", version: "1.0.0" });
+
+// CRITICAL: Set ALL handlers BEFORE connect()
+app.ontoolinput = (params) => {
+  // Tool args available immediately (before execution completes)
+  console.log("Input:", params.arguments);
+};
+
+app.ontoolinputpartial = (params) => {
+  // Streaming partial input (healed JSON, always valid)
+  console.log("Partial:", params.arguments);
+};
+
+app.ontoolresult = (result) => {
+  // Tool execution complete - render the data
+  const data = result.structuredContent;
+  renderUI(data);
+};
+
+app.onhostcontextchanged = (ctx) => {
+  // Theme/style updates from host
+  if (ctx.theme) applyDocumentTheme(ctx.theme);
+  if (ctx.styles?.variables) applyHostStyleVariables(ctx.styles.variables);
+  if (ctx.styles?.css?.fonts) applyHostFonts(ctx.styles.css.fonts);
+  // Handle safe area insets
+  if (ctx.safeAreaInsets) {
+    const { top, right, bottom, left } = ctx.safeAreaInsets;
+    document.body.style.padding = `${top}px ${right}px ${bottom}px ${left}px`;
+  }
+};
+
+app.onteardown = async () => {
+  // Cleanup before UI closes
+  return {};
+};
+
+app.onerror = (error) => {
+  console.error("App error:", error);
+};
+
+// THEN connect
+await app.connect();
+```
+
+### React Pattern
+```typescript
+import { useApp, useHostStyles } from "@modelcontextprotocol/ext-apps/react";
+
+function MyApp() {
+  const [data, setData] = useState(null);
+
+  const { app } = useApp({
+    appInfo: { name: "My App", version: "1.0.0" },
+    onToolResult: (result) => setData(result.structuredContent),
+  });
+
+  useHostStyles(app);  // Injects CSS variables, making var(--*) available
+
+  return <div>{data && <Content data={data} />}</div>;
+}
+```
+
+### Calling Server Tools from UI
+```typescript
+const result = await app.callServerTool({
+  name: "tool_name",
+  arguments: { param: "value" }
+});
+const newData = result.structuredContent;
+```
+
+### Updating Model Context
+```typescript
+// Keep the model informed of UI state changes
+app.updateModelContext({
+  summary: "User selected 3 items",
+  details: { selectedIds: [1, 2, 3] }
+});
+```
+
+### Debug Logging to Host
+```typescript
+// Send debug logs to the host application (not just iframe dev console)
+await app.sendLog({ level: "info", data: "Debug message" });
+await app.sendLog({ level: "error", data: { error: err.message } });
+```
+
+---
+
+## App Lifecycle Handlers
+
+| Handler | When It Fires | Use For |
+|---------|---------------|---------|
+| `ontoolinput` | Tool args available (before execution) | Show loading state, preview |
+| `ontoolinputpartial` | Streaming partial input (healed JSON) | Progressive rendering during generation |
+| `ontoolresult` | Tool execution complete | Render main UI |
+| `onhostcontextchanged` | Theme/style/locale/display mode changes | Apply host styling, safe areas, fullscreen |
+| `onteardown` | UI closing | Cleanup, save state |
+| `onerror` | Error occurred | Error display |
+
+---
+
+## Streaming Partial Input
+
+For large tool inputs, use `ontoolinputpartial` to show progress during LLM generation. The partial JSON is healed (always valid), enabling progressive UI updates.
+
+**Spec:** [ui/notifications/tool-input-partial](https://github.com/modelcontextprotocol/ext-apps/blob/main/specification/2026-01-26/apps.mdx#streaming-tool-input)
+
+```typescript
+app.ontoolinputpartial = (params) => {
+  const args = params.arguments; // Healed partial JSON - always valid, fields appear as generated
+  // Use args directly for progressive rendering
+};
+
+app.ontoolinput = (params) => {
+  // Final complete input - switch from preview to full render
+};
+```
+
+### Streaming Use Cases
+
+| Pattern | Example |
+|---------|---------|
+| Code preview | Show streaming code in `<pre>`, render on complete (`examples/shadertoy-server/`) |
+| Progressive form | Fill form fields as they stream in |
+| Live chart | Add data points to chart as array grows |
+| Partial render | Render incomplete structured data (tables, lists, trees) |
+
+### Simple Streaming Pattern (Code Preview)
+```typescript
+app.ontoolinputpartial = (params) => {
+  codePreview.textContent = params.arguments?.code ?? "";
+  codePreview.style.display = "block";
+  canvas.style.display = "none";
+};
+app.ontoolinput = (params) => {
+  codePreview.style.display = "none";
+  canvas.style.display = "block";
+  render(params.arguments);
+};
+```
+
+---
+
+## Visibility-Based Resource Management
+
+Pause expensive operations (animations, WebGL, polling) when view scrolls out of viewport:
+
+```typescript
+const observer = new IntersectionObserver((entries) => {
+  entries.forEach((entry) => {
+    if (entry.isIntersecting) {
+      animation.play(); // or: startPolling(), shaderToy.play()
+    } else {
+      animation.pause(); // or: stopPolling(), shaderToy.pause()
+    }
+  });
+});
+observer.observe(document.querySelector(".main"));
+```
+
+See `examples/shadertoy-server/` for a real implementation combining streaming input + visibility pause.
+
+---
+
+## Fullscreen Mode
+
+Request fullscreen via `app.requestDisplayMode()`. Check availability in host context:
+
+```typescript
+let currentMode: "inline" | "fullscreen" = "inline";
+
+app.onhostcontextchanged = (ctx) => {
+  // Check if fullscreen available
+  if (ctx.availableDisplayModes?.includes("fullscreen")) {
+    fullscreenBtn.style.display = "block";
+  }
+  // Track current mode
+  if (ctx.displayMode) {
+    currentMode = ctx.displayMode;
+    container.classList.toggle("fullscreen", currentMode === "fullscreen");
+  }
+};
+
+async function toggleFullscreen() {
+  const newMode = currentMode === "fullscreen" ? "inline" : "fullscreen";
+  const result = await app.requestDisplayMode({ mode: newMode });
+  currentMode = result.mode;
+}
+```
+
+### CSS for Fullscreen — Remove border radius in fullscreen:
+```css
+.main { border-radius: var(--border-radius-lg); overflow: hidden; }
+.main.fullscreen { border-radius: 0; }
+```
+
+See `examples/shadertoy-server/` for complete implementation.
+
+---
+
+## Safe Area Insets
+
+Always respect `safeAreaInsets` to avoid content being clipped by host chrome:
+
+```typescript
+app.onhostcontextchanged = (ctx) => {
+  if (ctx.safeAreaInsets) {
+    const { top, right, bottom, left } = ctx.safeAreaInsets;
+    document.body.style.padding = `${top}px ${right}px ${bottom}px ${left}px`;
+  }
+};
+```
+
+---
+
+## Host CSS Variables
+
+### Applying Styles
+
+**Vanilla JS** - Use helper functions:
+```typescript
+import { applyDocumentTheme, applyHostStyleVariables, applyHostFonts } from "@modelcontextprotocol/ext-apps";
+
+app.onhostcontextchanged = (ctx) => {
+  if (ctx.theme) applyDocumentTheme(ctx.theme);
+  if (ctx.styles?.variables) applyHostStyleVariables(ctx.styles.variables);
+  if (ctx.styles?.css?.fonts) applyHostFonts(ctx.styles.css.fonts);
+};
+```
+
+**React** - Use hooks:
+```typescript
+import { useApp, useHostStyles } from "@modelcontextprotocol/ext-apps/react";
+
+const { app } = useApp({ appInfo, capabilities, onAppCreated });
+useHostStyles(app); // Injects CSS variables to document, making var(--*) available
+```
+
+### Using Variables in CSS
+
+After applying with `applyHostStyleVariables()` or `useHostStyles()`:
+
+```css
+.container {
+  background: var(--color-background-secondary);
+  color: var(--color-text-primary);
+  font-family: var(--font-sans);
+  border-radius: var(--border-radius-md);
+}
+.code {
+  font-family: var(--font-mono);
+  font-size: var(--font-text-sm-size);
+  line-height: var(--font-text-sm-line-height);
+  color: var(--color-text-secondary);
+}
+.heading {
+  font-size: var(--font-heading-lg-size);
+  font-weight: var(--font-weight-semibold);
+}
+```
+
+Key variable groups: `--color-background-*`, `--color-text-*`, `--color-border-*`, `--font-sans`, `--font-mono`, `--font-text-*-size`, `--font-heading-*-size`, `--border-radius-*`. See `src/spec.types.ts` for full list.
+
+---
+
+## Build Configuration
+
+### Vite Config (src/app-ui/vite.config.ts)
+```typescript
+import { defineConfig } from "vite";
+import { viteSingleFile } from "vite-plugin-singlefile";
+
+export default defineConfig({
+  plugins: [viteSingleFile()],  // CRITICAL: Bundles everything into one HTML
+  root: __dirname,
+  build: {
+    outDir: "../../dist/app-ui",
+    emptyOutDir: true,
+    rollupOptions: { input: "mcp-app.html" },
+  },
+});
+```
+
+### tsconfig.json
+```json
+{
+  "compilerOptions": {
+    "target": "ES2022",
+    "module": "NodeNext",
+    "moduleResolution": "NodeNext",
+    "strict": true,
+    "esModuleInterop": true,
+    "skipLibCheck": true,
+    "outDir": "./dist"
+  },
+  "include": ["src/**/*"],
+  "exclude": ["node_modules", "dist", "src/app-ui/**/*"]
+}
+```
+
+**CRITICAL**: Exclude `src/app-ui/**/*` from TypeScript - Vite handles that separately.
+
+### package.json Scripts
+```json
+{
+  "type": "module",
+  "scripts": {
+    "build:ui": "cd src/app-ui && npx vite build",
+    "build:server": "tsc",
+    "build": "npm run build:ui && npm run build:server",
+    "serve": "tsx src/server.ts",
+    "start": "node dist/server.js"
+  }
+}
+```
+
+---
+
+## Interactive UI Patterns
+
+### CRITICAL: Module Scope Issue
+
+When using `<script type="module">`, all variables are **module-scoped** and NOT accessible from:
+- External `<script>` tags
+- HTML attributes (`onclick`, `onsubmit`, `ondblclick`)
+
+**Solution 1: Keep everything in the module**
+```javascript
+// Inside the module script - works because app and functions are in scope
+function renderUI(data) {
+  // Render UI
+  renderHTML(data);
+  // Store data globally for interactive features
+  window._appData = data;
+  // Setup interactivity after render
+  setupInteractivity();
+}
+
+function setupInteractivity() {
+  document.querySelectorAll('.card').forEach(card => {
+    card.draggable = true;
+    card.ondragstart = (e) => { /* has access to module's app variable */ };
+    card.ondblclick = () => openEditModal(card.dataset.id);
+  });
+}
+```
+
+**Solution 2: Expose to window (for HTML attributes)**
+```javascript
+// At end of module, expose functions needed by HTML onclick/onsubmit
+window.openEditModal = function(id) { /* ... */ };
+window.closeEditModal = function() { /* ... */ };
+window.saveItem = async function(e) {
+  e.preventDefault();
+  await app.callServerTool({ name: 'update_item', arguments: { id, data } });
+};
+```
+
+### Drag-and-Drop Pattern
+
+```javascript
+// Store current data globally for access during drag/drop
+window._appData = null;
+
+function setupDragDrop() {
+  let draggedId = null;
+
+  // Make cards draggable
+  document.querySelectorAll('.card').forEach(card => {
+    card.draggable = true;
+
+    card.ondragstart = (e) => {
+      draggedId = card.dataset.id;
+      card.classList.add('dragging');
+      e.dataTransfer.effectAllowed = 'move';
+    };
+
+    card.ondragend = () => {
+      card.classList.remove('dragging');
+      document.querySelectorAll('.column').forEach(c => c.classList.remove('drag-over'));
+    };
+  });
+
+  // Setup drop zones
+  document.querySelectorAll('.column').forEach(col => {
+    col.ondragover = (e) => {
+      e.preventDefault();
+      col.classList.add('drag-over');
+    };
+
+    col.ondragleave = (e) => {
+      if (!col.contains(e.relatedTarget)) col.classList.remove('drag-over');
+    };
+
+    col.ondrop = async (e) => {
+      e.preventDefault();
+      col.classList.remove('drag-over');
+      if (!draggedId) return;
+
+      const newColumnId = col.dataset.columnId;
+
+      // Call server to persist the move
+      try {
+        await app.callServerTool({
+          name: 'update_item',
+          arguments: { itemId: draggedId, columnId: newColumnId }
+        });
+
+        // Update local data and re-render
+        const item = window._appData.items.find(i => i.id === draggedId);
+        if (item) {
+          item.columnId = newColumnId;
+          renderUI(window._appData);  // Re-render (which calls setupDragDrop again)
+        }
+      } catch (err) {
+        alert('Move failed: ' + err.message);
+      }
+
+      draggedId = null;
+    };
+  });
+}
+```
+
+### Edit Modal Pattern
+
+**HTML Structure:**
+```html
+<div id="editModal" class="modal-overlay" style="display:none;">
+  <div class="modal-content">
+    <div class="modal-header">
+      <h3>Edit Item</h3>
+      <button onclick="closeEditModal()">&times;</button>
+    </div>
+    <form onsubmit="saveItem(event)">
+      <input type="hidden" id="editItemId">
+      <div class="form-group">
+        <label>Name</label>
+        <input type="text" id="editName" required>
+      </div>
+      <div class="form-group">
+        <label>Value</label>
+        <input type="number" id="editValue">
+      </div>
+      <button type="submit">Save</button>
+    </form>
+  </div>
+</div>
+```
+
+**JavaScript (in module, exposed to window):**
+```javascript
+window.openEditModal = function(id) {
+  const item = window._appData?.items?.find(i => i.id === id);
+  if (!item) return;
+
+  document.getElementById('editItemId').value = id;
+  document.getElementById('editName').value = item.name || '';
+  document.getElementById('editValue').value = item.value || 0;
+  document.getElementById('editModal').style.display = 'flex';
+};
+
+window.closeEditModal = function() {
+  document.getElementById('editModal').style.display = 'none';
+};
+
+window.saveItem = async function(e) {
+  e.preventDefault();
+  const id = document.getElementById('editItemId').value;
+  const name = document.getElementById('editName').value;
+  const value = parseFloat(document.getElementById('editValue').value) || 0;
+
+  try {
+    await app.callServerTool({
+      name: 'update_item',
+      arguments: { itemId: id, name, value }
+    });
+
+    // Update local data and re-render
+    const item = window._appData?.items?.find(i => i.id === id);
+    if (item) {
+      item.name = name;
+      item.value = value;
+      renderUI(window._appData);
+    }
+    closeEditModal();
+  } catch (err) {
+    alert('Save failed: ' + err.message);
+  }
+};
+```
+
+### CSS for Drag-and-Drop & Modals
+
+```css
+.card {
+  cursor: grab;
+  transition: transform 0.2s, opacity 0.2s;
+}
+
+.card:active {
+  cursor: grabbing;
+}
+
+.card.dragging {
+  opacity: 0.5;
+  transform: rotate(2deg);
+}
+
+.column.drag-over .cards-container {
+  background: #e0e7ff;
+  border: 2px dashed #4f46e5;
+}
+
+.modal-overlay {
+  position: fixed;
+  inset: 0;
+  background: rgba(0,0,0,0.5);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  z-index: 1000;
+}
+
+.modal-content {
+  background: white;
+  padding: 24px;
+  border-radius: 8px;
+  min-width: 300px;
+}
+```
+
+### Key Lessons Learned
+
+1. **Module scope**: `<script type="module">` creates isolated scope - use `window.fn = fn` for HTML attribute access
+2. **Re-render after changes**: Store data in `window._appData`, update it, call render function again
+3. **Setup handlers after render**: Call `setupDragDrop()` at end of render function since DOM changes
+4. **Use dataset attributes**: `card.dataset.id` maps to `data-id="..."` in HTML
+5. **Column identification**: Add `data-column-id` or `data-stage-id` to drop zone containers
+6. **Action vs View tools**: View tools have `_meta.ui.resourceUri`, action tools don't
+
+---
+
+## Data Handling Gotchas
+
+### Array Data Must Be Validated
+```typescript
+// BAD - crashes if data.items isn't an array
+const html = data.items.map(item => `<li>${item}</li>`).join('');
+
+// GOOD - defensive coding
+let items: Item[] = [];
+if (Array.isArray(data.items)) {
+  items = data.items;
+} else if (data.items && typeof data.items === 'object') {
+  // Handle nested { items: [...] } format
+  const nested = (data.items as any).items;
+  if (Array.isArray(nested)) items = nested;
+}
+const html = items.map(item => `<li>${item}</li>`).join('');
+```
+
+### Always Escape User Content
+```typescript
+function escapeHtml(text: string): string {
+  const div = document.createElement('div');
+  div.textContent = text;
+  return div.innerHTML;
+}
+```
+
+---
+
+## Critical Rules
+
+### MUST DO
+
+| Requirement | Why It Matters |
+|-------------|----------------|
+| Use `McpServer` class | Low-level `Server` class doesn't work with helpers |
+| Use `registerAppTool` | Properly sets up `_meta.ui.resourceUri` in both formats |
+| Use `registerAppResource` | Registers resource with correct MIME type |
+| Use `RESOURCE_MIME_TYPE` constant | Must be exactly `text/html;profile=mcp-app` |
+| Use `ui://` scheme for URIs | Required by spec |
+| Bundle UI to SINGLE HTML file | Host expects one file, use `vite-plugin-singlefile` |
+| Set ALL handlers BEFORE `app.connect()` | Miss initial data otherwise |
+| Call `app.connect()` | UI won't communicate without it |
+| Return `structuredContent` in tool response | This is how UI receives typed data |
+| Return text `content` as fallback | Non-UI hosts need this |
+| Use Zod schemas for tool params | McpServer requires Zod, not plain objects |
+| Exclude app-ui from tsconfig | Vite compiles UI separately |
+| Handle `safeAreaInsets` | Avoid content clipped by host chrome |
+| Use `npm install` for dependencies | Never hardcode version numbers from memory |
+| Use host CSS variables | Theme integration, don't hardcode colors |
+| Use `ontoolinputpartial` for large inputs | Show progress during LLM generation |
+
+### MUST NOT
+
+| Anti-Pattern | What Happens |
+|--------------|--------------|
+| Using `Server` instead of `McpServer` | `registerAppTool` won't work |
+| Using `@mcp-ui/server` | Different API, deprecated |
+| Plain object inputSchema | TypeScript errors, runtime failures |
+| Multiple JS/CSS files in UI | Host can't load them |
+| Setting handlers after `connect()` | Miss the initial tool result |
+| Forgetting `_meta.ui.resourceUri` | Host won't fetch UI |
+| Wrong MIME type | Host ignores resource |
+| `import.meta.url` in CommonJS | Build errors |
+| Returning `_meta` in tool response | Not needed if tool definition has it |
+| Hardcoded styles instead of CSS vars | Breaks host theme integration |
+| Ignoring safe area insets | Content clipped on some hosts |
+| Manually writing dependency versions | Version mismatches, use `npm install` |
+
+---
+
+## Host-Specific Notes
+
+### Goose
+- **Minimum version**: 1.19.0 for MCP Apps
+- **Works in**: Goose Desktop, Goose Web (`goose web`), Goose CLI
+- **Config location**: `~/.config/goose/config.yaml`
+- **Enable alpha features**: Add `ALPHA_FEATURES: true` to config
+- **Logs**: Server stderr goes to Goose logs
+
+### Claude Desktop
+- **Config location**: `~/Library/Application Support/Claude/claude_desktop_config.json`
+- **Node path**: Use full path like `/opt/homebrew/opt/node@22/bin/node`
+- **Logs**: Check Claude Desktop developer console
+
+---
+
+## Testing
+
+### Quick Pipe Tests (JSONRPC)
+```bash
+# Test tools/list - verify _meta.ui.resourceUri present
+echo '{"jsonrpc":"2.0","id":1,"method":"tools/list","params":{}}' | node dist/server.js
+
+# Test resources/list - verify MIME type correct
+echo '{"jsonrpc":"2.0","id":1,"method":"resources/list","params":{}}' | node dist/server.js
+
+# Test resources/read - verify HTML returned
+echo '{"jsonrpc":"2.0","id":1,"method":"resources/read","params":{"uri":"ui://my-app/widget.html"}}' | node dist/server.js
+```
+
+### Using basic-host (Interactive Testing)
+
+Test MCP Apps locally with the basic-host example from the SDK repo:
+
+```bash
+# Terminal 1: Build and run your server
+npm run build && npm run serve
+
+# Terminal 2: Run basic-host (from cloned repo)
+cd /tmp/mcp-ext-apps/examples/basic-host
+npm install
+SERVERS='["http://localhost:3001/mcp"]' npm run start
+# Open http://localhost:8080
+```
+
+Configure `SERVERS` with a JSON array of your server URLs (default: `http://localhost:3001/mcp`).
+
+---
+
+## Debugging Checklist
+
+When UI doesn't render:
+
+1. **Check tools/list** - Does tool have `_meta.ui.resourceUri`?
+2. **Check resources/list** - Is resource listed with correct MIME type?
+3. **Check resources/read** - Does it return HTML content?
+4. **Check browser console** - Any JS errors in the iframe?
+5. **Check server stderr** - Is resource being fetched?
+6. **Verify build** - Is `dist/app-ui/mcp-app.html` a single file with inlined JS?
+7. **Use sendLog** - Send debug logs to host from UI:
+   ```typescript
+   await app.sendLog({ level: "info", data: "Debug message" });
+   await app.sendLog({ level: "error", data: { error: err.message } });
+   ```
+
+Add logging to resource handler:
+```typescript
+async () => {
+  console.error(`[DEBUG] resources/read called for: ${uri}`);
+  // ...
+}
+```
+
+---
+
+## Common Errors
+
+| Error | Cause | Fix |
+|-------|-------|-----|
+| `n.map is not a function` | Data isn't array | Validate with `Array.isArray()` |
+| UI shows "Loading..." forever | `connect()` not called | Add `await app.connect()` |
+| UI doesn't receive data | Handlers set after connect | Set handlers BEFORE `connect()` |
+| TypeScript errors with inputSchema | Plain objects used | Use Zod schemas |
+| Host doesn't fetch resource | Tool missing `_meta` | Use `registerAppTool` helper |
+| onclick/ondblclick not firing | Functions scoped in module | Assign to `window` |
+| MIME type mismatch | Hardcoded wrong type | Use `RESOURCE_MIME_TYPE` constant |
+| "UI not built" error | Vite didn't run | Run `npm run build:ui` |
+| `_meta.ui.resourceUri` not in tools/list | Using wrong SDK | Use `@modelcontextprotocol/ext-apps/server` |
+| Drag-and-drop not working | Event handlers not accessible | Use module-internal handlers |
+| Content clipped at edges | Missing safe area handling | Handle `ctx.safeAreaInsets` |
+| Theme looks wrong in host | Hardcoded colors | Use `--color-*` CSS variables |
+
+---
+
+## Summary: The Golden Path
+
+1. **Install**: `@modelcontextprotocol/ext-apps`, `@modelcontextprotocol/sdk`, `zod`, `vite`, `vite-plugin-singlefile`
+2. **Server**: Use `McpServer` + `registerAppTool` + `registerAppResource`
+3. **Schemas**: Use Zod for all tool parameters
+4. **UI**: Use `App` from ext-apps, set ALL handlers BEFORE `connect()`
+5. **Streaming**: Use `ontoolinputpartial` for large inputs, `ontoolinput` for final
+6. **Styling**: Apply host CSS variables, handle safe area insets
+7. **Fullscreen**: Check `availableDisplayModes`, use `requestDisplayMode()`
+8. **Resources**: Use IntersectionObserver to pause expensive ops when off-screen
+9. **Build**: Vite with singlefile plugin, exclude from tsconfig
+10. **Test**: Verify tools/list shows `_meta`, resources/read returns HTML, use basic-host
+
+Follow this exactly and MCP Apps will work. Deviate and you'll debug for hours.
+
+---
+
+*This guide merges the official @modelcontextprotocol/ext-apps SDK documentation with battle-tested patterns from real-world debugging. Follow these patterns exactly.*
diff --git a/skills/mcp-apps-official/SKILL.md b/skills/mcp-apps-official/SKILL.md
new file mode 100644
index 0000000..ef064da
--- /dev/null
+++ b/skills/mcp-apps-official/SKILL.md
@@ -0,0 +1,1136 @@
+---
+name: Create MCP App
+description: This skill should be used when the user asks to "create an MCP App", "add a UI to an MCP tool", "build an interactive MCP View", "scaffold an MCP App", or needs guidance on MCP Apps SDK patterns, UI-resource registration, MCP App lifecycle, or host integration. Provides comprehensive guidance for building MCP Apps with interactive UIs.
+---
+
+# Create MCP App
+
+Build interactive UIs that run inside MCP-enabled hosts like Claude Desktop, Goose, VS Code. An MCP App combines an MCP tool with an HTML resource to display rich, interactive content.
+
+## Core Concept: Tool + Resource
+
+Every MCP App requires two parts linked together:
+
+1. **Tool** - Called by the LLM/host, returns data
+2. **Resource** - Serves the bundled HTML UI that displays the data
+3. **Link** - The tool's `_meta.ui.resourceUri` references the resource
+
+```
+Host calls tool → Server returns result → Host renders resource UI → UI receives result
+```
+
+## Architecture Decision: Direct Composition vs Dynamic Rendering
+
+### ✅ RECOMMENDED: Direct Component Composition (Per-App Files)
+
+Each MCP App is a standalone Vite-bundled HTML file that directly imports and composes the components it needs. The layout is hardcoded — no runtime interpretation layer.
+
+```tsx
+// pipeline-board-app.tsx
+import { PageHeader } from '../components/layout/PageHeader';
+import { KanbanBoard } from '../components/data/KanbanBoard';
+import { MetricCard } from '../components/data/MetricCard';
+
+export function PipelineBoardApp({ data }) {
+  return (
+    <PageHeader title={data.pipeline.name}>
+      <KanbanBoard columns={data.stages} />
+    </PageHeader>
+  );
+}
+```
+
+**Why this works:**
+- Each app knows exactly what it renders — no dynamic interpretation
+- React state is stable (no tree replacement issues)
+- Easy to debug — one app, one file, one purpose
+- Shared component library across all apps via imports
+- Vite bundles each app into a single HTML file
+
+### ❌ AVOID: Dynamic JSON Tree Rendering
+
+Do NOT build a universal renderer that receives a JSON UI tree at runtime and dynamically maps type strings to components. This approach causes:
+
+- **State destruction** — Every tool result replaces the entire tree, unmounting all React components and destroying local state (forms, drag positions, open dropdowns)
+- **Key instability** — Dynamically generated keys cause React to unmount/remount entire subtrees
+- **Silent failures** — Registry lookups for unknown types fail silently
+- **Debugging nightmare** — Issues are in the interpretation layer, not the components
+
+If you MUST use dynamic rendering (e.g., AI-generated UIs), apply these mitigations:
+- Use `mergeUITrees(prev, next)` instead of wholesale tree replacement
+- Ensure ALL element keys are semantic and deterministic (`"pipeline-kanban"` not `"el-${i}"`)
+- Separate data-result tool calls from navigation-intent tool calls (only navigation should replace the tree)
+
+## Interactivity Patterns (Ranked by Reliability)
+
+### Pattern 1: Client-Side State Only ⭐ MOST RELIABLE
+Send all data upfront via `ontoolresult`. The UI handles ALL interactions locally with React state. No server calls needed for sorting, filtering, drag-drop, tab switching, form editing, etc.
+
+```tsx
+// All interactivity is local — works on EVERY host
+const [items, setItems] = useState(data.items);
+const onDragEnd = (draggedId, targetColumn) => {
+  setItems(prev => prev.map(item => 
+    item.id === draggedId ? { ...item, column: targetColumn } : item
+  ));
+};
+```
+
+**Use for:** DataTable sorting/filtering, KanbanBoard drag-drop, TabGroup switching, form editing, chart interactions — anything that doesn't NEED fresh server data.
+
+### Pattern 2: `callServerTool` for On-Demand Data
+Only use when the UI needs fresh data FROM the server (not for writes). Requires host support.
+
+```tsx
+const loadMore = async () => {
+  const result = await app.callServerTool({ 
+    name: "search_contacts", 
+    arguments: { query, offset: page * 25 } 
+  });
+  setContacts(prev => [...prev, ...result.contacts]);
+};
+```
+
+**Use for:** Pagination, search-as-you-type, expanding tree nodes, loading related records.
+
+### Pattern 3: `updateModelContext` for Background Sync
+Silently inform the model what the user did in the UI. No visible message appears in chat. The model has this context for its next interaction.
+
+```tsx
+const onUserAction = (action) => {
+  // Update local state immediately
+  setLocalState(newState);
+  // Silently tell the model
+  app.updateModelContext({ 
+    text: `User moved Deal "${deal.name}" to stage "${newStage}"` 
+  });
+};
+```
+
+**Use for:** Tracking user interactions, keeping model informed of UI state changes.
+
+### Pattern 4: `sendMessage` for Triggering Model Actions
+Sends a visible message in the conversation that triggers the model to respond and take action.
+
+```tsx
+const onSave = () => {
+  app.sendMessage({ 
+    text: `Please save these changes:\n${getChangesSummary()}` 
+  });
+};
+```
+
+**Use for:** Explicit save/submit actions, batch syncing changes to server via the model.
+
+### Pattern 5: App-Only Tools (`visibility: ["app"]`)
+Tools hidden from the model, only callable from the UI. Useful for UI-specific server operations.
+
+```tsx
+_meta: { ui: { resourceUri, visibility: ["app"] } }
+```
+
+**Use for:** Polling, refresh buttons, pagination controls, form submissions.
+
+## Hybrid Interactivity (Recommended for Write Operations)
+
+For operations that need to persist to a server (creating invoices, updating deals, etc.), use capability detection to choose the best path:
+
+```tsx
+function useSmartAction() {
+  const { app } = useMCPApp();
+  const canCallTools = !!app?.getHostCapabilities()?.serverTools;
+
+  const executeAction = async (toolName, args, description) => {
+    if (canCallTools) {
+      // Direct path: call server tool immediately
+      try {
+        return await app.callServerTool({ name: toolName, arguments: args });
+      } catch (err) {
+        // Fallback on failure
+        app.updateModelContext({ text: `Action failed, please retry: ${description}` });
+      }
+    } else {
+      // Fallback: track locally, auto-save via sendMessage after debounce
+      trackChange({ toolName, args, description });
+      app.updateModelContext({ text: `User action: ${description}` });
+    }
+  };
+
+  return { executeAction, canCallTools };
+}
+```
+
+**Flow on hosts WITH `callServerTool`:** button click → server call → instant result
+**Flow on hosts WITHOUT `callServerTool`:** button click → local state update → auto-save after 3s idle → model executes writes
+
+## Host Compatibility Matrix
+
+As of **2026-01-26** (MCP Apps v1.0.1 — first official stable release):
+
+| Host | Renders UI | `callServerTool` | `updateModelContext` | `sendMessage` | Transport |
+|------|-----------|-------------------|----------------------|---------------|-----------|
+| Claude Desktop | ✅ | ✅ | ✅ | ✅ | stdio |
+| Claude Web | ✅ | ✅ | ✅ | ✅ | HTTP |
+| ChatGPT | ✅ | ✅ | ✅ | ✅ | HTTP |
+| VS Code Insiders | ✅ | ✅ | ✅ | ✅ | stdio |
+| Goose | ✅ | ⚠️ Partial | ✅ | ✅ | stdio/HTTP |
+| Postman | ✅ | ✅ | ✅ | ✅ | HTTP |
+| MCPJam | ✅ | ✅ | ✅ | ✅ | HTTP |
+| JetBrains IDEs | 🔜 Coming | — | — | — | stdio |
+
+**Rule:** Design for Pattern 1 (client-side state) first. Layer on `callServerTool` as progressive enhancement.
+**Transport rule:** Support BOTH stdio and HTTP in your server entry point — Claude Desktop and VS Code use stdio, web hosts use HTTP.
+
+## PostMessage Bridge Protocol
+
+MCP Apps use **JSON-RPC 2.0 over `window.postMessage`**:
+
+- App → Host: `{ jsonrpc: "2.0", id: N, method: "tools/call", params: { name, arguments } }`
+- Host → App: `{ jsonrpc: "2.0", id: N, result: { content: [...] } }`
+- Validates `event.source` (Window identity, NOT origin string)
+- Invalid messages are silently dropped (Zod validation)
+
+**Key requirements for `callServerTool` to work:**
+1. Host must declare `serverTools: {}` in `hostCapabilities` during `ui/initialize`
+2. Host must register `oncalltool` handler on `AppBridge`
+3. Host must relay `tools/call` requests to the real MCP server
+
+**Known failure modes:**
+- `srcdoc` iframes get origin `"null"` — use `document.write()` instead
+- Init handshake (`ui/initialize`) never completing — `callServerTool` hangs forever
+- Double-iframe `event.source` mismatch — messages silently dropped
+
+## Quick Start Decision Tree
+
+### Framework Selection
+
+| Framework | SDK Support | Best For |
+|-----------|-------------|----------|
+| React | `useApp` hook provided | Teams familiar with React |
+| Vanilla JS | Manual lifecycle | Simple apps, no build complexity |
+| Vue/Svelte/Preact/Solid | Manual lifecycle | Framework preference |
+
+### Project Structure (Multi-App Server)
+
+For servers with many views (like a CRM), share components across apps:
+
+```
+src/
+├── components/          # Shared component library
+│   ├── layout/          # PageHeader, Card, SplitLayout, StatsGrid, Section
+│   ├── data/            # DataTable, KanbanBoard, MetricCard, Timeline, etc.
+│   ├── charts/          # BarChart, LineChart, PieChart, FunnelChart
+│   ├── interactive/     # ContactPicker, InvoiceBuilder, FormGroup
+│   └── shared/          # ActionButton, SearchBar, Toast, Modal
+├── hooks/               # useCallTool, useSmartAction, useHostCapabilities
+├── styles/              # base.css, interactive.css
+├── apps/                # Individual app entry points
+│   ├── contact-grid/    # Contact list app
+│   │   ├── App.tsx
+│   │   ├── index.html
+│   │   └── vite.config.ts
+│   ├── pipeline-board/  # Kanban board app
+│   │   ├── App.tsx
+│   │   ├── index.html
+│   │   └── vite.config.ts
+│   └── invoice-preview/ # Invoice view app
+│       ├── App.tsx
+│       ├── index.html
+│       └── vite.config.ts
+├── server.ts            # MCP server with all tools + resources
+└── package.json
+```
+
+Each app has its own `vite.config.ts` with `vite-plugin-singlefile`, outputting to `dist/app-ui/{app-name}.html`.
+
+## Getting Reference Code
+
+**SDK Version:** `@modelcontextprotocol/ext-apps` v1.0.1 (Stable spec: 2026-01-26)
+**Spec:** [SEP-1865](https://github.com/modelcontextprotocol/modelcontextprotocol/pull/1865) — first official MCP extension
+
+Clone the SDK repository for working examples and API documentation:
+
+```bash
+git clone --branch "v$(npm view @modelcontextprotocol/ext-apps version)" --depth 1 https://github.com/modelcontextprotocol/ext-apps.git /tmp/mcp-ext-apps
+```
+
+### Framework Templates
+
+Learn and adapt from `/tmp/mcp-ext-apps/examples/basic-server-{framework}/`:
+
+| Template | Key Files |
+|----------|-----------|
+| `basic-server-vanillajs/` | `server.ts`, `src/mcp-app.ts`, `mcp-app.html` |
+| `basic-server-react/` | `server.ts`, `src/mcp-app.tsx` (uses `useApp` hook) |
+| `basic-server-vue/` | `server.ts`, `src/App.vue` |
+| `basic-server-svelte/` | `server.ts`, `src/App.svelte` |
+| `basic-server-preact/` | `server.ts`, `src/mcp-app.tsx` |
+| `basic-server-solid/` | `server.ts`, `src/mcp-app.tsx` |
+
+### API Reference (Source Files)
+
+Read JSDoc documentation directly from `/tmp/mcp-ext-apps/src/`:
+
+| File | Contents |
+|------|----------|
+| `src/app.ts` | `App` class, handlers, lifecycle |
+| `src/server/index.ts` | `registerAppTool`, `registerAppResource`, visibility |
+| `src/spec.types.ts` | All types: `McpUiHostContext`, CSS variables, display modes |
+| `src/styles.ts` | `applyDocumentTheme`, `applyHostStyleVariables`, `applyHostFonts` |
+| `src/react/useApp.tsx` | `useApp` hook for React apps |
+| `src/react/useHostStyles.ts` | `useHostStyles`, `useHostStyleVariables`, `useHostFonts` |
+
+### Advanced Examples
+
+| Example | Pattern Demonstrated |
+|---------|---------------------|
+| `examples/shadertoy-server/` | Streaming partial input + visibility-based pause/play |
+| `examples/wiki-explorer-server/` | `callServerTool` for interactive data fetching |
+| `examples/system-monitor-server/` | Polling pattern with interval management |
+| `examples/video-resource-server/` | Binary/blob resources |
+| `examples/sheet-music-server/` | `ontoolinput` - processing args before execution |
+| `examples/threejs-server/` | `ontoolinputpartial` - streaming/progressive rendering |
+| `examples/map-server/` | `updateModelContext` - keeping model informed of UI state |
+| `examples/transcript-server/` | `updateModelContext` + `sendMessage` - background updates + user messages |
+| `examples/cohort-heatmap-server/` | Complex data visualization (heatmap grid) |
+| `examples/scenario-modeler-server/` | Multi-parameter interactive modeling |
+| `examples/budget-allocator-server/` | Form with interdependent calculated fields |
+| `examples/customer-segmentation-server/` | Data filtering + visualization combo |
+| `examples/pdf-server/` | Document rendering in iframe |
+| `examples/qr-server/` | Python MCP server (non-TypeScript example) |
+| `examples/say-server/` | Simple demo — minimal MCP App |
+| `examples/quickstart/` | Official quickstart tutorial (start here) |
+| `examples/basic-host/` | Reference host implementation using `AppBridge` |
+
+## Critical Implementation Notes
+
+### Adding Dependencies
+
+```bash
+npm install @modelcontextprotocol/ext-apps @modelcontextprotocol/sdk express cors zod
+npm install -D typescript tsx vite vite-plugin-singlefile @types/express @types/cors @types/node concurrently cross-env
+```
+
+### Server-Side Registration (Official Pattern — v1.0.1)
+
+**ALWAYS use `registerAppTool()` and `registerAppResource()`** from `@modelcontextprotocol/ext-apps/server`. Do NOT manually register tools and resources separately — the helpers handle proper metadata linkage, MIME types, and resource registration.
+
+```typescript
+// server.ts
+import {
+  registerAppTool,
+  registerAppResource,
+  RESOURCE_MIME_TYPE,
+} from "@modelcontextprotocol/ext-apps/server";
+import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
+import fs from "node:fs/promises";
+import path from "node:path";
+
+const DIST_DIR = path.join(import.meta.dirname, "dist");
+
+export function createServer(): McpServer {
+  const server = new McpServer({
+    name: "My MCP App Server",
+    version: "1.0.0",
+  });
+
+  const resourceUri = "ui://my-server/contact-grid.html";
+
+  // Register tool WITH UI metadata
+  registerAppTool(
+    server,
+    "view_contact_grid",
+    {
+      title: "Contact Grid",                              // Human-readable title
+      description: "Display contact search results",
+      inputSchema: { query: { type: "string" } },
+      _meta: { ui: { resourceUri } },                     // Links tool → resource
+    },
+    async (args) => {
+      const contacts = await fetchContacts(args.query);
+      return {
+        content: [{ type: "text", text: JSON.stringify(contacts) }],  // Text fallback
+      };
+    },
+  );
+
+  // Register resource that serves the bundled HTML
+  registerAppResource(
+    server,
+    resourceUri,                                          // URI to match
+    "contact-grid",                                       // Resource name
+    { mimeType: RESOURCE_MIME_TYPE },                     // ALWAYS use this constant
+    async () => {
+      const html = await fs.readFile(
+        path.join(DIST_DIR, "contact-grid.html"), "utf-8"
+      );
+      return {
+        contents: [{ uri: resourceUri, mimeType: RESOURCE_MIME_TYPE, text: html }],
+      };
+    },
+  );
+
+  return server;
+}
+```
+
+### Server Entry Point (HTTP + Stdio)
+
+Servers should support BOTH HTTP (for web/testing) and stdio (for Claude Desktop, VS Code):
+
+```typescript
+// main.ts
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import { StreamableHTTPServerTransport } from "@modelcontextprotocol/sdk/server/streamableHttp.js";
+import cors from "cors";
+import express from "express";
+import { createServer } from "./server.js";
+
+async function main() {
+  if (process.argv.includes("--stdio")) {
+    // Stdio transport — for Claude Desktop, VS Code
+    await createServer().connect(new StdioServerTransport());
+  } else {
+    // HTTP transport — for web hosts, testing
+    const port = parseInt(process.env.PORT ?? "3001", 10);
+    const app = express();
+    app.use(cors());
+    app.use(express.json());
+
+    app.all("/mcp", async (req, res) => {
+      const server = createServer();
+      const transport = new StreamableHTTPServerTransport({
+        sessionIdGenerator: undefined,
+      });
+      res.on("close", () => { transport.close(); server.close(); });
+      await server.connect(transport);
+      await transport.handleRequest(req, res, req.body);
+    });
+
+    app.listen(port, () => console.log(`MCP server: http://localhost:${port}/mcp`));
+  }
+}
+main().catch(console.error);
+```
+
+### Package Scripts
+
+```json
+"scripts": {
+  "build": "tsc --noEmit && vite build",
+  "start": "concurrently 'vite build --watch' 'tsx watch main.ts'",
+  "serve": "tsx main.ts",
+  "stdio": "tsx main.ts --stdio"
+}
+```
+
+### Handler Registration Order
+
+Register ALL handlers BEFORE calling `app.connect()`:
+
+```typescript
+const app = new App({ name: "My App", version: "1.0.0" });
+app.ontoolinput = (params) => { /* handle input */ };
+app.ontoolresult = (result) => { /* handle result */ };
+app.onhostcontextchanged = (ctx) => { /* handle context */ };
+app.onteardown = async () => { return {}; };
+await app.connect();
+```
+
+### Tool Visibility
+
+```typescript
+// Default: visible to both model and app
+_meta: { ui: { resourceUri, visibility: ["model", "app"] } }
+
+// UI-only (hidden from model) - for refresh, form submissions
+_meta: { ui: { resourceUri, visibility: ["app"] } }
+
+// Model-only (app cannot call)
+_meta: { ui: { resourceUri, visibility: ["model"] } }
+```
+
+### Content Security Policy (CSP) & Permissions
+
+If your app needs to load external resources (CDN scripts, map tiles, API endpoints) or access device capabilities (microphone, camera), declare them in `_meta.ui`:
+
+```typescript
+_meta: {
+  ui: {
+    resourceUri,
+    // Allow loading from specific external origins
+    csp: {
+      "script-src": ["https://cdn.example.com"],
+      "img-src": ["https://tiles.mapbox.com", "https://api.mapbox.com"],
+      "connect-src": ["https://api.example.com"],
+    },
+    // Request device permissions (host will prompt user for consent)
+    permissions: ["microphone", "camera"],
+  },
+}
+```
+
+**Default:** Apps run in a sandboxed iframe with NO external access. If you don't declare CSP, all external requests are blocked. Only declare what you actually need.
+
+### Host Styling Integration
+
+**React:**
+```typescript
+import { useApp, useHostStyles } from "@modelcontextprotocol/ext-apps/react";
+const { app } = useApp({ appInfo, capabilities, onAppCreated });
+useHostStyles(app);
+```
+
+**CSS variables available after applying:**
+```css
+.container {
+  background: var(--color-background-secondary);
+  color: var(--color-text-primary);
+  font-family: var(--font-sans);
+  border-radius: var(--border-radius-md);
+}
+```
+
+### Safe Area Handling
+
+```typescript
+app.onhostcontextchanged = (ctx) => {
+  if (ctx.safeAreaInsets) {
+    const { top, right, bottom, left } = ctx.safeAreaInsets;
+    document.body.style.padding = `${top}px ${right}px ${bottom}px ${left}px`;
+  }
+};
+```
+
+### Streaming Partial Input
+
+For large tool inputs, use `ontoolinputpartial` to show progress:
+
+```typescript
+app.ontoolinputpartial = (params) => {
+  const args = params.arguments; // Healed partial JSON - always valid
+  preview.textContent = JSON.stringify(args, null, 2);
+};
+app.ontoolinput = (params) => {
+  render(params.arguments); // Final complete input
+};
+```
+
+### Visibility-Based Resource Management
+
+Pause expensive operations when scrolled out of viewport:
+
+```typescript
+const observer = new IntersectionObserver((entries) => {
+  entries.forEach((entry) => {
+    if (entry.isIntersecting) animation.play();
+    else animation.pause();
+  });
+});
+observer.observe(document.querySelector(".main"));
+```
+
+### Fullscreen Mode
+
+```typescript
+app.onhostcontextchanged = (ctx) => {
+  if (ctx.availableDisplayModes?.includes("fullscreen")) {
+    fullscreenBtn.style.display = "block";
+  }
+  if (ctx.displayMode) {
+    container.classList.toggle("fullscreen", ctx.displayMode === "fullscreen");
+  }
+};
+
+async function toggleFullscreen() {
+  const result = await app.requestDisplayMode({ 
+    mode: currentMode === "fullscreen" ? "inline" : "fullscreen" 
+  });
+  currentMode = result.mode;
+}
+```
+
+## Common Mistakes to Avoid
+
+1. **Dynamic JSON tree rendering** — Use direct component composition per app instead
+2. **Replacing entire UI tree on every tool result** — Merge trees or use stable component structure
+3. **Relying on `callServerTool` for basic interactivity** — Use client-side state first
+4. **No capability detection** — Always check `app.getHostCapabilities()?.serverTools` before using `callServerTool`
+5. **Handlers after connect()** — Register ALL handlers BEFORE `app.connect()`
+6. **Missing single-file bundling** — Must use `vite-plugin-singlefile`
+7. **Forgetting resource registration** — Both tool AND resource must be registered
+8. **No text fallback** — Always provide `content` array for non-UI hosts
+9. **Hardcoded styles** — Use host CSS variables for theme integration
+10. **No streaming for large inputs** — Use `ontoolinputpartial` for progress
+11. **No timeout on `callServerTool`** — Always wrap in `Promise.race` with 5s timeout; degrade to read-only on failure (see Graceful Degradation section)
+12. **Sending `sendMessage` on every micro-edit** — Batch changes locally, submit once on explicit save action
+13. **Inconsistent status colors across apps** — Use the shared `StatusBadge` with standard green/yellow/red/blue convention
+14. **No dirty-state indicator on config UIs** — Users must know they have unsaved changes; use `useDirtyState` hook + `SaveBar` component
+
+## Testing
+
+### Using basic-host
+
+```bash
+# Terminal 1: Build and run your server
+npm run build && npm run serve
+
+# Terminal 2: Run basic-host
+cd /tmp/mcp-ext-apps/examples/basic-host
+npm install
+SERVERS='["http://localhost:3001/mcp"]' npm run start
+# Open http://localhost:8080
+```
+
+### Debug with sendLog
+
+```typescript
+await app.sendLog({ level: "info", data: "Debug message" });
+await app.sendLog({ level: "error", data: { error: err.message } });
+```
+
+### Standalone Testing
+
+Build and test each app as a standalone web page with hardcoded data before integrating with the MCP server. This catches rendering/interactivity bugs before the MCP lifecycle adds complexity:
+
+```bash
+# Add a dev mode that uses mock data
+VITE_DEV_MODE=true npm run dev
+```
+
+```tsx
+const data = import.meta.env.VITE_DEV_MODE 
+  ? MOCK_DATA 
+  : useToolResult();
+```
+
+---
+
+## Real-World Examples from Production Apps
+
+**Reference:** 11 GHL MCP Apps built in `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/ghl-mcp-apps-only/`
+
+### 1. Contact Grid (Data Table)
+**Tool:** `view_contact_grid`
+**Use case:** Display contact search results in sortable grid
+**Pattern:** Client-side sorting/filtering of initial dataset
+**Components:** DataTable with column headers, row selection, status badges
+**Data flow:** Send all contacts upfront → All interactions are local (React state)
+
+```typescript
+{
+  name: 'view_contact_grid',
+  description: 'Display contact search results in a data grid',
+  inputSchema: {
+    type: 'object',
+    properties: {
+      query: { type: 'string', description: 'Search query' },
+      limit: { type: 'number', description: 'Max results (default: 25)' },
+    },
+  },
+  _meta: {
+    ui: { resourceUri: 'ui://ghl/contact-grid' },
+  },
+}
+```
+
+### 2. Pipeline Board (Kanban)
+**Tool:** `view_pipeline_board`
+**Use case:** Visual sales pipeline with drag-drop
+**Pattern:** Hybrid — Client-side drag-drop + `updateModelContext` to track moves
+**Components:** KanbanBoard, OpportunityCard, StageColumn
+**Interactivity:** Drag opportunity → Update local state → Silently inform model via `updateModelContext`
+
+```typescript
+const onDragEnd = (opportunityId: string, newStageId: string) => {
+  // Update local state immediately
+  setOpportunities(prev => prev.map(opp =>
+    opp.id === opportunityId ? { ...opp, stageId: newStageId } : opp
+  ));
+  
+  // Inform model (no visible message)
+  app?.updateModelContext({
+    text: `User moved opportunity "${opp.name}" to stage "${newStage.name}"`
+  });
+};
+```
+
+### 3. Calendar View
+**Tool:** `view_calendar`
+**Use case:** Monthly appointment calendar
+**Pattern:** Client-side navigation between months
+**Components:** CalendarGrid, EventMarker, MonthNav
+**State:** Month/year navigation is purely client-side
+
+### 4. Opportunity Card (Detail View)
+**Tool:** `view_opportunity_card`
+**Use case:** Single opportunity detail card
+**Pattern:** Static display (no interactivity needed)
+**Components:** Card, LabeledField, StatusBadge, Timeline
+**Data:** All details sent upfront
+
+### 5. Invoice Preview
+**Tool:** `view_invoice`
+**Use case:** Invoice detail with line items
+**Pattern:** Static display
+**Components:** InvoiceHeader, LineItemTable, TotalsSummary
+**Layout:** Fixed header + scrollable line items + sticky totals
+
+### 6. Campaign Stats Dashboard
+**Tool:** `show_campaign_stats`
+**Use case:** Marketing campaign performance metrics
+**Pattern:** Client-side stat calculations
+**Components:** MetricCard, ProgressBar, TrendIndicator
+**Calculations:** CTR, conversion rate, ROI calculated in UI from raw data
+
+### 7. Agent Stats Dashboard
+**Tool:** `show_agent_stats`
+**Use case:** Agent performance leaderboard
+**Pattern:** Client-side sorting/ranking
+**Components:** LeaderboardTable, PerformanceBadge, RankIndicator
+**Interactivity:** Sort by different metrics (calls, revenue, close rate)
+
+### 8. Contact Timeline
+**Tool:** `view_contact_timeline`
+**Use case:** Activity feed for a contact
+**Pattern:** Static display with expandable items
+**Components:** TimelineItem, EventIcon, ExpandableDetails
+**Layout:** Vertical timeline with timestamps
+
+### 9. Workflow Status
+**Tool:** `view_workflow_status`
+**Use case:** Workflow execution progress
+**Pattern:** Hybrid — Display current state + optional refresh
+**Components:** WorkflowSteps, ProgressIndicator, StepStatus
+**Refresh:** Optional `callServerTool` to re-fetch if host supports it
+
+### 10. Quick Book (Appointment Booking)
+**Tool:** `show_quick_book`
+**Use case:** Embedded appointment booking form
+**Pattern:** Hybrid — Form state local, submission via `sendMessage`
+**Components:** DateTimePicker, ContactSelector, ServiceDropdown
+**Flow:** User fills form → Click submit → `sendMessage` with booking details → Model executes booking
+
+```typescript
+const onSubmit = () => {
+  app?.sendMessage({
+    text: `Please book this appointment:\nContact: ${contact.name}\nDate: ${selectedDate}\nService: ${service}`
+  });
+};
+```
+
+### 11. Dashboard (Multi-Widget)
+**Tool:** `view_dashboard`
+**Use case:** Overview dashboard with multiple widgets
+**Pattern:** Client-side layout with multiple components
+**Components:** PageHeader, MetricCard, RecentActivity, QuickActions
+**Layout:** Grid layout with responsive columns
+
+### 12. Estimate Builder (Complex Form with Calculations)
+**Tool:** `build_estimate`
+**Use case:** Multi-line estimate/quote builder with live price calculations
+**Pattern:** Client-side form state with derived calculations + `sendMessage` on submit
+**Components:** FormGroup, LineItemEditor, CalculatedTotal, TaxSelector
+**Interactivity:** Add/remove line items → recalculate subtotals, tax, total in real-time → Submit via model
+**Key lesson:** All math runs client-side — server only needed at final submission
+
+```typescript
+const [lineItems, setLineItems] = useState<LineItem[]>([]);
+const subtotal = useMemo(() => lineItems.reduce((sum, li) => sum + li.qty * li.price, 0), [lineItems]);
+const tax = subtotal * taxRate;
+const total = subtotal + tax;
+// Submit only when user clicks "Send Estimate"
+const onSubmit = () => app?.sendMessage({ text: `Create estimate:\n${JSON.stringify({ lineItems, total })}` });
+```
+
+### 13. Duplicate Checker (Comparison UI with Merge Actions)
+**Tool:** `check_duplicates`
+**Use case:** Side-by-side comparison of potential duplicate records with merge/dismiss
+**Pattern:** Client-side pair navigation + `sendMessage` for merge action
+**Components:** ComparisonCard, FieldDiffHighlight, MergeSelector, DismissButton
+**Interactivity:** Navigate pairs locally → Select winning fields per row → Submit merge decision via model
+**Key lesson:** Highlight field-level differences with color coding (green = match, yellow = conflict, red = missing)
+
+### 14. Media Library (Async Asset Grid)
+**Tool:** `view_media_library`
+**Use case:** Browsable grid of uploaded images/files with preview
+**Pattern:** Client-side grid with lazy thumbnail loading + `callServerTool` for pagination
+**Components:** AssetGrid, ThumbnailCard, PreviewModal, FilterBar, UploadDropzone
+**Interactivity:** Click thumbnail → expand preview modal (local) | Load more → `callServerTool` pagination
+**Key lesson:** Use `loading="lazy"` on `<img>` tags and intersection observer for progressive loading — don't load 200 thumbnails at once
+
+```typescript
+const loadPage = async (page: number) => {
+  if (!canCallTools) return; // Degrade: show "load more in chat" button
+  const result = await withTimeout(app.callServerTool({
+    name: "list_media", arguments: { offset: page * 50, limit: 50 }
+  }), 5000);
+  setAssets(prev => [...prev, ...result.assets]);
+};
+```
+
+### 15. Inventory Dashboard (Multi-Widget Composition)
+**Tool:** `view_inventory`
+**Use case:** Stock levels, low-stock alerts, category breakdown in one view
+**Pattern:** Client-side widget composition with independent state per widget
+**Components:** StockLevelGauge, LowStockAlert, CategoryBreakdownChart, ReorderQueue
+**Layout:** CSS Grid with responsive breakpoints — 3 columns on desktop, 1 on mobile
+**Key lesson:** Each widget manages its own state independently; parent only provides data. No cross-widget state coupling.
+
+### 16. Conversation Thread (Chat-Style Feed)
+**Tool:** `view_conversation`
+**Use case:** Message history displayed as chat bubbles with sender alignment
+**Pattern:** Static display with scroll-to-bottom + optional `callServerTool` for older messages
+**Components:** MessageBubble, SenderAvatar, TimestampDivider, AttachmentPreview
+**Layout:** Flex column with `flex-direction: column-reverse` for natural scroll behavior
+**Key lesson:** Distinguish inbound vs outbound messages via alignment (left/right) and color, not just labels
+
+### 17. Free Slots Finder (Interactive Scheduling)
+**Tool:** `find_free_slots`
+**Use case:** Display available time slots for booking, filterable by date/duration
+**Pattern:** Client-side date navigation + slot selection + `sendMessage` to book
+**Components:** DateStrip, SlotGrid, DurationFilter, SelectedSlotSummary
+**Interactivity:** Swipe dates (local) → Filter by duration (local) → Select slot → "Book This" sends to model
+**Key lesson:** Send a full week of slots upfront to enable instant date switching without server calls
+
+### 18. Custom Fields Manager (Configuration/Settings UI)
+**Tool:** `manage_custom_fields`
+**Use case:** CRUD interface for custom field definitions (add, reorder, edit types)
+**Pattern:** Client-side list management with drag-to-reorder + batch `sendMessage` save
+**Components:** FieldRow, TypeSelector, DragHandle, AddFieldButton, SaveBar
+**Interactivity:** All adds/edits/reorders happen locally → Sticky save bar appears with change count → Submit all changes at once
+**Key lesson:** Track a `dirty` flag and show unsaved changes indicator — users need to know they have pending edits
+
+```typescript
+const [fields, setFields] = useState(data.fields);
+const [originalFields] = useState(data.fields);
+const isDirty = JSON.stringify(fields) !== JSON.stringify(originalFields);
+// Sticky save bar only appears when isDirty
+```
+
+### 19. Pipeline Analytics (Visualization Dashboard)
+**Tool:** `view_pipeline_analytics`
+**Use case:** Funnel visualization, conversion rates, stage duration metrics
+**Pattern:** Client-side chart rendering with time-range selector
+**Components:** FunnelChart, ConversionRateCard, StageDurationBar, TimeRangeSelector
+**Interactivity:** Switch time ranges locally (7d/30d/90d) → Charts recalculate from full dataset
+**Key lesson:** Send raw data for all time ranges upfront, let the UI slice — avoids server roundtrips for every filter change
+
+---
+
+## Common Patterns from Real Apps (65 Production Apps)
+
+### Pattern: Static Display (No Interactivity)
+**Use for:** Detail views, invoices, timelines
+**Apps:** Opportunity Card, Invoice Preview, Contact Timeline
+**Approach:** Send all data upfront, render with zero client logic
+
+### Pattern: Client-Side Interactivity
+**Use for:** Sorting, filtering, navigation, local forms
+**Apps:** Contact Grid, Calendar View, Agent Stats
+**Approach:** All interactions via React state, no server calls
+
+### Pattern: Drag-Drop with Silent Sync
+**Use for:** Kanban boards, reordering
+**Apps:** Pipeline Board
+**Approach:** Update local state + `updateModelContext` (no visible message)
+
+### Pattern: Form + Submit via Model
+**Use for:** Booking, creating records
+**Apps:** Quick Book
+**Approach:** Local form state, `sendMessage` on submit, model handles server write
+
+### Pattern: Dashboard with Multiple Widgets
+**Use for:** Overview screens, analytics
+**Apps:** Dashboard, Campaign Stats, Inventory Dashboard, Revenue Dashboard, Location Dashboard
+**Approach:** Grid layout, each widget self-contained, calculations client-side
+
+### Pattern: Complex Form Builder
+**Use for:** Creating/editing multi-field records with calculations
+**Apps:** Estimate Builder, Invoice Builder, Contact Creator, Message Composer, Social Post Composer
+**Approach:** All form state + derived calculations are client-side; `sendMessage` only on final submit. Show live totals/previews as user edits.
+
+### Pattern: Comparison / Deduplication
+**Use for:** Side-by-side record comparison, merge decisions
+**Apps:** Duplicate Checker
+**Approach:** Present pairs with field-level diff highlighting. User selects winning values locally, submits merge decision via model.
+
+### Pattern: Asset Grid / Gallery
+**Use for:** Browsable collections of images, files, templates
+**Apps:** Media Library, Template Library
+**Approach:** Lazy-loading grid with thumbnail cards. Preview modal on click (local). Pagination via `callServerTool` as progressive enhancement.
+
+### Pattern: Master → Detail Navigation
+**Use for:** List views that link to detail views
+**Apps:** Company List → Company Detail, Product Catalog → Product Detail, Funnel List → Funnel Detail, Course Catalog → Course Detail, Order List → Order Detail, Invoice List → Invoice Preview
+**Approach:** List view sends all summary data upfront; clicking an item either expands inline (local state) or triggers a new tool call for the detail view via `sendMessage`.
+
+### Pattern: Analytics / Visualization
+**Use for:** Charts, funnels, conversion tracking
+**Apps:** Pipeline Analytics, Pipeline Funnel, Revenue Dashboard, Reviews Dashboard
+**Approach:** Send raw data for all time ranges/filters upfront. All chart rendering, time-range switching, and metric calculations happen client-side. Avoid server calls for filter changes.
+
+### Pattern: Configuration / Settings
+**Use for:** Managing field definitions, tags, accounts, team members
+**Apps:** Custom Fields Manager, Tags Manager, Social Accounts, Subscription Manager, Team Management
+**Approach:** Local CRUD with dirty-state tracking. Sticky save bar with change count. Batch submit all changes via single `sendMessage`.
+
+### Pattern: Feed / Conversation
+**Use for:** Chat history, activity logs, notification streams
+**Apps:** Conversation List, Conversation Thread, Message Detail, Call Log
+**Approach:** Chronological display with sender differentiation. Use `flex-direction: column-reverse` for auto-scroll-to-bottom. Lazy-load older messages via `callServerTool` if supported.
+
+### Pattern: Interactive Scheduling
+**Use for:** Time slot selection, calendar-based booking
+**Apps:** Free Slots Finder, Calendar Resources, Social Calendar, Appointment Booker
+**Approach:** Send a full week/month of slots upfront for instant navigation. Date/duration filtering is local. Selection triggers `sendMessage` to book via model.
+
+---
+
+## Lessons Learned from 65 Production Apps
+
+### 1. Send All Data Upfront When Possible
+**Why:** Avoids host compatibility issues, works everywhere
+**Pattern:** Contact Grid sends all 25 results → All sorting/filtering is local
+**Extended:** Estimate Builder sends tax rates, product list, and customer info upfront — all calculations are instant
+
+### 2. Use `updateModelContext` for Silent Tracking
+**Why:** Keeps model informed without cluttering chat
+**Pattern:** Pipeline Board silently tracks every drag-drop move
+**Extended:** Custom Fields Manager tracks all add/edit/reorder/delete actions silently until explicit save
+
+### 3. Reserve `sendMessage` for Explicit Actions
+**Why:** Visible messages should be intentional user requests
+**Pattern:** Quick Book only sends message when user clicks "Book Appointment"
+**Extended:** Estimate Builder, Invoice Builder, Contact Creator all follow this — form state is local, submission is explicit
+
+### 4. Static Views Are Valid (Not Everything Needs Buttons)
+**Why:** Sometimes you just need to display data beautifully
+**Pattern:** Invoice Preview, Opportunity Card are pure display
+**Extended:** Call Detail, Order Detail, User Detail, Estimate Preview — roughly 20% of all 65 apps are pure static display
+
+### 5. Avoid Premature `callServerTool` Optimization
+**Why:** Not all hosts support it, adds complexity
+**Pattern:** Build client-side first, layer on `callServerTool` for refresh/pagination only if needed
+**Extended:** Only ~5 of 65 apps actually need `callServerTool` (Media Library pagination, Conversation Thread history loading). The other 60 work perfectly with upfront data.
+
+### 6. Shared Component Library = Consistency Win
+**Why:** Reusable UI components across all 65 apps
+**Components:** See **Shared Component Catalog** section below
+**Location:** Shared `components/` directory imported by all apps
+
+### 7. Inline HTML Works Great for Simple Apps
+**Why:** For apps under 200 lines, skip React and use vanilla HTML
+**Pattern:** Several GHL apps use inline HTML with minimal JavaScript
+**Benefits:** Zero build step, instant preview, easy to debug
+
+### 8. Track Dirty State for Settings/Config UIs
+**Why:** Users need to know they have unsaved changes
+**Pattern:** Custom Fields Manager, Tags Manager show a sticky save bar with change count when edits are pending
+**Implementation:** Compare current state to original snapshot; show "X unsaved changes" indicator
+
+### 9. Batch Changes, Don't Spam the Model
+**Why:** Sending a `sendMessage` for every micro-edit floods the conversation
+**Pattern:** Custom Fields Manager, Team Management collect all changes locally → single batch submit
+**Anti-pattern:** ╳ Sending `sendMessage` on every field edit, every drag, every toggle
+
+### 10. Master-Detail Can Be One App or Two
+**Why:** Some detail views are complex enough to warrant separate apps
+**Decision:** If detail view is <100 lines → expand inline (Company List with accordion). If detail view is >100 lines or has its own interactivity → separate app (Invoice List → Invoice Preview)
+**Pattern:** 6 master-detail pairs in the 65 apps, split roughly 50/50 between inline and separate
+
+### 11. Pre-Calculate ALL Time Ranges
+**Why:** Users expect instant filter switching; server roundtrips feel broken
+**Pattern:** Pipeline Analytics, Revenue Dashboard send raw data for 7d/30d/90d/all → UI slices and re-renders charts locally
+**Anti-pattern:** ╳ Calling `callServerTool` every time user switches from "7 days" to "30 days"
+
+### 12. Color-Code Status Consistently Across Apps
+**Why:** Users build muscle memory for what green/yellow/red mean
+**Convention used across 65 apps:**
+- Green: active, complete, paid, healthy
+- Yellow/amber: pending, in-progress, due soon
+- Red: overdue, failed, critical, inactive
+- Blue: informational, new, neutral
+
+---
+
+## Shared Component Catalog
+
+Reusable components available across all apps. Import from `../components/` when building new apps.
+
+### Layout Components (`components/layout/`)
+| Component | Purpose | Used In |
+|-----------|---------|---------|
+| `PageHeader` | Title bar with optional subtitle, actions, breadcrumbs | Nearly all apps |
+| `Card` | Bordered container with optional header/footer | Detail views, dashboard widgets |
+| `SplitLayout` | Two-panel side-by-side layout (list + detail) | Duplicate Checker, Master-Detail pairs |
+| `StatsGrid` | Responsive grid for metric cards (auto 1-4 columns) | All dashboard/analytics apps |
+| `Section` | Collapsible section with header | Settings UIs, long forms |
+| `StickyFooter` | Fixed bottom bar for save/submit actions | Form builders, config UIs |
+
+### Data Components (`components/data/`)
+| Component | Purpose | Used In |
+|-----------|---------|---------|
+| `DataTable` | Sortable, filterable table with column headers | Contact Grid, Invoice List, Order List, Transaction List |
+| `KanbanBoard` | Drag-drop column board | Pipeline Kanban, Task Board |
+| `MetricCard` | Single stat with label, value, trend indicator | All dashboards |
+| `Timeline` | Vertical chronological event list | Contact Timeline, Workflow Status |
+| `StatusBadge` | Colored pill badge (green/yellow/red/blue) | Everywhere — status display |
+| `LeaderboardTable` | Ranked table with position indicators | Agent Stats |
+| `ComparisonCard` | Side-by-side field comparison with diff highlighting | Duplicate Checker |
+| `FieldDiffHighlight` | Color-coded field match/conflict/missing indicator | Duplicate Checker |
+
+### Chart Components (`components/charts/`)
+| Component | Purpose | Used In |
+|-----------|---------|---------|
+| `BarChart` | Horizontal/vertical bar chart | Campaign Stats, Agent Stats |
+| `LineChart` | Time-series line chart | Revenue Dashboard, Pipeline Analytics |
+| `PieChart` | Pie/donut chart | Inventory Dashboard, Category breakdowns |
+| `FunnelChart` | Conversion funnel visualization | Pipeline Funnel, Pipeline Analytics |
+| `ProgressBar` | Horizontal progress indicator | Campaign Stats, Workflow Status |
+| `TrendIndicator` | Up/down arrow with percentage change | MetricCard companion |
+| `StockLevelGauge` | Fill-level indicator (0-100%) | Inventory Dashboard |
+
+### Interactive Components (`components/interactive/`)
+| Component | Purpose | Used In |
+|-----------|---------|---------|
+| `ContactPicker` | Searchable contact selector dropdown | Quick Book, Message Composer |
+| `InvoiceBuilder` | Line item editor with add/remove/reorder | Invoice Builder, Estimate Builder |
+| `FormGroup` | Label + input + validation error display | All form apps |
+| `DateTimePicker` | Date and time selection | Quick Book, Free Slots Finder |
+| `DurationFilter` | Duration range selector (15min/30min/1hr) | Free Slots Finder |
+| `DateStrip` | Horizontal scrollable date selector | Free Slots Finder, Social Calendar |
+| `SlotGrid` | Time slot grid with selection state | Free Slots Finder, Calendar Resources |
+| `LineItemEditor` | Add/remove/edit rows with calculated totals | Estimate Builder, Invoice Builder |
+| `TypeSelector` | Dropdown for field type selection | Custom Fields Manager |
+| `DragHandle` | Drag affordance for reorderable lists | Custom Fields Manager, Task Board |
+
+### Shared Components (`components/shared/`)
+| Component | Purpose | Used In |
+|-----------|---------|---------|
+| `ActionButton` | Primary/secondary/danger button variants | All interactive apps |
+| `SearchBar` | Debounced search input with clear button | Contact Grid, Media Library, Smartlist Viewer |
+| `Toast` | Temporary notification popup | Form submissions, error feedback |
+| `Modal` | Overlay dialog with backdrop | Media Library preview, confirmation dialogs |
+| `EmptyState` | Illustration + message when no data | All list/grid apps |
+| `LoadingSpinner` | Consistent loading indicator | Apps using `callServerTool` |
+| `SaveBar` | Sticky bar showing "X unsaved changes" + Save/Discard | Config UIs (Custom Fields, Tags Manager) |
+| `FilterBar` | Horizontal filter chips/dropdowns | Media Library, Smartlist Viewer, List apps |
+| `ThumbnailCard` | Image card with overlay info | Media Library, Template Library |
+| `MessageBubble` | Chat-style message with sender alignment | Conversation Thread |
+| `TimestampDivider` | "Today" / "Yesterday" divider in feeds | Conversation Thread, Call Log |
+
+### Hooks (`hooks/`)
+| Hook | Purpose |
+|------|---------|
+| `useCallTool` | Wrapper for `callServerTool` with loading/error state |
+| `useSmartAction` | Capability-detected action dispatch (direct vs fallback) |
+| `useHostCapabilities` | Read host capabilities once on mount |
+| `useDirtyState` | Track original vs current state, expose `isDirty` and `changeCount` |
+| `useDebounce` | Debounce value changes (search input, auto-save) |
+| `useLazyLoad` | Intersection observer for lazy loading grid items |
+
+---
+
+## Graceful Degradation & Timeout Strategy
+
+### `callServerTool` Timeout (MANDATORY)
+
+If `ui/initialize` never completes or `callServerTool` hangs, apps must degrade gracefully within 5 seconds — not spin forever.
+
+```typescript
+// hooks/useCallTool.ts
+function withTimeout<T>(promise: Promise<T>, ms: number): Promise<T> {
+  return Promise.race([
+    promise,
+    new Promise<never>((_, reject) =>
+      setTimeout(() => reject(new Error(`callServerTool timed out after ${ms}ms`)), ms)
+    ),
+  ]);
+}
+
+function useCallTool() {
+  const { app } = useMCPApp();
+  const canCallTools = !!app?.getHostCapabilities()?.serverTools;
+
+  const callTool = async (name: string, args: Record<string, unknown>) => {
+    if (!canCallTools) return { ok: false, reason: 'unsupported' as const };
+    try {
+      const result = await withTimeout(
+        app!.callServerTool({ name, arguments: args }),
+        5000 // 5 second hard timeout
+      );
+      return { ok: true, data: result };
+    } catch (err) {
+      return { ok: false, reason: 'timeout' as const, error: err };
+    }
+  };
+
+  return { callTool, canCallTools };
+}
+```
+
+### Degradation Tiers
+
+| Tier | Condition | Behavior |
+|------|-----------|----------|
+| Full | Host supports `callServerTool` + responds in <5s | All features enabled |
+| Read-Only | `callServerTool` times out or errors | Display data from initial `ontoolresult` only; disable pagination/refresh; show "Data loaded at [time]" |
+| Fallback | Host doesn't support `callServerTool` at all | Same as Read-Only; show "Load more in chat" button that uses `sendMessage` |
+| Text-Only | Host doesn't render UI | Return `content` array with formatted text (ALWAYS provide this) |
+
+### Init Handshake Timeout
+
+If `ui/initialize` hasn't completed within 3 seconds, assume limited host and proceed in read-only mode:
+
+```typescript
+const [initComplete, setInitComplete] = useState(false);
+const [timedOut, setTimedOut] = useState(false);
+
+useEffect(() => {
+  const timer = setTimeout(() => {
+    if (!initComplete) setTimedOut(true);
+  }, 3000);
+  return () => clearTimeout(timer);
+}, [initComplete]);
+
+// In render:
+if (timedOut && !initComplete) {
+  return <ReadOnlyView data={data} notice="Running in read-only mode" />;
+}
+```
+
+### Text Fallback (ALWAYS required)
+
+Every tool MUST return a `content` array with meaningful text, even when a UI resource exists. Non-UI hosts (CLI tools, API consumers) only see this:
+
+```typescript
+return {
+  content: [
+    { type: 'text', text: `Found ${contacts.length} contacts matching "${query}":\n${contacts.map(c => `- ${c.name} (${c.email})`).join('\n')}` }
+  ],
+  _meta: { ui: { resourceUri: 'ui://ghl/contact-grid' } },
+};
+```
+
+---
+
+## Reference Implementations
+
+**Full source code (65 apps):**
+- `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/GoHighLevel-MCP/src/ui/react-app/src/apps/`
+- 65 complete apps across 14 pattern categories
+- Shared component library (`components/`)
+- Shared hooks library (`hooks/`)
+- Build scripts for copying HTML to dist/
+
+**Standalone app reference (11 apps with structuredContent):**
+- `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/ghl-mcp-apps-only/`
+
+**Server integration:**
+- `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/GoHighLevel-MCP/src/apps/index.ts`
+- MCPAppsManager class pattern
+- Resource handler registration
+- Tool handler implementation
+
+Use these as templates when building new MCP Apps.
diff --git a/skills/mcp-deployment/SKILL.md b/skills/mcp-deployment/SKILL.md
new file mode 100644
index 0000000..da5570c
--- /dev/null
+++ b/skills/mcp-deployment/SKILL.md
@@ -0,0 +1,885 @@
+# MCP Deployment & Distribution
+
+**When to use this skill:** Packaging and distributing MCP servers. Use when preparing servers for production, Docker containers, Railway deployment, or GitHub publishing.
+
+**What this covers:** Deployment patterns from 30+ production MCP servers including Docker, Railway, npm publishing, and GitHub repository setup.
+
+---
+
+## 1. Deployment Overview
+
+### Common Deployment Targets
+
+1. **Local (Claude Desktop)** — Development + personal use
+2. **Docker Container** — Portable, isolated environment
+3. **Railway.app** — Hosted deployment (for web-accessible MCPs)
+4. **npm Registry** — Public distribution
+5. **GitHub** — Source code + documentation
+
+---
+
+## 2. Local Deployment (Claude Desktop)
+
+### Standard Configuration
+
+**Location:** `~/Library/Application Support/Claude/claude_desktop_config.json` (macOS)
+
+```json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "node",
+      "args": [
+        "/absolute/path/to/mcp-server-myservice/dist/index.js"
+      ],
+      "env": {
+        "MY_SERVICE_API_KEY": "your_api_key_here",
+        "MY_SERVICE_API_SECRET": "your_secret_here"
+      }
+    }
+  }
+}
+```
+
+**Key points:**
+- Use absolute paths for `args`
+- Environment variables in `env` object
+- Server name (`myservice`) appears in Claude Desktop
+- Restart Claude Desktop after config changes
+
+### Alternative: npx Installation
+
+```json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "npx",
+      "args": ["-y", "mcp-server-myservice"],
+      "env": {
+        "MY_SERVICE_API_KEY": "your_api_key_here"
+      }
+    }
+  }
+}
+```
+
+**Requires:**
+- Package published to npm
+- `bin` field in package.json pointing to executable
+
+---
+
+## 3. Docker Containerization
+
+### Dockerfile Template
+
+```dockerfile
+# Multi-stage build for smaller final image
+FROM node:20-alpine AS builder
+
+# Set working directory
+WORKDIR /app
+
+# Copy package files
+COPY package*.json ./
+
+# Install dependencies
+RUN npm ci
+
+# Copy source code
+COPY . .
+
+# Build TypeScript
+RUN npm run build
+
+# Production stage
+FROM node:20-alpine
+
+WORKDIR /app
+
+# Copy package files
+COPY package*.json ./
+
+# Install production dependencies only
+RUN npm ci --production
+
+# Copy built files from builder
+COPY --from=builder /app/dist ./dist
+
+# Copy UI files (if using MCP Apps)
+COPY --from=builder /app/dist/app-ui ./dist/app-ui
+
+# Expose port (if using HTTP transport)
+# EXPOSE 3000
+
+# Set environment variable defaults
+ENV NODE_ENV=production
+
+# Run the MCP server
+CMD ["node", "dist/index.js"]
+```
+
+**Key features:**
+- Multi-stage build → Smaller final image
+- `npm ci` → Faster, more reliable than `npm install`
+- `--production` → Excludes devDependencies
+- `node:20-alpine` → Lightweight base image
+
+### .dockerignore
+
+```
+node_modules
+dist
+.env
+.git
+.gitignore
+*.md
+npm-debug.log
+```
+
+**Why:** Prevents unnecessary files from being copied into image
+
+### Build & Run
+
+```bash
+# Build image
+docker build -t mcp-server-myservice .
+
+# Run container
+docker run -it --rm \
+  -e MY_SERVICE_API_KEY=your_key \
+  -e MY_SERVICE_API_SECRET=your_secret \
+  mcp-server-myservice
+
+# Run with env file
+docker run -it --rm \
+  --env-file .env \
+  mcp-server-myservice
+```
+
+### Docker Compose (Optional)
+
+```yaml
+# docker-compose.yml
+version: '3.8'
+
+services:
+  mcp-server:
+    build: .
+    environment:
+      - MY_SERVICE_API_KEY=${MY_SERVICE_API_KEY}
+      - MY_SERVICE_API_SECRET=${MY_SERVICE_API_SECRET}
+    restart: unless-stopped
+```
+
+```bash
+# Run with docker-compose
+docker-compose up -d
+```
+
+---
+
+## 4. Railway Deployment
+
+### railway.json
+
+```json
+{
+  "$schema": "https://railway.app/railway.schema.json",
+  "build": {
+    "builder": "NIXPACKS",
+    "buildCommand": "npm run build"
+  },
+  "deploy": {
+    "startCommand": "node dist/index.js",
+    "restartPolicyType": "ON_FAILURE",
+    "restartPolicyMaxRetries": 10
+  }
+}
+```
+
+**Key fields:**
+- `buildCommand` → Compile TypeScript
+- `startCommand` → Run compiled server
+- `restartPolicyType` → Auto-restart on failure
+
+### Environment Variables
+
+**In Railway Dashboard:**
+1. Go to project → Variables
+2. Add all required environment variables:
+   - `MY_SERVICE_API_KEY`
+   - `MY_SERVICE_API_SECRET`
+   - `NODE_ENV=production`
+
+### Deployment Commands
+
+```bash
+# Install Railway CLI
+npm install -g @railway/cli
+
+# Login
+railway login
+
+# Link to project
+railway link
+
+# Deploy
+railway up
+
+# View logs
+railway logs
+```
+
+### railway.toml (Alternative)
+
+```toml
+[build]
+builder = "NIXPACKS"
+buildCommand = "npm ci && npm run build"
+
+[deploy]
+startCommand = "node dist/index.js"
+restartPolicyType = "ON_FAILURE"
+restartPolicyMaxRetries = 10
+
+[env]
+NODE_ENV = "production"
+```
+
+---
+
+## 5. npm Publishing
+
+### package.json Configuration
+
+```json
+{
+  "name": "mcp-server-myservice",
+  "version": "1.0.0",
+  "description": "MCP server for MyService integration",
+  "type": "module",
+  "main": "dist/index.js",
+  "bin": {
+    "mcp-server-myservice": "dist/index.js"
+  },
+  "files": [
+    "dist",
+    "README.md",
+    "LICENSE"
+  ],
+  "keywords": [
+    "mcp",
+    "mcp-server",
+    "model-context-protocol",
+    "myservice",
+    "claude-desktop"
+  ],
+  "author": "Your Name <your.email@example.com>",
+  "license": "MIT",
+  "repository": {
+    "type": "git",
+    "url": "https://github.com/yourusername/mcp-server-myservice.git"
+  },
+  "bugs": {
+    "url": "https://github.com/yourusername/mcp-server-myservice/issues"
+  },
+  "homepage": "https://github.com/yourusername/mcp-server-myservice#readme"
+}
+```
+
+**Key fields:**
+- `bin` → Makes package executable via `npx`
+- `files` → Only include necessary files in package
+- `keywords` → Helps with npm search
+- `repository` → Links to GitHub
+
+### .npmignore
+
+```
+src
+*.ts
+tsconfig.json
+.env
+.env.example
+node_modules
+.git
+.DS_Store
+```
+
+**Why:** Prevents source files from being published (only `dist/` is needed)
+
+### Publishing Workflow
+
+```bash
+# 1. Ensure you're logged in to npm
+npm login
+
+# 2. Build the project
+npm run build
+
+# 3. Test locally before publishing
+npm pack
+# This creates a .tgz file - inspect it to verify contents
+
+# 4. Publish to npm
+npm publish
+
+# For scoped packages (e.g., @yourorg/mcp-server-myservice)
+npm publish --access public
+```
+
+### Versioning
+
+```bash
+# Patch release (1.0.0 -> 1.0.1)
+npm version patch
+
+# Minor release (1.0.0 -> 1.1.0)
+npm version minor
+
+# Major release (1.0.0 -> 2.0.0)
+npm version major
+
+# Then publish
+npm publish
+```
+
+---
+
+## 6. GitHub Repository Setup
+
+### File Structure
+
+```
+mcp-server-myservice/
+├── .github/
+│   └── workflows/
+│       ├── build.yml           # CI/CD
+│       └── publish.yml         # npm publish automation
+├── src/
+│   └── index.ts
+├── dist/                        # gitignored
+├── .env.example                 # Template for env vars
+├── .gitignore
+├── .npmignore
+├── Dockerfile
+├── docker-compose.yml
+├── railway.json
+├── package.json
+├── tsconfig.json
+├── README.md
+├── LICENSE
+└── CHANGELOG.md
+```
+
+### .gitignore
+
+```
+# Dependencies
+node_modules/
+
+# Build output
+dist/
+
+# Environment variables
+.env
+.env.local
+
+# Logs
+*.log
+npm-debug.log*
+
+# OS files
+.DS_Store
+Thumbs.db
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+
+# Misc
+.cache/
+```
+
+### README.md Template
+
+```markdown
+# MCP Server for MyService
+
+MCP (Model Context Protocol) server integration for MyService. Enables Claude Desktop to interact with MyService API.
+
+## Features
+
+- ✅ List and search contacts
+- ✅ Get contact details
+- ✅ Create and update contacts
+- ✅ View dashboard metrics
+- ✅ Rich UI components (contact grid, dashboard)
+
+## Installation
+
+### Claude Desktop
+
+Add to your `claude_desktop_config.json`:
+
+\`\`\`json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "npx",
+      "args": ["-y", "mcp-server-myservice"],
+      "env": {
+        "MY_SERVICE_API_KEY": "your_api_key_here"
+      }
+    }
+  }
+}
+\`\`\`
+
+### Manual Installation
+
+\`\`\`bash
+git clone https://github.com/yourusername/mcp-server-myservice.git
+cd mcp-server-myservice
+npm install
+npm run build
+\`\`\`
+
+Add to `claude_desktop_config.json`:
+
+\`\`\`json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "node",
+      "args": ["/absolute/path/to/mcp-server-myservice/dist/index.js"],
+      "env": {
+        "MY_SERVICE_API_KEY": "your_api_key_here"
+      }
+    }
+  }
+}
+\`\`\`
+
+## Configuration
+
+### Required Environment Variables
+
+- `MY_SERVICE_API_KEY` — Your MyService API key ([get one here](https://myservice.com/api-keys))
+- `MY_SERVICE_API_SECRET` — Your MyService API secret (optional)
+
+### Optional Environment Variables
+
+- `MY_SERVICE_BASE_URL` — Override API base URL (default: `https://api.myservice.com`)
+- `LOG_LEVEL` — Logging level: `debug`, `info`, `warn`, `error` (default: `info`)
+
+## Available Tools
+
+### Core Tools
+
+- `list_contacts` — List contacts with pagination and filters
+- `get_contact` — Get detailed contact information
+- `create_contact` — Create a new contact
+- `update_contact` — Update existing contact
+- `delete_contact` — Delete a contact
+
+### App Tools (Rich UI)
+
+- `view_contact_grid` — Display contact search results in a data grid
+- `show_dashboard` — Display dashboard with metrics and KPIs
+
+## Usage Examples
+
+### List contacts
+
+\`\`\`
+Can you show me all active contacts?
+\`\`\`
+
+### Search and display
+
+\`\`\`
+Search for contacts with "john" in their name and show me the grid
+\`\`\`
+
+### Create contact
+
+\`\`\`
+Create a new contact:
+Name: Jane Smith
+Email: jane@example.com
+Phone: 555-1234
+\`\`\`
+
+## Development
+
+\`\`\`bash
+# Install dependencies
+npm install
+
+# Run in development mode
+npm run dev
+
+# Build for production
+npm run build
+
+# Run tests
+npm test
+\`\`\`
+
+## Docker
+
+\`\`\`bash
+# Build image
+docker build -t mcp-server-myservice .
+
+# Run container
+docker run -it --rm \
+  -e MY_SERVICE_API_KEY=your_key \
+  mcp-server-myservice
+\`\`\`
+
+## Railway Deployment
+
+1. Fork this repository
+2. Connect to Railway
+3. Add environment variables in Railway dashboard
+4. Deploy
+
+## Contributing
+
+Pull requests are welcome! Please read [CONTRIBUTING.md](CONTRIBUTING.md) for details.
+
+## License
+
+MIT License - see [LICENSE](LICENSE) file for details.
+
+## Support
+
+- [Open an issue](https://github.com/yourusername/mcp-server-myservice/issues)
+- [MyService API Documentation](https://myservice.com/docs)
+- [MCP Documentation](https://modelcontextprotocol.io)
+```
+
+### LICENSE (MIT Template)
+
+```
+MIT License
+
+Copyright (c) 2026 Your Name
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
+```
+
+---
+
+## 7. GitHub Actions CI/CD
+
+### .github/workflows/build.yml
+
+```yaml
+name: Build and Test
+
+on:
+  push:
+    branches: [ main, develop ]
+  pull_request:
+    branches: [ main ]
+
+jobs:
+  build:
+    runs-on: ubuntu-latest
+
+    strategy:
+      matrix:
+        node-version: [18.x, 20.x]
+
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Use Node.js ${{ matrix.node-version }}
+        uses: actions/setup-node@v3
+        with:
+          node-version: ${{ matrix.node-version }}
+
+      - name: Install dependencies
+        run: npm ci
+
+      - name: Build
+        run: npm run build
+
+      - name: Run tests
+        run: npm test
+        if: ${{ hashFiles('**/*.test.ts') != '' }}
+
+      - name: Verify dist exists
+        run: test -d dist && test -f dist/index.js
+```
+
+### .github/workflows/publish.yml
+
+```yaml
+name: Publish to npm
+
+on:
+  release:
+    types: [created]
+
+jobs:
+  publish:
+    runs-on: ubuntu-latest
+
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Use Node.js
+        uses: actions/setup-node@v3
+        with:
+          node-version: '20.x'
+          registry-url: 'https://registry.npmjs.org'
+
+      - name: Install dependencies
+        run: npm ci
+
+      - name: Build
+        run: npm run build
+
+      - name: Publish to npm
+        run: npm publish
+        env:
+          NODE_AUTH_TOKEN: ${{ secrets.NPM_TOKEN }}
+```
+
+**Setup:**
+1. Go to npmjs.com → Account Settings → Access Tokens
+2. Create new token (Automation or Publish)
+3. Add to GitHub repo → Settings → Secrets → `NPM_TOKEN`
+
+---
+
+## 8. Distribution Checklist
+
+Before publishing/deploying:
+
+### Code Quality
+- [ ] All TypeScript compiles without errors
+- [ ] No console.logs in production code (use proper logging)
+- [ ] Error handling implemented for all tools
+- [ ] Environment variables validated on startup
+
+### Documentation
+- [ ] README.md with installation instructions
+- [ ] .env.example with all required variables
+- [ ] Tool descriptions are clear and helpful
+- [ ] Examples provided in README
+
+### Package Configuration
+- [ ] `package.json` has correct `name`, `version`, `description`
+- [ ] `files` field only includes necessary files
+- [ ] `keywords` added for npm search
+- [ ] `repository`, `bugs`, `homepage` URLs set
+- [ ] License file included
+
+### Testing
+- [ ] Tested locally in Claude Desktop
+- [ ] All tools work as expected
+- [ ] Apps render correctly (if applicable)
+- [ ] Error cases handled gracefully
+
+### Security
+- [ ] No API keys hardcoded
+- [ ] `.env` in `.gitignore`
+- [ ] Sensitive data not logged
+- [ ] Dependencies up to date (`npm audit`)
+
+### Deployment
+- [ ] Dockerfile builds successfully
+- [ ] Docker container runs without errors
+- [ ] Railway deployment works (if applicable)
+- [ ] npm package installs and runs via `npx`
+
+---
+
+## 9. Version Management
+
+### Semantic Versioning
+
+- **Patch (1.0.0 → 1.0.1):** Bug fixes, no API changes
+- **Minor (1.0.0 → 1.1.0):** New features, backward compatible
+- **Major (1.0.0 → 2.0.0):** Breaking changes
+
+### CHANGELOG.md
+
+```markdown
+# Changelog
+
+All notable changes to this project will be documented in this file.
+
+The format is based on [Keep a Changelog](https://keepachangelog.com/),
+and this project adheres to [Semantic Versioning](https://semver.org/).
+
+## [Unreleased]
+
+### Added
+- New `search_contacts` tool with full-text search
+
+### Changed
+- Improved error messages for API failures
+
+### Fixed
+- Fixed pagination issue in `list_contacts`
+
+## [1.1.0] - 2026-02-03
+
+### Added
+- Contact grid MCP app
+- Dashboard MCP app
+- Docker support
+
+### Changed
+- Updated dependencies to latest versions
+
+## [1.0.0] - 2026-01-15
+
+### Added
+- Initial release
+- Basic CRUD tools for contacts
+- MyService API integration
+```
+
+---
+
+## 10. Multi-Platform Distribution
+
+### npm + Docker + GitHub
+
+**Best practice:** Offer multiple installation methods
+
+**README.md section:**
+
+```markdown
+## Installation Methods
+
+### 1. npx (Easiest)
+
+\`\`\`bash
+# Add to claude_desktop_config.json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "npx",
+      "args": ["-y", "mcp-server-myservice"]
+    }
+  }
+}
+\`\`\`
+
+### 2. npm Global Install
+
+\`\`\`bash
+npm install -g mcp-server-myservice
+
+# Then reference in claude_desktop_config.json
+{
+  "mcpServers": {
+    "myservice": {
+      "command": "mcp-server-myservice"
+    }
+  }
+}
+\`\`\`
+
+### 3. Docker
+
+\`\`\`bash
+docker run -it --rm \
+  -e MY_SERVICE_API_KEY=your_key \
+  ghcr.io/yourusername/mcp-server-myservice:latest
+\`\`\`
+
+### 4. From Source
+
+\`\`\`bash
+git clone https://github.com/yourusername/mcp-server-myservice.git
+cd mcp-server-myservice
+npm install && npm run build
+
+# Reference dist/index.js in claude_desktop_config.json
+\`\`\`
+```
+
+---
+
+## 11. Common Deployment Issues
+
+### Issue: "Cannot find module"
+**Cause:** Missing dependencies or incorrect path
+**Fix:** Run `npm ci` and use absolute paths in config
+
+### Issue: "Environment variable not set"
+**Cause:** Missing env vars
+**Fix:** Add to `env` object in Claude Desktop config or `.env` file
+
+### Issue: "UI files not found"
+**Cause:** `dist/app-ui/` not copied during build
+**Fix:** Add `build:ui` script to copy HTML files
+
+### Issue: "ENOENT: no such file or directory"
+**Cause:** Path resolution fails in compiled code
+**Fix:** Use `fileURLToPath` for ESM `__dirname` equivalent
+
+### Issue: Docker build fails
+**Cause:** Missing build step or dependencies
+**Fix:** Ensure `npm run build` runs in Dockerfile and all deps installed
+
+---
+
+## 12. Resources
+
+- **MCP Deployment Guide:** https://modelcontextprotocol.io/docs/deployment
+- **Railway Docs:** https://docs.railway.app
+- **npm Publishing Guide:** https://docs.npmjs.com/creating-and-publishing-scoped-public-packages
+- **Docker Best Practices:** https://docs.docker.com/develop/dev-best-practices
+
+---
+
+## Summary
+
+**Distribution workflow:**
+1. Build: `npm run build`
+2. Test locally in Claude Desktop
+3. Create README.md with installation instructions
+4. Add Dockerfile + railway.json (if deploying)
+5. Publish to npm: `npm publish`
+6. Push to GitHub with proper README
+7. Tag releases for versioning
+8. Automate with GitHub Actions
+
+**Key files:**
+- `package.json` → npm distribution
+- `Dockerfile` → Docker containerization
+- `railway.json` → Railway deployment
+- `README.md` → User documentation
+- `.env.example` → Configuration template
+- `.github/workflows/` → CI/CD automation
+
+Follow these patterns and your MCP servers will be production-ready and easy to distribute.
diff --git a/skills/mcp-localbosses-integrator/SKILL.md b/skills/mcp-localbosses-integrator/SKILL.md
new file mode 100644
index 0000000..17ff51a
--- /dev/null
+++ b/skills/mcp-localbosses-integrator/SKILL.md
@@ -0,0 +1,1543 @@
+# MCP LocalBosses Integrator — Phase 4: Wire Into LocalBosses
+
+**When to use this skill:** You have a built MCP server (Phase 2) and HTML apps (Phase 3) and need to wire them into the LocalBosses Next.js app so they appear as a channel in the sidebar with working apps, threads, and AI interactions.
+
+**What this covers:** Exact files to update, channel configuration, app registration, route mapping, system prompt engineering, APP_DATA block format, thread lifecycle integration, integration validation, and rollback strategy.
+
+**Pipeline position:** Phase 4 of 6 → Input from Phases 2 & 3, output feeds `mcp-qa-tester` (Phase 5).
+
+---
+
+## 1. Inputs & Outputs
+
+**Inputs:**
+- Built MCP server in `{service}-mcp/` (from Phase 2)
+- HTML app files in `{service}-mcp/app-ui/` (from Phase 3)
+- `{service}-api-analysis.md` (from Phase 1 — for tool names and app IDs)
+
+**Output:** A fully wired LocalBosses channel where:
+- Channel appears in sidebar under correct category
+- All apps appear in the app toolbar
+- Clicking an app opens a thread with an intake question
+- AI responses include APP_DATA blocks that update the visual app
+- Thread lifecycle (create → interact → delete) works end-to-end
+
+**LocalBosses app location:** `localbosses-app/` (Next.js app)
+
+---
+
+## 2. Files to Update (Checklist)
+
+| # | File | Purpose |
+|---|------|---------|
+| 1 | `src/lib/channels.ts` | Add channel definition (sidebar entry) |
+| 2 | `src/lib/appNames.ts` | Add display names + icons for all apps |
+| 3 | `src/lib/app-intakes.ts` | Add intake questions for each app |
+| 4 | `src/app/api/mcp-apps/route.ts` | Add app ID → filename mapping + directory |
+| 5 | `src/app/api/chat/route.ts` | Add tool routing + system prompt for channel |
+
+---
+
+## 3. File 1: `src/lib/channels.ts`
+
+### What it does:
+Defines the channel that appears in the LocalBosses sidebar. Controls name, icon, category, description, system prompt, default app, and available apps.
+
+### Template:
+
+```typescript
+{
+  id: "{service}",
+  name: "{service}",
+  icon: "🔥",                          // Single emoji
+  category: "BUSINESS OPS",            // "BUSINESS OPS" | "MARKETING" | "TOOLS" | "SYSTEM"
+  description: "{One-line description of what this channel does}",
+  systemPrompt: `You are the {Service Name} Specialist for LocalBosses AI.
+
+Your expertise:
+- {Capability 1 — what the user can do}
+- {Capability 2}
+- {Capability 3}
+- {Capability 4}
+
+TOOL SELECTION RULES:
+- SEE/BROWSE/LIST multiple items → use list_* tools
+- ONE specific item by name/ID → use get_* tools
+- CREATE/ADD/NEW → use create_* tools
+- CHANGE/UPDATE/MODIFY → use update_* tools
+- DELETE/REMOVE → use delete_* tools (always confirm first)
+- STATS/METRICS/OVERVIEW → use analytics tools
+
+Before calling any tool, briefly state which tool you're choosing and why.
+
+MULTI-INTENT MESSAGES:
+- If the user asks for multiple things in one message, address them sequentially.
+- State which you're handling first and that you'll get to the others.
+- Complete one action before starting the next.
+
+CORRECTIONS:
+- If the user says "actually", "wait", "no I meant", "the other one", treat this as a correction to your previous action.
+- If they reference "the other one" or "that one", check previous results in the conversation and clarify if needed.
+- Never repeat the same action — understand what changed.
+
+Do NOT call tools when the user asks general questions about best practices, strategy, or how-to advice. Respond from your expertise instead.
+
+Be concise, practical, and action-oriented. When presenting data, always include an APP_DATA block so the visual app updates.`,
+  defaultApp: "{service}-dashboard",    // Optional: auto-opens on channel entry. Omit if no dashboard.
+  mcpApps: [
+    // List ALL app IDs registered for this channel
+    "{service}-dashboard",
+    "{service}-contact-grid",
+    "{service}-contact-card",
+    "{service}-contact-creator",
+    "{service}-calendar-view",
+    "{service}-pipeline-kanban",
+    "{service}-activity-timeline",
+    // ... all apps from Phase 3
+  ],
+},
+```
+
+### Placement:
+Add the new channel object to the `channels` array. Place it in the appropriate category section.
+
+### Real example (from automations channel):
+
+```typescript
+{
+  id: "automations",
+  name: "automations",
+  icon: "⚡",
+  category: "BUSINESS OPS",
+  description: "Build n8n workflows with natural language",
+  systemPrompt: `You are the Automations Specialist for LocalBosses AI, powered by n8n workflow automation.
+
+Your expertise:
+- Building n8n workflows from natural language descriptions
+- Connecting 1,084+ integrations (apps, APIs, databases)
+- Automation best practices (error handling, scheduling, data transformation)
+- Workflow optimization and debugging
+- Common automation patterns (lead capture, email sequences, data sync, notifications)
+
+TOOL SELECTION RULES:
+- SEE/BROWSE/LIST workflows → use list_workflows
+- ONE specific workflow by ID → use get_workflow
+- CREATE/ADD/NEW workflow → use create_workflow
+- CHANGE/UPDATE/MODIFY → use update_workflow
+- DELETE/REMOVE → use delete_workflow (always confirm first)
+- STATS/EXECUTION HISTORY → use list_executions
+
+Before calling any tool, briefly state which tool you're choosing and why.
+
+Do NOT call tools when users ask about automation best practices, n8n concepts, or workflow design patterns. Respond from your expertise instead.
+
+When users describe what they want to automate:
+1. Break it down into workflow steps
+2. Identify which n8n nodes to use
+3. Explain the data flow
+4. Suggest error handling approaches
+
+Always be practical and implementation-focused. If a workflow would be complex, break it into phases.`,
+  defaultApp: "n8n-workflow-builder",
+  mcpApps: [
+    "n8n-workflow-builder",
+    "n8n-execution-monitor",
+    "n8n-workflow-templates",
+    "n8n-node-config",
+    "n8n-health-monitor",
+    "n8n-webhook-tester",
+    "n8n-workflow-detail",
+  ],
+},
+```
+
+---
+
+## 4. File 2: `src/lib/appNames.ts`
+
+### What it does:
+Maps app IDs to human-friendly display names and emoji icons. Used by the app toolbar and anywhere apps are shown by name.
+
+### Template:
+
+```typescript
+// In the APP_DISPLAY_NAMES object, add one entry per app:
+
+// ═══════════════════════════════════════════
+// {Service Name} Apps
+// ═══════════════════════════════════════════
+"{service}-dashboard": { name: "Dashboard", icon: "📊" },
+"{service}-contact-grid": { name: "Contacts", icon: "👥" },
+"{service}-contact-card": { name: "Contact Card", icon: "👤" },
+"{service}-contact-creator": { name: "New Contact", icon: "➕" },
+"{service}-calendar-view": { name: "Calendar", icon: "📆" },
+"{service}-pipeline-kanban": { name: "Pipeline", icon: "📈" },
+"{service}-activity-timeline": { name: "Activity", icon: "📅" },
+```
+
+### Icon guidelines:
+- Use a single emoji that represents the app type
+- 📊 for dashboards/analytics
+- 👥 for contact lists, 👤 for single contact
+- ➕ for creation forms
+- 📆 for calendars
+- 📈 for pipeline/funnel
+- 📅 for timeline/activity
+- 🔍 for search
+- 📋 for lists
+- 📄 for detail views
+- 💰 for financial/invoice
+- ⚙️ for settings/config
+
+---
+
+## 5. File 3: `src/lib/app-intakes.ts`
+
+### What it does:
+Defines the intake question shown when a user clicks an app. This creates a conversational thread where the AI generates data for the app.
+
+### Interface:
+
+```typescript
+export interface AppIntake {
+  category: string;         // Grouping category for similar apps
+  question: string;         // The question shown to the user in the thread
+  skipLabel?: string;       // If defined, shows a "skip" button with this label
+  systemPromptAddon: string; // Extra AI instructions for generating APP_DATA
+}
+```
+
+### Intake Question Quality Criteria
+
+Every intake question MUST meet these standards:
+
+| Criterion | Requirement | Example |
+|-----------|-------------|---------|
+| **Input format hint** | Suggest what to provide | "Provide a name, email, or ID" |
+| **skipLabel** | Most common default action | `"All upcoming events"` |
+| **Length** | Under 20 words | ✓ "What contacts? Filter by name, status, or tag." |
+| **Action-oriented** | Tell what to DO, not ASK | ✓ "Filter contacts by name, status, or tag" ✗ "What would you like to see?" |
+| **Context-specific** | Tied to this app's data | ✓ "Which pipeline stage?" ✗ "What data do you want?" |
+
+**Bad examples:**
+- ❌ "What would you like to see?" — too vague, no format hint
+- ❌ "Please tell me what you're looking for in this application" — too long, not action-oriented
+- ❌ "Enter your query" — no context, no format hint
+
+**Good examples:**
+- ✅ "Filter contacts by name, status, or tag — or say 'show all'." (skipLabel: "All contacts")
+- ✅ "Which date range? e.g., 'this week', 'Feb 2026', 'next 7 days'" (skipLabel: "This week")
+- ✅ "Which contact? Provide a name, email, or ID."
+
+> **Note on MCP Elicitation:** The intake question pattern maps conceptually to MCP's `elicitation/create` capability (spec 2025-06-18). In the future, intake questions could be served as MCP elicitation requests rather than hardcoded in `app-intakes.ts`, enabling servers to dynamically request user input mid-flow. This would also support mid-conversation elicitation (e.g., "Which account?" during an OAuth flow, or "Confirm delete?" for destructive operations).
+
+### Template per app type:
+
+#### Dashboard apps:
+```typescript
+"{service}-dashboard": {
+  category: "dashboard",
+  question: "What time frame? e.g., last 7 days, this month, last quarter",
+  skipLabel: "Last 30 days",
+  systemPromptAddon: `The user is viewing the {Service} Dashboard. Generate APP_DATA with these fields:
+{
+  "title": "Service Dashboard",
+  "timeFrame": "Last 30 days",
+  "metrics": {
+    "total_contacts": 1234,
+    "active_deals": 56,
+    "revenue": 78900,
+    "appointments_today": 3
+  },
+  "recent": [
+    { "title": "Event name", "description": "Details", "date": "2026-02-04T10:30:00Z", "type": "event_type" }
+  ]
+}`,
+},
+```
+
+#### Data grid apps:
+```typescript
+"{service}-contact-grid": {
+  category: "data-grid",
+  question: "Filter contacts by name, status, or tag — or say 'show all'.",
+  skipLabel: "All contacts",
+  systemPromptAddon: `The user is viewing the contact grid. Generate APP_DATA with an array of contacts:
+{
+  "title": "Contacts",
+  "data": [
+    { "name": "John Smith", "email": "john@example.com", "phone": "(555) 123-4567", "status": "active", "created": "2026-01-15" }
+  ],
+  "meta": { "total": 150, "page": 1, "pageSize": 25 }
+}
+Include 5-10 realistic records. Match any filters the user requested.`,
+},
+```
+
+#### Detail card apps:
+```typescript
+"{service}-contact-card": {
+  category: "detail-card",
+  question: "Which contact? Provide a name, email, or ID.",
+  systemPromptAddon: `The user wants to view a specific contact's details. Generate APP_DATA with full contact info:
+{
+  "name": "John Smith",
+  "email": "john@example.com",
+  "phone": "(555) 123-4567",
+  "status": "active",
+  "company": "Acme Inc",
+  "tags": ["vip", "lead"],
+  "created": "2026-01-15",
+  "lastActivity": "2026-02-03T14:30:00Z",
+  "notes": "Key decision maker"
+}`,
+},
+```
+
+#### Form/wizard apps:
+```typescript
+"{service}-contact-creator": {
+  category: "form",
+  question: "Describe the new contact — I'll pre-fill the form for you.",
+  systemPromptAddon: `The user wants to create a new contact. Generate APP_DATA defining the form fields:
+{
+  "title": "Create New Contact",
+  "description": "Fill in the contact details",
+  "fields": [
+    { "name": "name", "label": "Full Name", "type": "text", "required": true, "placeholder": "John Smith" },
+    { "name": "email", "label": "Email", "type": "email", "required": false, "placeholder": "john@example.com" },
+    { "name": "phone", "label": "Phone", "type": "tel", "required": false, "placeholder": "(555) 123-4567" },
+    { "name": "status", "label": "Status", "type": "select", "options": ["active", "inactive", "lead"] }
+  ]
+}
+Pre-fill any values the user mentioned.`,
+},
+```
+
+#### Calendar apps:
+```typescript
+"{service}-calendar-view": {
+  category: "calendar",
+  question: "Which date range? e.g., this week, Feb 2026, next month",
+  skipLabel: "This week",
+  systemPromptAddon: `The user is viewing the calendar. Generate APP_DATA with events:
+{
+  "title": "Calendar",
+  "events": [
+    { "title": "Meeting with John", "start": "2026-02-04T10:00:00Z", "end": "2026-02-04T11:00:00Z", "contact": "John Smith", "status": "confirmed", "location": "Office" }
+  ]
+}
+Include events for the requested time range.`,
+},
+```
+
+#### Timeline apps:
+```typescript
+"{service}-activity-timeline": {
+  category: "timeline",
+  question: "Whose activity? Provide a contact name, or say 'all recent'.",
+  skipLabel: "All recent activity",
+  systemPromptAddon: `The user is viewing an activity timeline. Generate APP_DATA with events:
+{
+  "title": "Activity Timeline",
+  "events": [
+    { "title": "Email sent", "description": "Follow-up email to John", "date": "2026-02-04T10:30:00Z", "type": "email", "user": "Jake" },
+    { "title": "Call completed", "description": "15 min call discussing proposal", "date": "2026-02-03T16:00:00Z", "type": "call", "user": "Jake" }
+  ]
+}
+Order events from newest to oldest.`,
+},
+```
+
+#### Pipeline/funnel apps:
+```typescript
+"{service}-pipeline-kanban": {
+  category: "pipeline",
+  question: "Which pipeline? e.g., 'sales pipeline', 'hiring pipeline'",
+  skipLabel: "Main pipeline",
+  systemPromptAddon: `The user is viewing the pipeline board. Generate APP_DATA with stages and deals:
+{
+  "title": "Sales Pipeline",
+  "stages": [
+    {
+      "name": "New Leads",
+      "items": [
+        { "name": "Acme Deal", "value": 25000, "contact": "John Smith" }
+      ]
+    },
+    {
+      "name": "Qualified",
+      "items": [
+        { "name": "Beta Contract", "value": 50000, "contact": "Jane Doe" }
+      ]
+    },
+    { "name": "Proposal", "items": [] },
+    { "name": "Closed Won", "items": [] }
+  ]
+}`,
+},
+```
+
+---
+
+## 6. File 4: `src/app/api/mcp-apps/route.ts`
+
+### What it does:
+Maps app IDs to their HTML filenames and tells the server where to find the files.
+
+### Changes needed:
+
+#### A. Add to `APP_NAME_MAP`:
+
+```typescript
+const APP_NAME_MAP: Record<string, string> = {
+  // ... existing entries ...
+
+  // {Service Name} apps
+  "{service}-dashboard": "dashboard",
+  "{service}-contact-grid": "contact-grid",
+  "{service}-contact-card": "contact-card",
+  "{service}-contact-creator": "contact-creator",
+  "{service}-calendar-view": "calendar-view",
+  "{service}-pipeline-kanban": "pipeline-kanban",
+  "{service}-activity-timeline": "activity-timeline",
+};
+```
+
+**Rule:** Left side is the app ID (used in channels.ts, appNames.ts, intakes). Right side is the HTML filename WITHOUT the `.html` extension.
+
+#### B. Add to `APP_DIRS`:
+
+```typescript
+const APP_DIRS = [
+  // ... existing directories ...
+
+  // {Service Name} apps
+  join(process.cwd(), "../{service}-mcp/app-ui"),
+  // OR if using dist: join(process.cwd(), "../{service}-mcp/dist/app-ui"),
+];
+```
+
+**Rule:** Order matters — first match wins. Add new directories at the bottom unless they need priority.
+
+### How file resolution works:
+```
+1. User requests app ID "{service}-dashboard"
+2. APP_NAME_MAP maps it to filename "dashboard"
+3. For each directory in APP_DIRS:
+   a. Check: {dir}/dashboard.html (flat format)
+   b. Check: {dir}/dashboard/index.html (subdirectory format)
+4. First match wins, HTML is returned
+```
+
+---
+
+## 7. File 5: `src/app/api/chat/route.ts`
+
+### What it does:
+The chat route handles AI conversations. For app threads, it injects system prompts that tell the AI to include APP_DATA blocks in responses.
+
+### The APP_DATA Block Format:
+
+> **Important:** APP_DATA is a **LocalBosses-specific** convention for embedding structured data in LLM responses. It is NOT part of the MCP protocol. MCP's native equivalent is `structuredContent` on tool results (see Section 14 for the bridge roadmap).
+
+The AI response includes a hidden block that gets parsed by the frontend and sent to the app:
+
+```
+Your visible text response here...
+
+<!--APP_DATA:{"key":"value","data":[...]}:END_APP_DATA-->
+```
+
+**Rules:**
+1. EVERY response in an app thread MUST include exactly one APP_DATA block
+2. The JSON must be valid and on a SINGLE LINE (no line breaks inside)
+3. Place it AFTER the text explanation
+4. The block is automatically parsed and hidden from the user
+5. When the user refines, generate completely NEW APP_DATA (replace, don't append)
+
+### APP_DATA Failure Modes & Parsing Guidelines
+
+LLMs don't always produce perfect APP_DATA. Document and handle these known failure modes:
+
+| Failure Mode | Example | Fix |
+|---|---|---|
+| **Line breaks in JSON** | `<!--APP_DATA:{\n"key":"val"\n}:END_APP_DATA-->` | Strip all `\n` and `\r` before JSON.parse |
+| **Wrapped in code block** | ````json\n<!--APP_DATA:...-->` `` | Strip `` ```json `` and `` ``` `` wrappers before extracting |
+| **Invalid JSON** | Missing closing brace, trailing comma | Try JSON.parse, on failure try to fix common issues (trailing commas, unquoted keys) |
+| **Text after END_APP_DATA** | `...END_APP_DATA--> more text here` | Only extract between delimiters; ignore trailing content |
+| **No APP_DATA at all** | LLM just responds with plain text | Fallback: scan for JSON objects in the response heuristically |
+| **Multiple APP_DATA blocks** | Two blocks in one response | Use the LAST block (most likely the refined one) |
+
+#### Recommended Parser Pattern:
+
+```typescript
+function parseAppData(response: string): Record<string, unknown> | null {
+  // 1. Try exact match first
+  const exactMatch = response.match(/<!--APP_DATA:(.*?):END_APP_DATA-->/s);
+  if (exactMatch) {
+    const jsonStr = exactMatch[1].replace(/[\n\r]/g, '').trim();
+    try { return JSON.parse(jsonStr); } catch {}
+    // Try fixing common issues
+    try {
+      const fixed = jsonStr.replace(/,\s*([}\]])/g, '$1'); // trailing commas
+      return JSON.parse(fixed);
+    } catch {}
+  }
+
+  // 2. Try stripping code block wrappers
+  const stripped = response.replace(/```(?:json)?\s*/g, '').replace(/```/g, '');
+  const codeBlockMatch = stripped.match(/<!--APP_DATA:(.*?):END_APP_DATA-->/s);
+  if (codeBlockMatch) {
+    try { return JSON.parse(codeBlockMatch[1].replace(/[\n\r]/g, '').trim()); } catch {}
+  }
+
+  // 3. Heuristic fallback: find largest JSON object in response
+  const jsonMatches = response.match(/\{[^{}]*(?:\{[^{}]*\}[^{}]*)*\}/g);
+  if (jsonMatches) {
+    const largest = jsonMatches.sort((a, b) => b.length - a.length)[0];
+    try { return JSON.parse(largest); } catch {}
+  }
+
+  return null; // All parsing failed
+}
+```
+
+**Track success rate:** If APP_DATA parsing fails more than 10% of the time for a channel, the system prompt needs revision — add more explicit formatting examples or stronger instructions.
+
+### APP_DATA Schema Validation
+
+After parsing APP_DATA, validate it against the app's expected data shape **before** sending to the iframe. This catches silent data shape mismatches (e.g., tool returns `{contacts: [...]}` but app expects `{data: [...]}`).
+
+```typescript
+// Schema contracts per app type — shared between integrator and designer
+const APP_SCHEMAS: Record<string, { required: string[]; arrayFields?: string[] }> = {
+  'dashboard': { required: ['metrics'], arrayFields: ['recent'] },
+  'data-grid': { required: ['data', 'meta'], arrayFields: ['data'] },
+  'detail-card': { required: ['name'] },
+  'form': { required: ['fields'], arrayFields: ['fields'] },
+  'calendar': { required: ['events'], arrayFields: ['events'] },
+  'timeline': { required: ['events'], arrayFields: ['events'] },
+  'pipeline': { required: ['stages'], arrayFields: ['stages'] },
+};
+
+function validateAppData(data: Record<string, unknown>, appType: string): { valid: boolean; errors: string[] } {
+  const schema = APP_SCHEMAS[appType];
+  if (!schema) return { valid: true, errors: [] };
+
+  const errors: string[] = [];
+  for (const field of schema.required) {
+    if (!(field in data) || data[field] == null) {
+      errors.push(`Missing required field: "${field}"`);
+    }
+  }
+  for (const field of schema.arrayFields || []) {
+    if (field in data && !Array.isArray(data[field])) {
+      errors.push(`Expected array for "${field}", got ${typeof data[field]}`);
+    }
+  }
+  return { valid: errors.length === 0, errors };
+}
+```
+
+**Usage:** Call `validateAppData()` after `parseAppData()`. If validation fails, log the errors and either attempt auto-fix (wrap non-array in array) or show a diagnostic empty state in the app.
+
+### The Thread System Prompt (already exists in chat/route.ts):
+
+```typescript
+const THREAD_SYSTEM_PROMPT = `
+
+## MANDATORY: APP_DATA BLOCK (DO NOT SKIP)
+
+You are in an APP THREAD. Every response you give MUST include a hidden APP_DATA block that updates the visual app above the conversation. This is NOT optional.
+
+FORMAT (place at the VERY END of your response):
+<!--APP_DATA:{"key":"value"}:END_APP_DATA-->
+
+RULES:
+1. EVERY response MUST have exactly one APP_DATA block — no exceptions
+2. The JSON must be valid and on a SINGLE LINE (no line breaks inside)
+3. Place it AFTER your text explanation
+4. Generate REALISTIC data matching what the user requested
+5. Include 5-10 records for lists, complete details for single items
+6. The block is automatically parsed and hidden from the user
+7. Also write a brief natural language explanation before the block
+8. When the user refines, generate completely NEW APP_DATA (replace, don't append)
+
+If you forget the APP_DATA block, the visual app won't update and the user will see stale data. ALWAYS include it.`;
+```
+
+### What you MAY need to add to chat/route.ts:
+
+Usually the existing THREAD_SYSTEM_PROMPT + the intake's `systemPromptAddon` is sufficient. But if your service needs special tool routing or a channel-specific system prompt override, you may need to add logic:
+
+```typescript
+// Example: If the channel has MCP server tools that need explicit routing
+if (channelId === '{service}') {
+  // Add service-specific context to the system prompt
+  systemPrompt += `\n\nYou have access to the following {Service} tools:\n${toolList}`;
+}
+```
+
+### For workflow-type apps (like n8n):
+
+Use `WORKFLOW_JSON` format instead of `APP_DATA`:
+```
+<!--WORKFLOW_JSON:{"name":"...","nodes":[...]}:END_WORKFLOW-->
+```
+
+This is only for n8n-style workflow builders. All other apps use `APP_DATA`.
+
+### APP_DATA Output Formatting — Required Fields Per App Type
+
+When writing `systemPromptAddon` instructions, be explicit about exact required fields. Vague instructions produce inconsistent data:
+
+| App Type | Required APP_DATA Fields | Notes |
+|----------|--------------------------|-------|
+| **Dashboard** | `title`, `timeFrame`, `metrics` (object with 3-6 key/value pairs), `recent` (array of 3-5 items with `title`, `date`, `type`) | Metrics keys should match the dashboard's render function |
+| **Data Grid** | `title`, `data` (array of objects — each MUST have the same keys), `meta` (`total`, `page`, `pageSize`) | Every object in `data` must have identical field names |
+| **Detail Card** | All entity fields as top-level keys (no wrapping `data` object), must include `name` or `title` | Include `status`, `created`, `lastActivity` for consistency |
+| **Form** | `title`, `description`, `fields` (array with `name`, `label`, `type`, `required`) | Pre-fill values in `value` field when user provides info |
+| **Calendar** | `title`, `events` (array with `title`, `start` ISO, `end` ISO, `status`) | Always use ISO 8601 dates |
+| **Timeline** | `title`, `events` (array with `title`, `description`, `date` ISO, `type`) | Order newest → oldest |
+| **Pipeline** | `title`, `stages` (array with `name`, `items` array — each item has `name`, `value`) | Include 4-6 stages even if some are empty |
+| **Analytics** | `title`, `timeFrame`, `metrics`, `chartData` (array with `label`, `value`) | Values should be realistic percentages or counts |
+
+---
+
+## 7b. Host-Side Handler for App Actions (sendToHost)
+
+The App Designer's `sendToHost()` function posts `mcp_app_action` messages to the parent window. **The host (LocalBosses) must listen for these messages** — otherwise navigate, refresh, and tool_call actions from apps are dead features.
+
+### Implementation (in the iframe wrapper component):
+
+```typescript
+// In the component that renders the app iframe
+useEffect(() => {
+  function handleAppAction(event: MessageEvent) {
+    if (event.data?.type !== 'mcp_app_action') return;
+
+    const { action, payload, appId } = event.data;
+
+    switch (action) {
+      case 'navigate':
+        // App-to-app drill-down: open a different app with params
+        // e.g., click contact in grid → open contact-card
+        openApp(payload.app, payload.params);
+        break;
+
+      case 'refresh':
+        // Re-send the last tool call to get fresh data
+        resendLastToolCall(appId);
+        break;
+
+      case 'tool_call':
+        // App triggered a tool call (e.g., form submit, bulk action)
+        // Inject as a message into the thread so the AI executes it
+        sendMessageToThread(
+          `[Action] Call ${payload.tool} with: ${JSON.stringify(payload.args)}`,
+          { hidden: true } // Don't show raw JSON to user
+        );
+        break;
+
+      default:
+        console.warn('[Host] Unknown app action:', action);
+    }
+  }
+
+  window.addEventListener('message', handleAppAction);
+  return () => window.removeEventListener('message', handleAppAction);
+}, []);
+```
+
+### Key behaviors:
+- **`navigate`** — Opens the target app in a new thread (or switches to existing). Pass `payload.params` as initial context so the AI knows what data to fetch.
+- **`refresh`** — Re-executes the last tool call for that app's thread. The AI regenerates APP_DATA with fresh data.
+- **`tool_call`** — Injects a tool invocation into the thread. The AI sees the request, calls the MCP tool, and returns updated APP_DATA. Used by form submits, bulk actions, and in-app buttons.
+
+### Sending 'user_message_sent' to apps:
+
+When the user sends a new message in a thread, notify the app so it can show the "updating" overlay:
+
+```typescript
+// In the chat message send handler
+function onUserMessageSent() {
+  const iframe = document.querySelector(`iframe[data-app-id="${activeAppId}"]`);
+  if (iframe?.contentWindow) {
+    iframe.contentWindow.postMessage({ type: 'user_message_sent' }, '*');
+  }
+}
+```
+
+---
+
+## 8. System Prompt Engineering Guidelines
+
+The channel system prompt is the most critical piece. It determines:
+- What the AI knows about the service
+- When it uses tools vs just responds
+- How it formats data for apps
+- The tone and expertise level
+
+### Prompt Budget Targets
+
+Keep prompts lean. Every token in the system prompt is consumed on every single user message.
+
+| Prompt Component | Budget Target | Why |
+|---|---|---|
+| Channel system prompt | **< 500 tokens** | Loaded on every message in the channel |
+| systemPromptAddon (per app intake) | **< 300 tokens** | Only loaded in that app's thread |
+| THREAD_SYSTEM_PROMPT (shared) | ~200 tokens (fixed) | Already written; don't expand |
+| **Total per-thread context** | **< 1,000 tokens** | System prompt + addon + thread prompt |
+
+**Measure:** Paste your system prompt into a token counter. If it exceeds the budget, cut capability descriptions to single lines and remove examples from the channel prompt (put them in the addon instead).
+
+### Structure:
+
+```
+1. IDENTITY — "You are the {Service} Specialist for LocalBosses AI" (1 line)
+2. EXPERTISE — Bullet list of capabilities (4-6 bullets, < 15 words each)
+3. TOOL ROUTING — Structured decision tree (always include)
+4. NEGATIVE INSTRUCTIONS — When NOT to use tools (2-3 lines)
+5. MULTI-INTENT — How to handle multiple requests in one message
+6. CORRECTIONS — How to handle "actually/wait/no I meant" messages
+7. RATIONALE REQUIREMENT — "State which tool and why before calling"
+8. BEHAVIOR — How to respond (1-2 lines)
+```
+
+### Multi-Intent Handling (ALWAYS include):
+
+```
+MULTI-INTENT MESSAGES:
+- If the user asks for multiple things in one message, address them sequentially.
+- State which you're handling first and that you'll get to the others.
+- Complete one action before starting the next.
+```
+
+### Correction Handling (ALWAYS include):
+
+```
+CORRECTIONS:
+- If the user says "actually", "wait", "no I meant", "the other one",
+  treat this as a correction to your previous action.
+- If they reference "the other one" or "that one", check previous results
+  in the conversation and clarify if needed.
+- Never repeat the same action — understand what changed.
+```
+
+### Tool Routing Rules (ALWAYS include in channel system prompt):
+
+This is the single highest-impact section. Research shows structured decision trees reduce tool misrouting by ~30%.
+
+```
+TOOL SELECTION RULES:
+- SEE/BROWSE/LIST multiple items → use list_* tools
+- ONE specific item by name/ID → use get_* tools
+- CREATE/ADD/NEW → use create_* tools
+- CHANGE/UPDATE/MODIFY → use update_* tools
+- DELETE/REMOVE → use delete_* tools (always confirm first)
+- STATS/METRICS/OVERVIEW → use analytics tools
+
+Before calling any tool, briefly state which tool you're choosing and why.
+```
+
+**Customize the routing rules per service.** Replace `list_*` with actual tool names when the channel has few enough tools (< 15):
+
+```
+TOOL SELECTION RULES:
+- SEE/BROWSE events → use list_scheduled_events
+- ONE specific event → use get_event
+- CREATE new event type → use create_event_type
+- CANCEL/RESCHEDULE → use cancel_event (always confirm first)
+- SCHEDULING METRICS → use get_scheduling_analytics
+```
+
+### Negative Instructions (ALWAYS include):
+
+```
+Do NOT call tools when the user asks:
+- General questions about best practices or strategy
+- How-to advice that doesn't require their specific data
+- Clarifying questions about what they want (ask them back instead)
+- About features that don't exist in the system
+
+Do NOT use list tools when the user clearly knows which specific record they want — use the get tool instead.
+```
+
+### Rationale Requirement:
+
+Add this line to every channel system prompt:
+```
+Before calling any tool, briefly state which tool you're choosing and why.
+```
+
+This reduces misrouting by forcing the LLM to reason about tool selection before acting.
+
+### Tool description in system prompts:
+
+DON'T list raw tool names. DO describe capabilities in natural language:
+
+```
+❌ BAD:
+"Tools: list_contacts, get_contact, create_contact, update_contact, delete_contact"
+
+✅ GOOD:
+"You can manage contacts — search and browse the contact list, view detailed contact profiles,
+create new contacts, update existing information, and remove contacts when needed."
+```
+
+### When to use tools:
+
+Tell the AI explicitly when to use tools vs respond from knowledge:
+
+```
+When the user asks about their specific data (contacts, deals, appointments), use the appropriate
+MCP tools to fetch real data. When they ask general questions about CRM best practices or strategy,
+respond from your expertise without making API calls.
+```
+
+### Few-Shot Examples (include 2-3 in systemPromptAddon):
+
+Few-shot examples are the single most effective technique for consistent tool routing and APP_DATA generation. Include them in the `systemPromptAddon`, NOT the channel prompt (to stay within token budget).
+
+#### Example 1: List operation → Data Grid
+
+```
+USER: "Show me all active contacts"
+
+ASSISTANT: I'll use list_contacts to pull up your active contacts, filtered by status.
+
+[calls list_contacts with status="active"]
+
+Here are your active contacts. You have 23 active contacts in your system.
+
+<!--APP_DATA:{"title":"Active Contacts","data":[{"name":"John Smith","email":"john@example.com","phone":"(555) 123-4567","status":"active","created":"2026-01-15"},{"name":"Jane Doe","email":"jane@example.com","phone":"(555) 987-6543","status":"active","created":"2026-01-20"}],"meta":{"total":23,"page":1,"pageSize":25}}:END_APP_DATA-->
+```
+
+#### Example 2: Specific item → Detail Card
+
+```
+USER: "What's John Smith's email?"
+
+ASSISTANT: I'll use get_contact to look up John Smith's full profile.
+
+[calls get_contact with name="John Smith"]
+
+John Smith's email is john@example.com. Here's his full profile:
+
+<!--APP_DATA:{"name":"John Smith","email":"john@example.com","phone":"(555) 123-4567","status":"active","company":"Acme Inc","tags":["vip"],"created":"2026-01-15","lastActivity":"2026-02-03T14:30:00Z"}:END_APP_DATA-->
+```
+
+#### Example 3: General question → NO tool call
+
+```
+USER: "What's the best way to organize my contacts?"
+
+ASSISTANT: Great question! Here are some CRM best practices for contact organization:
+
+1. **Use tags consistently** — Create a standard set of tags (lead, customer, vip, partner) and apply them to every contact.
+2. **Keep statuses updated** — Move contacts through your pipeline stages promptly.
+3. **Add notes after every interaction** — Even a one-liner helps you remember context.
+4. **Regular cleanup** — Review inactive contacts quarterly and archive or re-engage.
+
+Would you like me to pull up your current contacts so we can start organizing?
+```
+
+> **Note on MCP Prompts:** Channel system prompts could be registered as MCP Prompt resources (via the server's `prompts` capability) for discoverability and versioning. Instead of hardcoding prompts in `route.ts`, servers could expose them as `prompts/list` entries, allowing clients to discover available interaction modes and enabling prompt versioning through the MCP protocol.
+
+> **Note on MCP Roots:** MCP Roots let clients inform servers about workspace/project scope. For LocalBosses, roots could scope which data is relevant per channel — e.g., a "CRM" root that tells the server to only expose CRM-related tools and data. This would enable dynamic tool filtering based on channel context rather than static system prompts.
+
+---
+
+## 9. Thread Lifecycle & State Management
+
+### How threads work:
+
+```
+1. User clicks app in toolbar
+2. App intake question appears (from app-intakes.ts)
+3. User responds (or clicks "skip" if skipLabel exists)
+4. AI receives: channel system prompt + THREAD_SYSTEM_PROMPT + intake systemPromptAddon + user message
+5. AI generates response + APP_DATA block
+6. Frontend parses APP_DATA, sends to iframe via postMessage
+7. App renders the data
+8. User can continue chatting to refine
+9. Each AI response generates new APP_DATA (replaces old)
+```
+
+### Thread-specific behavior:
+- Each thread is tied to ONE app — the app stays open above the chat
+- The AI always includes APP_DATA in thread responses
+- When user refines ("show me only active contacts"), AI generates NEW APP_DATA
+- Thread can be closed/deleted without affecting the app or other threads
+
+### Thread State Management
+
+Threads use **localStorage** for persistence. Be aware of these operational constraints:
+
+| Concern | Details | Mitigation |
+|---------|---------|------------|
+| **Storage mechanism** | `localStorage` in the browser — key-value, synchronous, per-origin | Thread data is JSON-serialized per thread ID |
+| **Persistence** | Survives page reload and browser restart. Cleared on cache clear or incognito close. | Not a permanent store — don't rely on it for critical data |
+| **Expiry / Cleanup** | No automatic expiry. Old threads accumulate indefinitely. | Implement cleanup: delete threads older than 30 days on app load |
+| **Max thread count** | No hard limit, but performance degrades with 100+ threads in localStorage | Warn or auto-archive after 50 threads per channel. Archive = move to a compressed summary. |
+| **Storage quota** | ~5-10 MB per origin (browser-dependent). Each thread with APP_DATA ≈ 2-20 KB. | At 5 MB limit: ~250-2,500 threads before quota exceeded. Handle `QuotaExceededError` gracefully. |
+| **Quota exceeded handling** | `localStorage.setItem()` throws `QuotaExceededError` | Catch the error, delete oldest threads until space is available, notify user |
+
+**Recommended cleanup pattern:**
+```typescript
+function cleanupOldThreads(maxAgeDays = 30, maxCount = 50) {
+  const threads = getAllThreads(); // from localStorage
+  const now = Date.now();
+  const cutoff = now - (maxAgeDays * 24 * 60 * 60 * 1000);
+
+  // Delete by age
+  threads.filter(t => t.lastActivity < cutoff).forEach(t => deleteThread(t.id));
+
+  // Delete by count (keep newest)
+  const remaining = getAllThreads().sort((a, b) => b.lastActivity - a.lastActivity);
+  if (remaining.length > maxCount) {
+    remaining.slice(maxCount).forEach(t => deleteThread(t.id));
+  }
+}
+```
+
+---
+
+## 10. Channel Configuration Rollback Strategy
+
+Adding a channel requires editing 4 source files. If integration fails or QA reveals problems, you need a clean way to undo.
+
+### Strategy 1: Git-Based Rollback (Recommended)
+
+```bash
+# BEFORE integration: create a checkpoint
+git add -A && git commit -m "pre-integration checkpoint: {service}"
+
+# DO the integration (edit all 4 files)
+# ... edit channels.ts, appNames.ts, app-intakes.ts, route.ts ...
+
+# TEST the integration
+npm run build && npm run dev
+# Run QA checks...
+
+# IF QA PASSES:
+git add -A && git commit -m "feat: add {service} channel integration"
+
+# IF QA FAILS:
+git checkout -- src/lib/channels.ts src/lib/appNames.ts src/lib/app-intakes.ts src/app/api/mcp-apps/route.ts
+# Clean revert, no broken state
+```
+
+### Strategy 2: Feature-Flag Rollback
+
+For production deployments, use a feature flag so new channels can be toggled without code changes:
+
+```typescript
+// In channels.ts:
+{
+  id: "{service}",
+  name: "{service}",
+  enabled: process.env.ENABLE_SERVICE_CHANNEL === "true", // default: disabled
+  // ... rest of config
+}
+
+// Filter in sidebar rendering:
+const visibleChannels = channels.filter(c => c.enabled !== false);
+```
+
+**Workflow:**
+1. Integrate with `enabled: false` (or env var `ENABLE_SERVICE_CHANNEL=false`)
+2. Deploy to production — channel is invisible
+3. QA in production with `ENABLE_SERVICE_CHANNEL=true` in your session
+4. If QA passes: set env var to `true` globally
+5. If QA fails: leave disabled, fix, redeploy
+
+### Strategy 3: Manifest-Based (Future)
+
+Instead of editing 4 shared TypeScript files, each channel could be defined in a single JSON manifest file:
+```
+channels/{service}.json  →  contains all config (channel def, app names, intakes, route map)
+```
+Delete the file = remove the channel. This is the cleanest approach but requires refactoring the LocalBosses codebase.
+
+---
+
+## 11. Complete Example: Adding a New Service
+
+Let's walk through adding "Calendly" as a complete example, applying all patterns from this guide:
+
+### channels.ts:
+```typescript
+{
+  id: "calendly",
+  name: "calendly",
+  icon: "📅",
+  category: "BUSINESS OPS",
+  description: "Manage scheduling, appointments, and calendars",
+  systemPrompt: `You are the Scheduling Specialist for LocalBosses AI, powered by Calendly.
+
+Your expertise:
+- Managing event types and scheduling links
+- Viewing and managing scheduled events
+- Finding available time slots
+- Scheduling analytics and insights
+
+TOOL SELECTION RULES:
+- SEE/BROWSE events → use list_scheduled_events
+- ONE specific event by ID → use get_event
+- VIEW event types → use list_event_types
+- CANCEL/RESCHEDULE → use cancel_event (always confirm first)
+- SCHEDULING METRICS → use get_scheduling_analytics
+- AVAILABILITY → use get_availability
+
+Before calling any tool, briefly state which tool you're choosing and why.
+
+Do NOT call tools when users ask about scheduling best practices, time management tips, or general calendar advice. Respond from your expertise instead.
+
+Be concise and action-oriented.`,
+  defaultApp: "calendly-dashboard",
+  mcpApps: [
+    "calendly-dashboard",
+    "calendly-event-grid",
+    "calendly-event-detail",
+    "calendly-calendar",
+    "calendly-availability",
+  ],
+},
+```
+
+### appNames.ts:
+```typescript
+"calendly-dashboard": { name: "Dashboard", icon: "📊" },
+"calendly-event-grid": { name: "Events", icon: "📋" },
+"calendly-event-detail": { name: "Event Detail", icon: "📄" },
+"calendly-calendar": { name: "Calendar", icon: "📆" },
+"calendly-availability": { name: "Availability", icon: "🕐" },
+```
+
+### app-intakes.ts:
+```typescript
+"calendly-dashboard": {
+  category: "dashboard",
+  question: "What time frame? e.g., this week, last month, Q1 2026",
+  skipLabel: "Last 30 days",
+  systemPromptAddon: `Generate APP_DATA for the Calendly dashboard.
+
+Required fields:
+- "title": descriptive (e.g., "Scheduling Dashboard — Last 30 Days")
+- "timeFrame": string matching user request
+- "metrics": { "total_events", "upcoming", "completed", "cancelled" }
+- "recent": array of 3-5 recent events with { "title", "date" (ISO), "type" }
+
+Example interaction:
+USER: "Show me last week's stats"
+→ Use get_scheduling_analytics with date range = last 7 days
+→ Return APP_DATA with metrics and recent events from that period
+
+<!--APP_DATA:{"title":"Scheduling Dashboard — Last 7 Days","timeFrame":"Last 7 days","metrics":{"total_events":12,"upcoming":3,"completed":8,"cancelled":1},"recent":[{"title":"Strategy Call","date":"2026-02-03T10:00:00Z","type":"completed"}]}:END_APP_DATA-->`,
+},
+"calendly-event-grid": {
+  category: "data-grid",
+  question: "Filter events by date, status, or type — or say 'all upcoming'.",
+  skipLabel: "All upcoming events",
+  systemPromptAddon: `Generate APP_DATA for the event grid.
+
+Required fields:
+- "title": descriptive (e.g., "Upcoming Events")
+- "data": array of events, each with { "name", "email" (invitee), "date" (ISO), "status", "duration", "type" }
+- "meta": { "total", "page", "pageSize" }
+
+Include 5-10 realistic records matching the user's filters.`,
+},
+"calendly-event-detail": {
+  category: "detail-card",
+  question: "Which event? Provide a name, date, or invitee.",
+  systemPromptAddon: `Generate APP_DATA for a single event detail.
+
+Required fields: "title", "name", "status", "start" (ISO), "end" (ISO), "attendee", "email", "eventType", "location", "notes"
+
+All fields top-level (no wrapping data object).`,
+},
+"calendly-calendar": {
+  category: "calendar",
+  question: "Which date range? e.g., this week, February, next 14 days",
+  skipLabel: "This week",
+  systemPromptAddon: `Generate APP_DATA for the calendar view with events in the requested range.
+
+Required fields:
+- "title": descriptive
+- "events": array with { "title", "start" (ISO), "end" (ISO), "contact", "status", "location" }`,
+},
+"calendly-availability": {
+  category: "form",
+  question: "Which schedule's availability? e.g., 'my default schedule'",
+  systemPromptAddon: `Generate APP_DATA with availability settings as form fields.
+
+Required fields:
+- "title", "description"
+- "fields": array with { "name", "label", "type", "required" }`,
+},
+```
+
+### mcp-apps/route.ts:
+```typescript
+// In APP_NAME_MAP:
+"calendly-dashboard": "dashboard",
+"calendly-event-grid": "event-grid",
+"calendly-event-detail": "event-detail",
+"calendly-calendar": "calendar-view",
+"calendly-availability": "availability",
+
+// In APP_DIRS:
+join(process.cwd(), "../calendly-mcp/app-ui"),
+```
+
+---
+
+## 12. Integration Validation Script
+
+**Run this script after every integration to catch missing or orphaned entries across all 4 files.**
+
+Save as `scripts/validate-integration.ts` and run with `npx ts-node scripts/validate-integration.ts` (or transpile and run with Node).
+
+```typescript
+#!/usr/bin/env ts-node
+/**
+ * MCP LocalBosses Integration Validator
+ *
+ * Cross-references all 4 integration files to find:
+ * - Missing entries (app ID in channels.ts but not in other files)
+ * - Orphaned entries (app ID in appNames/intakes/route but not in any channel)
+ * - File resolution failures (APP_NAME_MAP entry doesn't resolve to an HTML file)
+ *
+ * Usage: npx ts-node scripts/validate-integration.ts
+ *    or: node scripts/validate-integration.js (after compiling)
+ *
+ * Exit code: 0 = all good, 1 = errors found
+ */
+
+import * as fs from "fs";
+import * as path from "path";
+
+// ─── Configuration ───────────────────────────────────────────
+const BASE_DIR = path.resolve(__dirname, "../src");
+const CHANNELS_FILE = path.join(BASE_DIR, "lib/channels.ts");
+const APP_NAMES_FILE = path.join(BASE_DIR, "lib/appNames.ts");
+const APP_INTAKES_FILE = path.join(BASE_DIR, "lib/app-intakes.ts");
+const ROUTE_FILE = path.join(BASE_DIR, "app/api/mcp-apps/route.ts");
+
+// ─── Parsers ─────────────────────────────────────────────────
+
+function readFile(filePath: string): string {
+  if (!fs.existsSync(filePath)) {
+    console.error(`❌ File not found: ${filePath}`);
+    process.exit(1);
+  }
+  return fs.readFileSync(filePath, "utf-8");
+}
+
+/**
+ * Extract all app IDs from channels.ts mcpApps arrays.
+ * Looks for patterns like: mcpApps: ["app-1", "app-2", ...]
+ * and string literals inside those arrays.
+ */
+function parseChannelApps(source: string): { channelId: string; apps: string[] }[] {
+  const channels: { channelId: string; apps: string[] }[] = [];
+
+  // Match channel blocks with id and mcpApps
+  const channelBlockRegex = /\{\s*(?:[^{}]*?)id:\s*["'`]([^"'`]+)["'`][^{}]*?mcpApps:\s*\[([\s\S]*?)\]/g;
+  let match: RegExpExecArray | null;
+
+  while ((match = channelBlockRegex.exec(source)) !== null) {
+    const channelId = match[1];
+    const appsArrayContent = match[2];
+    const appIds = [...appsArrayContent.matchAll(/["'`]([^"'`]+)["'`]/g)].map((m) => m[1]);
+    channels.push({ channelId, apps: appIds });
+  }
+
+  // Fallback: if regex didn't catch structured blocks, try simpler pattern
+  if (channels.length === 0) {
+    const simpleRegex = /mcpApps:\s*\[([\s\S]*?)\]/g;
+    while ((match = simpleRegex.exec(source)) !== null) {
+      const appIds = [...match[1].matchAll(/["'`]([^"'`]+)["'`]/g)].map((m) => m[1]);
+      if (appIds.length > 0) {
+        channels.push({ channelId: "unknown", apps: appIds });
+      }
+    }
+  }
+
+  return channels;
+}
+
+/**
+ * Extract all keys from appNames.ts APP_DISPLAY_NAMES object.
+ * Looks for patterns like: "app-id": { name: "...", icon: "..." }
+ */
+function parseAppNames(source: string): string[] {
+  const keys: string[] = [];
+  const regex = /["'`]([a-z0-9][\w-]*)["'`]\s*:\s*\{\s*name\s*:/g;
+  let match: RegExpExecArray | null;
+  while ((match = regex.exec(source)) !== null) {
+    keys.push(match[1]);
+  }
+  return keys;
+}
+
+/**
+ * Extract all keys from app-intakes.ts APP_INTAKES object.
+ * Looks for patterns like: "app-id": { category: "...", question: "..." }
+ */
+function parseAppIntakes(source: string): string[] {
+  const keys: string[] = [];
+  const regex = /["'`]([a-z0-9][\w-]*)["'`]\s*:\s*\{\s*(?:category|question)\s*:/g;
+  let match: RegExpExecArray | null;
+  while ((match = regex.exec(source)) !== null) {
+    keys.push(match[1]);
+  }
+  return keys;
+}
+
+/**
+ * Extract APP_NAME_MAP keys and values, plus APP_DIRS paths.
+ */
+function parseRouteFile(source: string): { nameMap: Map<string, string>; dirs: string[] } {
+  const nameMap = new Map<string, string>();
+
+  // Extract APP_NAME_MAP entries: "app-id": "filename"
+  const mapRegex = /["'`]([a-z0-9][\w-]*)["'`]\s*:\s*["'`]([^"'`]+)["'`]/g;
+  // Only match within APP_NAME_MAP block
+  const mapBlockMatch = source.match(/APP_NAME_MAP[^{]*\{([\s\S]*?)\}/);
+  if (mapBlockMatch) {
+    let match: RegExpExecArray | null;
+    const block = mapBlockMatch[1];
+    while ((match = mapRegex.exec(block)) !== null) {
+      nameMap.set(match[1], match[2]);
+    }
+  }
+
+  // Extract APP_DIRS paths
+  const dirs: string[] = [];
+  const dirsBlockMatch = source.match(/APP_DIRS\s*=\s*\[([\s\S]*?)\]/);
+  if (dirsBlockMatch) {
+    const pathRegex = /["'`]([^"'`]+)["'`]/g;
+    let match: RegExpExecArray | null;
+    while ((match = pathRegex.exec(dirsBlockMatch[1])) !== null) {
+      dirs.push(match[1]);
+    }
+    // Also handle join() patterns
+    const joinRegex = /join\s*\([^)]*["'`]([^"'`]+)["'`]\s*\)/g;
+    while ((match = joinRegex.exec(dirsBlockMatch[1])) !== null) {
+      dirs.push(match[1]);
+    }
+  }
+
+  return { nameMap, dirs };
+}
+
+/**
+ * Check if an HTML file exists for a given filename in any of the app directories.
+ */
+function resolveHtmlFile(filename: string, dirs: string[], projectRoot: string): string | null {
+  for (const dir of dirs) {
+    const resolvedDir = dir.startsWith("/") ? dir : path.resolve(projectRoot, dir);
+    const flatPath = path.join(resolvedDir, `${filename}.html`);
+    const indexPath = path.join(resolvedDir, filename, "index.html");
+
+    if (fs.existsSync(flatPath)) return flatPath;
+    if (fs.existsSync(indexPath)) return indexPath;
+  }
+  return null;
+}
+
+// ─── Main Validation ─────────────────────────────────────────
+
+function validate() {
+  console.log("🔍 MCP LocalBosses Integration Validator\n");
+  console.log("═".repeat(60));
+
+  let errors = 0;
+  let warnings = 0;
+
+  // 1. Parse all files
+  const channelsSource = readFile(CHANNELS_FILE);
+  const appNamesSource = readFile(APP_NAMES_FILE);
+  const appIntakesSource = readFile(APP_INTAKES_FILE);
+  const routeSource = readFile(ROUTE_FILE);
+
+  const channelData = parseChannelApps(channelsSource);
+  const appNameKeys = new Set(parseAppNames(appNamesSource));
+  const appIntakeKeys = new Set(parseAppIntakes(appIntakesSource));
+  const { nameMap: routeNameMap, dirs: routeDirs } = parseRouteFile(routeSource);
+
+  // Collect ALL app IDs referenced in channels
+  const allChannelApps = new Set<string>();
+  for (const channel of channelData) {
+    for (const app of channel.apps) {
+      allChannelApps.add(app);
+    }
+  }
+
+  console.log(`\n📊 Parsed Summary:`);
+  console.log(`   Channels: ${channelData.length}`);
+  console.log(`   Channel app references: ${allChannelApps.size}`);
+  console.log(`   appNames entries: ${appNameKeys.size}`);
+  console.log(`   app-intakes entries: ${appIntakeKeys.size}`);
+  console.log(`   route APP_NAME_MAP entries: ${routeNameMap.size}`);
+  console.log(`   route APP_DIRS: ${routeDirs.length}`);
+
+  // 2. Cross-reference: every app in channels must exist in other 3 files
+  console.log(`\n${"─".repeat(60)}`);
+  console.log(`\n🔗 Cross-Reference: Apps in channels.ts → other files\n`);
+
+  for (const channel of channelData) {
+    for (const appId of channel.apps) {
+      const inNames = appNameKeys.has(appId);
+      const inIntakes = appIntakeKeys.has(appId);
+      const inRoute = routeNameMap.has(appId);
+
+      if (!inNames || !inIntakes || !inRoute) {
+        const missing: string[] = [];
+        if (!inNames) missing.push("appNames.ts");
+        if (!inIntakes) missing.push("app-intakes.ts");
+        if (!inRoute) missing.push("route.ts");
+        console.log(`   ❌ "${appId}" (channel: ${channel.channelId}) — MISSING from: ${missing.join(", ")}`);
+        errors++;
+      }
+    }
+  }
+
+  if (errors === 0) {
+    console.log(`   ✅ All channel apps found in all 3 files`);
+  }
+
+  // 3. Find orphaned entries (in appNames/intakes/route but not in any channel)
+  console.log(`\n${"─".repeat(60)}`);
+  console.log(`\n🗑️  Orphaned Entries (in files but not in any channel)\n`);
+
+  let orphanCount = 0;
+  for (const key of appNameKeys) {
+    if (!allChannelApps.has(key)) {
+      console.log(`   ⚠️  "${key}" in appNames.ts but not in any channel's mcpApps`);
+      warnings++;
+      orphanCount++;
+    }
+  }
+  for (const key of appIntakeKeys) {
+    if (!allChannelApps.has(key)) {
+      console.log(`   ⚠️  "${key}" in app-intakes.ts but not in any channel's mcpApps`);
+      warnings++;
+      orphanCount++;
+    }
+  }
+  for (const key of routeNameMap.keys()) {
+    if (!allChannelApps.has(key)) {
+      console.log(`   ⚠️  "${key}" in route.ts APP_NAME_MAP but not in any channel's mcpApps`);
+      warnings++;
+      orphanCount++;
+    }
+  }
+  if (orphanCount === 0) {
+    console.log(`   ✅ No orphaned entries`);
+  }
+
+  // 4. Verify HTML file resolution
+  console.log(`\n${"─".repeat(60)}`);
+  console.log(`\n📁 HTML File Resolution (APP_NAME_MAP → actual files)\n`);
+
+  const projectRoot = path.resolve(__dirname, "..");
+  let resolutionFailures = 0;
+
+  for (const [appId, filename] of routeNameMap.entries()) {
+    const resolved = resolveHtmlFile(filename, routeDirs, projectRoot);
+    if (!resolved) {
+      console.log(`   ❌ "${appId}" → "${filename}.html" — NOT FOUND in any APP_DIRS`);
+      errors++;
+      resolutionFailures++;
+    }
+  }
+
+  if (resolutionFailures === 0) {
+    console.log(`   ✅ All APP_NAME_MAP entries resolve to HTML files`);
+  }
+
+  // 5. Summary
+  console.log(`\n${"═".repeat(60)}`);
+  console.log(`\n📋 RESULTS: ${errors} errors, ${warnings} warnings`);
+
+  if (errors > 0) {
+    console.log(`\n❌ VALIDATION FAILED — fix ${errors} error(s) before deploying`);
+    process.exit(1);
+  } else if (warnings > 0) {
+    console.log(`\n⚠️  VALIDATION PASSED with ${warnings} warning(s) — review orphaned entries`);
+    process.exit(0);
+  } else {
+    console.log(`\n✅ VALIDATION PASSED — all integrations are consistent`);
+    process.exit(0);
+  }
+}
+
+validate();
+```
+
+**Run in CI:**
+```bash
+# Add to package.json scripts:
+"validate:integration": "ts-node scripts/validate-integration.ts"
+
+# Or without ts-node (compile first):
+"validate:integration": "tsc scripts/validate-integration.ts --outDir scripts/dist && node scripts/dist/validate-integration.js"
+```
+
+**Run before every deploy and as part of Phase 5 QA.**
+
+---
+
+## 13. Quality Gate Checklist
+
+Before passing to Phase 5 (QA), verify:
+
+- [ ] **Channel appears in sidebar** — under correct category with correct icon
+- [ ] **All apps appear in toolbar** — when channel is selected
+- [ ] **Default app auto-opens** — if defaultApp is configured
+- [ ] **Clicking each app opens a thread** — with the intake question
+- [ ] **"Skip" button works** — if skipLabel is defined
+- [ ] **AI generates APP_DATA** — in every thread response
+- [ ] **App receives data** — visual app updates when AI responds
+- [ ] **Refinement works** — asking follow-up questions generates new APP_DATA
+- [ ] **System prompt is comprehensive** — includes tool routing rules, negative instructions, rationale requirement
+- [ ] **System prompt is under budget** — channel prompt < 500 tokens, addons < 300 tokens each
+- [ ] **No 404s for app files** — all HTML files resolve in mcp-apps route
+- [ ] **No missing entries** — every app ID appears in all 4 files (channels, appNames, intakes, route)
+- [ ] **Validation script passes** — `npm run validate:integration` exits with code 0
+- [ ] **Intake questions meet quality criteria** — format hints, skipLabels, under 20 words, action-oriented
+- [ ] **Test fixtures generated** — `test-fixtures/tool-routing.json` baseline created for QA (see below)
+
+### Per-Service Test Fixture Generation
+
+The integrator should generate a `test-fixtures/tool-routing.json` baseline for the QA tester (Phase 5). This file maps natural-language user messages to expected tool calls, derived from the system prompt's tool routing rules:
+
+```json
+{
+  "service": "{service}",
+  "fixtures": [
+    { "message": "show me all contacts", "expectedTool": "list_contacts", "expectedArgs": {} },
+    { "message": "find John Smith", "expectedTool": "get_contact", "expectedArgs": { "name": "John Smith" } },
+    { "message": "add a new contact named Sarah", "expectedTool": "create_contact", "expectedArgs": { "name": "Sarah" } },
+    { "message": "delete the old lead", "expectedTool": null, "expectedBehavior": "should ask for confirmation and specifics" },
+    { "message": "what's the best way to organize contacts?", "expectedTool": null, "expectedBehavior": "respond from expertise, no tool call" }
+  ]
+}
+```
+
+**Generate at least 20 fixtures per service** covering: list, get, create, update, delete, analytics, no-tool-needed, ambiguous queries, and multi-intent messages. Save to `{service}-mcp/test-fixtures/tool-routing.json`. The QA tester uses these for tool routing validation.
+
+### Cross-reference check (critical):
+Every app ID must appear in ALL of these:
+1. `channels.ts` — in the `mcpApps` array
+2. `appNames.ts` — in `APP_DISPLAY_NAMES`
+3. `app-intakes.ts` — in `APP_INTAKES`
+4. `mcp-apps/route.ts` — in `APP_NAME_MAP`
+
+Missing from any one = broken experience. **Use the validation script (Section 12) to automate this check.**
+
+---
+
+## 14. MCP Protocol Bridge: structuredContent → APP_DATA
+
+> This section documents how MCP's native `structuredContent` relates to LocalBosses' APP_DATA pattern, and the roadmap for convergence.
+
+### The Two Layers
+
+**MCP Protocol Layer** (standard):
+- MCP tools return results with `content` (text fallback) and `structuredContent` (typed JSON)
+- Tools declare `outputSchema` so clients know the data shape
+- This is the standard way to send typed data from tools to clients
+
+**LocalBosses Application Layer** (custom):
+- The APP_DATA block (`<!--APP_DATA:...:END_APP_DATA-->`) embeds structured data in LLM-generated text
+- The frontend parses APP_DATA and routes it to the appropriate iframe app via postMessage
+- This is a LocalBosses-specific convention, NOT part of the MCP protocol
+
+### How They Connect Today
+
+```
+MCP Tool → structuredContent (typed JSON)
+    ↓
+LLM receives tool result, generates response
+    ↓
+LLM embeds data as APP_DATA block in response text
+    ↓
+LocalBosses frontend parses APP_DATA
+    ↓
+Frontend sends data to app iframe via postMessage
+```
+
+The LLM is the bridge — it receives `structuredContent` from the tool and re-serializes it as APP_DATA. This works but is lossy (the LLM may modify, truncate, or malform the data).
+
+### Roadmap
+
+| Phase | Approach | Status |
+|-------|----------|--------|
+| **Short-term (current)** | APP_DATA pattern — LLM embeds JSON in response text, frontend parses | ✅ Implemented |
+| **Medium-term** | Route `structuredContent` directly to apps — bypass LLM re-serialization. When a tool returns `structuredContent`, send it directly to the appropriate app without waiting for the LLM to echo it. | 🔜 Planned |
+| **Long-term** | Adopt official MCP Apps protocol (launched Jan 2026) — tools declare `_meta.ui.resourceUri`, apps communicate via JSON-RPC over postMessage, bidirectional data flow. **⚠️ This is live NOW** — Claude, ChatGPT, VS Code, and Goose all support MCP Apps today. | 🔴 Live — Adopt ASAP |
+
+### Medium-Term Architecture
+
+```
+MCP Tool returns structuredContent
+    ↓
+LocalBosses chat route intercepts structuredContent from tool result
+    ↓
+Routes directly to app iframe via postMessage (no LLM re-serialization)
+    ↓
+LLM still generates text explanation, but data is sourced from tool result, not LLM output
+```
+
+**Benefits:** No JSON parsing failures, no data loss from LLM re-serialization, schema-validated data.
+
+### Long-Term → NOW: MCP Apps Protocol (⚠️ Live — Adopt ASAP)
+
+> **Urgency:** The MCP Apps extension launched January 26, 2026 and is **already supported** by Claude, ChatGPT, VS Code, and Goose. This is NOT a future consideration — it's a live standard. Our APP_DATA pattern works only in LocalBosses; MCP Apps works in ANY MCP client.
+
+The official MCP Apps extension defines:
+- `_meta.ui.resourceUri` on tools — declares which UI resource to render
+- `ui://` resource URIs served by the MCP server
+- `@modelcontextprotocol/ext-apps` SDK — standardized App class with `ontoolresult`, `callServerTool`, `updateModelContext`
+- JSON-RPC over postMessage for bidirectional app ↔ server communication
+
+**Migration path:**
+1. Add `_meta.ui.resourceUri` to tool definitions in the server builder
+2. Register app HTML files as `ui://` resources in each MCP server
+3. Update app template to use `@modelcontextprotocol/ext-apps` App class for data reception
+4. Maintain backward compatibility with postMessage/APP_DATA for LocalBosses during transition
+
+**Impact:** MCP tools work in ANY MCP client (Claude, ChatGPT, VS Code) — not just LocalBosses. Massive distribution multiplier.
+
+---
+
+## 15. Execution Workflow
+
+```
+1. Create git checkpoint: git add -A && git commit -m "pre-integration: {service}"
+2. Read {service}-api-analysis.md — get app IDs and tool groups
+3. Update channels.ts — add channel definition with system prompt (include tool routing rules)
+4. Update appNames.ts — add display names and icons
+5. Update app-intakes.ts — add intake questions (meeting quality criteria) and systemPromptAddons
+6. Update mcp-apps/route.ts — add APP_NAME_MAP entries and APP_DIRS path
+7. Verify chat/route.ts — ensure THREAD_SYSTEM_PROMPT works (usually no changes needed)
+8. Run validation script: npx ts-node scripts/validate-integration.ts
+9. Fix any errors/warnings from validation
+10. Test: build LocalBosses, open channel, click app, verify thread + data flow
+11. If QA passes: git add -A && git commit -m "feat: add {service} channel integration"
+12. If QA fails: git checkout -- src/lib/channels.ts src/lib/appNames.ts src/lib/app-intakes.ts src/app/api/mcp-apps/route.ts
+```
+
+**Estimated time:** 30-60 minutes per channel.
+
+**Agent model recommendation:** Sonnet — well-defined patterns, file editing. But system prompt crafting benefits from Opus for nuanced AI instruction design.
+
+---
+
+*This skill is Phase 4 of the MCP Factory pipeline. It wires the server and apps into LocalBosses so everything is accessible through the UI.*
diff --git a/skills/mcp-qa-tester/SKILL.md b/skills/mcp-qa-tester/SKILL.md
new file mode 100644
index 0000000..b80689a
--- /dev/null
+++ b/skills/mcp-qa-tester/SKILL.md
@@ -0,0 +1,3388 @@
+# MCP QA Tester — Automated Testing Framework & Quality Metrics Pipeline
+
+**When to use this skill:** Testing MCP servers, apps, and their LocalBosses integration. Use after Phase 4 (integration) to verify everything works — at the protocol level, visually, functionally, and against live APIs. This is an **automated-first** framework with quantitative metrics, regression baselines, and persistent reporting.
+
+**What this covers:** MCP protocol compliance, automated unit/visual/functional testing, accessibility auditing, performance benchmarking, security validation, chaos testing, and quantitative quality metrics with regression tracking.
+
+---
+
+## Testing Architecture
+
+```
+Layer 0: Protocol Compliance ─── MCP Inspector + JSON-RPC lifecycle validation
+Layer 1: Static Analysis ──────── TypeScript build, linting, file structure, schema validation
+Layer 2: Visual Testing ────────── Playwright screenshots, BackstopJS regression, Gemini analysis
+Layer 2.5: Accessibility ────────── axe-core, keyboard nav, contrast audit, screen reader compat
+Layer 3: Functional Testing ───── Tool routing smoke tests, data flow validation, thread lifecycle
+Layer 3.5: Performance ────────── Cold start, latency, memory, file size budgets
+Layer 4: Live API Testing ──────── Real API calls with credential management strategy
+Layer 4.5: Security ────────────── XSS, CSP, postMessage origin, key exposure
+Layer 5: Integration Testing ──── Full E2E scenarios, chaos testing, cross-browser validation
+```
+
+Every layer has **quantitative pass/fail criteria**. Do NOT skip layers — issues compound.
+
+---
+
+## Quantitative Quality Metrics (REQUIRED)
+
+Every QA report MUST include these metrics. No more pass/fail checklists — we measure.
+
+| Metric | Target | Method | Priority |
+|--------|--------|--------|----------|
+| **MCP Protocol Compliance** | 100% | MCP Inspector — all checks pass | P0 |
+| **Tool Correctness Rate** | >95% | Run 20 NL messages, count correct tool selections | P0 |
+| **Task Completion Rate** | >90% | Run 10 E2E scenarios, count fully completed | P0 |
+| **APP_DATA Schema Match** | 100% | Validate every APP_DATA against JSON schema | P0 |
+| **Response Latency P50** | <3s | Measure 10 read interactions | P1 |
+| **Response Latency P95** | <8s | Measure 10 interactions (reads + writes) | P1 |
+| **App Render Success** | 100% | All apps render data state without console errors | P0 |
+| **Accessibility Score** | >90 | axe-core audit on every app HTML | P1 |
+| **Cold Start Time** | <2s | `time node dist/index.js` → first ListTools response | P1 |
+| **App File Size** | <50KB each | Check all HTML files | P1 |
+| **Security Scan** | 0 critical | XSS + CSP + key exposure checks | P0 |
+
+### How to calculate:
+
+```
+Tool Correctness Rate = (correct_tool_selections / total_test_messages) × 100
+Task Completion Rate  = (completed_scenarios / total_scenarios) × 100
+APP_DATA Schema Match = (valid_app_data_blocks / total_app_data_blocks) × 100
+```
+
+---
+
+## Layer 0: MCP Protocol Compliance Testing
+
+**Why this layer exists:** The MCP spec defines exact JSON-RPC lifecycle, tool definition formats, and error codes. If the server isn't protocol-compliant, nothing else matters. This is the foundation.
+
+### 0.1 — MCP Inspector (Official Tool)
+
+```bash
+# Install and run MCP Inspector against the server
+npx @modelcontextprotocol/inspector stdio node dist/index.js
+
+# The Inspector validates:
+# ✅ initialize → initialized lifecycle
+# ✅ tools/list response format
+# ✅ tools/call request/response format
+# ✅ JSON-RPC message framing
+# ✅ Capability negotiation
+# ✅ Notification handling
+```
+
+### 0.2 — Automated Protocol Test Script
+
+Save as `tests/protocol-compliance.test.ts`:
+
+```typescript
+import { spawn, ChildProcess } from 'child_process';
+import * as readline from 'readline';
+
+// Minimal JSON-RPC client for testing MCP servers over stdio
+class MCPTestClient {
+  private proc: ChildProcess;
+  private rl: readline.Interface;
+  private pending: Map<number, { resolve: Function; reject: Function }> = new Map();
+  private nextId = 1;
+  private notifications: any[] = [];
+
+  constructor(command: string, args: string[]) {
+    this.proc = spawn(command, args, { stdio: ['pipe', 'pipe', 'pipe'] });
+    this.rl = readline.createInterface({ input: this.proc.stdout! });
+    this.rl.on('line', (line) => {
+      try {
+        const msg = JSON.parse(line);
+        if (msg.id && this.pending.has(msg.id)) {
+          this.pending.get(msg.id)!.resolve(msg);
+          this.pending.delete(msg.id);
+        } else if (!msg.id) {
+          this.notifications.push(msg);
+        }
+      } catch (e) { /* ignore non-JSON lines */ }
+    });
+  }
+
+  async request(method: string, params?: any): Promise<any> {
+    const id = this.nextId++;
+    const msg = JSON.stringify({ jsonrpc: '2.0', id, method, params: params || {} });
+    this.proc.stdin!.write(msg + '\n');
+    return new Promise((resolve, reject) => {
+      this.pending.set(id, { resolve, reject });
+      setTimeout(() => {
+        if (this.pending.has(id)) {
+          this.pending.delete(id);
+          reject(new Error(`Timeout on ${method}`));
+        }
+      }, 10000);
+    });
+  }
+
+  getNotifications() { return this.notifications; }
+
+  async close() {
+    this.proc.kill();
+  }
+}
+
+describe('MCP Protocol Compliance', () => {
+  let client: MCPTestClient;
+
+  beforeAll(async () => {
+    client = new MCPTestClient('node', ['dist/index.js']);
+  });
+
+  afterAll(async () => {
+    await client.close();
+  });
+
+  test('initialize → initialized lifecycle', async () => {
+    const initResult = await client.request('initialize', {
+      protocolVersion: '2025-11-25',
+      capabilities: {},
+      clientInfo: { name: 'qa-test-client', version: '1.0.0' }
+    });
+
+    expect(initResult.result).toBeDefined();
+    expect(initResult.result.protocolVersion).toBeDefined();
+    expect(initResult.result.capabilities).toBeDefined();
+    expect(initResult.result.serverInfo).toBeDefined();
+    expect(initResult.result.serverInfo.name).toBeTruthy();
+    expect(initResult.result.serverInfo.version).toBeTruthy();
+
+    // Send initialized notification (no id = notification)
+    client.request('notifications/initialized', {}).catch(() => {});
+  });
+
+  test('tools/list returns valid tool definitions', async () => {
+    const result = await client.request('tools/list', {});
+    
+    expect(result.result).toBeDefined();
+    expect(result.result.tools).toBeInstanceOf(Array);
+    expect(result.result.tools.length).toBeGreaterThan(0);
+
+    for (const tool of result.result.tools) {
+      // Required fields per MCP 2025-11-25
+      expect(tool.name).toBeTruthy();
+      expect(tool.description).toBeTruthy();
+      expect(typeof tool.name).toBe('string');
+      expect(typeof tool.description).toBe('string');
+      
+      // Name format: must be alphanumeric + underscores/hyphens/dots
+      expect(tool.name).toMatch(/^[a-zA-Z0-9_.\-]+$/);
+      
+      // inputSchema must be valid JSON Schema object
+      if (tool.inputSchema) {
+        expect(tool.inputSchema.type).toBe('object');
+      }
+
+      // If title exists, must be string
+      if (tool.title) {
+        expect(typeof tool.title).toBe('string');
+      }
+
+      // If outputSchema exists, validate it
+      if (tool.outputSchema) {
+        expect(tool.outputSchema.type).toBeDefined();
+      }
+
+      // If annotations exist, validate known fields
+      if (tool.annotations) {
+        const validAnnotations = [
+          'readOnlyHint', 'destructiveHint', 'idempotentHint', 'openWorldHint'
+        ];
+        for (const key of Object.keys(tool.annotations)) {
+          if (validAnnotations.includes(key)) {
+            expect(typeof tool.annotations[key]).toBe('boolean');
+          }
+        }
+      }
+    }
+  });
+
+  test('tools/call returns valid response for read-only tools', async () => {
+    // Get list of tools first
+    const listResult = await client.request('tools/list', {});
+    const readOnlyTools = listResult.result.tools.filter(
+      (t: any) => t.annotations?.readOnlyHint === true
+    );
+
+    // Test first read-only tool (safest to call)
+    if (readOnlyTools.length > 0) {
+      const tool = readOnlyTools[0];
+      const callResult = await client.request('tools/call', {
+        name: tool.name,
+        arguments: {}
+      });
+
+      expect(callResult.result).toBeDefined();
+      
+      // Result must have content array
+      if (!callResult.result.isError) {
+        expect(callResult.result.content).toBeInstanceOf(Array);
+        for (const item of callResult.result.content) {
+          expect(item.type).toBeDefined();
+          // Text content must have text field
+          if (item.type === 'text') {
+            expect(typeof item.text).toBe('string');
+          }
+        }
+      }
+
+      // If structuredContent exists, validate against outputSchema
+      if (callResult.result.structuredContent && tool.outputSchema) {
+        // Basic type check — full JSON Schema validation is in the schema validator section
+        expect(typeof callResult.result.structuredContent).toBe('object');
+      }
+    }
+  });
+
+  test('error responses use correct JSON-RPC error codes', async () => {
+    // Call non-existent tool — should get method not found or tool error
+    const result = await client.request('tools/call', {
+      name: 'nonexistent_tool_that_should_not_exist_12345',
+      arguments: {}
+    });
+
+    // Should be an error response
+    expect(
+      result.error || result.result?.isError
+    ).toBeTruthy();
+
+    // If protocol error, must use standard JSON-RPC codes
+    if (result.error) {
+      expect(result.error.code).toBeDefined();
+      expect(typeof result.error.code).toBe('number');
+      expect(result.error.message).toBeTruthy();
+      // Standard codes: -32700 (parse), -32600 (invalid request),
+      // -32601 (method not found), -32602 (invalid params), -32603 (internal)
+    }
+  });
+
+  test('notification handling works', async () => {
+    // Server should handle ping
+    try {
+      await client.request('ping', {});
+      // If no error, ping is supported
+    } catch (e) {
+      // Ping timeout is acceptable for some servers
+    }
+  });
+});
+```
+
+### 0.3 — structuredContent Validation
+
+```typescript
+// tests/structured-content.test.ts
+import Ajv from 'ajv';
+
+const ajv = new Ajv({ allErrors: true });
+
+function validateStructuredContent(
+  toolName: string,
+  outputSchema: object,
+  structuredContent: any
+): { valid: boolean; errors: string[] } {
+  const validate = ajv.compile(outputSchema);
+  const valid = validate(structuredContent);
+  return {
+    valid: !!valid,
+    errors: valid ? [] : (validate.errors || []).map(e =>
+      `${e.instancePath} ${e.message}`
+    )
+  };
+}
+
+// Run this after getting tools/list + tools/call results
+describe('structuredContent schema validation', () => {
+  test('every tool with outputSchema returns conforming structuredContent', async () => {
+    // This would be populated from actual tool calls
+    const toolResults: Array<{
+      toolName: string;
+      outputSchema: object;
+      structuredContent: any;
+    }> = []; // Populate from Layer 4 results
+
+    for (const { toolName, outputSchema, structuredContent } of toolResults) {
+      if (structuredContent && outputSchema) {
+        const result = validateStructuredContent(toolName, outputSchema, structuredContent);
+        expect(result.valid).toBe(true);
+        if (!result.valid) {
+          console.error(`Schema mismatch for ${toolName}:`, result.errors);
+        }
+      }
+    }
+  });
+});
+```
+
+### 0.4 — Tasks & Elicitation Testing (2025-11-25 Spec)
+
+If the server declares `tasks` capability (async operations via SEP-1686), test the task lifecycle:
+
+```typescript
+test('tasks/list returns valid task list', async () => {
+  const result = await client.request('tasks/list', {});
+  if (result.result) {
+    expect(result.result.tasks).toBeInstanceOf(Array);
+  }
+  // Some servers may not implement tasks — that's OK, just verify no crash
+});
+
+test('long-running tool call returns task reference when task-enabled', async () => {
+  // If a tool has execution.taskSupport = "required" or "optional",
+  // calling it with _meta.taskId should return a task reference
+  // rather than blocking until completion
+  const listResult = await client.request('tools/list', {});
+  const taskTools = listResult.result.tools.filter(
+    (t: any) => t.execution?.taskSupport === 'required' || t.execution?.taskSupport === 'optional'
+  );
+  // Log task-capable tools for the report
+  console.log(`Task-capable tools: ${taskTools.map((t: any) => t.name).join(', ') || 'none'}`);
+});
+```
+
+If the server uses **elicitation** (`elicitation/create`), test that:
+- Elicitation requests include valid `requestedSchema` with JSON Schema
+- The server handles user-provided elicitation responses gracefully
+- URL mode elicitation (2025-11-25) correctly redirects to external URLs
+- The server doesn't hang if elicitation is denied by the client
+
+```typescript
+test('server handles elicitation denial gracefully', async () => {
+  // If server requests elicitation and client denies, server should
+  // return a useful error message, not crash or hang
+  // This is tested implicitly by calling tools without providing
+  // elicitation responses — the server should timeout or fallback
+});
+```
+
+### Quality Gate:
+- [ ] MCP Inspector passes all checks
+- [ ] initialize → initialized lifecycle works
+- [ ] tools/list returns valid, non-empty tool array
+- [ ] All tool names match `/^[a-zA-Z0-9_.\-]+$/`
+- [ ] All tool descriptions are non-empty strings
+- [ ] tools/call returns valid content arrays
+- [ ] structuredContent (if present) matches outputSchema
+- [ ] Error responses use correct JSON-RPC codes
+- [ ] Server handles unknown methods gracefully (doesn't crash)
+
+---
+
+## Layer 1: Static Analysis
+
+### 1.1 — TypeScript Compilation
+```bash
+cd {service}-mcp
+npm run build 2>&1
+# Must exit 0 with no errors
+# Warnings are OK but should be reviewed
+
+# Separate type-check (catches issues build might miss)
+npx tsc --noEmit 2>&1
+```
+
+### 1.2 — Code Quality Checks
+```bash
+# Check for `any` types (red flag)
+grep -rn ": any" src/ --include="*.ts" | grep -v "node_modules" | grep -v "// eslint" | grep -v "catch"
+# Goal: zero instances in tool handlers
+# Exception: catch(error: any) is acceptable
+
+# Check for console.log (should use structured logging)
+grep -rn "console.log" src/ --include="*.ts" | grep -v "node_modules"
+# Goal: zero — use console.error for MCP server logging
+
+# Check SDK version is pinned appropriately
+node -e "const p = require('./package.json'); console.log('SDK:', p.dependencies['@modelcontextprotocol/sdk'])"
+# Should be ^1.26.0 or higher (security fix: GHSA-345p-7cg4-v4c7)
+
+# Check Zod version
+node -e "const p = require('./package.json'); console.log('Zod:', p.dependencies['zod'])"
+# Should be ^3.25.0 or higher
+```
+
+### 1.3 — HTML App Validation
+```bash
+# Check all app HTML files exist and are within size budget
+for f in app-ui/*.html ui/dist/*.html; do
+  if [ -f "$f" ]; then
+    SIZE=$(wc -c < "$f" | tr -d ' ')
+    if [ "$SIZE" -gt 51200 ]; then
+      echo "⚠️  $f ($SIZE bytes) — EXCEEDS 50KB budget"
+    else
+      echo "✅ $f ($SIZE bytes)"
+    fi
+  else
+    echo "❌ $f MISSING"
+  fi
+done
+```
+
+### 1.4 — Route Mapping Cross-Reference
+```bash
+# Verify every app ID in channels.ts has a matching entry in ALL integration files
+node -e "
+const fs = require('fs');
+const path = require('path');
+
+const LB_ROOT = 'localbosses-app/src';
+const files = {
+  channels: fs.readFileSync(path.join(LB_ROOT, 'lib/channels.ts'), 'utf8'),
+  appNames: fs.readFileSync(path.join(LB_ROOT, 'lib/appNames.ts'), 'utf8'),
+  intakes: fs.readFileSync(path.join(LB_ROOT, 'lib/app-intakes.ts'), 'utf8'),
+  route: fs.readFileSync(path.join(LB_ROOT, 'app/api/mcp-apps/route.ts'), 'utf8'),
+};
+
+// Extract app IDs from channels (anything in mcpApps arrays)
+const channelApps = [...files.channels.matchAll(/['\"]([a-z0-9-]+)['\"]/g)]
+  .map(m => m[1])
+  .filter(id => id.length > 3 && !['true','false','null'].includes(id));
+
+let issues = 0;
+const unique = [...new Set(channelApps)];
+for (const id of unique) {
+  const inNames = files.appNames.includes(id);
+  const inIntakes = files.intakes.includes(id);
+  const inRoute = files.route.includes(id);
+  if (!inNames || !inIntakes || !inRoute) {
+    console.log('❌ ' + id + ': ' +
+      (!inNames ? 'MISSING appNames ' : '') +
+      (!inIntakes ? 'MISSING app-intakes ' : '') +
+      (!inRoute ? 'MISSING route ' : ''));
+    issues++;
+  }
+}
+if (issues === 0) console.log('✅ All ' + unique.length + ' app IDs cross-referenced');
+else console.log('\\n⚠️  ' + issues + ' cross-reference issues found');
+"
+```
+
+### Quality Gate:
+- [ ] TypeScript compiles with zero errors
+- [ ] `tsc --noEmit` passes clean
+- [ ] No unintended `any` types in tool handlers
+- [ ] SDK pinned to `^1.26.0`+, Zod to `^3.25.0`+ (Do NOT use Zod v4.x with SDK v1.x — known incompatibility, issue #1429)
+- [ ] All HTML app files exist, are >1KB and <50KB
+- [ ] All app IDs cross-referenced across channels, appNames, app-intakes, and route map
+- [ ] All route mappings resolve to actual HTML files
+
+---
+
+## Layer 2: Visual Testing
+
+### 2.1 — Automated Playwright Visual Tests
+
+Save as `tests/visual.test.ts`:
+
+```typescript
+import { test, expect, Page } from '@playwright/test';
+import * as fs from 'fs';
+import * as path from 'path';
+
+// Configuration
+const APP_UI_DIR = path.resolve(__dirname, '../app-ui');
+const SCREENSHOTS_DIR = path.resolve(__dirname, '../test-results/screenshots');
+const BASELINES_DIR = path.resolve(__dirname, '../test-baselines/screenshots');
+const FIXTURES_DIR = path.resolve(__dirname, '../test-fixtures');
+
+// Ensure directories exist
+fs.mkdirSync(SCREENSHOTS_DIR, { recursive: true });
+
+// Discover all HTML app files
+const appFiles = fs.readdirSync(APP_UI_DIR)
+  .filter(f => f.endsWith('.html'))
+  .map(f => path.join(APP_UI_DIR, f));
+
+// Load fixture for app type (or use default)
+function loadFixture(appFile: string): any {
+  const baseName = path.basename(appFile, '.html');
+  const fixturePath = path.join(FIXTURES_DIR, `${baseName}.json`);
+  if (fs.existsSync(fixturePath)) {
+    return JSON.parse(fs.readFileSync(fixturePath, 'utf8'));
+  }
+  // Default fixture
+  return {
+    title: 'Test Data',
+    data: [
+      { name: 'Test Item 1', status: 'active', value: 100 },
+      { name: 'Test Item 2', status: 'inactive', value: 200 },
+      { name: 'Test Item 3', status: 'pending', value: 300 },
+    ],
+    meta: { total: 3, page: 1, pageSize: 25 }
+  };
+}
+
+for (const appFile of appFiles) {
+  const appName = path.basename(appFile, '.html');
+
+  test.describe(`Visual: ${appName}`, () => {
+    let page: Page;
+
+    test.beforeEach(async ({ browser }) => {
+      page = await browser.newPage({ viewport: { width: 400, height: 600 } });
+      await page.goto(`file://${appFile}`);
+      // Collect console errors
+      page.on('console', msg => {
+        if (msg.type() === 'error') {
+          console.error(`[${appName}] Console error:`, msg.text());
+        }
+      });
+    });
+
+    test.afterEach(async () => {
+      await page.close();
+    });
+
+    test('renders loading state initially', async () => {
+      // Before any data, loading state should show
+      const loading = page.locator('#loading');
+      const content = page.locator('#content');
+      // At least one should be visible
+      const loadingVis = await loading.isVisible().catch(() => false);
+      const contentVis = await content.isVisible().catch(() => false);
+      expect(loadingVis || contentVis).toBe(true);
+
+      await page.screenshot({
+        path: path.join(SCREENSHOTS_DIR, `${appName}-loading.png`)
+      });
+    });
+
+    test('renders empty state', async () => {
+      // Inject empty data
+      await page.evaluate(() => {
+        window.postMessage({ type: 'mcp_app_data', data: {} }, '*');
+      });
+      await page.waitForTimeout(500);
+
+      // Should show empty state, not crash
+      const hasError = await page.evaluate(() => {
+        return document.body.innerText.includes('Error') ||
+               document.body.innerText.includes('undefined');
+      });
+      
+      await page.screenshot({
+        path: path.join(SCREENSHOTS_DIR, `${appName}-empty.png`)
+      });
+      
+      // No JS crashes
+      expect(hasError).toBe(false);
+    });
+
+    test('renders data state without console errors', async () => {
+      const fixture = loadFixture(appFile);
+      const consoleErrors: string[] = [];
+      page.on('console', msg => {
+        if (msg.type() === 'error') consoleErrors.push(msg.text());
+      });
+
+      // Inject fixture data
+      await page.evaluate((data) => {
+        window.postMessage({ type: 'mcp_app_data', data }, '*');
+      }, fixture);
+      await page.waitForTimeout(1000);
+
+      // Content should be visible (loading hidden)
+      const loading = page.locator('#loading');
+      const loadingHidden = !(await loading.isVisible().catch(() => true));
+      
+      await page.screenshot({
+        path: path.join(SCREENSHOTS_DIR, `${appName}-data.png`)
+      });
+
+      expect(consoleErrors).toHaveLength(0);
+    });
+
+    test('no horizontal overflow at 320px', async () => {
+      await page.setViewportSize({ width: 320, height: 600 });
+      const fixture = loadFixture(appFile);
+      
+      await page.evaluate((data) => {
+        window.postMessage({ type: 'mcp_app_data', data }, '*');
+      }, fixture);
+      await page.waitForTimeout(500);
+
+      const hasOverflow = await page.evaluate(() => {
+        return document.documentElement.scrollWidth > document.documentElement.clientWidth;
+      });
+
+      await page.screenshot({
+        path: path.join(SCREENSHOTS_DIR, `${appName}-narrow.png`)
+      });
+
+      expect(hasOverflow).toBe(false);
+    });
+
+    test('dark theme compliance', async () => {
+      const fixture = loadFixture(appFile);
+      await page.evaluate((data) => {
+        window.postMessage({ type: 'mcp_app_data', data }, '*');
+      }, fixture);
+      await page.waitForTimeout(500);
+
+      // Check background color is dark
+      const bgColor = await page.evaluate(() => {
+        return getComputedStyle(document.body).backgroundColor;
+      });
+      // Should be dark (r,g,b each < 60)
+      const match = bgColor.match(/\d+/g);
+      if (match) {
+        const [r, g, b] = match.map(Number);
+        expect(r).toBeLessThan(60);
+        expect(g).toBeLessThan(60);
+        expect(b).toBeLessThan(60);
+      }
+    });
+  });
+}
+```
+
+### 2.2 — BackstopJS Visual Regression
+
+```bash
+# Initialize BackstopJS (one-time setup)
+npm install -g backstopjs
+backstop init
+
+# Configure backstop.json:
+```
+
+```json
+{
+  "id": "mcp-apps",
+  "viewports": [
+    { "label": "thread-panel", "width": 400, "height": 600 },
+    { "label": "narrow", "width": 320, "height": 600 },
+    { "label": "wide", "width": 800, "height": 600 }
+  ],
+  "scenarios": [
+    {
+      "label": "contact-grid-data",
+      "url": "file:///path/to/app-ui/contact-grid.html",
+      "onReadyScript": "inject-data.js",
+      "delay": 1000,
+      "misMatchThreshold": 5.0,
+      "requireSameDimensions": true
+    }
+  ],
+  "paths": {
+    "bitmaps_reference": "test-baselines/backstop",
+    "bitmaps_test": "test-results/backstop",
+    "engine_scripts": "tests/backstop-scripts"
+  },
+  "engine": "playwright",
+  "engineOptions": {
+    "args": ["--no-sandbox"]
+  }
+}
+```
+
+```javascript
+// tests/backstop-scripts/inject-data.js
+module.exports = async (page, scenario, viewport, isReference, browserContext) => {
+  const fixtures = require('../test-fixtures/' + scenario.label.split('-')[0] + '.json');
+  await page.evaluate((data) => {
+    window.postMessage({ type: 'mcp_app_data', data }, '*');
+  }, fixtures);
+  await page.waitForTimeout(500);
+};
+```
+
+```bash
+# Capture baselines (run once when apps are verified correct)
+backstop reference
+
+# Test against baselines (run on every QA cycle)
+backstop test
+# Result: PASS if <5% pixel diff, FAIL otherwise
+# Visual diff report opens in browser automatically
+```
+
+### 2.3 — Gemini Multimodal Analysis (Subjective Quality)
+
+```bash
+# After Playwright captures screenshots, run Gemini for subjective quality:
+gemini "Analyze this MCP app screenshot. Check and rate PASS/WARN/FAIL:
+
+1. RENDERING: Does it show real content (not blank/placeholder)?
+2. DARK THEME: Background ~#1a1d23, accent ~#ff6d5a, text ~#dcddde
+3. LAYOUT: Content properly aligned, no overlapping elements?
+4. TYPOGRAPHY: Text readable, proper sizing, no clipping?
+5. DATA QUALITY: Does the rendered data look realistic?
+6. RESPONSIVENESS: Would this work at 280px (thread panel)?
+7. BUGS: Any visual artifacts, broken images, misaligned elements?" -f screenshot.png
+```
+
+### Quality Gate:
+- [ ] All apps render loading → empty → data states without crashes
+- [ ] Zero console errors in data state
+- [ ] No horizontal overflow at 320px width
+- [ ] Dark theme compliance (background RGB each <60)
+- [ ] BackstopJS regression: <5% pixel diff from baselines
+- [ ] Gemini subjective review: no FAIL ratings
+
+---
+
+## Layer 2.5: Accessibility Testing
+
+### 2.5.1 — axe-core Automated Audit
+
+Integrate directly into Playwright tests:
+
+```typescript
+// tests/accessibility.test.ts
+import { test, expect, Page } from '@playwright/test';
+import AxeBuilder from '@axe-core/playwright';
+import * as fs from 'fs';
+import * as path from 'path';
+
+const APP_UI_DIR = path.resolve(__dirname, '../app-ui');
+const FIXTURES_DIR = path.resolve(__dirname, '../test-fixtures');
+
+const appFiles = fs.readdirSync(APP_UI_DIR)
+  .filter(f => f.endsWith('.html'));
+
+for (const appFile of appFiles) {
+  const appName = path.basename(appFile, '.html');
+
+  test.describe(`Accessibility: ${appName}`, () => {
+    test('passes axe-core audit with data loaded', async ({ page }) => {
+      await page.goto(`file://${path.join(APP_UI_DIR, appFile)}`);
+
+      // Load fixture data
+      const fixturePath = path.join(FIXTURES_DIR, `${appName}.json`);
+      const fixture = fs.existsSync(fixturePath)
+        ? JSON.parse(fs.readFileSync(fixturePath, 'utf8'))
+        : { title: 'Test', data: [{ name: 'Test', status: 'active' }] };
+
+      await page.evaluate((data) => {
+        window.postMessage({ type: 'mcp_app_data', data }, '*');
+      }, fixture);
+      await page.waitForTimeout(1000);
+
+      // Run axe-core
+      const results = await new AxeBuilder({ page })
+        .withTags(['wcag2a', 'wcag2aa', 'wcag21a', 'wcag21aa'])
+        .analyze();
+
+      // Log violations for debugging
+      if (results.violations.length > 0) {
+        console.log(`\n[${appName}] Accessibility violations:`);
+        for (const v of results.violations) {
+          console.log(`  ${v.impact}: ${v.id} — ${v.description}`);
+          console.log(`    Help: ${v.helpUrl}`);
+          for (const node of v.nodes.slice(0, 3)) {
+            console.log(`    Target: ${node.target.join(' > ')}`);
+          }
+        }
+      }
+
+      // Calculate score: (passes / (passes + violations)) * 100
+      const totalChecks = results.passes.length + results.violations.length;
+      const score = totalChecks > 0
+        ? Math.round((results.passes.length / totalChecks) * 100)
+        : 100;
+
+      console.log(`[${appName}] Accessibility score: ${score}%`);
+
+      // Target: >90% score, zero critical/serious violations
+      const criticalViolations = results.violations.filter(
+        v => v.impact === 'critical' || v.impact === 'serious'
+      );
+      expect(criticalViolations).toHaveLength(0);
+      expect(score).toBeGreaterThanOrEqual(90);
+    });
+
+    test('all interactive elements reachable via keyboard', async ({ page }) => {
+      await page.goto(`file://${path.join(APP_UI_DIR, appFile)}`);
+      
+      // Inject data first
+      const fixturePath = path.join(FIXTURES_DIR, `${appName}.json`);
+      const fixture = fs.existsSync(fixturePath)
+        ? JSON.parse(fs.readFileSync(fixturePath, 'utf8'))
+        : { title: 'Test', data: [{ name: 'Test' }] };
+
+      await page.evaluate((data) => {
+        window.postMessage({ type: 'mcp_app_data', data }, '*');
+      }, fixture);
+      await page.waitForTimeout(500);
+
+      // Get all interactive elements
+      const interactiveElements = await page.evaluate(() => {
+        const selectors = 'a, button, input, select, textarea, [tabindex], [role="button"], [role="link"], [role="tab"]';
+        const elements = document.querySelectorAll(selectors);
+        return Array.from(elements).map(el => ({
+          tag: el.tagName.toLowerCase(),
+          text: (el as HTMLElement).innerText?.slice(0, 50) || el.getAttribute('aria-label') || '',
+          tabIndex: (el as HTMLElement).tabIndex,
+          visible: (el as HTMLElement).offsetParent !== null,
+        }));
+      });
+
+      // Filter to visible elements
+      const visibleInteractive = interactiveElements.filter(el => el.visible);
+
+      // Tab through all elements and verify focus reaches each
+      let focusedCount = 0;
+      for (let i = 0; i < visibleInteractive.length + 5; i++) {
+        await page.keyboard.press('Tab');
+        const focused = await page.evaluate(() => {
+          const el = document.activeElement;
+          return el ? el.tagName.toLowerCase() : 'none';
+        });
+        if (focused !== 'body' && focused !== 'none') {
+          focusedCount++;
+        }
+      }
+
+      // At least 80% of visible interactive elements should be reachable
+      if (visibleInteractive.length > 0) {
+        const reachRate = focusedCount / visibleInteractive.length;
+        expect(reachRate).toBeGreaterThanOrEqual(0.8);
+      }
+    });
+  });
+}
+```
+
+### 2.5.2 — Standalone axe-core Snippet (for Browser DevTools)
+
+```javascript
+// Paste this into browser console on any app iframe:
+(async () => {
+  if (!window.axe) {
+    const s = document.createElement('script');
+    s.src = 'https://cdnjs.cloudflare.com/ajax/libs/axe-core/4.10.0/axe.min.js';
+    document.head.appendChild(s);
+    await new Promise(r => s.onload = r);
+  }
+  const results = await axe.run(document, {
+    runOnly: ['wcag2a', 'wcag2aa', 'wcag21aa']
+  });
+  console.log('=== Accessibility Results ===');
+  console.log(`Passes: ${results.passes.length}`);
+  console.log(`Violations: ${results.violations.length}`);
+  const score = Math.round(
+    (results.passes.length / (results.passes.length + results.violations.length)) * 100
+  );
+  console.log(`Score: ${score}%`);
+  if (results.violations.length > 0) {
+    console.table(results.violations.map(v => ({
+      impact: v.impact,
+      id: v.id,
+      description: v.description,
+      nodes: v.nodes.length
+    })));
+  }
+  return results;
+})();
+```
+
+### 2.5.3 — Color Contrast Audit
+
+```javascript
+// Validate contrast ratios for all text elements
+// Paste into browser console on any app iframe:
+(function auditContrast() {
+  function luminance(r, g, b) {
+    const a = [r, g, b].map(v => {
+      v /= 255;
+      return v <= 0.03928 ? v / 12.92 : Math.pow((v + 0.055) / 1.055, 2.4);
+    });
+    return a[0] * 0.2126 + a[1] * 0.7152 + a[2] * 0.0722;
+  }
+  function contrastRatio(rgb1, rgb2) {
+    const l1 = luminance(...rgb1) + 0.05;
+    const l2 = luminance(...rgb2) + 0.05;
+    return l1 > l2 ? l1 / l2 : l2 / l1;
+  }
+  function parseRGB(color) {
+    const m = color.match(/\d+/g);
+    return m ? m.slice(0, 3).map(Number) : [0, 0, 0];
+  }
+
+  const textElements = document.querySelectorAll('*');
+  const issues = [];
+  
+  textElements.forEach(el => {
+    const style = getComputedStyle(el);
+    if (!el.textContent?.trim() || style.display === 'none') return;
+    
+    const fgRGB = parseRGB(style.color);
+    const bgRGB = parseRGB(style.backgroundColor);
+    
+    // Skip if background is transparent (would need to walk up)
+    if (style.backgroundColor === 'rgba(0, 0, 0, 0)') return;
+    
+    const ratio = contrastRatio(fgRGB, bgRGB);
+    const fontSize = parseFloat(style.fontSize);
+    const isBold = parseInt(style.fontWeight) >= 700;
+    const isLargeText = fontSize >= 24 || (fontSize >= 18.66 && isBold);
+    const required = isLargeText ? 3.0 : 4.5;
+    
+    if (ratio < required) {
+      issues.push({
+        text: el.textContent.trim().slice(0, 40),
+        fg: style.color,
+        bg: style.backgroundColor,
+        ratio: ratio.toFixed(1),
+        required: required,
+        tag: el.tagName
+      });
+    }
+  });
+  
+  if (issues.length === 0) {
+    console.log('✅ All text passes WCAG AA contrast requirements');
+  } else {
+    console.log(`❌ ${issues.length} contrast failures:`);
+    console.table(issues);
+  }
+})();
+```
+
+### 2.5.4 — Screen Reader Testing (macOS VoiceOver)
+
+```markdown
+### VoiceOver Manual Test Procedure:
+1. Open the app in Safari (VoiceOver works best with Safari)
+2. Enable VoiceOver: Cmd+F5
+3. Navigate with VO+Right Arrow through all elements
+4. Verify:
+   - [ ] App title/heading is announced
+   - [ ] Data table rows are announced with column headers
+   - [ ] Status badges announce text (not just color)
+   - [ ] Loading state announces "Loading" or similar
+   - [ ] Empty state announces helpful message
+   - [ ] Interactive elements announce their purpose
+   - [ ] No "blank" or "group" without context
+5. Disable VoiceOver: Cmd+F5
+```
+
+### Quality Gate:
+- [ ] axe-core score >90% on all apps
+- [ ] Zero critical/serious axe violations
+- [ ] All text meets WCAG AA contrast (4.5:1 normal, 3:1 large)
+- [ ] Secondary text uses #b0b2b8 or lighter (not #96989d)
+- [ ] All interactive elements reachable via Tab
+- [ ] VoiceOver reads meaningful content (no blank/unlabeled regions)
+
+---
+
+## Layer 3: Functional Testing
+
+### 3.1 — Jest Unit Tests with MSW (Mock Service Worker)
+
+Test tool handlers without hitting real APIs:
+
+```typescript
+// tests/tools.test.ts
+import { http, HttpResponse } from 'msw';
+import { setupServer } from 'msw/node';
+
+// Mock API responses
+const mockContacts = [
+  { id: '1', name: 'John Doe', email: 'john@example.com', phone: '555-0101', status: 'active' },
+  { id: '2', name: 'Jane Smith', email: 'jane@example.com', phone: '555-0102', status: 'inactive' },
+  { id: '3', name: 'Bob Wilson', email: 'bob@example.com', phone: '555-0103', status: 'active' },
+];
+
+const handlers = [
+  // Mock the external API endpoints your tools call
+  http.get('https://api.example.com/v1/contacts', ({ request }) => {
+    const url = new URL(request.url);
+    const page = Number(url.searchParams.get('page') || 1);
+    const pageSize = Number(url.searchParams.get('pageSize') || 25);
+    const status = url.searchParams.get('status');
+    
+    let filtered = mockContacts;
+    if (status) filtered = filtered.filter(c => c.status === status);
+    
+    return HttpResponse.json({
+      data: filtered.slice((page - 1) * pageSize, page * pageSize),
+      meta: { total: filtered.length, page, pageSize }
+    });
+  }),
+
+  http.get('https://api.example.com/v1/contacts/:id', ({ params }) => {
+    const contact = mockContacts.find(c => c.id === params.id);
+    if (!contact) {
+      return new HttpResponse(null, { status: 404 });
+    }
+    return HttpResponse.json(contact);
+  }),
+
+  http.post('https://api.example.com/v1/contacts', async ({ request }) => {
+    const body = await request.json() as any;
+    return HttpResponse.json({
+      id: 'new-1',
+      ...body,
+      created_at: new Date().toISOString()
+    }, { status: 201 });
+  }),
+
+  // Mock 500 error for chaos testing
+  http.get('https://api.example.com/v1/error-endpoint', () => {
+    return new HttpResponse(null, { status: 500 });
+  }),
+];
+
+const server = setupServer(...handlers);
+
+beforeAll(() => server.listen({ onUnhandledRequest: 'warn' }));
+afterEach(() => server.resetHandlers());
+afterAll(() => server.close());
+
+describe('Tool Handlers', () => {
+  test('list_contacts returns paginated results', async () => {
+    // Import your actual tool handler
+    // const { handleListContacts } = require('../src/tools/contacts');
+    // const result = await handleListContacts({ page: 1, pageSize: 25 });
+    
+    // For now, test the API client directly
+    const response = await fetch('https://api.example.com/v1/contacts?page=1&pageSize=25');
+    const data = await response.json();
+    
+    expect(data.data).toBeInstanceOf(Array);
+    expect(data.data.length).toBeGreaterThan(0);
+    expect(data.meta.total).toBeDefined();
+    expect(data.meta.page).toBe(1);
+    
+    // Validate each contact shape
+    for (const contact of data.data) {
+      expect(contact.id).toBeTruthy();
+      expect(contact.name).toBeTruthy();
+      expect(contact.email).toBeTruthy();
+    }
+  });
+
+  test('list_contacts filters by status', async () => {
+    const response = await fetch('https://api.example.com/v1/contacts?status=active');
+    const data = await response.json();
+    
+    for (const contact of data.data) {
+      expect(contact.status).toBe('active');
+    }
+  });
+
+  test('get_contact returns single contact', async () => {
+    const response = await fetch('https://api.example.com/v1/contacts/1');
+    const data = await response.json();
+    
+    expect(data.id).toBe('1');
+    expect(data.name).toBe('John Doe');
+  });
+
+  test('get_contact returns 404 for unknown ID', async () => {
+    const response = await fetch('https://api.example.com/v1/contacts/unknown-99');
+    expect(response.status).toBe(404);
+  });
+
+  test('create_contact returns created entity', async () => {
+    const response = await fetch('https://api.example.com/v1/contacts', {
+      method: 'POST',
+      headers: { 'Content-Type': 'application/json' },
+      body: JSON.stringify({ name: 'New Contact', email: 'new@test.com' })
+    });
+    const data = await response.json();
+    
+    expect(response.status).toBe(201);
+    expect(data.id).toBeTruthy();
+    expect(data.name).toBe('New Contact');
+  });
+
+  test('handles API 500 errors gracefully', async () => {
+    const response = await fetch('https://api.example.com/v1/error-endpoint');
+    expect(response.status).toBe(500);
+    // Tool handler should return isError: true, not crash
+  });
+});
+```
+
+> **MSW Mock Validation:** Hand-crafted mocks can drift from real API responses. When credentials are available (Layer 4), validate that MSW mock response shapes match actual API responses. Run a script that calls the real API once and diffs the response keys/types against your mock handlers. Update mocks quarterly or whenever the API ships a new version.
+
+### 3.2 — Tool Routing Smoke Test
+
+Automated script that sends NL messages and checks tool selection:
+
+```typescript
+// tests/tool-routing.test.ts
+import * as fs from 'fs';
+import * as path from 'path';
+
+interface RoutingFixture {
+  message: string;
+  expectedTool: string;
+  category: string;
+}
+
+// Load routing fixtures (maintain this file!)
+const ROUTING_FIXTURES_PATH = path.resolve(__dirname, '../test-fixtures/tool-routing.json');
+
+const routingFixtures: RoutingFixture[] = JSON.parse(
+  fs.readFileSync(ROUTING_FIXTURES_PATH, 'utf8')
+);
+
+describe('Tool Routing', () => {
+  // This test requires the AI/LLM in the loop — typically run via LocalBosses API
+  // or by mocking the tool selection logic
+  
+  test('routing fixtures file is valid', () => {
+    expect(routingFixtures.length).toBeGreaterThanOrEqual(20);
+    
+    for (const fixture of routingFixtures) {
+      expect(fixture.message).toBeTruthy();
+      expect(fixture.expectedTool).toBeTruthy();
+      expect(fixture.category).toBeTruthy();
+    }
+  });
+
+  test('all expected tools exist in server', async () => {
+    // Parse the server's tool definitions to get available tool names
+    const toolNames = new Set<string>();
+    
+    // Read from compiled server or source
+    // This validates that routing fixtures reference real tools
+    const srcDir = path.resolve(__dirname, '../src/tools');
+    if (fs.existsSync(srcDir)) {
+      const toolFiles = fs.readdirSync(srcDir).filter(f => f.endsWith('.ts'));
+      for (const file of toolFiles) {
+        const content = fs.readFileSync(path.join(srcDir, file), 'utf8');
+        const nameMatches = content.matchAll(/name:\s*['"]([^'"]+)['"]/g);
+        for (const match of nameMatches) {
+          toolNames.add(match[1]);
+        }
+      }
+    }
+
+    if (toolNames.size > 0) {
+      for (const fixture of routingFixtures) {
+        expect(toolNames.has(fixture.expectedTool)).toBe(true);
+      }
+    }
+  });
+});
+
+// Tool routing fixtures template — save as test-fixtures/tool-routing.json:
+/*
+[
+  { "message": "Show me all contacts", "expectedTool": "list_contacts", "category": "list" },
+  { "message": "Find John Smith", "expectedTool": "search_contacts", "category": "search" },
+  { "message": "What's John's email?", "expectedTool": "get_contact", "category": "get" },
+  { "message": "Add a new contact", "expectedTool": "create_contact", "category": "create" },
+  { "message": "Update John's phone number", "expectedTool": "update_contact", "category": "update" },
+  { "message": "Remove the test contact", "expectedTool": "delete_contact", "category": "delete" },
+  { "message": "Show me a summary of this month", "expectedTool": "get_dashboard", "category": "analytics" },
+  ... (at least 20 fixtures per server)
+]
+*/
+```
+
+### 3.2b — DeepEval LLM-in-the-Loop Tool Routing Evaluation
+
+Static routing fixtures validate that tool names exist, but they don't test whether the LLM actually selects the right tool. Use **DeepEval** for real LLM tool routing evaluation with `ToolCorrectnessMetric` and `TaskCompletionMetric`.
+
+**Setup:**
+```bash
+pip install deepeval
+deepeval login  # Optional: for dashboard tracking
+```
+
+**Test file** — save as `tests/tool_routing_eval.py`:
+
+```python
+# tests/tool_routing_eval.py
+# Requires: pip install deepeval anthropic
+# Run: deepeval test run tests/tool_routing_eval.py
+
+import json
+import os
+from deepeval import evaluate
+from deepeval.metrics import ToolCorrectnessMetric, TaskCompletionMetric
+from deepeval.test_case import LLMTestCase, ToolCall
+from anthropic import Anthropic
+
+client = Anthropic()
+
+def load_tool_definitions(server_dir: str) -> list[dict]:
+    """Load tool definitions from compiled MCP server."""
+    # Read tool names/schemas from the source files
+    # Adapt path to your server structure
+    import glob
+    tools = []
+    for f in glob.glob(f"{server_dir}/src/tools/*.ts"):
+        with open(f) as fh:
+            content = fh.read()
+            # Extract tool definitions (simplified — adapt to your codebase)
+            import re
+            for match in re.finditer(r'name:\s*["\'](\w+)["\']', content):
+                tools.append({"name": match.group(1)})
+    return tools
+
+def run_agent(message: str, system_prompt: str, tools: list[dict]) -> tuple[str, list[ToolCall]]:
+    """Send message through Claude with tools, return response + tool calls."""
+    # Convert MCP tool defs to Anthropic tool format
+    anthropic_tools = [
+        {
+            "name": t["name"],
+            "description": t.get("description", f"Tool: {t['name']}"),
+            "input_schema": t.get("inputSchema", {"type": "object", "properties": {}})
+        }
+        for t in tools
+    ]
+
+    response = client.messages.create(
+        model="claude-sonnet-4-20250514",
+        max_tokens=1024,
+        system=system_prompt,
+        messages=[{"role": "user", "content": message}],
+        tools=anthropic_tools,
+    )
+
+    tool_calls = []
+    text_response = ""
+    for block in response.content:
+        if block.type == "tool_use":
+            tool_calls.append(ToolCall(name=block.name, arguments=block.input))
+        elif block.type == "text":
+            text_response += block.text
+
+    return text_response, tool_calls
+
+# Load fixtures and system prompt
+FIXTURES_PATH = "test-fixtures/tool-routing.json"
+SYSTEM_PROMPT_PATH = "test-fixtures/system-prompt.txt"
+
+with open(FIXTURES_PATH) as f:
+    fixtures = json.load(f)
+
+system_prompt = ""
+if os.path.exists(SYSTEM_PROMPT_PATH):
+    with open(SYSTEM_PROMPT_PATH) as f:
+        system_prompt = f.read()
+
+# Build test cases
+tool_correctness = ToolCorrectnessMetric()
+task_completion = TaskCompletionMetric()
+
+test_cases = []
+for fixture in fixtures:
+    response_text, actual_calls = run_agent(
+        fixture["message"], system_prompt, load_tool_definitions(".")
+    )
+    test_cases.append(
+        LLMTestCase(
+            input=fixture["message"],
+            actual_output=response_text,
+            expected_tools=[ToolCall(name=fixture["expectedTool"])],
+            tools_called=actual_calls,
+        )
+    )
+
+# Evaluate
+results = evaluate(test_cases, [tool_correctness, task_completion])
+print(f"\n=== DeepEval Results ===")
+print(f"Tool Correctness: {tool_correctness.score:.1%}")
+print(f"Task Completion: {task_completion.score:.1%}")
+# Target: Tool Correctness >95%, Task Completion >90%
+```
+
+**When to run:** After every tool description change, system prompt update, or model upgrade. This is the REAL test of whether the AI routes correctly — fixture files alone are testing theater.
+
+### 3.3 — APP_DATA Schema Validator
+
+```typescript
+// tests/app-data-validator.ts
+import Ajv from 'ajv';
+import * as fs from 'fs';
+import * as path from 'path';
+
+const ajv = new Ajv({ allErrors: true, strict: false });
+
+// Define expected schemas per app type
+const APP_DATA_SCHEMAS: Record<string, object> = {
+  'dashboard': {
+    type: 'object',
+    required: ['title'],
+    properties: {
+      title: { type: 'string' },
+      metrics: {
+        type: 'array',
+        items: {
+          type: 'object',
+          required: ['label', 'value'],
+          properties: {
+            label: { type: 'string' },
+            value: { type: ['string', 'number'] },
+            change: { type: ['string', 'number'] },
+            trend: { enum: ['up', 'down', 'flat'] }
+          }
+        }
+      },
+      charts: { type: 'array' },
+      data: { type: ['array', 'object'] }
+    }
+  },
+  'data-grid': {
+    type: 'object',
+    required: ['data'],
+    properties: {
+      title: { type: 'string' },
+      data: {
+        type: 'array',
+        items: { type: 'object' },
+        minItems: 0
+      },
+      meta: {
+        type: 'object',
+        properties: {
+          total: { type: 'number' },
+          page: { type: 'number' },
+          pageSize: { type: 'number' }
+        }
+      },
+      columns: { type: 'array' }
+    }
+  },
+  'detail-card': {
+    type: 'object',
+    properties: {
+      title: { type: 'string' },
+      data: { type: 'object' },
+      sections: { type: 'array' },
+      fields: { type: 'array' }
+    }
+  },
+  'timeline': {
+    type: 'object',
+    properties: {
+      title: { type: 'string' },
+      events: {
+        type: 'array',
+        items: {
+          type: 'object',
+          required: ['date'],
+          properties: {
+            date: { type: 'string' },
+            title: { type: 'string' },
+            description: { type: 'string' },
+            type: { type: 'string' }
+          }
+        }
+      },
+      data: { type: 'array' }
+    }
+  },
+  'pipeline': {
+    type: 'object',
+    properties: {
+      title: { type: 'string' },
+      stages: {
+        type: 'array',
+        items: {
+          type: 'object',
+          required: ['name'],
+          properties: {
+            name: { type: 'string' },
+            items: { type: 'array' },
+            count: { type: 'number' },
+            value: { type: ['number', 'string'] }
+          }
+        }
+      }
+    }
+  }
+};
+
+export function validateAppData(
+  appType: string,
+  appData: any
+): { valid: boolean; errors: string[]; warnings: string[] } {
+  const errors: string[] = [];
+  const warnings: string[] = [];
+
+  // Basic checks
+  if (!appData || typeof appData !== 'object') {
+    return { valid: false, errors: ['APP_DATA is null or not an object'], warnings: [] };
+  }
+
+  // Schema validation
+  const schema = APP_DATA_SCHEMAS[appType];
+  if (schema) {
+    const validate = ajv.compile(schema);
+    const isValid = validate(appData);
+    if (!isValid && validate.errors) {
+      for (const err of validate.errors) {
+        errors.push(`${err.instancePath || '/'} ${err.message}`);
+      }
+    }
+  } else {
+    warnings.push(`No schema defined for app type: ${appType}`);
+  }
+
+  // Common checks regardless of app type
+  if (appData.data && Array.isArray(appData.data)) {
+    if (appData.data.length === 0) {
+      warnings.push('data array is empty — app will show empty state');
+    }
+    // Check for null/undefined values in data items
+    for (let i = 0; i < Math.min(appData.data.length, 5); i++) {
+      const item = appData.data[i];
+      for (const [key, val] of Object.entries(item || {})) {
+        if (val === undefined) {
+          warnings.push(`data[${i}].${key} is undefined (will show as "undefined" in app)`);
+        }
+      }
+    }
+  }
+
+  return { valid: errors.length === 0, errors, warnings };
+}
+
+// Parse APP_DATA from AI response text
+export function extractAppData(responseText: string): any | null {
+  // Standard format
+  const match = responseText.match(/<!--APP_DATA:([\s\S]*?):END_APP_DATA-->/);
+  if (match) {
+    try {
+      // Strip whitespace/newlines that LLMs sometimes add
+      const cleaned = match[1].replace(/[\n\r]/g, '').trim();
+      return JSON.parse(cleaned);
+    } catch (e) {
+      // Try with more aggressive cleanup
+      try {
+        const aggressive = match[1]
+          .replace(/[\n\r\t]/g, '')
+          .replace(/,\s*}/g, '}')   // trailing commas
+          .replace(/,\s*]/g, ']')   // trailing commas in arrays
+          .trim();
+        return JSON.parse(aggressive);
+      } catch (e2) {
+        return null;
+      }
+    }
+  }
+  
+  // Fallback: try to find JSON in code blocks
+  const codeBlockMatch = responseText.match(/```(?:json)?\s*([\s\S]*?)```/);
+  if (codeBlockMatch) {
+    try {
+      return JSON.parse(codeBlockMatch[1].trim());
+    } catch (e) {
+      return null;
+    }
+  }
+  
+  return null;
+}
+```
+
+### 3.4 — Thread Lifecycle Testing
+
+```markdown
+### Thread Lifecycle: {channel}
+
+1. [ ] Click app in toolbar → thread panel opens
+2. [ ] Intake question appears in thread
+3. [ ] Type response → AI processes in thread context
+4. [ ] App loads in thread panel (if data returned or skipped)
+5. [ ] Send follow-up message → app updates with new data
+6. [ ] Close thread panel (X) → panel closes, thread indicator remains
+7. [ ] Click thread indicator → panel reopens with preserved state
+8. [ ] Delete thread → thread removed, parent message removed
+9. [ ] Switch channels → come back → thread state persists (localStorage)
+```
+
+### Quality Gate:
+- [ ] All tool handler unit tests pass (Jest + MSW)
+- [ ] Tool routing fixtures file has ≥20 test messages
+- [ ] All routing fixture tools exist in the server
+- [ ] APP_DATA schema validation passes for all app types
+- [ ] APP_DATA parser handles malformed JSON gracefully
+- [ ] Thread lifecycle completes without errors
+
+---
+
+## Layer 3.5: Performance Testing
+
+### 3.5.1 — Server Cold Start
+
+```bash
+#!/bin/bash
+# Measure cold start time
+SERVICE_DIR="$1"
+cd "$SERVICE_DIR"
+
+echo "=== Cold Start Benchmark ==="
+
+# Measure time to first ListTools response
+START=$(date +%s%N)
+echo '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"perf-test","version":"1.0.0"}}}' | \
+  timeout 10 node dist/index.js 2>/dev/null | head -1 > /dev/null
+END=$(date +%s%N)
+
+ELAPSED=$(( (END - START) / 1000000 ))
+echo "Cold start to first response: ${ELAPSED}ms"
+if [ "$ELAPSED" -gt 2000 ]; then
+  echo "❌ FAIL — exceeds 2000ms target"
+else
+  echo "✅ PASS — under 2000ms target"
+fi
+```
+
+### 3.5.2 — Tool Invocation Latency
+
+```typescript
+// tests/performance.test.ts
+import { performance } from 'perf_hooks';
+
+describe('Performance', () => {
+  test('tool invocation overhead is under 100ms (excluding API time)', async () => {
+    // With MSW intercepting API calls (near-zero latency),
+    // measure the tool handler overhead itself
+    const times: number[] = [];
+    
+    for (let i = 0; i < 10; i++) {
+      const start = performance.now();
+      // Call a read-only tool through the handler
+      // await toolHandler({ page: 1, pageSize: 10 });
+      const response = await fetch('https://api.example.com/v1/contacts?page=1&pageSize=10');
+      await response.json();
+      const elapsed = performance.now() - start;
+      times.push(elapsed);
+    }
+
+    const sorted = times.sort((a, b) => a - b);
+    const p50 = sorted[Math.floor(sorted.length * 0.5)];
+    const p95 = sorted[Math.floor(sorted.length * 0.95)];
+
+    console.log(`Tool overhead P50: ${p50.toFixed(1)}ms, P95: ${p95.toFixed(1)}ms`);
+    expect(p50).toBeLessThan(100);
+  });
+
+  test('memory usage stays under 100MB with all tools loaded', async () => {
+    const used = process.memoryUsage();
+    const heapMB = Math.round(used.heapUsed / 1024 / 1024);
+    const rssMB = Math.round(used.rss / 1024 / 1024);
+    
+    console.log(`Heap: ${heapMB}MB, RSS: ${rssMB}MB`);
+    expect(rssMB).toBeLessThan(100);
+  });
+});
+```
+
+### 3.5.3 — App File Size Budget
+
+```bash
+#!/bin/bash
+echo "=== App File Size Budget (max 50KB) ==="
+OVER=0
+for f in app-ui/*.html; do
+  if [ -f "$f" ]; then
+    SIZE=$(wc -c < "$f" | tr -d ' ')
+    KB=$((SIZE / 1024))
+    if [ "$SIZE" -gt 51200 ]; then
+      echo "❌ $(basename $f): ${KB}KB (OVER BUDGET)"
+      OVER=$((OVER + 1))
+    else
+      echo "✅ $(basename $f): ${KB}KB"
+    fi
+  fi
+done
+[ "$OVER" -eq 0 ] && echo "All apps within budget" || echo "⚠️  $OVER apps over 50KB budget"
+```
+
+### 3.5.4 — App Render Performance (Playwright)
+
+```typescript
+// In visual.test.ts, add:
+test('time to first render is under 2s', async ({ page }) => {
+  const start = Date.now();
+  await page.goto(`file://${appFile}`);
+  
+  const fixture = loadFixture(appFile);
+  await page.evaluate((data) => {
+    window.postMessage({ type: 'mcp_app_data', data }, '*');
+  }, fixture);
+  
+  // Wait for content to be visible
+  await page.locator('#content').waitFor({ state: 'visible', timeout: 5000 });
+  const renderTime = Date.now() - start;
+  
+  console.log(`[${appName}] Time to first render: ${renderTime}ms`);
+  expect(renderTime).toBeLessThan(2000);
+});
+```
+
+### 3.5.5 — Load Testing (HTTP Transport)
+
+For servers running with `MCP_TRANSPORT=http`, test concurrent connection handling:
+
+```bash
+#!/bin/bash
+# load-test-http.sh — Test concurrent MCP connections
+# Requires: npm install -g autocannon (or use curl + GNU parallel)
+
+MCP_PORT="${1:-3000}"
+CONCURRENCY="${2:-10}"
+DURATION="${3:-10}"
+
+echo "=== MCP HTTP Load Test ==="
+echo "Target: http://localhost:${MCP_PORT}/mcp"
+echo "Concurrency: ${CONCURRENCY} connections"
+echo "Duration: ${DURATION}s"
+echo ""
+
+# Test 1: Concurrent initialize requests
+echo "--- Test 1: Concurrent initialize ---"
+for i in $(seq 1 $CONCURRENCY); do
+  curl -s -X POST "http://localhost:${MCP_PORT}/mcp" \
+    -H "Content-Type: application/json" \
+    -d '{"jsonrpc":"2.0","id":'$i',"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"load-test-'$i'","version":"1.0.0"}}}' \
+    -o /dev/null -w "Connection $i: %{http_code} in %{time_total}s\n" &
+done
+wait
+echo ""
+
+# Test 2: Concurrent tools/list under load
+echo "--- Test 2: Concurrent tools/list ---"
+START=$(date +%s%N)
+for i in $(seq 1 $CONCURRENCY); do
+  curl -s -X POST "http://localhost:${MCP_PORT}/mcp" \
+    -H "Content-Type: application/json" \
+    -d '{"jsonrpc":"2.0","id":1,"method":"tools/list","params":{}}' \
+    -o /dev/null -w "%{http_code} " &
+done
+wait
+END=$(date +%s%N)
+ELAPSED=$(( (END - START) / 1000000 ))
+echo ""
+echo "All $CONCURRENCY requests completed in ${ELAPSED}ms"
+echo ""
+
+# Test 3: Session management under load (verify no cross-session leaks)
+echo "--- Test 3: Session isolation ---"
+SESSION1=$(curl -s -X POST "http://localhost:${MCP_PORT}/mcp" \
+  -H "Content-Type: application/json" \
+  -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"session-1","version":"1.0.0"}}}' \
+  -D - -o /dev/null 2>&1 | grep -i "mcp-session-id" | cut -d' ' -f2 | tr -d '\r')
+SESSION2=$(curl -s -X POST "http://localhost:${MCP_PORT}/mcp" \
+  -H "Content-Type: application/json" \
+  -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"session-2","version":"1.0.0"}}}' \
+  -D - -o /dev/null 2>&1 | grep -i "mcp-session-id" | cut -d' ' -f2 | tr -d '\r')
+
+if [ "$SESSION1" != "$SESSION2" ] && [ -n "$SESSION1" ] && [ -n "$SESSION2" ]; then
+  echo "✅ Sessions are unique (no cross-session leaks)"
+else
+  echo "⚠️  Session isolation check inconclusive"
+fi
+
+echo ""
+echo "=== Load Test Complete ==="
+echo "Target: ${CONCURRENCY} concurrent connections should complete without 5xx errors"
+```
+
+**Pass criteria:**
+- Zero 5xx errors under 10 concurrent connections
+- All responses return within 5s
+- No cross-session data leaks (GHSA-345p-7cg4-v4c7 regression test)
+- Memory usage stays under 200MB during load
+
+### Quality Gate:
+- [ ] Cold start <2s to first ListTools response
+- [ ] Tool invocation overhead P50 <100ms (excluding API latency)
+- [ ] Memory usage <100MB after loading all tool groups
+- [ ] All HTML app files <50KB
+- [ ] Time to first render <2s for all apps
+- [ ] HTTP transport handles 10 concurrent connections without errors
+
+---
+
+## Layer 4: Live API Testing
+
+### 4.1 — Credential Management Strategy
+
+**Before running Layer 4, categorize the server:**
+
+| Category | Description | Layer 4 Approach |
+|----------|-------------|-----------------|
+| **has-creds** | API key/OAuth token available in `.env` | Full live testing |
+| **needs-creds** | Credentials needed but not yet obtained | Skip Layer 4, note in report |
+| **sandbox-available** | API provides sandbox/test environment | Use sandbox creds (preferred) |
+| **no-sandbox** | Only production credentials available | Careful read-only testing only |
+
+**Centralized credential management:**
+
+```bash
+# Master credentials file (NOT committed to git)
+# Location: ~/.clawdbot/workspace/.env.mcp-testing
+
+# Format per service:
+# {SERVICE}_API_KEY=xxx
+# {SERVICE}_API_BASE_URL=https://api.example.com
+# {SERVICE}_SANDBOX=true|false
+# {SERVICE}_CRED_STATUS=has-creds|needs-creds|sandbox|no-sandbox
+# {SERVICE}_CRED_EXPIRES=2026-03-01
+
+# Script to distribute to individual servers:
+cat ~/.clawdbot/workspace/.env.mcp-testing | grep "^${SERVICE}_" | sed "s/${SERVICE}_//" > ${SERVICE}-mcp/.env
+```
+
+**For servers WITHOUT credentials, focus on Layers 0-3:**
+- Layer 0: Protocol compliance (no API needed)
+- Layer 1: Static analysis (no API needed)
+- Layer 2: Visual testing with fixture data (no API needed)
+- Layer 2.5: Accessibility (no API needed)
+- Layer 3: Functional testing with MSW mocks (no API needed)
+- Layer 3.5: Performance with mocks (no API needed)
+- Layer 4: **SKIP** — note in report as "No credentials available"
+- Layer 4.5: Security (most checks don't need API)
+- Layer 5: Partial — E2E with mocked responses
+
+### 4.2 — Test Each Tool Group
+
+```markdown
+### Live API Test: {service} / {tool-group}
+
+**Auth:** {method} — Token/key set in .env
+**Base URL:** {url}
+**Cred Status:** {has-creds|sandbox|no-creds}
+
+| Tool | Test Input | Expected | Actual | Latency | Status |
+|------|-----------|----------|--------|---------|--------|
+| list_{entities} | {} (default) | Array of items | | ms | |
+| list_{entities} | { status: "active" } | Filtered array | | ms | |
+| get_{entity} | { id: "known-id" } | Single item | | ms | |
+| create_{entity} | { name: "QA Test" } | Created w/ ID | | ms | |
+| update_{entity} | { id: "id", name: "Updated" } | Updated item | | ms | |
+| delete_{entity} | { id: "qa-test-id" } | Confirmation | | ms | |
+```
+
+### 4.3 — Response Shape Verification
+
+```bash
+# For each tool, verify response shape matches what the app expects
+# Extract field references from app HTML
+grep -oP 'data\.\K[a-zA-Z_]+' app-ui/{app}.html | sort -u > /tmp/expected-fields.txt
+
+# Compare with actual API response fields
+echo '{api_response}' | jq 'keys' > /tmp/actual-fields.txt
+
+# Diff
+diff /tmp/expected-fields.txt /tmp/actual-fields.txt
+```
+
+### Quality Gate:
+- [ ] All read-only tools return valid data
+- [ ] Write tools create/update/delete correctly (use sandbox)
+- [ ] Response shapes match what apps expect
+- [ ] Error responses (401, 403, 404, 422, 429) handled gracefully
+- [ ] All response latencies recorded for P50/P95 metrics
+- [ ] Cleanup: delete any test data created during QA
+
+---
+
+## Layer 4.5: Security Testing
+
+### 4.5.1 — XSS Testing
+
+```typescript
+// tests/security.test.ts
+import { test, expect } from '@playwright/test';
+import * as path from 'path';
+
+const XSS_PAYLOADS = [
+  '<script>alert("xss")</script>',
+  '<img src=x onerror=alert("xss")>',
+  '"><script>alert(1)</script>',
+  "';alert(String.fromCharCode(88,83,83))//",
+  '<svg onload=alert("xss")>',
+  'javascript:alert("xss")',
+  '<iframe src="javascript:alert(1)">',
+  '{{constructor.constructor("return this")().alert(1)}}',
+  '<details open ontoggle=alert(1)>',
+  '<math><mtext><table><mglyph><svg><mtext><style><img src=x onerror=alert(1)>',
+];
+
+test.describe('XSS Security', () => {
+  test('escapeHtml blocks all XSS payloads in text fields', async ({ page }) => {
+    const appFile = path.resolve(__dirname, '../app-ui/contact-grid.html');
+    await page.goto(`file://${appFile}`);
+
+    for (const payload of XSS_PAYLOADS) {
+      let alertFired = false;
+      page.on('dialog', async dialog => {
+        alertFired = true;
+        await dialog.dismiss();
+      });
+
+      // Inject data with XSS payloads in every text field
+      await page.evaluate((xss) => {
+        window.postMessage({
+          type: 'mcp_app_data',
+          data: {
+            title: xss,
+            data: [
+              { name: xss, email: xss, phone: xss, status: xss },
+            ],
+            meta: { total: 1, page: 1, pageSize: 25 }
+          }
+        }, '*');
+      }, payload);
+
+      await page.waitForTimeout(200);
+      expect(alertFired).toBe(false);
+    }
+  });
+});
+```
+
+### 4.5.2 — postMessage Origin Validation
+
+```javascript
+// Check in browser console — app should validate message origin
+// Inject from a different origin simulation:
+(function testOriginValidation() {
+  // Check if app code validates event.origin
+  const appScript = document.querySelector('script')?.textContent || '';
+  const checksOrigin = appScript.includes('event.origin') ||
+                       appScript.includes('e.origin') ||
+                       appScript.includes('message.origin');
+  
+  if (checksOrigin) {
+    console.log('✅ App validates postMessage origin');
+  } else {
+    console.log('⚠️  App does NOT validate postMessage origin — potential security issue');
+    console.log('   Recommended: Add origin check in message event listener');
+  }
+})();
+```
+
+### 4.5.3 — Content Security Policy Check
+
+```bash
+# Check if HTML apps declare CSP
+for f in app-ui/*.html; do
+  if grep -q "Content-Security-Policy" "$f"; then
+    echo "✅ $(basename $f) has CSP meta tag"
+  else
+    echo "⚠️  $(basename $f) — no CSP meta tag"
+  fi
+done
+
+# Check for inline event handlers (CSP-unfriendly)
+for f in app-ui/*.html; do
+  INLINE=$(grep -c 'on[a-z]*=' "$f" || echo "0")
+  if [ "$INLINE" -gt 0 ]; then
+    echo "⚠️  $(basename $f) has $INLINE inline event handlers"
+  fi
+done
+```
+
+### 4.5.4 — API Key Exposure Check
+
+```bash
+# Check for leaked secrets in client-side code
+echo "=== API Key Exposure Scan ==="
+
+# Common patterns for API keys/secrets
+PATTERNS=(
+  'api[_-]?key'
+  'apikey'
+  'secret'
+  'token'
+  'password'
+  'authorization.*Bearer'
+  'sk_live_'
+  'pk_live_'
+  'ghp_'
+  'gho_'
+)
+
+for f in app-ui/*.html; do
+  for pat in "${PATTERNS[@]}"; do
+    MATCHES=$(grep -ci "$pat" "$f" || echo "0")
+    if [ "$MATCHES" -gt 0 ]; then
+      echo "❌ $(basename $f) may contain exposed secrets (pattern: $pat)"
+      grep -in "$pat" "$f" | head -3
+    fi
+  done
+done
+
+# Also check compiled JS
+for f in dist/**/*.js; do
+  if [ -f "$f" ]; then
+    for pat in "${PATTERNS[@]}"; do
+      MATCHES=$(grep -ci "$pat" "$f" || echo "0")
+      if [ "$MATCHES" -gt 0 ]; then
+        echo "⚠️  $(basename $f) references: $pat (verify not actual key)"
+      fi
+    done
+  fi
+done
+```
+
+### Quality Gate:
+- [ ] All XSS payloads blocked (escapeHtml works)
+- [ ] No alert dialogs triggered from any payload
+- [ ] postMessage origin validated (or documented as acceptable risk)
+- [ ] No API keys/secrets exposed in HTML app files
+- [ ] No API keys/secrets in client-facing JavaScript
+- [ ] CSP meta tag present (or documented why not)
+
+---
+
+## Layer 5: Integration & Chaos Testing
+
+### 5.1 — End-to-End Scenarios
+
+Write **at least 1 E2E scenario per app type** (minimum 5 per server):
+
+```markdown
+### E2E Scenario: {scenario-name}
+
+**Channel:** {channel}
+**Goal:** {what the user is trying to accomplish}
+**App type:** {dashboard|grid|card|timeline|pipeline|calendar|analytics|monitor}
+
+**Steps:**
+1. Navigate to #{channel}
+2. Type: "{natural language message}"
+3. Verify: AI responds with correct tool call
+4. Verify: APP_DATA block present and valid JSON
+5. Verify: App {app-id} renders with correct data
+6. In thread, type: "{follow-up message}"
+7. Verify: App updates with new/refined data
+8. Measure: Response latency for each step
+
+**Metrics:**
+- Tool selected correctly: ✅/❌
+- APP_DATA valid: ✅/❌
+- App rendered: ✅/❌
+- Latency step 3: ___ms
+- Latency step 7: ___ms
+
+**Pass criteria:**
+- [ ] All steps complete without errors
+- [ ] Response time <5s for each step
+- [ ] Zero console errors
+- [ ] Data is accurate and well-formatted
+```
+
+### 5.1b — Automated End-to-End Data Flow Test (Playwright)
+
+The magic moment: message → AI → tool → APP_DATA → app render → correct data. This test automates the entire flow:
+
+```typescript
+// tests/e2e-dataflow.test.ts
+import { test, expect } from '@playwright/test';
+
+const LOCALBOSSES_URL = process.env.LB_URL || 'http://localhost:3000';
+
+test.describe('End-to-End Data Flow', () => {
+  test('message triggers tool → APP_DATA → app renders correct data', async ({ page }) => {
+    // 1. Navigate to the channel
+    await page.goto(`${LOCALBOSSES_URL}/#/channel/{channel-id}`);
+    await page.waitForLoadState('networkidle');
+
+    // 2. Send a test message
+    const chatInput = page.locator('[data-testid="chat-input"], textarea, input[type="text"]');
+    await chatInput.fill('Show me all active contacts');
+    await chatInput.press('Enter');
+
+    // 3. Wait for AI response (tool call indicator or text response)
+    const aiResponse = page.locator('[data-testid="ai-response"], .message-content').last();
+    await aiResponse.waitFor({ state: 'visible', timeout: 15000 });
+
+    // 4. Verify APP_DATA block was generated
+    const responseText = await aiResponse.textContent();
+    // The APP_DATA is in the raw response (may be hidden in the UI)
+    // Check that the app iframe loaded
+    const appFrame = page.frameLocator('iframe[data-app-id]').first();
+
+    // 5. Verify app rendered with data (not empty/loading state)
+    const appContent = appFrame.locator('#content');
+    await appContent.waitFor({ state: 'visible', timeout: 10000 });
+
+    // 6. Verify correct data is displayed
+    // App should show contact data, not empty state
+    const appText = await appContent.textContent();
+    expect(appText).toBeTruthy();
+    expect(appText!.length).toBeGreaterThan(10); // Has real content
+
+    // 7. Verify no console errors in the app iframe
+    const consoleErrors: string[] = [];
+    page.on('console', msg => {
+      if (msg.type() === 'error') consoleErrors.push(msg.text());
+    });
+    expect(consoleErrors).toHaveLength(0);
+
+    // 8. Screenshot for the record
+    await page.screenshot({ path: 'test-results/e2e-dataflow.png', fullPage: true });
+  });
+});
+```
+
+> **Note:** This test requires LocalBosses running locally with the integrated channel. It's the most important test — it validates the complete user experience end-to-end. Run this after every integration change.
+
+### 5.2 — Chaos Testing
+
+Test resilience under adverse conditions:
+
+```typescript
+// tests/chaos.test.ts
+
+describe('Chaos Testing', () => {
+  test('API returns 500 on every call', async () => {
+    // Override MSW handlers to return 500
+    server.use(
+      http.get('https://api.example.com/*', () => {
+        return new HttpResponse('Internal Server Error', { status: 500 });
+      }),
+      http.post('https://api.example.com/*', () => {
+        return new HttpResponse('Internal Server Error', { status: 500 });
+      })
+    );
+
+    // Tool should return isError: true, NOT crash
+    // const result = await callTool('list_contacts', {});
+    // expect(result.isError).toBe(true);
+    // expect(result.content[0].text).toContain('error');
+  });
+
+  test('postMessage sends wrong format data', async ({ page }) => {
+    await page.goto(`file://${appFile}`);
+    
+    // Send wrong type
+    await page.evaluate(() => {
+      window.postMessage({ type: 'wrong_type', data: {} }, '*');
+    });
+    await page.waitForTimeout(300);
+    
+    // App should not crash — should still show loading/empty
+    const bodyText = await page.textContent('body');
+    expect(bodyText).not.toContain('undefined');
+    expect(bodyText).not.toContain('TypeError');
+
+    // Send data with wrong shape
+    await page.evaluate(() => {
+      window.postMessage({ type: 'mcp_app_data', data: 'not an object' }, '*');
+    });
+    await page.waitForTimeout(300);
+    
+    const bodyText2 = await page.textContent('body');
+    expect(bodyText2).not.toContain('undefined');
+  });
+
+  test('APP_DATA is 500KB+ (huge dataset)', async ({ page }) => {
+    await page.goto(`file://${appFile}`);
+    
+    // Generate huge dataset
+    const hugeData = {
+      title: 'Performance Stress Test',
+      data: Array.from({ length: 2000 }, (_, i) => ({
+        id: `item-${i}`,
+        name: `Contact ${i} ${'A'.repeat(100)}`,
+        email: `contact${i}@example.com`,
+        phone: `555-${String(i).padStart(4, '0')}`,
+        status: i % 2 === 0 ? 'active' : 'inactive',
+        notes: 'X'.repeat(200)
+      })),
+      meta: { total: 2000, page: 1, pageSize: 2000 }
+    };
+
+    const start = Date.now();
+    await page.evaluate((data) => {
+      window.postMessage({ type: 'mcp_app_data', data }, '*');
+    }, hugeData);
+    
+    // Should render within 5 seconds even with huge data
+    await page.locator('#content').waitFor({ state: 'visible', timeout: 5000 });
+    const renderTime = Date.now() - start;
+    
+    console.log(`Huge dataset render time: ${renderTime}ms`);
+    expect(renderTime).toBeLessThan(5000);
+  });
+
+  test('rapid-fire 10 messages', async ({ page }) => {
+    await page.goto(`file://${appFile}`);
+    
+    // Send 10 data updates in quick succession
+    for (let i = 0; i < 10; i++) {
+      await page.evaluate((idx) => {
+        window.postMessage({
+          type: 'mcp_app_data',
+          data: {
+            title: `Update ${idx}`,
+            data: [{ name: `Item ${idx}`, status: 'active' }],
+            meta: { total: 1, page: 1, pageSize: 25 }
+          }
+        }, '*');
+      }, i);
+    }
+    
+    await page.waitForTimeout(1000);
+    
+    // App should show the LAST update (not crash or show stale data)
+    const content = await page.textContent('body');
+    expect(content).toContain('Update 9');
+  });
+
+  test('two apps rendering simultaneously', async ({ browser }) => {
+    const page1 = await browser.newPage();
+    const page2 = await browser.newPage();
+    
+    await page1.goto(`file://${appFile}`);
+    await page2.goto(`file://${appFile}`);
+    
+    // Send data to both simultaneously
+    await Promise.all([
+      page1.evaluate(() => {
+        window.postMessage({
+          type: 'mcp_app_data',
+          data: { title: 'App 1', data: [{ name: 'One' }] }
+        }, '*');
+      }),
+      page2.evaluate(() => {
+        window.postMessage({
+          type: 'mcp_app_data',
+          data: { title: 'App 2', data: [{ name: 'Two' }] }
+        }, '*');
+      })
+    ]);
+    
+    await page1.waitForTimeout(500);
+    await page2.waitForTimeout(500);
+    
+    // Both should render their respective data
+    expect(await page1.textContent('body')).toContain('One');
+    expect(await page2.textContent('body')).toContain('Two');
+    
+    await page1.close();
+    await page2.close();
+  });
+});
+```
+
+### 5.3 — Cross-Browser Testing Notes
+
+| Browser | Priority | Key Differences | How to Test |
+|---------|----------|----------------|-------------|
+| **Chrome** | P0 | Primary target — test all features here | Playwright `chromium` channel |
+| **Firefox** | P1 | CSS Grid/Flexbox rendering differs slightly; `backdrop-filter` needs `-webkit-` prefix | Playwright `firefox` channel |
+| **Mobile Safari** | P1 | Touch targets (min 44×44px), safe area insets, `-webkit-` prefixes, no `backdrop-filter` | Playwright `webkit` channel or real device |
+| **Electron** | P2 | If LocalBosses ships as desktop app; test Node integration, `contextBridge` | Playwright with Electron |
+
+```typescript
+// playwright.config.ts — multi-browser setup
+import { defineConfig, devices } from '@playwright/test';
+
+export default defineConfig({
+  projects: [
+    { name: 'chromium', use: { ...devices['Desktop Chrome'] } },
+    { name: 'firefox', use: { ...devices['Desktop Firefox'] } },
+    { name: 'webkit', use: { ...devices['Desktop Safari'] } },
+    { name: 'mobile-chrome', use: { ...devices['Pixel 5'] } },
+    { name: 'mobile-safari', use: { ...devices['iPhone 13'] } },
+  ],
+});
+```
+
+### Quality Gate:
+- [ ] All E2E scenarios pass (≥1 per app type)
+- [ ] Chaos tests: API 500s handled gracefully
+- [ ] Chaos tests: wrong postMessage format doesn't crash app
+- [ ] Chaos tests: 500KB+ dataset renders within 5s
+- [ ] Chaos tests: rapid-fire messages show final state
+- [ ] Cross-browser: Chrome + Firefox + WebKit all render correctly
+
+---
+
+## Layer 5.5: Production Smoke Test (Post-Deployment)
+
+After deploying a server + apps to production, run this validation before considering it shipped:
+
+```bash
+#!/bin/bash
+# smoke-test.sh — Post-deployment validation
+# Usage: ./smoke-test.sh <service-name> [base-url]
+
+SERVICE="$1"
+BASE_URL="${2:-http://localhost:3000}"
+
+echo "=== Production Smoke Test: ${SERVICE} ==="
+echo "Target: ${BASE_URL}"
+echo ""
+
+PASS=0
+FAIL=0
+
+# 1. Server is reachable (HTTP transport)
+echo "--- Server Reachability ---"
+HTTP_CODE=$(curl -s -o /dev/null -w "%{http_code}" -X POST "${BASE_URL}/mcp" \
+  -H "Content-Type: application/json" \
+  -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"smoke-test","version":"1.0.0"}}}')
+
+if [ "$HTTP_CODE" = "200" ]; then
+  echo "✅ Server responds to initialize (HTTP $HTTP_CODE)"
+  PASS=$((PASS + 1))
+else
+  echo "❌ Server unreachable or error (HTTP $HTTP_CODE)"
+  FAIL=$((FAIL + 1))
+fi
+
+# 2. tools/list returns tools
+echo "--- Tool List ---"
+TOOLS_RESPONSE=$(curl -s -X POST "${BASE_URL}/mcp" \
+  -H "Content-Type: application/json" \
+  -d '{"jsonrpc":"2.0","id":2,"method":"tools/list","params":{}}')
+TOOL_COUNT=$(echo "$TOOLS_RESPONSE" | grep -o '"name"' | wc -l | tr -d ' ')
+
+if [ "$TOOL_COUNT" -gt 0 ]; then
+  echo "✅ tools/list returns $TOOL_COUNT tools"
+  PASS=$((PASS + 1))
+else
+  echo "❌ tools/list returned 0 tools"
+  FAIL=$((FAIL + 1))
+fi
+
+# 3. health_check tool responds
+echo "--- Health Check ---"
+HEALTH=$(curl -s -X POST "${BASE_URL}/mcp" \
+  -H "Content-Type: application/json" \
+  -d '{"jsonrpc":"2.0","id":3,"method":"tools/call","params":{"name":"health_check","arguments":{}}}')
+
+if echo "$HEALTH" | grep -q '"status"'; then
+  echo "✅ health_check tool responds"
+  PASS=$((PASS + 1))
+else
+  echo "⚠️  health_check tool not found or error"
+fi
+
+# 4. App HTML files are served (if HTTP)
+echo "--- App Files ---"
+for app_id in $(echo "$TOOLS_RESPONSE" | grep -oP '"name":\s*"\K[^"]+' | head -3); do
+  APP_HTTP=$(curl -s -o /dev/null -w "%{http_code}" "${BASE_URL}/api/mcp-apps?app=${app_id}")
+  if [ "$APP_HTTP" = "200" ]; then
+    echo "✅ App ${app_id} is served"
+  fi
+done
+
+# Summary
+echo ""
+echo "=== Smoke Test Results ==="
+echo "Passed: $PASS"
+echo "Failed: $FAIL"
+[ "$FAIL" -eq 0 ] && echo "✅ SMOKE TEST PASSED" || echo "❌ SMOKE TEST FAILED"
+```
+
+---
+
+## Layer 6: Production Monitoring (Post-Ship)
+
+> *"All testing is pre-ship. There's no guidance on tracking tool correctness, APP_DATA parse success rate, or user satisfaction in production."* — Kofi
+
+Pre-ship testing validates that everything **can** work. Production monitoring validates that everything **does** work, continuously.
+
+### 6.1 — Production Quality Metrics
+
+Track these metrics in production via logging in the chat route and aggregating weekly:
+
+| Metric | Target | How to Measure | Alert Threshold |
+|--------|--------|----------------|-----------------|
+| **APP_DATA Parse Success Rate** | >98% | Log every `parseAppData()` call: success vs fallback vs failure | <95% over 1 hour |
+| **Tool Correctness Sampling** | >95% | Sample 5% of interactions weekly, LLM-judge correctness | <90% in weekly sample |
+| **Time to First App Render** | P50 <3s, P95 <8s | Measure from user message send → app `#content` visible | P95 >12s |
+| **User Retry Rate** | <15% | Count rephrased messages within 30s of previous message | >25% over 1 day |
+| **Thread Completion Rate** | >80% | % of threads where user reaches a data-displaying app state | <60% over 1 week |
+
+### 6.2 — Instrumentation Code
+
+Add to the chat route to collect production metrics:
+
+```typescript
+// lib/production-metrics.ts
+interface MetricEvent {
+  timestamp: string;
+  channel: string;
+  metric: string;
+  value: number;
+  metadata?: Record<string, unknown>;
+}
+
+const metrics: MetricEvent[] = [];
+
+export function trackMetric(channel: string, metric: string, value: number, metadata?: Record<string, unknown>) {
+  metrics.push({
+    timestamp: new Date().toISOString(),
+    channel,
+    metric,
+    value,
+    metadata,
+  });
+  // Flush to file every 100 events
+  if (metrics.length >= 100) flushMetrics();
+}
+
+function flushMetrics() {
+  const fs = require('fs');
+  const path = require('path');
+  const file = path.join(process.cwd(), 'logs', `metrics-${new Date().toISOString().split('T')[0]}.jsonl`);
+  fs.mkdirSync(path.dirname(file), { recursive: true });
+  fs.appendFileSync(file, metrics.map(m => JSON.stringify(m)).join('\n') + '\n');
+  metrics.length = 0;
+}
+
+// Usage in chat route:
+// trackMetric(channelId, 'app_data_parse', success ? 1 : 0, { fallback: usedFallback });
+// trackMetric(channelId, 'tool_call_latency', latencyMs, { tool: toolName });
+// trackMetric(channelId, 'thread_completed', 1);
+```
+
+### 6.3 — Weekly Quality Review
+
+```bash
+#!/bin/bash
+# weekly-quality-report.sh — Aggregate production metrics
+METRICS_DIR="logs"
+WEEK_START=$(date -v-7d +%Y-%m-%d)
+
+echo "=== Weekly Production Quality Report ==="
+echo "Period: ${WEEK_START} to $(date +%Y-%m-%d)"
+echo ""
+
+# APP_DATA parse success rate
+TOTAL_PARSES=$(cat ${METRICS_DIR}/metrics-*.jsonl 2>/dev/null | grep '"app_data_parse"' | wc -l | tr -d ' ')
+SUCCESS_PARSES=$(cat ${METRICS_DIR}/metrics-*.jsonl 2>/dev/null | grep '"app_data_parse"' | grep '"value":1' | wc -l | tr -d ' ')
+if [ "$TOTAL_PARSES" -gt 0 ]; then
+  PARSE_RATE=$((SUCCESS_PARSES * 100 / TOTAL_PARSES))
+  echo "APP_DATA Parse Success: ${PARSE_RATE}% (${SUCCESS_PARSES}/${TOTAL_PARSES})"
+else
+  echo "APP_DATA Parse Success: No data"
+fi
+
+echo ""
+echo "Action items:"
+echo "- Review any channels with parse rate <95%"
+echo "- Check retry rate spikes for system prompt issues"
+echo "- Sample 5 random interactions for manual correctness review"
+```
+
+---
+
+## CI/CD Pipeline Template
+
+Automate the QA pipeline in CI. Save as `.github/workflows/mcp-qa.yml`:
+
+```yaml
+# .github/workflows/mcp-qa.yml
+name: MCP QA Pipeline
+on:
+  push:
+    paths: ['*-mcp/**', 'mcp-servers/**']
+  pull_request:
+    paths: ['*-mcp/**', 'mcp-servers/**']
+
+jobs:
+  qa:
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        node-version: [22]
+    steps:
+      - uses: actions/checkout@v4
+
+      - uses: actions/setup-node@v4
+        with:
+          node-version: ${{ matrix.node-version }}
+          cache: 'npm'
+
+      - name: Install dependencies
+        run: npm ci
+
+      - name: TypeScript build
+        run: npm run build
+
+      - name: Type check
+        run: npx tsc --noEmit
+
+      - name: Jest unit tests
+        run: npx jest --ci --coverage
+        env:
+          NODE_ENV: test
+
+      - name: Install Playwright browsers
+        run: npx playwright install --with-deps
+
+      - name: Playwright visual + accessibility tests
+        run: npx playwright test
+
+      - name: App file size check
+        run: |
+          for f in app-ui/*.html; do
+            if [ -f "$f" ]; then
+              SIZE=$(wc -c < "$f" | tr -d ' ')
+              if [ "$SIZE" -gt 51200 ]; then
+                echo "❌ $(basename $f) exceeds 50KB ($SIZE bytes)"
+                exit 1
+              fi
+              echo "✅ $(basename $f) ($SIZE bytes)"
+            fi
+          done
+
+      - name: Security scan
+        run: |
+          ISSUES=0
+          for f in app-ui/*.html; do
+            for pat in "api_key" "apikey" "secret" "sk_live" "pk_live"; do
+              if grep -qi "$pat" "$f" 2>/dev/null; then
+                echo "❌ $(basename $f): potential key exposure ($pat)"
+                ISSUES=$((ISSUES + 1))
+              fi
+            done
+          done
+          [ "$ISSUES" -eq 0 ] || exit 1
+
+      - name: Upload test results
+        uses: actions/upload-artifact@v4
+        if: always()
+        with:
+          name: test-results
+          path: |
+            test-results/
+            coverage/
+          retention-days: 30
+
+  # Optional: DeepEval tool routing (requires API key)
+  tool-routing:
+    runs-on: ubuntu-latest
+    if: github.event_name == 'push' && github.ref == 'refs/heads/main'
+    needs: qa
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: '3.12'
+      - run: pip install deepeval anthropic
+      - name: Run DeepEval tool routing evaluation
+        run: deepeval test run tests/tool_routing_eval.py
+        env:
+          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
+          DEEPEVAL_API_KEY: ${{ secrets.DEEPEVAL_API_KEY }}
+```
+
+---
+
+## Testing Reality Check
+
+> *What the QA catches vs what it misses — from Kofi's review*
+
+### ✅ What This QA Framework CATCHES (real quality):
+
+| Test | What It Validates | Real-World Impact |
+|------|-------------------|-------------------|
+| TypeScript compilation | Code compiles, types correct | Prevents server crashes |
+| MCP Inspector | Protocol compliance | Server works with any MCP client |
+| Playwright visual tests | Apps render all 3 states, dark theme, responsive | Users see a polished UI |
+| axe-core accessibility | WCAG AA, keyboard nav, screen reader | Accessible to all users |
+| XSS payload testing | No script injection via user data | Security against malicious data |
+| Chaos testing (500 errors, wrong formats, huge data) | Graceful degradation | App doesn't crash under adverse conditions |
+| Static cross-reference | All app IDs consistent across 4 files | No broken routes or missing entries |
+| File size budgets | Apps under 50KB | Fast loading |
+| BackstopJS regression | Visual changes are intentional | No accidental UI regressions |
+| Cold start / latency benchmarks | Performance within targets | Users don't wait too long |
+
+### ❌ What This QA Framework MISSES (gaps to be aware of):
+
+| Gap | Why It Matters | Current State | Mitigation |
+|-----|---------------|---------------|------------|
+| **Tool routing accuracy with real LLM** | THE quality metric — does the AI pick the right tool? | DeepEval added (3.2b) but requires API key + cost | Run DeepEval on main branch pushes, not every PR |
+| **APP_DATA generation quality** | Does the LLM produce valid JSON matching app expectations? | Not fully tested — parser is tested, generator is probabilistic | Few-shot examples in system prompts + Layer 6 monitoring |
+| **Multi-step tool chains** | "Find John's email and send him a meeting invite" — requires 3 tool calls | Not tested — all routing tests are single-tool | Add multi-step fixtures to DeepEval test cases |
+| **Conversation context** | "Show me more details about the second one" — requires memory | Not addressed in any skill | Requires thread state tracking — future work |
+| **Real API response shape drift** | MSW mocks may not match real API | MSW validation note added (3.1) but manual | Quarterly mock validation when credentials available |
+| **Production quality after ship** | Is quality maintained over time? | Layer 6 monitoring added | Implement metric collection + weekly review |
+| **APP_DATA parse failure rate in production** | How often does the LLM produce unparseable JSON? | Layer 6 tracks this now | Set alerting threshold at <95% success |
+
+### The Hard Truth:
+This QA framework is excellent at testing **infrastructure** (server compiles, apps render, accessibility passes, security is clean) — roughly 40% of the user experience. The **AI interaction quality** (tool routing, data generation, multi-step flows) is the other 60%, and it's harder to test deterministically because the LLM is probabilistic. Layer 6 monitoring and DeepEval close this gap but don't eliminate it. **Ship with awareness, monitor in production, iterate on system prompts.**
+
+---
+
+## Test Data Fixtures Library
+
+### Standard Fixture: Dashboard
+
+Save as `test-fixtures/dashboard.json`:
+
+```json
+{
+  "title": "Monthly Performance Overview",
+  "metrics": [
+    { "label": "Total Revenue", "value": "$124,500", "change": "+12.3%", "trend": "up" },
+    { "label": "New Customers", "value": 847, "change": "+5.2%", "trend": "up" },
+    { "label": "Churn Rate", "value": "2.1%", "change": "-0.3%", "trend": "down" },
+    { "label": "Avg Response Time", "value": "1.4h", "change": "-8.5%", "trend": "down" }
+  ],
+  "charts": [
+    {
+      "type": "bar",
+      "title": "Revenue by Month",
+      "data": [
+        { "label": "Sep", "value": 95000 },
+        { "label": "Oct", "value": 102000 },
+        { "label": "Nov", "value": 98000 },
+        { "label": "Dec", "value": 115000 },
+        { "label": "Jan", "value": 124500 }
+      ]
+    }
+  ],
+  "data": {
+    "summary": "Revenue is up 12.3% month-over-month with strong customer acquisition."
+  }
+}
+```
+
+### Standard Fixture: Data Grid
+
+Save as `test-fixtures/data-grid.json`:
+
+```json
+{
+  "title": "Active Contacts",
+  "columns": ["Name", "Email", "Phone", "Status", "Created"],
+  "data": [
+    { "name": "John Doe", "email": "john@acmecorp.com", "phone": "555-0101", "status": "active", "created": "2026-01-15" },
+    { "name": "Jane Smith", "email": "jane@techstart.io", "phone": "555-0102", "status": "active", "created": "2026-01-20" },
+    { "name": "Bob Wilson", "email": "bob@globalinc.com", "phone": "555-0103", "status": "inactive", "created": "2025-12-01" },
+    { "name": "Alice Brown", "email": "alice@startup.co", "phone": "555-0104", "status": "active", "created": "2026-02-01" },
+    { "name": "Charlie Davis", "email": "charlie@enterprise.net", "phone": "555-0105", "status": "pending", "created": "2026-02-03" },
+    { "name": "Diana Evans", "email": "diana@agency.com", "phone": "555-0106", "status": "active", "created": "2025-11-15" },
+    { "name": "Frank Garcia", "email": "frank@solutions.biz", "phone": "555-0107", "status": "active", "created": "2026-01-28" },
+    { "name": "Grace Hill", "email": "grace@design.studio", "phone": "555-0108", "status": "inactive", "created": "2025-10-05" }
+  ],
+  "meta": { "total": 156, "page": 1, "pageSize": 25 }
+}
+```
+
+### Standard Fixture: Timeline
+
+Save as `test-fixtures/timeline.json`:
+
+```json
+{
+  "title": "Contact Activity Timeline",
+  "events": [
+    { "date": "2026-02-04T14:30:00Z", "title": "Email Opened", "description": "Campaign: February Newsletter", "type": "email" },
+    { "date": "2026-02-03T10:15:00Z", "title": "Meeting Scheduled", "description": "Demo call with sales team", "type": "meeting" },
+    { "date": "2026-02-01T09:00:00Z", "title": "Deal Created", "description": "Enterprise Plan — $15,000/yr", "type": "deal" },
+    { "date": "2026-01-28T16:45:00Z", "title": "Form Submitted", "description": "Requested pricing information", "type": "form" },
+    { "date": "2026-01-25T11:30:00Z", "title": "First Visit", "description": "Visited pricing page from Google Ads", "type": "visit" }
+  ]
+}
+```
+
+### Edge Case Fixtures
+
+Save as `test-fixtures/edge-cases.json`:
+
+```json
+{
+  "empty_strings": {
+    "data": [
+      { "name": "", "email": "", "phone": "", "status": "" }
+    ]
+  },
+  "null_values": {
+    "data": [
+      { "name": null, "email": null, "phone": null, "status": null }
+    ]
+  },
+  "extremely_long_text": {
+    "data": [
+      {
+        "name": "Bartholomew Christopherson-Williamsworth III, Esq., Ph.D., M.B.A., J.D., CPA, CFP®, CAIA®, FRM®",
+        "email": "bartholomew.christopherson-williamsworth.the.third.esquire.phd.mba.jd@extremely-long-company-name-international-holdings-corporation-unlimited.com",
+        "phone": "+1 (555) 012-3456 ext. 78901234",
+        "status": "active — pending final review by committee chairperson and board of directors",
+        "notes": "Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum."
+      }
+    ]
+  },
+  "unicode": {
+    "data": [
+      { "name": "田中太郎", "email": "tanaka@例え.jp", "status": "アクティブ" },
+      { "name": "Müller, Günther", "email": "günther@münchen.de", "status": "aktiv" },
+      { "name": "Дмитрий Иванов", "email": "dmitry@компания.ru", "status": "активный" },
+      { "name": "محمد عبدالله", "email": "mohammed@شركة.sa", "status": "نشط" },
+      { "name": "🧑‍💻 Developer", "email": "dev@🏢.com", "status": "✅ Active" }
+    ]
+  },
+  "html_entities": {
+    "data": [
+      { "name": "O'Brien & Sons <LLC>", "email": "info@obrien&sons.com", "notes": 'He said "hello" & left' }
+    ]
+  }
+}
+```
+
+### Adversarial Fixtures
+
+Save as `test-fixtures/adversarial.json`:
+
+```json
+{
+  "xss_payloads": {
+    "data": [
+      { "name": "<script>alert('xss')</script>", "email": "test@test.com" },
+      { "name": "<img src=x onerror=alert(1)>", "email": "\"><script>alert(1)</script>" },
+      { "name": "<svg onload=alert('xss')>", "email": "javascript:alert(1)" },
+      { "name": "{{constructor.constructor('return this')().alert(1)}}", "email": "test@test.com" },
+      { "name": "<details open ontoggle=alert(1)>", "email": "<iframe src='javascript:alert(1)'>" }
+    ]
+  },
+  "sql_injection": {
+    "data": [
+      { "name": "'; DROP TABLE contacts; --", "email": "test@test.com" },
+      { "name": "1' OR '1'='1", "email": "' UNION SELECT * FROM users --" },
+      { "name": "admin'--", "email": "1; UPDATE users SET role='admin'" }
+    ]
+  },
+  "malformed": {
+    "missing_fields": { "data": [{ "id": "1" }] },
+    "wrong_types": { "data": "not an array", "meta": "not an object" },
+    "nested_nulls": { "data": [{ "name": { "first": null, "last": null }, "contacts": [null, null] }] },
+    "circular_attempt": { "data": [{ "self": "[Circular]" }] },
+    "massive_nesting": { "a": { "b": { "c": { "d": { "e": { "f": { "g": "deep" } } } } } } }
+  }
+}
+```
+
+### Scale Fixture Generator
+
+```typescript
+// tests/generate-scale-fixture.ts
+// Run: npx ts-node tests/generate-scale-fixture.ts > test-fixtures/scale-1000.json
+
+function generateScaleData(count: number) {
+  const statuses = ['active', 'inactive', 'pending', 'archived'];
+  const domains = ['gmail.com', 'outlook.com', 'company.co', 'startup.io', 'enterprise.net'];
+  
+  return {
+    title: `Scale Test: ${count} Records`,
+    data: Array.from({ length: count }, (_, i) => ({
+      id: `contact-${String(i).padStart(6, '0')}`,
+      name: `Contact ${i + 1}`,
+      email: `user${i + 1}@${domains[i % domains.length]}`,
+      phone: `555-${String(i).padStart(4, '0')}`,
+      status: statuses[i % statuses.length],
+      created: new Date(2025, 0, 1 + (i % 365)).toISOString().split('T')[0],
+      value: Math.round(Math.random() * 100000) / 100,
+      tags: [`tag-${i % 10}`, `region-${i % 5}`]
+    })),
+    meta: { total: count, page: 1, pageSize: count }
+  };
+}
+
+console.log(JSON.stringify(generateScaleData(1000), null, 2));
+```
+
+---
+
+## Regression Testing Baselines
+
+### Baseline Workflow
+
+```
+1. CAPTURE — First time app is verified correct:
+   backstop reference
+   # Stores golden screenshots in test-baselines/backstop/
+
+2. TEST — On every subsequent QA run:
+   backstop test
+   # Compares current screenshots against baselines
+   # Result: PASS (<5% diff) or FAIL (>5% diff)
+
+3. APPROVE — When intentional changes are made:
+   backstop approve
+   # Updates baselines to reflect new correct state
+
+4. TRACK — Tool routing baselines:
+   # test-fixtures/tool-routing.json is the routing baseline
+   # Update ONLY when intentionally changing tool descriptions
+   # Run routing tests after ANY tool description change
+```
+
+### Screenshot Baseline Structure
+
+```
+test-baselines/
+├── backstop/
+│   ├── {app-name}_thread-panel_data.png
+│   ├── {app-name}_thread-panel_loading.png
+│   ├── {app-name}_thread-panel_empty.png
+│   ├── {app-name}_narrow_data.png
+│   └── {app-name}_wide_data.png
+├── tool-routing.json          # NL → tool mapping baseline
+└── app-data-schemas/          # JSON schemas per app type
+    ├── dashboard.schema.json
+    ├── data-grid.schema.json
+    ├── detail-card.schema.json
+    ├── timeline.schema.json
+    └── pipeline.schema.json
+```
+
+### Programmatic Screenshot Comparison (Without BackstopJS)
+
+```typescript
+// tests/screenshot-diff.ts
+import { PNG } from 'pngjs';
+import * as fs from 'fs';
+import pixelmatch from 'pixelmatch';
+
+function compareScreenshots(
+  baselinePath: string,
+  currentPath: string,
+  diffOutputPath: string
+): { diffPercent: number; pass: boolean } {
+  const baseline = PNG.sync.read(fs.readFileSync(baselinePath));
+  const current = PNG.sync.read(fs.readFileSync(currentPath));
+  
+  const { width, height } = baseline;
+  const diff = new PNG({ width, height });
+  
+  const numDiffPixels = pixelmatch(
+    baseline.data, current.data, diff.data,
+    width, height,
+    { threshold: 0.1 }
+  );
+  
+  const totalPixels = width * height;
+  const diffPercent = (numDiffPixels / totalPixels) * 100;
+  
+  if (diffPercent > 5) {
+    fs.writeFileSync(diffOutputPath, PNG.sync.write(diff));
+  }
+  
+  return {
+    diffPercent: Math.round(diffPercent * 100) / 100,
+    pass: diffPercent <= 5.0
+  };
+}
+```
+
+---
+
+## Automated QA Script (Full)
+
+Save as `scripts/mcp-qa.sh`:
+
+```bash
+#!/bin/bash
+set -euo pipefail
+
+# MCP QA — Automated Testing Pipeline
+# Usage: ./mcp-qa.sh <service-name> [--skip-layer4]
+#
+# Runs all automated layers and generates a persistent report.
+
+SERVICE="$1"
+SKIP_LAYER4="${2:-}"
+DATE=$(date +%Y-%m-%d)
+TIMESTAMP=$(date +%Y%m%d-%H%M%S)
+
+if [ -z "$SERVICE" ]; then
+  echo "Usage: $0 <service-name> [--skip-layer4]"
+  exit 1
+fi
+
+# Persistent report location
+REPORT_DIR="$HOME/.clawdbot/workspace/mcp-factory-reviews/${SERVICE}"
+mkdir -p "$REPORT_DIR"
+REPORT="${REPORT_DIR}/qa-report-${DATE}.md"
+
+# Find server directory
+SERVER_DIR=""
+for d in "${SERVICE}-mcp" "mcp-servers/${SERVICE}" "mcp-diagrams/mcp-servers/${SERVICE}"; do
+  if [ -d "$d" ]; then
+    SERVER_DIR="$d"
+    break
+  fi
+done
+
+if [ -z "$SERVER_DIR" ]; then
+  echo "❌ Server directory not found for ${SERVICE}"
+  exit 1
+fi
+
+cat > "$REPORT" << EOF
+# MCP QA Report: ${SERVICE}
+**Date:** ${DATE}
+**Timestamp:** ${TIMESTAMP}
+**Tester:** Automated QA Pipeline
+**Server:** ${SERVER_DIR}
+
+---
+
+## Quantitative Metrics
+
+| Metric | Target | Actual | Status |
+|--------|--------|--------|--------|
+EOF
+
+TOTAL_PASS=0
+TOTAL_FAIL=0
+TOTAL_WARN=0
+TOTAL_SKIP=0
+
+pass() { TOTAL_PASS=$((TOTAL_PASS + 1)); echo "✅ $1"; }
+fail() { TOTAL_FAIL=$((TOTAL_FAIL + 1)); echo "❌ $1"; }
+warn() { TOTAL_WARN=$((TOTAL_WARN + 1)); echo "⚠️  $1"; }
+skip() { TOTAL_SKIP=$((TOTAL_SKIP + 1)); echo "⏭️  $1"; }
+
+echo ""
+echo "========================================"
+echo "  MCP QA Pipeline: ${SERVICE}"
+echo "  $(date)"
+echo "========================================"
+echo ""
+
+# ─── LAYER 0: Protocol Compliance ───
+echo "--- Layer 0: Protocol Compliance ---"
+echo "" >> "$REPORT"
+echo "## Layer 0: Protocol Compliance" >> "$REPORT"
+
+cd "$SERVER_DIR"
+
+# Build first
+if npm run build 2>&1 | tail -5 > /tmp/mcp-qa-build.log; then
+  pass "TypeScript build succeeded"
+  echo "- ✅ TypeScript build succeeded" >> "$REPORT"
+else
+  fail "TypeScript build FAILED"
+  echo "- ❌ TypeScript build FAILED" >> "$REPORT"
+  cat /tmp/mcp-qa-build.log >> "$REPORT"
+fi
+
+# MCP Inspector (if available)
+if command -v npx &> /dev/null; then
+  echo "Running MCP Inspector..."
+  if timeout 15 npx @modelcontextprotocol/inspector stdio node dist/index.js 2>/tmp/mcp-inspector.log; then
+    pass "MCP Inspector passed"
+    echo "- ✅ MCP Inspector passed" >> "$REPORT"
+  else
+    warn "MCP Inspector had issues (check /tmp/mcp-inspector.log)"
+    echo "- ⚠️  MCP Inspector had issues" >> "$REPORT"
+  fi
+else
+  skip "MCP Inspector (npx not available)"
+  echo "- ⏭️  MCP Inspector skipped" >> "$REPORT"
+fi
+
+cd - > /dev/null
+
+# ─── LAYER 1: Static Analysis ───
+echo ""
+echo "--- Layer 1: Static Analysis ---"
+echo "" >> "$REPORT"
+echo "## Layer 1: Static Analysis" >> "$REPORT"
+
+# TypeScript type check
+cd "$SERVER_DIR"
+if npx tsc --noEmit 2>&1 | tail -3 > /tmp/mcp-qa-typecheck.log; then
+  pass "tsc --noEmit clean"
+  echo "- ✅ Type check clean" >> "$REPORT"
+else
+  fail "tsc --noEmit has errors"
+  echo "- ❌ Type check errors:" >> "$REPORT"
+  cat /tmp/mcp-qa-typecheck.log >> "$REPORT"
+fi
+cd - > /dev/null
+
+# Any types
+ANY_COUNT=$(grep -rn ": any" "$SERVER_DIR/src/" --include="*.ts" 2>/dev/null | grep -cv "catch\|eslint\|node_modules" || echo "0")
+if [ "$ANY_COUNT" -eq 0 ]; then
+  pass "No unintended 'any' types"
+else
+  warn "${ANY_COUNT} 'any' types found"
+fi
+echo "- any types: ${ANY_COUNT}" >> "$REPORT"
+
+# SDK version
+SDK_VER=$(cd "$SERVER_DIR" && node -e "console.log(require('./package.json').dependencies['@modelcontextprotocol/sdk'] || 'NOT FOUND')" 2>/dev/null || echo "UNKNOWN")
+echo "- SDK version: ${SDK_VER}" >> "$REPORT"
+# Warn if SDK is below 1.26.0 (security fix)
+if echo "$SDK_VER" | grep -q "1.25"; then
+  warn "SDK version ${SDK_VER} — should be ^1.26.0+ (security fix GHSA-345p-7cg4-v4c7)"
+  echo "- ⚠️  SDK should be ^1.26.0+ (security fix)" >> "$REPORT"
+fi
+
+# App files
+echo "" >> "$REPORT"
+echo "### App Files" >> "$REPORT"
+APP_COUNT=0
+APP_OVERSIZED=0
+for dir in "$SERVER_DIR/app-ui" "$SERVER_DIR/ui/dist"; do
+  if [ -d "$dir" ]; then
+    for f in "$dir"/*.html; do
+      if [ -f "$f" ]; then
+        SIZE=$(wc -c < "$f" | tr -d ' ')
+        KB=$((SIZE / 1024))
+        APP_COUNT=$((APP_COUNT + 1))
+        if [ "$SIZE" -gt 51200 ]; then
+          APP_OVERSIZED=$((APP_OVERSIZED + 1))
+          echo "- ⚠️  $(basename $f): ${KB}KB (over 50KB budget)" >> "$REPORT"
+        else
+          echo "- ✅ $(basename $f): ${KB}KB" >> "$REPORT"
+        fi
+      fi
+    done
+  fi
+done
+echo "| App File Size | <50KB each | ${APP_OVERSIZED}/${APP_COUNT} over budget | $([ $APP_OVERSIZED -eq 0 ] && echo '✅' || echo '⚠️') |" >> /tmp/mcp-qa-metrics.txt
+
+# ─── LAYER 2: Jest Unit Tests ───
+echo ""
+echo "--- Layer 2: Automated Tests ---"
+echo "" >> "$REPORT"
+echo "## Layer 2: Automated Tests" >> "$REPORT"
+
+cd "$SERVER_DIR"
+if [ -f "jest.config.ts" ] || [ -f "jest.config.js" ] || grep -q '"jest"' package.json 2>/dev/null; then
+  echo "Running Jest tests..."
+  if npx jest --ci --coverage 2>&1 | tee /tmp/mcp-qa-jest.log | tail -10; then
+    pass "Jest tests passed"
+    echo "- ✅ Jest tests passed" >> "$REPORT"
+  else
+    fail "Jest tests FAILED"
+    echo "- ❌ Jest tests failed" >> "$REPORT"
+    tail -20 /tmp/mcp-qa-jest.log >> "$REPORT"
+  fi
+else
+  skip "No Jest config found"
+  echo "- ⏭️  No Jest test suite found" >> "$REPORT"
+fi
+
+# Playwright visual tests
+if [ -f "playwright.config.ts" ] || [ -f "playwright.config.js" ]; then
+  echo "Running Playwright visual tests..."
+  if npx playwright test 2>&1 | tee /tmp/mcp-qa-playwright.log | tail -10; then
+    pass "Playwright tests passed"
+    echo "- ✅ Playwright tests passed" >> "$REPORT"
+  else
+    fail "Playwright tests FAILED"
+    echo "- ❌ Playwright tests failed" >> "$REPORT"
+    tail -20 /tmp/mcp-qa-playwright.log >> "$REPORT"
+  fi
+else
+  skip "No Playwright config found"
+  echo "- ⏭️  No Playwright test suite found" >> "$REPORT"
+fi
+
+# BackstopJS visual regression
+if [ -f "backstop.json" ]; then
+  echo "Running BackstopJS regression..."
+  if backstop test 2>&1 | tee /tmp/mcp-qa-backstop.log | tail -5; then
+    pass "BackstopJS regression passed"
+    echo "- ✅ Visual regression passed" >> "$REPORT"
+  else
+    warn "BackstopJS regression detected differences"
+    echo "- ⚠️  Visual regression diffs detected" >> "$REPORT"
+  fi
+else
+  skip "No backstop.json found"
+  echo "- ⏭️  No BackstopJS config found" >> "$REPORT"
+fi
+
+cd - > /dev/null
+
+# ─── LAYER 4: Live API (optional) ───
+if [ "$SKIP_LAYER4" != "--skip-layer4" ]; then
+  echo ""
+  echo "--- Layer 4: Live API Testing ---"
+  echo "" >> "$REPORT"
+  echo "## Layer 4: Live API Testing" >> "$REPORT"
+
+  if [ -f "$SERVER_DIR/.env" ]; then
+    pass ".env file exists"
+    echo "- ✅ .env credentials found" >> "$REPORT"
+    echo "- ⚠️  Manual verification of live API required" >> "$REPORT"
+  else
+    skip "No .env file — skipping live API tests"
+    echo "- ⏭️  No credentials available" >> "$REPORT"
+  fi
+else
+  skip "Layer 4 skipped (--skip-layer4)"
+  echo "" >> "$REPORT"
+  echo "## Layer 4: Live API Testing — SKIPPED" >> "$REPORT"
+fi
+
+# ─── SECURITY SCAN ───
+echo ""
+echo "--- Layer 4.5: Security Scan ---"
+echo "" >> "$REPORT"
+echo "## Layer 4.5: Security Scan" >> "$REPORT"
+
+SECURITY_ISSUES=0
+for dir in "$SERVER_DIR/app-ui" "$SERVER_DIR/ui/dist"; do
+  if [ -d "$dir" ]; then
+    for f in "$dir"/*.html; do
+      if [ -f "$f" ]; then
+        # Check for potential key exposure
+        for pat in "api.key" "apikey" "api_key" "secret" "sk_live" "pk_live"; do
+          if grep -qi "$pat" "$f" 2>/dev/null; then
+            SECURITY_ISSUES=$((SECURITY_ISSUES + 1))
+            echo "- ❌ $(basename $f): potential key exposure (${pat})" >> "$REPORT"
+          fi
+        done
+      fi
+    done
+  fi
+done
+
+if [ "$SECURITY_ISSUES" -eq 0 ]; then
+  pass "No API key exposure detected"
+  echo "- ✅ No API key exposure detected in app files" >> "$REPORT"
+else
+  fail "${SECURITY_ISSUES} potential security issues"
+fi
+
+# ─── SUMMARY ───
+echo ""
+echo "========================================"
+echo "  SUMMARY"
+echo "========================================"
+echo "  ✅ Passed: ${TOTAL_PASS}"
+echo "  ❌ Failed: ${TOTAL_FAIL}"
+echo "  ⚠️  Warnings: ${TOTAL_WARN}"
+echo "  ⏭️  Skipped: ${TOTAL_SKIP}"
+echo "========================================"
+
+OVERALL="PASS"
+[ "$TOTAL_FAIL" -gt 0 ] && OVERALL="FAIL"
+[ "$TOTAL_FAIL" -eq 0 ] && [ "$TOTAL_WARN" -gt 0 ] && OVERALL="PASS WITH WARNINGS"
+
+cat >> "$REPORT" << EOF
+
+---
+
+## Summary
+
+| Category | Count |
+|----------|-------|
+| ✅ Passed | ${TOTAL_PASS} |
+| ❌ Failed | ${TOTAL_FAIL} |
+| ⚠️  Warnings | ${TOTAL_WARN} |
+| ⏭️  Skipped | ${TOTAL_SKIP} |
+
+## Overall: **${OVERALL}**
+
+---
+
+*Report generated by MCP QA Pipeline v2.0*
+*Saved to: ${REPORT}*
+EOF
+
+echo ""
+echo "Report saved to: $REPORT"
+echo "Overall: ${OVERALL}"
+```
+
+---
+
+## Test Report Template (Full)
+
+Generate this after running all layers. Save to `mcp-factory-reviews/{service}/qa-report-{date}.md`:
+
+```markdown
+# MCP QA Report: {Service Name}
+**Date:** {YYYY-MM-DD}
+**Tester:** {agent/human}
+**Server:** {service}-mcp v{version}
+**Apps:** {count} apps tested
+**Credential Status:** {has-creds|needs-creds|sandbox|no-sandbox}
+
+---
+
+## Quantitative Metrics
+
+| Metric | Target | Actual | Status |
+|--------|--------|--------|--------|
+| MCP Protocol Compliance | 100% | __%  | ✅/❌ |
+| Tool Correctness Rate | >95% | __/20 (__%) | ✅/❌ |
+| Task Completion Rate | >90% | __/10 (__%) | ✅/❌ |
+| APP_DATA Schema Match | 100% | __/__ (__%) | ✅/❌ |
+| Response Latency P50 | <3s | __s | ✅/❌ |
+| Response Latency P95 | <8s | __s | ✅/❌ |
+| App Render Success | 100% | __/__ | ✅/❌ |
+| Accessibility Score | >90 | __% | ✅/❌ |
+| Cold Start Time | <2s | __ms | ✅/❌ |
+| App File Size (max) | <50KB | __KB | ✅/❌ |
+| Security (critical) | 0 | __ | ✅/❌ |
+
+## Layer Results
+
+| Layer | Status | Issues | Details |
+|-------|--------|--------|---------|
+| 0 — Protocol | ✅/⚠️/❌ | {count} | {notes} |
+| 1 — Static | ✅/⚠️/❌ | {count} | {notes} |
+| 2 — Visual | ✅/⚠️/❌ | {count} | {notes} |
+| 2.5 — Accessibility | ✅/⚠️/❌ | {count} | {notes} |
+| 3 — Functional | ✅/⚠️/❌ | {count} | {notes} |
+| 3.5 — Performance | ✅/⚠️/❌ | {count} | {notes} |
+| 4 — Live API | ✅/⚠️/❌/⏭️ | {count} | {notes} |
+| 4.5 — Security | ✅/⚠️/❌ | {count} | {notes} |
+| 5 — Integration | ✅/⚠️/❌ | {count} | {notes} |
+
+## Overall: {PASS / PASS WITH WARNINGS / FAIL}
+
+---
+
+## Issues Found
+
+### Critical (must fix before ship)
+1. {issue}: {description} — {file:line}
+
+### Warnings (should fix)
+1. {issue}: {description}
+
+### Notes (nice to have)
+1. {observation}
+
+---
+
+## App-by-App Results
+
+### {app-id-1}
+- Visual: ✅/❌ — {notes}
+- Accessibility: Score __% — {violations}
+- Data flow: ✅/❌ — {notes}
+- States (loading/empty/data): ✅/❌
+- File size: __KB
+- XSS test: ✅/❌
+- Screenshot: {path}
+
+---
+
+## Tool Invocation Results
+
+| # | NL Message | Expected Tool | Actual Tool | Correct? | Latency |
+|---|-----------|---------------|-------------|----------|---------|
+| 1 | "Show me all contacts" | list_contacts | | ✅/❌ | ms |
+| 2 | "Find John Smith" | search_contacts | | ✅/❌ | ms |
+| ... | | | | | |
+| 20 | | | | | |
+
+**Tool Correctness Rate: __/20 = __%**
+
+---
+
+## E2E Scenario Results
+
+| # | Scenario | Steps | Completed? | Latency | Notes |
+|---|----------|-------|-----------|---------|-------|
+| 1 | {name} | {n} | ✅/❌ | ms | |
+| ... | | | | | |
+| 10 | | | | | |
+
+**Task Completion Rate: __/10 = __%**
+
+---
+
+## Trend (vs Previous Report)
+
+| Metric | Previous | Current | Change |
+|--------|----------|---------|--------|
+| Tool Correctness | __% | __% | +/-__% |
+| Task Completion | __% | __% | +/-__% |
+| Accessibility | __% | __% | +/-__% |
+| Avg Latency | __s | __s | +/-__s |
+
+---
+
+## Recommendations
+1. {what to fix/improve before shipping}
+2. {items for next QA cycle}
+
+---
+
+*Report saved to: mcp-factory-reviews/{service}/qa-report-{date}.md*
+*Previous reports in same directory for trending.*
+```
+
+### Report Trending Script
+
+```bash
+#!/bin/bash
+# Aggregate QA trends across reports
+# Usage: ./qa-trend.sh <service-name>
+
+SERVICE="$1"
+REPORT_DIR="$HOME/.clawdbot/workspace/mcp-factory-reviews/${SERVICE}"
+
+if [ ! -d "$REPORT_DIR" ]; then
+  echo "No reports found for ${SERVICE}"
+  exit 1
+fi
+
+echo "=== QA Trend: ${SERVICE} ==="
+echo ""
+echo "| Date | Overall | Pass | Fail | Warn |"
+echo "|------|---------|------|------|------|"
+
+for report in $(ls -1 "$REPORT_DIR"/qa-report-*.md 2>/dev/null | sort); do
+  DATE=$(basename "$report" | sed 's/qa-report-//' | sed 's/.md//')
+  OVERALL=$(grep "^## Overall:" "$report" 2>/dev/null | head -1 | sed 's/.*\*\*//' | sed 's/\*\*.*//')
+  PASS=$(grep "✅ Passed" "$report" 2>/dev/null | grep -o '[0-9]*' | head -1 || echo "?")
+  FAIL=$(grep "❌ Failed" "$report" 2>/dev/null | grep -o '[0-9]*' | head -1 || echo "?")
+  WARN=$(grep "⚠️" "$report" 2>/dev/null | grep -o '[0-9]*' | head -1 || echo "?")
+  echo "| ${DATE} | ${OVERALL} | ${PASS} | ${FAIL} | ${WARN} |"
+done
+```
+
+---
+
+## Quick Reference Commands
+
+```bash
+# ─── LAYER 0 ───
+# MCP Inspector (protocol compliance)
+npx @modelcontextprotocol/inspector stdio node dist/index.js
+
+# ─── LAYER 1 ───
+# Quick compile + type check
+cd {service}-mcp && npm run build && npx tsc --noEmit
+
+# ─── LAYER 2 ───
+# Run Playwright visual tests
+npx playwright test tests/visual.test.ts
+
+# Run BackstopJS regression
+backstop test
+
+# Capture new baselines
+backstop reference
+
+# ─── LAYER 2.5 ───
+# Run accessibility tests
+npx playwright test tests/accessibility.test.ts
+
+# ─── LAYER 3 ───
+# Run Jest unit tests
+npx jest --verbose
+
+# Run tool routing tests
+npx jest tests/tool-routing.test.ts
+
+# Validate APP_DATA schemas
+npx ts-node tests/app-data-validator.ts
+
+# ─── LAYER 3.5 ───
+# Cold start benchmark
+time echo '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-11-25","capabilities":{},"clientInfo":{"name":"perf","version":"1.0"}}}' | timeout 10 node dist/index.js | head -1
+
+# File size audit
+for f in app-ui/*.html; do echo "$(wc -c < "$f" | tr -d ' ') $f"; done | sort -n
+
+# ─── LAYER 4 ───
+# Start server for manual testing
+node dist/index.js
+
+# ─── LAYER 4.5 ───
+# Security scan
+grep -rn "apikey\|api_key\|secret\|sk_live" app-ui/ --include="*.html"
+
+# ─── LAYER 5 ───
+# Full automated pipeline
+./scripts/mcp-qa.sh {service-name}
+
+# Trend report
+./scripts/qa-trend.sh {service-name}
+
+# ─── BROWSER TOOLS ───
+# Screenshot via browser tool
+# browser → open → http://192.168.0.25:3000 → navigate → screenshot
+
+# Monitor postMessages in browser console
+# window.addEventListener('message', e => console.log('[PM]', e.data.type, e.data))
+
+# axe-core in browser console (paste the snippet from Layer 2.5.2)
+```
+
+---
+
+## Common Issues & Fixes
+
+| Symptom | Layer | Cause | Fix |
+|---------|-------|-------|-----|
+| App shows blank white screen | 2 | HTML file not found or wrong path | Check APP_NAME_MAP + APP_DIRS in route.ts |
+| App shows loading forever | 3 | postMessage not received | Check data block format: `<!--APP_DATA:{...}:END_APP_DATA-->` |
+| App renders but wrong data | 3 | APP_DATA JSON shape mismatch | Compare tool response fields with app's render() expectations |
+| Tool not triggered by NL | 3 | Poor tool description | Add "do NOT use when" disambiguation |
+| Wrong tool triggered | 3 | Similar tool descriptions | Add negative examples to both competing tools |
+| Thread panel empty | 3 | Thread state not persisted | Check localStorage `lb-threads` key |
+| Console error: CORS | 2 | iframe cross-origin issue | Ensure app served from same origin |
+| Dark theme wrong | 2 | Hardcoded light colors | Audit CSS for `#fff`, `white`, `#f` colors |
+| Overflow at narrow width | 2 | Fixed widths in CSS | Use `max-width: 100%`, `overflow-x: auto`, flex/grid |
+| axe-core contrast fail | 2.5 | Text color too dim | Use #b0b2b8+ for secondary text (not #96989d) |
+| MCP Inspector fails | 0 | Protocol error in server | Check initialize handler, verify JSON-RPC framing |
+| Cold start >2s | 3.5 | Heavy imports at startup | Use lazy loading for tool groups |
+| structuredContent mismatch | 0 | Output doesn't match outputSchema | Validate tool return against declared schema |
+| APP_DATA parse fails | 3 | LLM produced invalid JSON | Use robust parser with newline stripping + trailing comma fix |
+| XSS detected | 4.5 | Missing escapeHtml on field | Add escapeHtml() to all dynamic text insertions |
+| Key exposure | 4.5 | API key in HTML file | Move to server-side only, never send to client |
+
+---
+
+## Project Setup: Adding Tests to an Existing Server
+
+When adding this test framework to a server that doesn't have it yet:
+
+```bash
+cd {service}-mcp
+
+# 1. Install test dependencies
+npm install -D jest ts-jest @types/jest msw playwright @playwright/test @axe-core/playwright ajv pngjs pixelmatch backstopjs
+
+# 2. Add Jest config
+cat > jest.config.ts << 'EOF'
+export default {
+  preset: 'ts-jest',
+  testEnvironment: 'node',
+  testPathPattern: 'tests/.*\\.test\\.ts$',
+  setupFilesAfterSetup: ['./tests/setup.ts'],
+};
+EOF
+
+# 3. Add Playwright config
+cat > playwright.config.ts << 'EOF'
+import { defineConfig, devices } from '@playwright/test';
+export default defineConfig({
+  testDir: './tests',
+  testMatch: ['visual.test.ts', 'accessibility.test.ts', 'chaos.test.ts'],
+  projects: [
+    { name: 'chromium', use: { ...devices['Desktop Chrome'] } },
+    { name: 'firefox', use: { ...devices['Desktop Firefox'] } },
+    { name: 'webkit', use: { ...devices['Desktop Safari'] } },
+  ],
+});
+EOF
+
+# 4. Create directory structure
+mkdir -p tests test-fixtures test-baselines/backstop test-baselines/app-data-schemas test-results/screenshots
+
+# 5. Create initial fixture files
+# (copy from the fixtures library section above)
+
+# 6. Add scripts to package.json
+npm pkg set scripts.test="jest"
+npm pkg set scripts.test:visual="playwright test"
+npm pkg set scripts.test:a11y="playwright test tests/accessibility.test.ts"
+npm pkg set scripts.test:all="jest && playwright test"
+npm pkg set scripts.qa="../../scripts/mcp-qa.sh $(basename $(pwd) -mcp)"
+
+# 7. Install Playwright browsers
+npx playwright install
+```
diff --git a/skills/mcp-server-builder/SKILL.md b/skills/mcp-server-builder/SKILL.md
new file mode 100644
index 0000000..db7e4c4
--- /dev/null
+++ b/skills/mcp-server-builder/SKILL.md
@@ -0,0 +1,2609 @@
+# MCP Server Builder — Phase 2: Build the MCP Server
+
+**When to use this skill:** You have a completed `{service}-api-analysis.md` from Phase 1 and need to produce a fully compiled MCP server. This skill contains every pattern, template, and standard needed to build from scratch.
+
+**What this covers:** Project scaffolding, TypeScript MCP server with Feb 2026 SDK standards (annotations, `title`, `outputSchema`, `structuredContent`, lazy loading, Zod validation), auth patterns, error handling, rate limiting, circuit breaker, structured logging, pagination strategies, request timeouts, and tool description optimization.
+
+**Pipeline position:** Phase 2 of 6 → Input from `mcp-api-analyzer` (Phase 1), output feeds `mcp-app-designer` (Phase 3) and `mcp-localbosses-integrator` (Phase 4)
+
+**MCP Spec Compliance:** 2025-11-25 spec, TypeScript SDK `^1.26.0`
+
+---
+
+## 1. Inputs & Outputs
+
+**Input:** `{service}-api-analysis.md` (from Phase 1)
+**Output:** Complete MCP server directory:
+
+```
+{service}-mcp/
+├── src/
+│   ├── index.ts              # Server entry, transport selection, orchestration
+│   ├── client.ts             # API client (auth, timeouts, circuit breaker, retry, rate limiting)
+│   ├── logger.ts             # Structured JSON logging on stderr
+│   ├── tools/
+│   │   ├── index.ts          # Tool registry + lazy loader
+│   │   ├── health.ts         # health_check tool (always included)
+│   │   ├── {group1}.ts       # Tool group: definitions + handlers
+│   │   ├── {group2}.ts       # Tool group: definitions + handlers
+│   │   └── ...
+│   └── types.ts              # Shared TypeScript interfaces
+├── app-ui/                   # (Created in Phase 3)
+├── dist/                     # Compiled output
+├── package.json
+├── tsconfig.json
+├── .env.example
+├── .gitignore
+└── README.md
+```
+
+**When to use one-file pattern instead:** If the analysis doc shows ≤15 tools total, put everything in `src/index.ts`. Split into modules only when there are 15+ tools or multiple tool groups.
+
+**Reference template:** `mcp-diagrams/mcp-servers/template/` — use as starting point, then customize.
+
+---
+
+## 2. Template Variable Reference
+
+**IMPORTANT:** All templates use placeholder variables that MUST be replaced before use. Search-and-replace all of these:
+
+| Pattern | Convention | Example | Used In |
+|---------|-----------|---------|---------|
+| `{service}` | lowercase, hyphenated | `calendly` | directory names, package name, MCP name |
+| `{SERVICE}` | UPPER_SNAKE_CASE | `CALENDLY` | environment variable names |
+| `{Service}` | PascalCase | `Calendly` | class names, display titles |
+| `{Service Name}` | Title Case with spaces | `Calendly` | README headings, descriptions |
+| `{group}` | lowercase | `contacts` | tool group filenames |
+| `{group_name}` | lowercase with underscores | `contact_management` | group identifiers |
+| `{resources}` | lowercase plural | `contacts` | tool names, API endpoints |
+| `{resource}` | lowercase singular | `contact` | tool names, API endpoints |
+| `{Resource}` | PascalCase singular | `Contact` | TypeScript type names |
+
+**Verification step:** After building, run `grep -r '{service}\|{SERVICE}\|{Service}\|{group}\|{resource}\|{Resource}' src/` — output should be empty.
+
+---
+
+## 3. Project Scaffolding
+
+### Step 1: Create directory and init
+
+```bash
+mkdir -p {service}-mcp/src/tools
+cd {service}-mcp
+
+# Initialize package.json
+cat > package.json << 'EOF'
+{
+  "name": "mcp-server-{service}",
+  "version": "1.0.0",
+  "type": "module",
+  "main": "dist/index.js",
+  "bin": {
+    "mcp-server-{service}": "dist/index.js"
+  },
+  "scripts": {
+    "build": "tsc",
+    "start": "node dist/index.js",
+    "start:http": "MCP_TRANSPORT=http node dist/index.js",
+    "dev": "tsx src/index.ts"
+  },
+  "dependencies": {
+    "@modelcontextprotocol/sdk": "^1.26.0",
+    "zod": "^3.25.0"
+  },
+  "devDependencies": {
+    "@types/node": "^22.0.0",
+    "tsx": "^4.7.0",
+    "typescript": "^5.5.0"
+  }
+}
+EOF
+
+npm install
+```
+
+> **Security Note (Feb 2026):** v1.26.0 fixes GHSA-345p-7cg4-v4c7 (cross-client data leak in shared transport instances). Always use ≥1.26.0.
+>
+> **SDK v2 Warning:** The TypeScript SDK v2 is in pre-alpha (stable expected Q1 2026). Pin to v1.x for production. v1.x will receive bug fixes for 6+ months after v2 ships.
+>
+> **Zod v4 Warning:** Do NOT use Zod v4.x with MCP SDK v1.x — known incompatibility (issue #1429, `w._parse is not a function`). The `^3.25.0` pin is correct and will not pull in Zod v4.
+
+### Step 2: TypeScript config
+
+```bash
+cat > tsconfig.json << 'EOF'
+{
+  "compilerOptions": {
+    "target": "ES2022",
+    "module": "NodeNext",
+    "moduleResolution": "NodeNext",
+    "outDir": "./dist",
+    "rootDir": "./src",
+    "strict": true,
+    "esModuleInterop": true,
+    "skipLibCheck": true,
+    "forceConsistentCasingInFileNames": true,
+    "declaration": true,
+    "sourceMap": true,
+    "resolveJsonModule": true
+  },
+  "include": ["src/**/*"],
+  "exclude": ["node_modules", "dist", "app-ui"]
+}
+EOF
+```
+
+### Step 3: .env.example
+
+```bash
+cat > .env.example << 'EOF'
+# {Service Name} MCP Server Configuration
+{SERVICE}_API_KEY=your_api_key_here
+# {SERVICE}_API_SECRET=your_secret_here        # If OAuth2
+# {SERVICE}_BASE_URL=https://api.example.com   # Override for sandbox
+# {SERVICE}_ACCOUNT_ID=your_account_id         # If multi-tenant
+
+# Transport (optional — default: stdio)
+# MCP_TRANSPORT=http
+# MCP_HTTP_PORT=3000
+EOF
+```
+
+### Step 4: .gitignore
+
+```bash
+cat > .gitignore << 'EOF'
+node_modules/
+dist/
+.env
+*.log
+EOF
+```
+
+---
+
+## 4. Core Files — Templates
+
+### 4.1 `src/types.ts` — Shared Types
+
+```typescript
+// Types derived from the API analysis document
+
+export interface PaginationParams {
+  page?: number;
+  pageSize?: number;
+}
+
+export interface PaginatedResponse<T> {
+  data: T[];
+  meta: {
+    total: number;
+    page: number;
+    pageSize: number;
+    hasMore: boolean;
+  };
+}
+
+export interface ToolGroup {
+  name: string;
+  tools: ToolDefinition[];
+  handlers: Record<string, ToolHandler>;
+  loaded: boolean;
+}
+
+export interface ToolDefinition {
+  name: string;
+  title: string;
+  description: string;
+  inputSchema: {
+    type: "object";
+    properties: Record<string, unknown>;
+    required?: string[];
+  };
+  outputSchema?: Record<string, unknown>;
+  annotations?: {
+    readOnlyHint?: boolean;
+    destructiveHint?: boolean;
+    idempotentHint?: boolean;
+    openWorldHint?: boolean;
+  };
+  icons?: Array<{ src: string; mimeType: string }>;
+}
+
+export type ToolHandler = (args: Record<string, unknown>) => Promise<{
+  content: Array<{ type: string; text: string } | { type: "resource_link"; uri: string; name: string; mimeType?: string }>;
+  structuredContent?: unknown;
+  isError?: boolean;
+}>;
+```
+
+### 4.2 `src/logger.ts` — Structured Logging
+
+```typescript
+// Structured JSON logger — all output to stderr (stdout reserved for MCP protocol)
+// Logs: tool invocations, API calls, errors, with request IDs and timing
+
+import { randomUUID } from "crypto";
+
+type LogLevel = "debug" | "info" | "warn" | "error";
+
+interface LogEntry {
+  ts: string;
+  level: LogLevel;
+  event: string;
+  requestId?: string;
+  durationMs?: number;
+  [key: string]: unknown;
+}
+
+class Logger {
+  private serverName: string;
+
+  constructor(serverName: string) {
+    this.serverName = serverName;
+  }
+
+  private write(level: LogLevel, event: string, data: Record<string, unknown> = {}): void {
+    const entry: LogEntry = {
+      ts: new Date().toISOString(),
+      level,
+      event,
+      server: this.serverName,
+      ...data,
+    };
+    console.error(JSON.stringify(entry));
+  }
+
+  debug(event: string, data?: Record<string, unknown>): void {
+    this.write("debug", event, data);
+  }
+
+  info(event: string, data?: Record<string, unknown>): void {
+    this.write("info", event, data);
+  }
+
+  warn(event: string, data?: Record<string, unknown>): void {
+    this.write("warn", event, data);
+  }
+
+  error(event: string, data?: Record<string, unknown>): void {
+    this.write("error", event, data);
+  }
+
+  // Generate a request ID for tracing
+  requestId(): string {
+    return randomUUID().slice(0, 8);
+  }
+
+  // Time an async operation
+  async time<T>(event: string, fn: () => Promise<T>, data?: Record<string, unknown>): Promise<T> {
+    const requestId = this.requestId();
+    const start = performance.now();
+    this.info(`${event}.start`, { requestId, ...data });
+    try {
+      const result = await fn();
+      const durationMs = Math.round(performance.now() - start);
+      this.info(`${event}.done`, { requestId, durationMs, ...data });
+      return result;
+    } catch (error) {
+      const durationMs = Math.round(performance.now() - start);
+      this.error(`${event}.error`, {
+        requestId,
+        durationMs,
+        error: error instanceof Error ? error.message : String(error),
+        stack: error instanceof Error ? error.stack : undefined,
+        ...data,
+      });
+      throw error;
+    }
+  }
+}
+
+export const logger = new Logger("{service}");
+```
+
+### 4.3 `src/client.ts` — API Client with Timeouts, Circuit Breaker, and Pluggable Pagination
+
+```typescript
+// API Client for {Service}
+// Handles auth, request timeouts, circuit breaker, retry, rate limiting, and pagination
+
+import { logger } from "./logger.js";
+
+const DEFAULT_BASE_URL = "https://api.example.com";
+const MAX_RETRIES = 3;
+const RETRY_BASE_DELAY = 1000; // ms
+const DEFAULT_TIMEOUT_MS = 30_000; // 30 seconds
+
+// ============================================
+// CIRCUIT BREAKER
+// ============================================
+type CircuitState = "closed" | "open" | "half-open";
+
+class CircuitBreaker {
+  private state: CircuitState = "closed";
+  private failureCount = 0;
+  private lastFailureTime = 0;
+  private halfOpenLock = false; // Mutex: only ONE request passes in half-open
+  private readonly failureThreshold: number;
+  private readonly resetTimeoutMs: number;
+
+  constructor(failureThreshold = 5, resetTimeoutMs = 60_000) {
+    this.failureThreshold = failureThreshold;
+    this.resetTimeoutMs = resetTimeoutMs;
+  }
+
+  canExecute(): boolean {
+    if (this.state === "closed") return true;
+    if (this.state === "open") {
+      if (Date.now() - this.lastFailureTime >= this.resetTimeoutMs) {
+        // Only allow ONE request through in half-open
+        if (!this.halfOpenLock) {
+          this.halfOpenLock = true;
+          this.state = "half-open";
+          logger.info("circuit_breaker.half_open");
+          return true;
+        }
+        return false; // Another request already testing
+      }
+      return false;
+    }
+    // half-open: already locked, reject additional requests
+    return false;
+  }
+
+  recordSuccess(): void {
+    this.halfOpenLock = false;
+    if (this.state !== "closed") {
+      logger.info("circuit_breaker.closed", { previousFailures: this.failureCount });
+    }
+    this.failureCount = 0;
+    this.state = "closed";
+  }
+
+  recordFailure(): void {
+    this.halfOpenLock = false;
+    this.failureCount++;
+    this.lastFailureTime = Date.now();
+    if (this.failureCount >= this.failureThreshold || this.state === "half-open") {
+      this.state = "open";
+      logger.warn("circuit_breaker.open", {
+        failureCount: this.failureCount,
+        resetAfterMs: this.resetTimeoutMs,
+      });
+    }
+  }
+
+  getState(): CircuitState {
+    return this.state;
+  }
+}
+
+// ============================================
+// PAGINATION STRATEGIES
+// ============================================
+// Pluggable pagination — each tool specifies which strategy its endpoint uses
+
+export type PaginationStrategy =
+  | { type: "offset"; pageParam?: string; pageSizeParam?: string }
+  | { type: "cursor"; cursorParam?: string; cursorPath?: string }
+  | { type: "keyset"; afterParam?: string; afterField?: string }
+  | { type: "link-header" }
+  | { type: "next-url"; nextUrlPath?: string };
+
+// ============================================
+// API CLIENT
+// ============================================
+export class APIClient {
+  private apiKey: string;
+  private baseUrl: string;
+  private rateLimitRemaining: number = Infinity;
+  private rateLimitReset: number = 0;
+  private circuitBreaker: CircuitBreaker;
+  private timeoutMs: number;
+
+  constructor(apiKey: string, baseUrl?: string, timeoutMs?: number) {
+    this.apiKey = apiKey;
+    this.baseUrl = baseUrl || DEFAULT_BASE_URL;
+    this.timeoutMs = timeoutMs || DEFAULT_TIMEOUT_MS;
+    this.circuitBreaker = new CircuitBreaker();
+  }
+
+  // === Core request with timeout + circuit breaker + retry + rate limit ===
+  async request<T = unknown>(
+    endpoint: string,
+    options: RequestInit = {}
+  ): Promise<T> {
+    // Circuit breaker check
+    if (!this.circuitBreaker.canExecute()) {
+      throw new Error(
+        `Circuit breaker is open — API is unavailable. Retry after ${Math.ceil(60)} seconds.`
+      );
+    }
+
+    // Wait if rate limited
+    await this.waitForRateLimit();
+
+    let lastError: Error | null = null;
+
+    for (let attempt = 0; attempt < MAX_RETRIES; attempt++) {
+      try {
+        const url = `${this.baseUrl}${endpoint}`;
+
+        // AbortController for request timeout
+        const controller = new AbortController();
+        const timeoutId = setTimeout(() => controller.abort(), this.timeoutMs);
+
+        const requestId = logger.requestId();
+        const start = performance.now();
+
+        logger.debug("api_request.start", {
+          requestId,
+          method: options.method || "GET",
+          endpoint,
+          attempt: attempt + 1,
+        });
+
+        try {
+          const response = await fetch(url, {
+            ...options,
+            signal: controller.signal,
+            headers: {
+              "Authorization": `Bearer ${this.apiKey}`,
+              "Content-Type": "application/json",
+              "Accept": "application/json",
+              ...options.headers,
+            },
+          });
+
+          const durationMs = Math.round(performance.now() - start);
+
+          // Track rate limit headers
+          this.updateRateLimits(response);
+
+          // Handle rate limit response
+          if (response.status === 429) {
+            const retryAfter = parseInt(
+              response.headers.get("Retry-After") || "5",
+              10
+            );
+            logger.warn("api_request.rate_limited", { requestId, retryAfter, endpoint });
+            await this.delay(retryAfter * 1000);
+            continue;
+          }
+
+          // Handle server errors (retry)
+          if (response.status >= 500) {
+            this.circuitBreaker.recordFailure();
+            lastError = new Error(
+              `Server error: ${response.status} ${response.statusText}`
+            );
+            logger.warn("api_request.server_error", {
+              requestId, durationMs, status: response.status, endpoint, attempt: attempt + 1,
+            });
+            const baseDelay = RETRY_BASE_DELAY * Math.pow(2, attempt);
+            const jitter = Math.random() * baseDelay * 0.5; // 0-50% random jitter
+            await this.delay(baseDelay + jitter);
+            continue;
+          }
+
+          // Handle client errors (don't retry)
+          if (!response.ok) {
+            const errorBody = await response.text();
+            logger.error("api_request.client_error", {
+              requestId, durationMs, status: response.status, endpoint, body: errorBody.slice(0, 500),
+            });
+            throw new Error(
+              `API error ${response.status}: ${response.statusText} — ${errorBody}`
+            );
+          }
+
+          // Success — record with circuit breaker
+          this.circuitBreaker.recordSuccess();
+
+          logger.debug("api_request.done", {
+            requestId, durationMs, status: response.status, endpoint,
+          });
+
+          // Handle empty responses (204 No Content)
+          if (response.status === 204) {
+            return { success: true } as T;
+          }
+
+          return (await response.json()) as T;
+        } finally {
+          clearTimeout(timeoutId);
+        }
+      } catch (error) {
+        if (error instanceof Error && error.name === "AbortError") {
+          this.circuitBreaker.recordFailure();
+          lastError = new Error(`Request timeout after ${this.timeoutMs}ms: ${endpoint}`);
+          logger.error("api_request.timeout", { endpoint, timeoutMs: this.timeoutMs });
+          continue;
+        }
+        if (error instanceof Error && !error.message.startsWith("Server error")) {
+          throw error; // Don't retry client errors
+        }
+        lastError = error instanceof Error ? error : new Error(String(error));
+      }
+    }
+
+    throw lastError || new Error("Request failed after retries");
+  }
+
+  // === Convenience methods ===
+  async get<T = unknown>(endpoint: string): Promise<T> {
+    return this.request<T>(endpoint, { method: "GET" });
+  }
+
+  async post<T = unknown>(endpoint: string, data: unknown): Promise<T> {
+    return this.request<T>(endpoint, {
+      method: "POST",
+      body: JSON.stringify(data),
+    });
+  }
+
+  async put<T = unknown>(endpoint: string, data: unknown): Promise<T> {
+    return this.request<T>(endpoint, {
+      method: "PUT",
+      body: JSON.stringify(data),
+    });
+  }
+
+  async patch<T = unknown>(endpoint: string, data: unknown): Promise<T> {
+    return this.request<T>(endpoint, {
+      method: "PATCH",
+      body: JSON.stringify(data),
+    });
+  }
+
+  async delete<T = unknown>(endpoint: string): Promise<T> {
+    return this.request<T>(endpoint, { method: "DELETE" });
+  }
+
+  // === Pluggable pagination ===
+  async paginate<T>(
+    endpoint: string,
+    params: {
+      page?: number;
+      pageSize?: number;
+      extraParams?: Record<string, string>;
+      strategy?: PaginationStrategy;
+    } = {}
+  ): Promise<{ data: T[]; meta: { total: number; page: number; pageSize: number; hasMore: boolean } }> {
+    const { page = 1, pageSize = 25, extraParams = {}, strategy } = params;
+    const paginationStrategy = strategy || { type: "offset" as const };
+
+    switch (paginationStrategy.type) {
+      // === Offset/page-number pagination (most common) ===
+      case "offset": {
+        const pageParam = paginationStrategy.pageParam || "page";
+        const sizeParam = paginationStrategy.pageSizeParam || "pageSize";
+        const queryParams = new URLSearchParams({
+          [pageParam]: String(page),
+          [sizeParam]: String(Math.min(pageSize, 100)),
+          ...extraParams,
+        });
+        const result = await this.get<any>(`${endpoint}?${queryParams}`);
+        const data = Array.isArray(result) ? result : result.data || result.items || result.results || [];
+        const total = result.meta?.total || result.total || result.totalCount || data.length;
+        return { data, meta: { total, page, pageSize, hasMore: page * pageSize < total } };
+      }
+
+      // === Cursor-based pagination (Slack, Facebook, etc.) ===
+      case "cursor": {
+        const cursorParam = paginationStrategy.cursorParam || "cursor";
+        const cursorPath = paginationStrategy.cursorPath || "meta.nextCursor";
+        const queryParams = new URLSearchParams({
+          limit: String(Math.min(pageSize, 100)),
+          ...extraParams,
+        });
+        // If page > 1, caller must supply cursor via extraParams
+        const result = await this.get<any>(`${endpoint}?${queryParams}`);
+        const data = Array.isArray(result) ? result : result.data || result.items || result.results || [];
+        const nextCursor = this.getNestedValue(result, cursorPath);
+        const total = result.meta?.total || result.total || data.length;
+        return {
+          data,
+          meta: { total, page, pageSize, hasMore: !!nextCursor },
+        };
+      }
+
+      // === Keyset pagination (Stripe-style: starting_after=obj_xxx) ===
+      case "keyset": {
+        const afterParam = paginationStrategy.afterParam || "starting_after";
+        const queryParams = new URLSearchParams({
+          limit: String(Math.min(pageSize, 100)),
+          ...extraParams,
+        });
+        const result = await this.get<any>(`${endpoint}?${queryParams}`);
+        const data = Array.isArray(result) ? result : result.data || result.items || [];
+        const hasMore = result.has_more ?? result.hasMore ?? data.length >= pageSize;
+        return {
+          data,
+          meta: { total: -1, page, pageSize, hasMore },
+        };
+      }
+
+      // === Link-header pagination (GitHub-style) ===
+      case "link-header": {
+        const queryParams = new URLSearchParams({
+          per_page: String(Math.min(pageSize, 100)),
+          page: String(page),
+          ...extraParams,
+        });
+        const url = `${this.baseUrl}${endpoint}?${queryParams}`;
+        const controller = new AbortController();
+        const timeoutId = setTimeout(() => controller.abort(), this.timeoutMs);
+        try {
+          const response = await fetch(url, {
+            signal: controller.signal,
+            headers: {
+              "Authorization": `Bearer ${this.apiKey}`,
+              "Accept": "application/json",
+            },
+          });
+          this.updateRateLimits(response);
+          const data = await response.json() as T[];
+          const linkHeader = response.headers.get("Link") || "";
+          const hasMore = linkHeader.includes('rel="next"');
+          return {
+            data: Array.isArray(data) ? data : [],
+            meta: { total: -1, page, pageSize, hasMore },
+          };
+        } finally {
+          clearTimeout(timeoutId);
+        }
+      }
+
+      // === Next-URL pagination (API returns full URL for next page) ===
+      case "next-url": {
+        const nextUrlPath = paginationStrategy.nextUrlPath || "next";
+        const queryParams = new URLSearchParams({
+          limit: String(Math.min(pageSize, 100)),
+          ...extraParams,
+        });
+        const result = await this.get<any>(`${endpoint}?${queryParams}`);
+        const data = Array.isArray(result) ? result : result.data || result.items || result.results || [];
+        const nextUrl = this.getNestedValue(result, nextUrlPath);
+        const total = result.count || result.total || data.length;
+        return {
+          data,
+          meta: { total, page, pageSize, hasMore: !!nextUrl },
+        };
+      }
+
+      default:
+        throw new Error(`Unknown pagination strategy: ${(paginationStrategy as any).type}`);
+    }
+  }
+
+  // Helper: access nested object values by dot path
+  private getNestedValue(obj: any, path: string): any {
+    return path.split(".").reduce((o, k) => o?.[k], obj);
+  }
+
+  // === Health check: validate connectivity + auth ===
+  async healthCheck(): Promise<{ reachable: boolean; authenticated: boolean; latencyMs: number; error?: string }> {
+    const start = performance.now();
+    try {
+      const controller = new AbortController();
+      const timeoutId = setTimeout(() => controller.abort(), 10_000);
+      try {
+        const response = await fetch(this.baseUrl, {
+          signal: controller.signal,
+          headers: {
+            "Authorization": `Bearer ${this.apiKey}`,
+            "Accept": "application/json",
+          },
+        });
+        const latencyMs = Math.round(performance.now() - start);
+        return {
+          reachable: true,
+          authenticated: response.status !== 401 && response.status !== 403,
+          latencyMs,
+          ...(response.status >= 400 ? { error: `Status ${response.status}` } : {}),
+        };
+      } finally {
+        clearTimeout(timeoutId);
+      }
+    } catch (error) {
+      return {
+        reachable: false,
+        authenticated: false,
+        latencyMs: Math.round(performance.now() - start),
+        error: error instanceof Error ? error.message : String(error),
+      };
+    }
+  }
+
+  // === Rate limit helpers ===
+  private updateRateLimits(response: Response): void {
+    const remaining = response.headers.get("X-RateLimit-Remaining");
+    const reset = response.headers.get("X-RateLimit-Reset");
+
+    if (remaining) this.rateLimitRemaining = parseInt(remaining, 10);
+    if (reset) this.rateLimitReset = parseInt(reset, 10) * 1000;
+  }
+
+  private async waitForRateLimit(): Promise<void> {
+    if (this.rateLimitRemaining <= 1 && this.rateLimitReset > Date.now()) {
+      const waitMs = this.rateLimitReset - Date.now() + 100;
+      logger.warn("rate_limit.waiting", { waitMs: Math.min(waitMs, 30000) });
+      await this.delay(Math.min(waitMs, 30000));
+    }
+  }
+
+  private delay(ms: number): Promise<void> {
+    return new Promise((resolve) => setTimeout(resolve, ms));
+  }
+}
+```
+
+### 4.4 `src/tools/index.ts` — Tool Registry with Lazy Loading
+
+```typescript
+import { z } from "zod";
+import type { APIClient } from "../client.js";
+import type { ToolDefinition, ToolHandler, ToolGroup } from "../types.js";
+
+// Import tool group loaders (lazy — they return definitions + handlers)
+// Each group file exports: getTools(client) => { tools, handlers }
+
+export class ToolRegistry {
+  private groups: Map<string, ToolGroup> = new Map();
+  private toolToGroup: Map<string, string> = new Map();
+  private client: APIClient;
+
+  // Group loader functions — add one per tool group from the analysis
+  private groupLoaders: Record<
+    string,
+    () => Promise<{ tools: ToolDefinition[]; handlers: Record<string, ToolHandler> }>
+  > = {};
+
+  constructor(client: APIClient) {
+    this.client = client;
+    this.registerGroupLoaders();
+  }
+
+  private registerGroupLoaders(): void {
+    // Register lazy loaders for each tool group
+    // These import() calls only execute when the group is first needed
+    this.groupLoaders = {
+      health: async () => {
+        const mod = await import("./health.js");
+        return mod.getTools(this.client);
+      },
+      contacts: async () => {
+        const mod = await import("./contacts.js");
+        return mod.getTools(this.client);
+      },
+      deals: async () => {
+        const mod = await import("./deals.js");
+        return mod.getTools(this.client);
+      },
+      // ... add one per group from analysis doc
+    };
+  }
+
+  // Load a specific group on demand
+  private async loadGroup(groupName: string): Promise<void> {
+    if (this.groups.has(groupName) && this.groups.get(groupName)!.loaded) {
+      return; // Already loaded
+    }
+
+    const loader = this.groupLoaders[groupName];
+    if (!loader) {
+      throw new Error(`Unknown tool group: ${groupName}`);
+    }
+
+    const { tools, handlers } = await loader();
+
+    this.groups.set(groupName, {
+      name: groupName,
+      tools,
+      handlers,
+      loaded: true,
+    });
+
+    // Map tool names to their group for handler lookup
+    for (const tool of tools) {
+      this.toolToGroup.set(tool.name, groupName);
+    }
+  }
+
+  // Load ALL groups (for ListTools — must show all available tools)
+  async loadAllGroups(): Promise<void> {
+    await Promise.all(
+      Object.keys(this.groupLoaders).map((name) => this.loadGroup(name))
+    );
+  }
+
+  // Get all tool definitions (loads all groups if needed)
+  async getAllTools(): Promise<ToolDefinition[]> {
+    await this.loadAllGroups();
+    const allTools: ToolDefinition[] = [];
+    for (const group of this.groups.values()) {
+      allTools.push(...group.tools);
+    }
+    return allTools;
+  }
+
+  // Get handler for a specific tool
+  async getHandler(toolName: string): Promise<ToolHandler> {
+    // Ensure the tool's group is loaded
+    const groupName = this.toolToGroup.get(toolName);
+    if (!groupName) {
+      // Group might not be loaded yet — load all and retry
+      await this.loadAllGroups();
+      const retryGroup = this.toolToGroup.get(toolName);
+      if (!retryGroup) {
+        throw new Error(`Unknown tool: ${toolName}`);
+      }
+      const group = this.groups.get(retryGroup)!;
+      const handler = group.handlers[toolName];
+      if (!handler) throw new Error(`No handler for tool: ${toolName}`);
+      return handler;
+    }
+
+    await this.loadGroup(groupName);
+    const group = this.groups.get(groupName)!;
+    const handler = group.handlers[toolName];
+    if (!handler) throw new Error(`No handler for tool: ${toolName}`);
+    return handler;
+  }
+}
+```
+
+### 4.5 `src/tools/health.ts` — Health Check Tool (Always Included)
+
+```typescript
+// Health check tool — validates environment, API connectivity, and auth
+// Always include this tool in every MCP server
+
+import type { APIClient } from "../client.js";
+import type { ToolDefinition, ToolHandler } from "../types.js";
+import { logger } from "../logger.js";
+
+function getToolDefinitions(): ToolDefinition[] {
+  return [
+    {
+      name: "health_check",
+      title: "Health Check",
+      description:
+        "Validate server health: checks that environment variables are set, the API is reachable, and authentication is valid. Use when diagnosing connection issues or verifying server setup.",
+      inputSchema: {
+        type: "object",
+        properties: {},
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          status: { type: "string", enum: ["healthy", "degraded", "unhealthy"] },
+          checks: {
+            type: "object",
+            properties: {
+              envVars: { type: "object", properties: { ok: { type: "boolean" }, missing: { type: "array", items: { type: "string" } } } },
+              apiReachable: { type: "boolean" },
+              authValid: { type: "boolean" },
+              latencyMs: { type: "number" },
+            },
+          },
+          error: { type: "string" },
+        },
+        required: ["status", "checks"],
+      },
+      annotations: {
+        readOnlyHint: true,
+        destructiveHint: false,
+        idempotentHint: true,
+        openWorldHint: false,
+      },
+    },
+  ];
+}
+
+function getToolHandlers(client: APIClient): Record<string, ToolHandler> {
+  return {
+    health_check: async () => {
+      const checks: Record<string, unknown> = {};
+
+      // Check 1: Required environment variables
+      const requiredEnvVars = ["{SERVICE}_API_KEY"];
+      const missing = requiredEnvVars.filter((v) => !process.env[v]);
+      checks.envVars = { ok: missing.length === 0, missing };
+
+      // Check 2: API reachability + auth
+      const healthResult = await client.healthCheck();
+      checks.apiReachable = healthResult.reachable;
+      checks.authValid = healthResult.authenticated;
+      checks.latencyMs = healthResult.latencyMs;
+
+      // Determine overall status
+      let status: "healthy" | "degraded" | "unhealthy";
+      if (missing.length > 0 || !healthResult.reachable) {
+        status = "unhealthy";
+      } else if (!healthResult.authenticated) {
+        status = "degraded";
+      } else {
+        status = "healthy";
+      }
+
+      const result = {
+        status,
+        checks,
+        ...(healthResult.error ? { error: healthResult.error } : {}),
+      };
+
+      logger.info("health_check", { status, checks });
+
+      return {
+        content: [{ type: "text", text: JSON.stringify(result, null, 2) }],
+        structuredContent: result,
+      };
+    },
+  };
+}
+
+export function getTools(client: APIClient) {
+  return {
+    tools: getToolDefinitions(),
+    handlers: getToolHandlers(client),
+  };
+}
+```
+
+### 4.6 `src/tools/{group}.ts` — Tool Group Template
+
+```typescript
+// Tool group: {group_name}
+// Generated from {service}-api-analysis.md
+
+import { z } from "zod";
+import type { APIClient } from "../client.js";
+import type { ToolDefinition, ToolHandler } from "../types.js";
+import { logger } from "../logger.js";
+
+// === Zod Schemas ===
+const ListContactsSchema = z.object({
+  page: z.number().optional().default(1).describe("Page number (default 1)"),
+  pageSize: z.number().optional().default(25).describe("Results per page (default 25, max 100)"),
+  query: z.string().optional().describe("Search by name, email, or phone"),
+  status: z.enum(["active", "inactive", "all"]).optional().describe("Filter by status"),
+});
+
+const GetContactSchema = z.object({
+  contact_id: z.string().describe("Contact ID"),
+});
+
+const CreateContactSchema = z.object({
+  name: z.string().describe("Contact full name"),
+  email: z.string().email().optional().describe("Contact email address"),
+  phone: z.string().optional().describe("Contact phone number"),
+});
+
+const UpdateContactSchema = z.object({
+  contact_id: z.string().describe("Contact ID"),
+  name: z.string().optional().describe("Updated name"),
+  email: z.string().email().optional().describe("Updated email"),
+  phone: z.string().optional().describe("Updated phone"),
+});
+
+const DeleteContactSchema = z.object({
+  contact_id: z.string().describe("Contact ID to delete"),
+});
+
+// === Tool Definitions ===
+// Note: Every tool MUST have: name, title, description, inputSchema, outputSchema, annotations
+// See Section 11 (Token Budget) for description length targets
+function getToolDefinitions(): ToolDefinition[] {
+  return [
+    {
+      name: "list_contacts",
+      title: "List Contacts",
+      description:
+        "List contacts with optional filters and pagination. Returns name, email, phone, and status. Use when the user wants to browse or filter contacts. Do NOT use to search by keyword (use search_contacts) or get one contact's details (use get_contact).",
+      inputSchema: {
+        type: "object",
+        properties: {
+          page: { type: "number", description: "Page number (default 1)" },
+          pageSize: { type: "number", description: "Results per page (default 25, max 100)" },
+          query: { type: "string", description: "Search by name, email, or phone" },
+          status: { type: "string", enum: ["active", "inactive", "all"], description: "Filter by status" },
+        },
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          data: {
+            type: "array",
+            items: {
+              type: "object",
+              properties: {
+                id: { type: "string" },
+                name: { type: "string" },
+                email: { type: "string" },
+                phone: { type: "string" },
+                status: { type: "string" },
+              },
+            },
+          },
+          meta: {
+            type: "object",
+            properties: {
+              total: { type: "number" },
+              page: { type: "number" },
+              pageSize: { type: "number" },
+              hasMore: { type: "boolean" },
+            },
+          },
+        },
+        required: ["data", "meta"],
+      },
+      annotations: {
+        readOnlyHint: true,
+        destructiveHint: false,
+        idempotentHint: true,
+        openWorldHint: false,
+      },
+    },
+    {
+      name: "get_contact",
+      title: "Get Contact Details",
+      description:
+        "Get full details for a specific contact by ID. Returns all fields including activity history and tags. Use when the user references a known contact or needs detailed info. Do NOT use to browse multiple contacts (use list_contacts).",
+      inputSchema: {
+        type: "object",
+        properties: {
+          contact_id: { type: "string", description: "Contact ID" },
+        },
+        required: ["contact_id"],
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          id: { type: "string" },
+          name: { type: "string" },
+          email: { type: "string" },
+          phone: { type: "string" },
+          status: { type: "string" },
+          tags: { type: "array", items: { type: "string" } },
+          created_at: { type: "string" },
+          updated_at: { type: "string" },
+        },
+        required: ["id", "name"],
+      },
+      annotations: {
+        readOnlyHint: true,
+        destructiveHint: false,
+        idempotentHint: true,
+        openWorldHint: false,
+      },
+    },
+    {
+      name: "create_contact",
+      title: "Create Contact",
+      description:
+        "Create a new contact. Returns the created contact with assigned ID. Use when the user wants to add a new person to the system.",
+      inputSchema: {
+        type: "object",
+        properties: {
+          name: { type: "string", description: "Contact full name" },
+          email: { type: "string", description: "Contact email address" },
+          phone: { type: "string", description: "Contact phone number" },
+        },
+        required: ["name"],
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          id: { type: "string" },
+          name: { type: "string" },
+          email: { type: "string" },
+          phone: { type: "string" },
+          status: { type: "string" },
+          created_at: { type: "string" },
+        },
+        required: ["id", "name"],
+      },
+      annotations: {
+        readOnlyHint: false,
+        destructiveHint: false,
+        idempotentHint: false,
+        openWorldHint: false,
+      },
+    },
+    {
+      name: "update_contact",
+      title: "Update Contact",
+      description:
+        "Update an existing contact's fields. Only include fields to change. Use when the user wants to modify contact information.",
+      inputSchema: {
+        type: "object",
+        properties: {
+          contact_id: { type: "string", description: "Contact ID" },
+          name: { type: "string", description: "Updated name" },
+          email: { type: "string", description: "Updated email" },
+          phone: { type: "string", description: "Updated phone" },
+        },
+        required: ["contact_id"],
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          id: { type: "string" },
+          name: { type: "string" },
+          email: { type: "string" },
+          phone: { type: "string" },
+          status: { type: "string" },
+          updated_at: { type: "string" },
+        },
+        required: ["id"],
+      },
+      annotations: {
+        readOnlyHint: false,
+        destructiveHint: false,
+        idempotentHint: true,
+        openWorldHint: false,
+      },
+    },
+    {
+      name: "delete_contact",
+      title: "Delete Contact",
+      description:
+        "Permanently delete a contact. Cannot be undone. Use only when the user explicitly asks to delete a contact.",
+      inputSchema: {
+        type: "object",
+        properties: {
+          contact_id: { type: "string", description: "Contact ID to delete" },
+        },
+        required: ["contact_id"],
+      },
+      outputSchema: {
+        type: "object",
+        properties: {
+          success: { type: "boolean" },
+          deleted_id: { type: "string" },
+        },
+        required: ["success"],
+      },
+      annotations: {
+        readOnlyHint: false,
+        destructiveHint: true,
+        idempotentHint: true,
+        openWorldHint: false,
+      },
+    },
+  ];
+}
+
+// === Tool Handlers ===
+// Every handler returns BOTH content (text fallback) AND structuredContent (typed JSON)
+function getToolHandlers(client: APIClient): Record<string, ToolHandler> {
+  return {
+    list_contacts: async (args) => {
+      const params = ListContactsSchema.parse(args);
+      const result = await logger.time("tool.list_contacts", () =>
+        client.paginate("/contacts", {
+          page: params.page,
+          pageSize: params.pageSize,
+          extraParams: {
+            ...(params.query ? { query: params.query } : {}),
+            ...(params.status ? { status: params.status } : {}),
+          },
+        })
+      , { tool: "list_contacts" });
+
+      return {
+        content: [{
+          type: "text",
+          text: JSON.stringify(result, null, 2),
+          annotations: { audience: ["user", "assistant"], priority: 0.7 },
+        }],
+        structuredContent: result,
+      };
+    },
+
+    get_contact: async (args) => {
+      const { contact_id } = GetContactSchema.parse(args);
+      const result = await logger.time("tool.get_contact", () =>
+        client.get(`/contacts/${contact_id}`)
+      , { tool: "get_contact", contact_id });
+
+      return {
+        content: [
+          {
+            type: "text",
+            text: JSON.stringify(result, null, 2),
+            annotations: { audience: ["user"], priority: 0.8 },
+          },
+          // resource_link — allows clients to subscribe to updates for this contact
+          {
+            type: "resource_link" as const,
+            uri: `{service}://contacts/${contact_id}`,
+            name: `Contact ${contact_id}`,
+            mimeType: "application/json",
+          },
+        ],
+        structuredContent: result,
+      };
+    },
+
+    create_contact: async (args) => {
+      const data = CreateContactSchema.parse(args);
+      const result = await logger.time("tool.create_contact", () =>
+        client.post("/contacts", data)
+      , { tool: "create_contact" });
+
+      return {
+        content: [{
+          type: "text",
+          text: JSON.stringify(result, null, 2),
+          annotations: { audience: ["user"], priority: 0.9 },
+        }],
+        structuredContent: result,
+      };
+    },
+
+    update_contact: async (args) => {
+      const { contact_id, ...updateData } = UpdateContactSchema.parse(args);
+      const result = await logger.time("tool.update_contact", () =>
+        client.patch(`/contacts/${contact_id}`, updateData)
+      , { tool: "update_contact", contact_id });
+
+      return {
+        content: [{
+          type: "text",
+          text: JSON.stringify(result, null, 2),
+          annotations: { audience: ["user"], priority: 0.9 },
+        }],
+        structuredContent: result,
+      };
+    },
+
+    delete_contact: async (args) => {
+      const { contact_id } = DeleteContactSchema.parse(args);
+      await logger.time("tool.delete_contact", () =>
+        client.delete(`/contacts/${contact_id}`)
+      , { tool: "delete_contact", contact_id });
+
+      const result = { success: true, deleted_id: contact_id };
+      return {
+        content: [{
+          type: "text",
+          text: JSON.stringify(result, null, 2),
+          annotations: { audience: ["user"], priority: 1.0 },
+        }],
+        structuredContent: result,
+      };
+    },
+  };
+}
+
+// === Export: getTools(client) ===
+export function getTools(client: APIClient) {
+  return {
+    tools: getToolDefinitions(),
+    handlers: getToolHandlers(client),
+  };
+}
+```
+
+### 4.7 `src/index.ts` — Server Entry Point (Stdio + Streamable HTTP)
+
+```typescript
+#!/usr/bin/env node
+import { Server } from "@modelcontextprotocol/sdk/server/index.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import {
+  CallToolRequestSchema,
+  ListToolsRequestSchema,
+} from "@modelcontextprotocol/sdk/types.js";
+import { z } from "zod";
+import { APIClient } from "./client.js";
+import { ToolRegistry } from "./tools/index.js";
+import { logger } from "./logger.js";
+
+// ============================================
+// CONFIGURATION
+// ============================================
+const MCP_NAME = "{service}";
+const MCP_VERSION = "1.0.0";
+
+// ============================================
+// SERVER SETUP
+// ============================================
+async function main() {
+  // Validate environment variables
+  const apiKey = process.env["{SERVICE}_API_KEY"];
+  if (!apiKey) {
+    logger.error("startup.missing_env", { variable: "{SERVICE}_API_KEY" });
+    console.error("Error: {SERVICE}_API_KEY environment variable required");
+    console.error("Copy .env.example to .env and fill in your credentials");
+    process.exit(1);
+  }
+
+  const baseUrl = process.env["{SERVICE}_BASE_URL"];
+
+  // Initialize client and tool registry
+  const client = new APIClient(apiKey, baseUrl);
+  const registry = new ToolRegistry(client);
+
+  // Create MCP server — only declare capabilities that are actually implemented
+  const server = new Server(
+    { name: `${MCP_NAME}-mcp`, version: MCP_VERSION },
+    {
+      capabilities: {
+        tools: { listChanged: false },
+        logging: {},
+        // Enable these ONLY when the server actually implements them:
+        // resources: { subscribe: false, listChanged: false },
+        // prompts: { listChanged: false },
+      },
+    }
+  );
+
+  // List all available tools
+  server.setRequestHandler(ListToolsRequestSchema, async () => {
+    const tools = await registry.getAllTools();
+    logger.info("tools.list", { count: tools.length });
+    return { tools };
+  });
+
+  // Handle tool execution
+  server.setRequestHandler(CallToolRequestSchema, async (request) => {
+    const { name, arguments: args } = request.params;
+    const requestId = logger.requestId();
+
+    logger.info("tool.call.start", { requestId, tool: name, args });
+    const start = performance.now();
+
+    try {
+      const handler = await registry.getHandler(name);
+      const result = await handler(args || {});
+
+      const durationMs = Math.round(performance.now() - start);
+      logger.info("tool.call.done", { requestId, tool: name, durationMs, isError: false });
+
+      return result;
+    } catch (error) {
+      const durationMs = Math.round(performance.now() - start);
+
+      // === Error Classification ===
+      // Protocol Errors: JSON-RPC codes for structural issues (unknown tool, malformed request)
+      // Tool Execution Errors: isError=true for API/validation/business failures
+      //   → Input validation errors are Tool Execution Errors (enables LLM self-correction)
+
+      let message: string;
+      if (error instanceof z.ZodError) {
+        // Input validation error → Tool Execution Error (NOT protocol error)
+        // Returning this as isError lets the LLM self-correct the input
+        message = `Validation error: ${error.errors.map(e => `${e.path.join(".")}: ${e.message}`).join(", ")}`;
+        logger.warn("tool.call.validation_error", {
+          requestId, tool: name, durationMs, errors: error.errors,
+        });
+      } else if (error instanceof Error) {
+        message = error.message;
+        logger.error("tool.call.error", {
+          requestId, tool: name, durationMs, error: message, stack: error.stack,
+        });
+      } else {
+        message = String(error);
+        logger.error("tool.call.error", { requestId, tool: name, durationMs, error: message });
+      }
+
+      return {
+        content: [{ type: "text", text: `Error: ${message}` }],
+        structuredContent: { error: message, tool: name },
+        isError: true,
+      };
+    }
+  });
+
+  // === Transport Selection ===
+  // stdio: For local use (Claude Desktop, Cursor, direct subprocess spawning)
+  // Streamable HTTP: For remote/production deployment (network-accessible server)
+  const transportMode = process.env.MCP_TRANSPORT || "stdio";
+
+  if (transportMode === "http") {
+    await startHttpTransport(server);
+  } else {
+    await startStdioTransport(server);
+  }
+}
+
+// === Stdio Transport (default — local use) ===
+async function startStdioTransport(server: Server) {
+  const transport = new StdioServerTransport();
+  await server.connect(transport);
+  logger.info("server.started", { transport: "stdio", name: MCP_NAME });
+}
+
+// === Streamable HTTP Transport (remote/production deployment) ===
+// Use when: deploying as a network service, multi-client access, load balancing
+// Requires: MCP_TRANSPORT=http, optional MCP_HTTP_PORT (default 3000)
+async function startHttpTransport(server: Server) {
+  // Dynamic import — only load HTTP transport when needed
+  const { StreamableHTTPServerTransport } = await import(
+    "@modelcontextprotocol/sdk/server/streamableHttp.js"
+  );
+  const { createServer } = await import("http");
+
+  const port = parseInt(process.env.MCP_HTTP_PORT || "3000", 10);
+
+  // Session management with TTL, max sessions, and cleanup
+  const sessions = new Map<string, { transport: StreamableHTTPServerTransport; lastActivity: number }>();
+  const MAX_SESSIONS = 100;
+  const SESSION_TTL_MS = 30 * 60 * 1000; // 30 minutes
+
+  // Session cleanup interval — evict expired sessions every 60s
+  const cleanupInterval = setInterval(() => {
+    const now = Date.now();
+    for (const [id, session] of sessions.entries()) {
+      if (now - session.lastActivity > SESSION_TTL_MS) {
+        logger.info("session.expired", { sessionId: id });
+        sessions.delete(id);
+      }
+    }
+  }, 60_000);
+
+  // Evict oldest session if at capacity
+  function evictOldestSession(): void {
+    let oldest: string | null = null;
+    let oldestTime = Infinity;
+    for (const [id, s] of sessions.entries()) {
+      if (s.lastActivity < oldestTime) {
+        oldestTime = s.lastActivity;
+        oldest = id;
+      }
+    }
+    if (oldest) {
+      logger.info("session.evicted", { sessionId: oldest });
+      sessions.delete(oldest);
+    }
+  }
+
+  const httpServer = createServer(async (req, res) => {
+    const url = new URL(req.url || "/", `http://localhost:${port}`);
+
+    // Health endpoint (non-MCP)
+    if (url.pathname === "/health") {
+      res.writeHead(200, { "Content-Type": "application/json" });
+      res.end(JSON.stringify({ status: "ok", server: MCP_NAME, activeSessions: sessions.size }));
+      return;
+    }
+
+    // MCP endpoint
+    if (url.pathname === "/mcp") {
+      const sessionId = req.headers["mcp-session-id"] as string | undefined;
+
+      if (req.method === "POST") {
+        // New or existing session
+        let transport: StreamableHTTPServerTransport;
+
+        if (sessionId && sessions.has(sessionId)) {
+          const session = sessions.get(sessionId)!;
+          session.lastActivity = Date.now();
+          transport = session.transport;
+        } else {
+          // Enforce max sessions — evict oldest if at capacity
+          if (sessions.size >= MAX_SESSIONS) {
+            evictOldestSession();
+          }
+
+          transport = new StreamableHTTPServerTransport({
+            sessionIdGenerator: () => crypto.randomUUID(),
+          });
+          await server.connect(transport);
+          // Store session after connection
+          const newSessionId = transport.sessionId;
+          if (newSessionId) {
+            sessions.set(newSessionId, { transport, lastActivity: Date.now() });
+          }
+        }
+
+        await transport.handleRequest(req, res);
+        return;
+      }
+
+      if (req.method === "GET") {
+        // SSE stream for server-initiated messages
+        if (sessionId && sessions.has(sessionId)) {
+          const session = sessions.get(sessionId)!;
+          session.lastActivity = Date.now();
+          await session.transport.handleRequest(req, res);
+          return;
+        }
+        res.writeHead(400, { "Content-Type": "application/json" });
+        res.end(JSON.stringify({ error: "No session. Send POST first." }));
+        return;
+      }
+
+      if (req.method === "DELETE") {
+        // Session cleanup
+        if (sessionId && sessions.has(sessionId)) {
+          const session = sessions.get(sessionId)!;
+          await session.transport.handleRequest(req, res);
+          sessions.delete(sessionId);
+          return;
+        }
+      }
+    }
+
+    res.writeHead(404);
+    res.end();
+  });
+
+  // Clean up on server shutdown
+  process.on("SIGTERM", () => {
+    clearInterval(cleanupInterval);
+    sessions.clear();
+  });
+
+  httpServer.listen(port, () => {
+    logger.info("server.started", { transport: "http", name: MCP_NAME, port, endpoint: `/mcp` });
+  });
+}
+
+main().catch((error) => {
+  logger.error("server.fatal", { error: error instanceof Error ? error.message : String(error) });
+  process.exit(1);
+});
+```
+
+---
+
+## 5. Auth Patterns
+
+Choose the pattern from the analysis doc and use the corresponding client code:
+
+### Pattern A: API Key (most common)
+```typescript
+headers: {
+  "Authorization": `Bearer ${this.apiKey}`,
+  // OR: "X-API-Key": this.apiKey,
+  // OR: "Api-Key": this.apiKey,
+}
+```
+
+### Pattern B: OAuth2 Client Credentials
+```typescript
+export class APIClient {
+  private clientId: string;
+  private clientSecret: string;
+  private accessToken: string | null = null;
+  private tokenExpiry: number = 0;
+  private refreshPromise: Promise<string> | null = null; // Mutex: share one refresh across concurrent callers
+
+  constructor(clientId: string, clientSecret: string) {
+    this.clientId = clientId;
+    this.clientSecret = clientSecret;
+  }
+
+  private async getAccessToken(): Promise<string> {
+    // Return cached token if valid (5 min buffer)
+    if (this.accessToken && Date.now() < this.tokenExpiry - 300_000) {
+      return this.accessToken;
+    }
+
+    // If already refreshing, wait for that to complete (prevents thundering herd)
+    if (this.refreshPromise) {
+      return this.refreshPromise;
+    }
+
+    // Start a new refresh and let all concurrent callers share it
+    this.refreshPromise = this._doRefresh();
+    try {
+      const token = await this.refreshPromise;
+      return token;
+    } finally {
+      this.refreshPromise = null;
+    }
+  }
+
+  private async _doRefresh(): Promise<string> {
+    const controller = new AbortController();
+    const timeoutId = setTimeout(() => controller.abort(), 30_000);
+
+    try {
+      const response = await fetch("https://auth.example.com/oauth/token", {
+        method: "POST",
+        signal: controller.signal,
+        headers: { "Content-Type": "application/x-www-form-urlencoded" },
+        body: new URLSearchParams({
+          grant_type: "client_credentials",
+          client_id: this.clientId,
+          client_secret: this.clientSecret,
+        }),
+      });
+
+      if (!response.ok) {
+        throw new Error(`Auth failed: ${response.status} ${response.statusText}`);
+      }
+
+      const data = await response.json();
+      this.accessToken = data.access_token;
+      this.tokenExpiry = Date.now() + data.expires_in * 1000;
+      return this.accessToken!;
+    } finally {
+      clearTimeout(timeoutId);
+    }
+  }
+
+  async request<T>(endpoint: string, options: RequestInit = {}): Promise<T> {
+    const token = await this.getAccessToken();
+    // ... use token in Authorization header, with AbortController timeout
+  }
+}
+```
+
+### Pattern C: Basic Auth
+```typescript
+headers: {
+  "Authorization": `Basic ${Buffer.from(`${this.username}:${this.password}`).toString("base64")}`,
+}
+```
+
+### Pattern D: API Key + Account ID (multi-tenant)
+```typescript
+headers: {
+  "Authorization": `Bearer ${this.apiKey}`,
+  "X-Account-ID": this.accountId,
+}
+```
+
+---
+
+## 6. MCP Annotations (Feb 2026 Standard)
+
+**EVERY tool MUST have annotations.** The annotations object goes on each tool definition:
+
+```typescript
+{
+  name: "tool_name",
+  title: "Tool Display Name",
+  description: "...",
+  inputSchema: { ... },
+  outputSchema: { ... },
+  annotations: {
+    readOnlyHint: boolean,      // true if tool only reads data (GET)
+    destructiveHint: boolean,   // true if tool deletes data (DELETE)
+    idempotentHint: boolean,    // true if repeated calls have same effect (GET, PUT, DELETE)
+    openWorldHint: boolean,     // true if affects systems outside this API (rare)
+  }
+}
+```
+
+### Decision matrix:
+
+| Operation | readOnly | destructive | idempotent | openWorld |
+|-----------|----------|-------------|------------|-----------|
+| GET / list / search | `true` | `false` | `true` | `false` |
+| POST / create | `false` | `false` | `false` | `false` |
+| PUT / update / upsert | `false` | `false` | `true` | `false` |
+| PATCH / partial update | `false` | `false` | `true` | `false` |
+| DELETE | `false` | `true` | `true` | `false` |
+| Send email / SMS | `false` | `false` | `false` | `true` |
+| Trigger webhook | `false` | `false` | `false` | `true` |
+
+---
+
+## 7. Tool Definition Standards (2025-11-25 Spec)
+
+Every tool definition MUST include these fields:
+
+```typescript
+{
+  // REQUIRED
+  name: "list_contacts",                    // machine name, snake_case
+  title: "List Contacts",                   // human-readable display name
+  description: "...",                        // routing signal for LLM (see Section 8)
+  inputSchema: { type: "object", ... },     // JSON Schema for input parameters
+  
+  // REQUIRED (2025-06-18+)
+  outputSchema: {                            // JSON Schema 2020-12 for structured output
+    type: "object",
+    properties: { ... },
+    required: [ ... ],
+  },
+  
+  // REQUIRED
+  annotations: { ... },                     // behavioral hints (see Section 6)
+
+  // OPTIONAL — for rich UI clients
+  icons: [                                   // icon for display in tool lists/palettes
+    { src: "https://example.com/icon.svg", mimeType: "image/svg+xml" },
+  ],
+}
+```
+
+### outputSchema guidelines (JSON Schema 2020-12):
+
+- Declare the shape of `structuredContent` returned by the tool
+- Use standard JSON Schema types: `string`, `number`, `boolean`, `object`, `array`
+- Include `required` array for non-optional fields
+- Keep schemas concise — only document fields the client needs to consume
+- The SDK validates `structuredContent` against `outputSchema` when both are present
+
+```typescript
+// Example: List endpoint outputSchema
+outputSchema: {
+  type: "object",
+  properties: {
+    data: {
+      type: "array",
+      items: {
+        type: "object",
+        properties: {
+          id: { type: "string" },
+          name: { type: "string" },
+          email: { type: "string" },
+          status: { type: "string" },
+        },
+      },
+    },
+    meta: {
+      type: "object",
+      properties: {
+        total: { type: "number" },
+        page: { type: "number" },
+        pageSize: { type: "number" },
+        hasMore: { type: "boolean" },
+      },
+    },
+  },
+  required: ["data", "meta"],
+},
+
+// Example: Single entity outputSchema
+outputSchema: {
+  type: "object",
+  properties: {
+    id: { type: "string" },
+    name: { type: "string" },
+    email: { type: "string" },
+    phone: { type: "string" },
+    status: { type: "string" },
+    created_at: { type: "string" },
+  },
+  required: ["id", "name"],
+},
+
+// Example: Delete/action outputSchema
+outputSchema: {
+  type: "object",
+  properties: {
+    success: { type: "boolean" },
+    deleted_id: { type: "string" },
+  },
+  required: ["success"],
+},
+```
+
+### icons (optional):
+
+```typescript
+// SVG preferred for crisp scaling at any size
+icons: [
+  { src: "https://cdn.example.com/contacts-icon.svg", mimeType: "image/svg+xml" },
+],
+
+// Or PNG for raster icons
+icons: [
+  { src: "https://cdn.example.com/contacts-icon.png", mimeType: "image/png" },
+],
+```
+
+Icons are used by rich MCP clients (VS Code, Claude Desktop) to display tools in palettes and menus. Optional but improves discoverability. Use one icon per tool — prefer SVG.
+
+---
+
+## 8. Tool Description Best Practices for LLM Routing
+
+The description is the MOST IMPORTANT field. It determines whether the LLM picks the right tool.
+
+### Formula:
+```
+{What it does in one sentence}. {What it returns — 2-3 key fields}. 
+{When to use it — user intents}. {When NOT to use it — disambiguation}.
+```
+
+### Good examples:
+```
+"List contacts with optional filters and pagination. Returns name, email, phone, and status. 
+Use when the user wants to browse or filter contacts. Do NOT use to search by keyword 
+(use search_contacts) or get one contact's details (use get_contact)."
+
+"Get full details for a specific contact by ID. Returns all fields including activity history 
+and tags. Use when the user references a known contact. Do NOT use to browse multiple contacts 
+(use list_contacts)."
+
+"Create a new contact. Returns the created contact with assigned ID. 
+Use when the user wants to add a new person to the system."
+
+"Permanently delete a contact. Cannot be undone. 
+Use only when the user explicitly asks to delete a contact."
+```
+
+### Bad examples:
+```
+"Gets contacts"                    // What contacts? How? When?
+"Contact management tool"         // Not actionable
+"CRUD operations for contacts"    // Technical jargon, no routing signal
+"Fetches contact data from API"   // Implementation detail, not user intent
+```
+
+### For similar tools, create clear differentiation:
+```
+list_contacts: "...browse or filter contacts. Do NOT use for keyword search."
+search_contacts: "...full-text search. Use when searching by specific keyword."
+get_contact: "...single contact by ID. Use for one specific contact's details."
+```
+
+---
+
+## 9. Tool Result Standards (structuredContent)
+
+**Every tool handler MUST return both `content` (text fallback) and `structuredContent` (typed JSON).**
+
+This is required by the MCP 2025-06-18 spec. `content` is the universal text fallback for clients that don't support structured output. `structuredContent` is the typed JSON that matches `outputSchema`.
+
+### Standard return pattern:
+
+```typescript
+// Basic: return both text and structured content
+const result = await client.get(`/contacts/${contact_id}`);
+return {
+  content: [{ type: "text", text: JSON.stringify(result, null, 2) }],
+  structuredContent: result,
+};
+
+// With resource_link: tool result includes a link to a subscribable MCP Resource
+const result = await client.get(`/contacts/${contact_id}`);
+return {
+  content: [
+    { type: "text", text: JSON.stringify(result, null, 2) },
+    {
+      type: "resource_link",
+      uri: `{service}://contacts/${contact_id}`,
+      name: `Contact ${result.name}`,
+      mimeType: "application/json",
+    },
+  ],
+  structuredContent: result,
+};
+
+// Error: also use structuredContent for error responses
+return {
+  content: [{ type: "text", text: `Error: ${message}` }],
+  structuredContent: { error: message, tool: name },
+  isError: true,
+};
+```
+
+### Content annotations on tool results
+
+Content blocks support `annotations` with `audience` and `priority` to control routing:
+
+```typescript
+// Content annotation pattern — add to every content block
+{
+  type: "text",
+  text: JSON.stringify(result, null, 2),
+  annotations: {
+    audience: ["user", "assistant"],  // Who should see this content
+    priority: 0.7,                     // 0.0-1.0, higher = more prominent
+  },
+}
+```
+
+Use the content annotation planning from the analysis doc (Section 6b) to set appropriate values per tool type. See Section 4.6 for handler examples with annotations.
+
+> **Note on HTML escaping in apps:** If building apps that render user-supplied text, use a regex-based `escapeHtml()` — it's ~10x faster than DOM-based approaches (`document.createElement('div').textContent`), especially for large datasets.
+
+### When to include `resource_link`:
+
+- GET single entity tools (get_contact, get_deal, get_invoice)
+- The `uri` should follow `{service}://{resource_type}/{id}` convention
+- Allows clients to subscribe to resource updates via MCP Resources
+- Don't include on list/search tools (too many links) or write tools
+
+---
+
+## 10. Error Handling Standards
+
+### Protocol Errors vs Tool Execution Errors
+
+The MCP spec (2025-11-25) formally distinguishes two error categories:
+
+| Category | When | How | LLM Behavior |
+|----------|------|-----|---------------|
+| **Protocol Errors** | Unknown tool, malformed JSON-RPC, server crash | JSON-RPC error codes (-32600 to -32603, -32700) | LLM cannot self-correct |
+| **Tool Execution Errors** | API failure, validation error, business logic | `isError: true` in result content | LLM CAN self-correct |
+
+**Critical rule: Input validation errors are Tool Execution Errors, NOT Protocol Errors.** Returning validation errors as `isError: true` lets the LLM read the error, fix its input, and retry — enabling self-correction.
+
+### Three-level error handling:
+
+#### Client-level (in `client.ts`):
+- Retry on 429 (rate limit) and 5xx (server error)
+- Don't retry on 4xx (client error — bad request, not found, unauthorized)
+- Circuit breaker prevents hammering a down service
+- Request timeout via AbortController prevents indefinite hangs
+- Parse error body for useful messages
+- Track rate limit headers
+
+#### Handler-level (in tool handlers):
+- Zod validation catches bad input before API call
+- Catch specific error types for better messages
+
+#### Server-level (in `index.ts`):
+- Never crash — always return an error response
+- Use `isError: true` flag for tool execution errors
+- Include the original error message so LLM can self-correct
+- Return `structuredContent` with error info
+
+```typescript
+// In the CallToolRequest handler:
+try {
+  const handler = await registry.getHandler(name);
+  const result = await handler(args || {});
+  return result;
+} catch (error) {
+  let message: string;
+  if (error instanceof z.ZodError) {
+    // Input validation → Tool Execution Error (LLM self-corrects)
+    message = `Validation error: ${error.errors.map(e => `${e.path.join(".")}: ${e.message}`).join(", ")}`;
+  } else if (error instanceof Error) {
+    message = error.message;
+  } else {
+    message = String(error);
+  }
+  return {
+    content: [{ type: "text", text: `Error: ${message}` }],
+    structuredContent: { error: message, tool: name },
+    isError: true,
+  };
+}
+```
+
+---
+
+## 11. Token Budget Awareness
+
+**This is the real performance bottleneck.** Each tool definition consumes 50–1000 tokens depending on schema complexity. Tool definitions are sent to the LLM on every request.
+
+### Budget targets:
+
+| Metric | Target | Why |
+|--------|--------|-----|
+| Tokens per tool description | **< 200 tokens** | Prevents context bloat |
+| Total tool definition tokens (per server) | **< 5,000 tokens** | Keeps 97.5% of context free |
+| Max tools per server | **~25 active** | Above this, accuracy degrades |
+| Max tools per interaction | **15–20** | Optimal accuracy range |
+
+### Token optimization techniques:
+
+1. **Concise descriptions** — Cut filler words. "List contacts with optional filters" not "This tool allows you to list contacts with various optional filtering parameters."
+
+2. **Minimal inputSchema** — Only document parameters the LLM needs to set. Don't include internal/computed params.
+
+3. **Short property descriptions** — `"Page number (default 1)"` not `"The page number for paginated results. If not provided, defaults to 1."`
+
+4. **Combine similar tools** — If `list_contacts` and `search_contacts` differ by one optional param, merge them. Fewer tools = better routing.
+
+5. **outputSchema brevity** — Include key fields, not exhaustive response bodies. The LLM doesn't need to know about every field the API returns.
+
+### Token counting helper
+
+Run this after building to verify token budgets:
+
+```bash
+# Approximate token count per tool (words × 1.3)
+node -e "
+  const fs = require('fs');
+  const src = fs.readFileSync('dist/index.js', 'utf8');
+  // Extract tool definitions — look for name/description pairs
+  const toolRegex = /name:\s*['\"](\w+)['\"][\s\S]*?description:\s*['\"]([^'\"]+)['\"]/g;
+  let match, total = 0;
+  while ((match = toolRegex.exec(src)) !== null) {
+    const tokens = Math.ceil(match[2].split(/\s+/).length * 1.3);
+    total += tokens;
+    const status = tokens > 200 ? '⚠️' : '✅';
+    console.log(\`\${status} \${match[1]}: ~\${tokens} tokens\`);
+  }
+  console.log(\`\nTotal description tokens: ~\${total}\`);
+  console.log(total > 5000 ? '⚠️  Over 5,000 token budget!' : '✅ Within token budget');
+"
+```
+
+### Warning: Large servers
+
+A server with 50+ tools at 200 tokens each = **10,000+ tokens** consumed from context window before any conversation begins. For these servers:
+- Implement selective tool registration based on channel/context
+- Group tools and only register the relevant group per session
+- Consider splitting into multiple focused servers
+
+---
+
+## 12. Zod Validation Standards
+
+Every tool handler MUST validate its input with Zod before making API calls:
+
+```typescript
+import { z } from "zod";
+
+// Define schema with descriptions (they appear in error messages)
+const ListContactsSchema = z.object({
+  page: z.number().int().positive().optional().default(1),
+  pageSize: z.number().int().min(1).max(100).optional().default(25),
+  query: z.string().optional(),
+  status: z.enum(["active", "inactive", "all"]).optional(),
+  sortBy: z.enum(["created", "updated", "name"]).optional(),
+  createdAfter: z.string().datetime().optional(),
+});
+
+// In handler:
+async (args) => {
+  const params = ListContactsSchema.parse(args);
+  // params is now fully typed and validated
+}
+```
+
+### Common Zod patterns:
+
+```typescript
+// Required string
+z.string().describe("Contact ID")
+
+// Optional with default
+z.number().optional().default(25)
+
+// Enum
+z.enum(["active", "inactive", "all"])
+
+// Email
+z.string().email()
+
+// ISO date
+z.string().datetime()
+
+// Constrained number
+z.number().int().min(1).max(100)
+
+// Optional object
+z.record(z.unknown()).optional()
+
+// Array of strings
+z.array(z.string()).optional()
+```
+
+---
+
+## 13. Transport Selection Guide
+
+### Stdio (default — local use)
+```typescript
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+const transport = new StdioServerTransport();
+await server.connect(transport);
+```
+
+**Use when:**
+- Running as a local subprocess (Claude Desktop, Cursor, CLI tools)
+- Single-client access (one client spawns one server process)
+- No network exposure needed
+- Development/testing
+
+### Streamable HTTP (remote/production)
+```typescript
+import { StreamableHTTPServerTransport } from "@modelcontextprotocol/sdk/server/streamableHttp.js";
+```
+
+**Use when:**
+- Deploying as a network service
+- Multiple clients need to connect simultaneously
+- Running behind a load balancer or gateway
+- Production deployment with monitoring
+- Docker/containerized deployment
+
+**Key characteristics:**
+- HTTP POST for client→server messages (JSON-RPC)
+- HTTP GET with SSE for server→client notifications
+- Session management via `MCP-Session-Id` header
+- Resumability via `Last-Event-ID`
+- Supports concurrent clients
+
+**Note:** Legacy SSE transport is deprecated. Use Streamable HTTP for all new remote deployments.
+
+The `src/index.ts` template (Section 4.7) includes both transports, selected via `MCP_TRANSPORT` env var.
+
+---
+
+## 14. Pagination Strategies
+
+The API client supports pluggable pagination. Each tool specifies which strategy its endpoint uses:
+
+### Strategy: Offset (most common)
+```typescript
+// ?page=2&pageSize=25
+const result = await client.paginate<Contact>("/contacts", {
+  page: 2, pageSize: 25,
+  strategy: { type: "offset", pageParam: "page", pageSizeParam: "pageSize" },
+});
+```
+
+### Strategy: Cursor (Slack, Facebook, GraphQL)
+```typescript
+// ?cursor=eyJsYXN0SWQiOiIxMjMifQ==&limit=25
+const result = await client.paginate<Contact>("/contacts", {
+  pageSize: 25,
+  strategy: { type: "cursor", cursorParam: "cursor", cursorPath: "meta.nextCursor" },
+  extraParams: { cursor: previousCursor },
+});
+```
+
+### Strategy: Keyset (Stripe — starting_after=obj_xxx)
+```typescript
+// ?starting_after=con_abc123&limit=25
+const result = await client.paginate<Contact>("/contacts", {
+  pageSize: 25,
+  strategy: { type: "keyset", afterParam: "starting_after" },
+  extraParams: { starting_after: lastItemId },
+});
+```
+
+### Strategy: Link Header (GitHub-style)
+```typescript
+// Reads Link: <url>; rel="next" from response headers
+const result = await client.paginate<Contact>("/contacts", {
+  page: 1, pageSize: 25,
+  strategy: { type: "link-header" },
+});
+```
+
+### Strategy: Next URL (API returns full URL for next page)
+```typescript
+// API response: { results: [...], next: "https://api.example.com/contacts?offset=50" }
+const result = await client.paginate<Contact>("/contacts", {
+  pageSize: 25,
+  strategy: { type: "next-url", nextUrlPath: "next" },
+});
+```
+
+**Choosing a strategy:** Check the API analysis doc. The pagination section should specify which pattern the API uses. Default to `offset` if not specified. Document the strategy choice in the tool group file.
+
+---
+
+## 15. Tasks (Async Operations) for Long-Running Tools
+
+The 2025-11-25 spec adds experimental Tasks support (SEP-1686). For tools where the operation may take >10 seconds, declare task support so clients can poll for results instead of waiting.
+
+### When to use Tasks:
+- **Report generation** — compiling analytics, PDFs, exports (30-120s)
+- **Bulk operations** — updating 100+ records, mass imports (10-60s)
+- **External processing** — waiting on third-party webhooks, payment processing
+- **Data migration** — moving large datasets between systems
+
+### Tool definition with task support:
+
+```typescript
+{
+  name: "export_report",
+  title: "Export Report",
+  description: "Generate and export an analytics report. May take 30-120 seconds. Use when user requests a full report or data export.",
+  inputSchema: { ... },
+  outputSchema: { ... },
+  annotations: { readOnlyHint: true, destructiveHint: false, idempotentHint: true, openWorldHint: false },
+  execution: {
+    taskSupport: "optional",  // "required" | "optional" | "forbidden"
+  },
+}
+```
+
+### Server capabilities with Tasks:
+
+```typescript
+capabilities: {
+  tools: { listChanged: false },
+  logging: {},
+  tasks: {
+    list: {},
+    cancel: {},
+    requests: { tools: { call: {} } },
+  },
+}
+```
+
+### Task-aware handler pattern:
+
+```typescript
+// For task-enabled tools, the handler can return immediately with a task reference
+// The SDK manages the task lifecycle — the handler just does the work
+async function handleExportReport(args: Record<string, unknown>): Promise<ToolResult> {
+  const params = ExportReportSchema.parse(args);
+  
+  // Long-running operation
+  const result = await generateReport(params);
+  
+  return {
+    content: [{
+      type: "text",
+      text: JSON.stringify(result, null, 2),
+      annotations: { audience: ["user"], priority: 0.8 },
+    }],
+    structuredContent: result,
+  };
+}
+```
+
+> **Note:** Tasks support is experimental in the 2025-11-25 spec. Implement only for tools identified as task candidates in the analysis doc (Section 10). Most tools should NOT use tasks — only long-running operations that would otherwise hit timeout limits.
+
+---
+
+## 16. One-File Pattern (for ≤15 tools)
+
+If the analysis shows 15 or fewer tools, skip the modular structure and use a single `src/index.ts`. Still include all standards: `title`, `outputSchema`, `structuredContent`, logging, health check, timeouts, circuit breaker.
+
+```typescript
+#!/usr/bin/env node
+import { Server } from "@modelcontextprotocol/sdk/server/index.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import {
+  CallToolRequestSchema,
+  ListToolsRequestSchema,
+} from "@modelcontextprotocol/sdk/types.js";
+import { z } from "zod";
+
+const MCP_NAME = "{service}";
+const MCP_VERSION = "1.0.0";
+const API_BASE_URL = "https://api.example.com";
+const REQUEST_TIMEOUT_MS = 30_000;
+
+// === STRUCTURED LOGGER (inline) ===
+function log(level: string, event: string, data: Record<string, unknown> = {}) {
+  console.error(JSON.stringify({ ts: new Date().toISOString(), level, event, server: MCP_NAME, ...data }));
+}
+
+// === CIRCUIT BREAKER (inline) ===
+let cbFailures = 0;
+let cbLastFailure = 0;
+let cbState: "closed" | "open" | "half-open" = "closed";
+const CB_THRESHOLD = 5;
+const CB_RESET_MS = 60_000;
+
+function cbCanExecute(): boolean {
+  if (cbState === "closed") return true;
+  if (cbState === "open" && Date.now() - cbLastFailure >= CB_RESET_MS) { cbState = "half-open"; return true; }
+  return cbState === "half-open";
+}
+function cbSuccess() { cbFailures = 0; cbState = "closed"; }
+function cbFailure() { cbFailures++; cbLastFailure = Date.now(); if (cbFailures >= CB_THRESHOLD) cbState = "open"; }
+
+// === API CLIENT (inline) ===
+class APIClient {
+  constructor(private apiKey: string, private baseUrl = API_BASE_URL) {}
+
+  async request<T>(endpoint: string, options: RequestInit = {}): Promise<T> {
+    if (!cbCanExecute()) throw new Error("Circuit breaker open — API unavailable");
+    const controller = new AbortController();
+    const timeoutId = setTimeout(() => controller.abort(), REQUEST_TIMEOUT_MS);
+    try {
+      const response = await fetch(`${this.baseUrl}${endpoint}`, {
+        ...options,
+        signal: controller.signal,
+        headers: { "Authorization": `Bearer ${this.apiKey}`, "Content-Type": "application/json", ...options.headers },
+      });
+      if (response.status >= 500) { cbFailure(); throw new Error(`Server error: ${response.status}`); }
+      if (!response.ok) { const body = await response.text(); throw new Error(`API error ${response.status}: ${body}`); }
+      cbSuccess();
+      if (response.status === 204) return { success: true } as T;
+      return (await response.json()) as T;
+    } catch (error) {
+      if (error instanceof Error && error.name === "AbortError") { cbFailure(); throw new Error(`Timeout after ${REQUEST_TIMEOUT_MS}ms`); }
+      throw error;
+    } finally {
+      clearTimeout(timeoutId);
+    }
+  }
+
+  async get<T>(endpoint: string): Promise<T> { return this.request<T>(endpoint); }
+  async post<T>(endpoint: string, data: unknown): Promise<T> { return this.request<T>(endpoint, { method: "POST", body: JSON.stringify(data) }); }
+  async patch<T>(endpoint: string, data: unknown): Promise<T> { return this.request<T>(endpoint, { method: "PATCH", body: JSON.stringify(data) }); }
+  async delete<T>(endpoint: string): Promise<T> { return this.request<T>(endpoint, { method: "DELETE" }); }
+
+  async paginate<T>(endpoint: string, params: { page?: number; pageSize?: number; extraParams?: Record<string, string> } = {}) {
+    const { page = 1, pageSize = 25, extraParams = {} } = params;
+    const qs = new URLSearchParams({ page: String(page), pageSize: String(Math.min(pageSize, 100)), ...extraParams });
+    const result = await this.get<any>(`${endpoint}?${qs}`);
+    const data = Array.isArray(result) ? result : result.data || result.items || result.results || [];
+    const total = result.meta?.total || result.total || data.length;
+    return { data, meta: { total, page, pageSize, hasMore: page * pageSize < total } };
+  }
+
+  async healthCheck() {
+    const start = performance.now();
+    try {
+      const controller = new AbortController();
+      const tid = setTimeout(() => controller.abort(), 10_000);
+      try {
+        const r = await fetch(this.baseUrl, { signal: controller.signal, headers: { "Authorization": `Bearer ${this.apiKey}` } });
+        return { reachable: true, authenticated: r.status !== 401 && r.status !== 403, latencyMs: Math.round(performance.now() - start) };
+      } finally { clearTimeout(tid); }
+    } catch (e) {
+      return { reachable: false, authenticated: false, latencyMs: Math.round(performance.now() - start), error: String(e) };
+    }
+  }
+}
+
+// === ZOD SCHEMAS ===
+const ListItemsSchema = z.object({
+  page: z.number().optional().default(1),
+  pageSize: z.number().optional().default(25),
+});
+// ...add more schemas
+
+// === TOOL DEFINITIONS ===
+const tools = [
+  {
+    name: "health_check",
+    title: "Health Check",
+    description: "Validate server health: env vars set, API reachable, auth valid. Use to diagnose connection issues.",
+    inputSchema: { type: "object" as const, properties: {} },
+    outputSchema: {
+      type: "object", properties: {
+        status: { type: "string" }, checks: { type: "object" },
+      }, required: ["status", "checks"],
+    },
+    annotations: { readOnlyHint: true, destructiveHint: false, idempotentHint: true, openWorldHint: false },
+  },
+  {
+    name: "list_items",
+    title: "List Items",
+    description: "List items with pagination. Returns name, status. Use to browse items. Do NOT use to get one item's details.",
+    inputSchema: {
+      type: "object" as const,
+      properties: {
+        page: { type: "number", description: "Page number (default 1)" },
+        pageSize: { type: "number", description: "Results per page (default 25)" },
+      },
+    },
+    outputSchema: {
+      type: "object", properties: {
+        data: { type: "array", items: { type: "object" } },
+        meta: { type: "object" },
+      }, required: ["data", "meta"],
+    },
+    annotations: { readOnlyHint: true, destructiveHint: false, idempotentHint: true, openWorldHint: false },
+  },
+  // ...add more tools
+];
+
+// === TOOL HANDLER ===
+async function handleTool(client: APIClient, name: string, args: Record<string, unknown>) {
+  switch (name) {
+    case "health_check": {
+      const required = ["{SERVICE}_API_KEY"];
+      const missing = required.filter(v => !process.env[v]);
+      const hc = await client.healthCheck();
+      const status = missing.length > 0 || !hc.reachable ? "unhealthy" : !hc.authenticated ? "degraded" : "healthy";
+      const result = { status, checks: { envVars: { ok: !missing.length, missing }, ...hc } };
+      return { content: [{ type: "text", text: JSON.stringify(result, null, 2) }], structuredContent: result };
+    }
+    case "list_items": {
+      const params = ListItemsSchema.parse(args);
+      const result = await client.paginate("/items", { page: params.page, pageSize: params.pageSize });
+      return { content: [{ type: "text", text: JSON.stringify(result, null, 2) }], structuredContent: result };
+    }
+    // ...add more cases
+    default:
+      throw new Error(`Unknown tool: ${name}`);
+  }
+}
+
+// === SERVER ===
+async function main() {
+  const apiKey = process.env["{SERVICE}_API_KEY"];
+  if (!apiKey) { console.error("Error: {SERVICE}_API_KEY required"); process.exit(1); }
+
+  const client = new APIClient(apiKey);
+  const server = new Server(
+    { name: `${MCP_NAME}-mcp`, version: MCP_VERSION },
+    {
+      capabilities: {
+        tools: { listChanged: false },
+        logging: {},
+        // Enable ONLY when implemented:
+        // resources: { subscribe: false, listChanged: false },
+        // prompts: { listChanged: false },
+      },
+    }
+  );
+
+  server.setRequestHandler(ListToolsRequestSchema, async () => {
+    log("info", "tools.list", { count: tools.length });
+    return { tools };
+  });
+
+  server.setRequestHandler(CallToolRequestSchema, async (request) => {
+    const { name, arguments: args } = request.params;
+    const start = performance.now();
+    log("info", "tool.call.start", { tool: name });
+    try {
+      const result = await handleTool(client, name, args || {});
+      log("info", "tool.call.done", { tool: name, durationMs: Math.round(performance.now() - start) });
+      return result;
+    } catch (error) {
+      const message = error instanceof Error ? error.message : String(error);
+      log("error", "tool.call.error", { tool: name, error: message, durationMs: Math.round(performance.now() - start) });
+      return {
+        content: [{ type: "text", text: `Error: ${message}` }],
+        structuredContent: { error: message, tool: name },
+        isError: true,
+      };
+    }
+  });
+
+  const transport = new StdioServerTransport();
+  await server.connect(transport);
+  log("info", "server.started", { transport: "stdio" });
+}
+
+main().catch(console.error);
+```
+
+---
+
+## 17. README Template
+
+````markdown
+# {Service Name} MCP Server
+
+MCP server for {Service Name} API integration. Provides {N} tools across {M} groups for {brief description of capabilities}.
+
+## Setup
+
+1. **Get API credentials:** {Instructions to get API key from service}
+2. **Configure environment:**
+   ```bash
+   cp .env.example .env
+   # Edit .env with your credentials
+   ```
+3. **Build and run:**
+   ```bash
+   npm install
+   npm run build
+   npm start          # stdio transport (default, for Claude Desktop)
+   npm run start:http  # HTTP transport (for remote/production)
+   ```
+
+## Environment Variables
+
+| Variable | Required | Description |
+|----------|----------|-------------|
+| `{SERVICE}_API_KEY` | Yes | Your API key from {service dashboard URL} |
+| `{SERVICE}_BASE_URL` | No | Override base URL (default: {default URL}) |
+| `MCP_TRANSPORT` | No | `stdio` (default) or `http` |
+| `MCP_HTTP_PORT` | No | HTTP server port (default: 3000) |
+
+## Available Tools
+
+### Health
+| Tool | Description |
+|------|-------------|
+| `health_check` | Validate server connectivity and auth |
+
+### {Group 1}: {Group Description}
+| Tool | Description |
+|------|-------------|
+| `list_{resources}` | List with filters and pagination |
+| `get_{resource}` | Get by ID |
+| `create_{resource}` | Create new |
+| `update_{resource}` | Update existing |
+| `delete_{resource}` | Delete |
+
+{Repeat for each group}
+
+## Transport Options
+
+### Stdio (Local — Claude Desktop, Cursor)
+```json
+{
+  "mcpServers": {
+    "{service}": {
+      "command": "node",
+      "args": ["{absolute-path}/dist/index.js"],
+      "env": {
+        "{SERVICE}_API_KEY": "your_key_here"
+      }
+    }
+  }
+}
+```
+
+### Streamable HTTP (Remote — Production)
+```bash
+MCP_TRANSPORT=http MCP_HTTP_PORT=3000 node dist/index.js
+```
+Then connect clients to `http://your-server:3000/mcp`.
+````
+
+---
+
+## 18. Quality Gate Checklist
+
+Before passing the server to Phase 3/4, verify:
+
+### Core Requirements
+- [ ] **`npm run build` succeeds** — tsc compiles clean, zero errors
+- [ ] **No template variables remain** — `grep -r '{service}\|{SERVICE}\|{Service}' src/` returns empty
+- [ ] **SDK pinned to `^1.26.0`** — security fix GHSA-345p-7cg4-v4c7, ensures 2025-11-25 spec support
+- [ ] **Zod pinned to `^3.25.0`** — compatible with SDK v1.x (do NOT use Zod v4 — issue #1429)
+
+### Tool Definitions (2025-11-25 Spec)
+- [ ] **Every tool has `title`** — human-readable display name
+- [ ] **Every tool has `outputSchema`** — JSON Schema 2020-12 declaring output shape
+- [ ] **Every tool has `annotations`** — readOnlyHint, destructiveHint, idempotentHint, openWorldHint
+- [ ] **Every tool description follows the formula** — what/returns/when/when-NOT
+- [ ] **Every tool description under 200 tokens** — concise for token budget
+- [ ] **Total tool definitions under 5,000 tokens** — prevents context bloat
+
+### Tool Results
+- [ ] **Every handler returns `structuredContent`** — typed JSON alongside text `content`
+- [ ] **`structuredContent` matches `outputSchema`** — validate shapes match
+- [ ] **GET single-entity tools return `resource_link`** — subscribable MCP Resource URIs
+- [ ] **Error responses include `isError: true`** — with both `content` and `structuredContent`
+
+### Resilience
+- [ ] **All fetch calls have AbortController timeout** — 30s default, no indefinite hangs
+- [ ] **Circuit breaker is active** — fails fast when API is down, auto-recovers
+- [ ] **Retry logic on 429 and 5xx** — with exponential backoff
+- [ ] **Rate limit headers tracked** — proactive wait before hitting limits
+
+### Server
+- [ ] **Only implemented capabilities declared** — tools + logging (add resources/prompts only when implemented)
+- [ ] **`health_check` tool is included** — validates env vars, API reach, auth
+- [ ] **Structured logging on stderr** — JSON-formatted, with request IDs and timing
+- [ ] **Both transports available** — stdio (default) + Streamable HTTP (via MCP_TRANSPORT=http)
+
+### Standard Files
+- [ ] **All required env vars validated on startup** — clear error messages if missing
+- [ ] **`.env.example` lists ALL variables** — with descriptive comments
+- [ ] **README documents setup, tool list, both transports** — copy-paste ready
+- [ ] **Every tool has Zod input validation** — schemas parse before API calls
+- [ ] **Pagination uses the correct strategy** — matches API's pagination pattern
+- [ ] **No `any` types** — strict TypeScript (except unavoidable API response parsing)
+- [ ] **Tool names follow `verb_noun` convention** — snake_case, descriptive
+
+---
+
+## 19. Execution Workflow
+
+```
+1. Read {service}-api-analysis.md
+2. Determine pattern: one-file (≤15 tools) vs modular (15+ tools)
+3. Scaffold project structure (mkdir, package.json, tsconfig.json)
+4. Create logger.ts (structured JSON logging)
+5. Build API client with correct auth pattern, timeouts, circuit breaker
+6. Create health.ts (health_check tool — always included)
+7. Create tool group files (one per group from analysis)
+   - Every tool: name, title, description (with disambiguation), inputSchema, outputSchema, annotations
+   - Every handler: Zod validation → API call → return { content, structuredContent }
+   - GET single-entity handlers: include resource_link in content
+8. Wire up tool registry with lazy loading
+9. Create server entry point with both transports
+10. Create .env.example and README.md
+11. Run `npm install && npm run build`
+12. Fix any compilation errors
+13. Run token counting helper (Section 11) — verify <200 tokens/tool, <5,000 total
+14. Run quality gate checklist
+15. Output: compiled MCP server ready for Phase 3/4
+```
+
+**Estimated time:** 30-60 minutes for small servers, 1-2 hours for large ones.
+
+**Agent model recommendation:** Sonnet — well-defined patterns, code generation. Escalate to Opus only if auth pattern is unusual or 25+ tools require careful description disambiguation.
+
+---
+
+*This skill is Phase 2 of the MCP Factory pipeline. It takes an analysis document and produces a compiled, production-ready MCP server conforming to the MCP 2025-11-25 spec.*
diff --git a/skills/mcp-server-development/2026-BLUEPRINT.md b/skills/mcp-server-development/2026-BLUEPRINT.md
new file mode 100644
index 0000000..bf9e324
--- /dev/null
+++ b/skills/mcp-server-development/2026-BLUEPRINT.md
@@ -0,0 +1,723 @@
+# MCP Server Blueprint — February 2026
+
+**This is the definitive template for building production-ready MCP servers in 2026.**
+
+Use this checklist for EVERY new MCP server. No skipping steps. These patterns ensure your server is:
+- ✅ Usable (not just functional)
+- ✅ Fast (lazy loading, efficient queries)
+- ✅ Discoverable (labels, descriptions)
+- ✅ Interactive (MCP Apps where appropriate)
+- ✅ Debuggable (logging, progress)
+- ✅ Production-ready (error handling, deployment)
+
+---
+
+## Phase 1: Planning (Before Writing Code)
+
+### 1.1 Define Server Scope
+- [ ] What API/service are you integrating?
+- [ ] What are the 5-10 most important operations?
+- [ ] Who is the target user? (developers, business users, etc.)
+- [ ] What data is most frequently accessed?
+
+### 1.2 Identify Tool Categories
+Label your tools by category. Common patterns:
+- [ ] **CRUD operations** (create, read, update, delete)
+- [ ] **Search/Filter** (find data with queries)
+- [ ] **Analytics/Reporting** (stats, dashboards, summaries)
+- [ ] **Workflows** (multi-step operations)
+- [ ] **Admin** (configuration, settings)
+
+### 1.3 Identify UI Opportunities
+Which operations benefit from visual display?
+- [ ] **Data grids** — Contact lists, search results, tables
+- [ ] **Dashboards** — Metrics, KPIs, analytics
+- [ ] **Cards** — Detail views (invoices, opportunities, profiles)
+- [ ] **Timelines** — Activity feeds, history
+- [ ] **Forms** — Quick actions (booking, creating records)
+- [ ] **Kanban** — Pipeline views, project boards
+
+If you have 3+ UI opportunities, plan for MCP Apps.
+
+---
+
+## Phase 2: Core Server Setup
+
+### 2.1 Project Structure
+```bash
+mkdir mcp-server-myservice
+cd mcp-server-myservice
+npm init -y
+npm install @modelcontextprotocol/sdk
+npm install -D typescript @types/node tsx fs-extra @types/fs-extra
+```
+
+### 2.2 File Structure
+```
+mcp-server-myservice/
+├── src/
+│   ├── index.ts              # Main server (or server.ts)
+│   ├── clients/
+│   │   └── api-client.ts     # API client
+│   ├── apps/                 # If using MCP Apps
+│   │   └── index.ts          # Apps manager
+│   ├── ui/                   # If using MCP Apps
+│   │   ├── contact-grid.html
+│   │   └── dashboard.html
+│   └── types/
+│       └── index.ts          # Shared types
+├── dist/                     # Build output
+├── scripts/
+│   └── copy-ui.js            # UI build script
+├── package.json
+├── tsconfig.json
+├── .env.example
+├── .gitignore
+├── .npmignore
+├── Dockerfile
+├── railway.json
+└── README.md
+```
+
+### 2.3 Package Configuration
+```json
+{
+  "name": "mcp-server-myservice",
+  "version": "1.0.0",
+  "type": "module",
+  "main": "dist/index.js",
+  "bin": {
+    "mcp-server-myservice": "dist/index.js"
+  },
+  "scripts": {
+    "build": "npm run build:ts && npm run build:ui",
+    "build:ts": "tsc",
+    "build:ui": "node scripts/copy-ui.js",
+    "dev": "tsx src/index.ts",
+    "start": "node dist/index.js"
+  },
+  "files": ["dist", "README.md", "LICENSE"],
+  "keywords": ["mcp", "mcp-server", "model-context-protocol", "myservice"]
+}
+```
+
+---
+
+## Phase 3: Tool Design (The Most Important Phase)
+
+### 3.1 Tool Naming Convention
+✅ **Use:** `verb_noun` (snake_case)
+❌ **Avoid:** camelCase, PascalCase, kebab-case
+
+**CRUD patterns:**
+- `list_contacts` (with pagination + filters)
+- `get_contact` (by ID)
+- `create_contact` (returns created object)
+- `update_contact` (partial updates)
+- `delete_contact` (confirm before delete)
+- `search_contacts` (full-text search if different from list)
+
+**Other patterns:**
+- `send_email`, `schedule_appointment`, `export_report`, `analyze_pipeline`
+
+### 3.2 Tool Metadata & Labels ⭐ CRITICAL
+Every tool MUST have `_meta` with labels:
+
+```typescript
+{
+  name: "search_contacts",
+  description: "Search contacts with filters. Returns paginated results.",
+  inputSchema: { /* ... */ },
+  _meta: {
+    labels: {
+      category: "contacts",           // Group by feature
+      access: "read",                 // read | write | delete
+      complexity: "simple",           // simple | complex | batch
+    },
+  },
+}
+```
+
+**Label categories to use:**
+- `category`: contacts, deals, analytics, calendar, email, admin, workflows
+- `access`: read, write, delete
+- `complexity`: simple (1 API call), complex (multiple calls), batch (loops)
+- `sensitivity`: public, internal, confidential (optional)
+
+### 3.3 Input Schemas — Best Practices
+```typescript
+inputSchema: {
+  type: "object" as const,
+  properties: {
+    // Always describe parameters
+    page: { 
+      type: "number", 
+      description: "Page number (default 1, starts at 1)" 
+    },
+    pageSize: { 
+      type: "number", 
+      description: "Results per page (default 50, max 100)" 
+    },
+    // Use enums for fixed options
+    status: { 
+      type: "string", 
+      description: "Filter by status",
+      enum: ["active", "inactive", "pending"],
+    },
+    // ISO 8601 for dates
+    createdAfter: { 
+      type: "string", 
+      description: "Filter created after (ISO 8601: 2026-02-03T14:00:00Z)" 
+    },
+  },
+  // Mark required fields explicitly
+  required: ["contactId"],
+}
+```
+
+### 3.4 Pagination (Mandatory for List Operations)
+Every `list_` or `search_` tool MUST support pagination:
+
+```typescript
+{
+  name: "list_contacts",
+  description: "List contacts with pagination and filters",
+  inputSchema: {
+    type: "object" as const,
+    properties: {
+      page: { type: "number", description: "Page number (default 1)" },
+      pageSize: { type: "number", description: "Results per page (default 50, max 100)" },
+      query: { type: "string", description: "Search query (optional)" },
+    },
+  },
+  _meta: {
+    labels: { category: "contacts", access: "read", complexity: "simple" },
+  },
+}
+```
+
+**In handler:**
+```typescript
+case "list_contacts": {
+  const { page = 1, pageSize = 50, query } = args;
+  const params = new URLSearchParams();
+  params.append("page", String(page));
+  params.append("pageSize", String(Math.min(Number(pageSize), 100))); // Cap at API max
+  if (query) params.append("query", query);
+  
+  return await client.get(`/contacts?${params}`);
+}
+```
+
+---
+
+## Phase 4: Lazy-Loaded Resources ⭐ NEW
+
+### 4.1 When to Use Resources vs Tools
+**Use resources for:**
+- Large datasets (contact lists, transaction history)
+- Frequently changing data (real-time dashboards)
+- Reference data (documentation, schemas)
+- User-specific data (per-user settings, dashboards)
+
+**Use tools for:**
+- Operations with parameters (search, filter, create)
+- One-time fetches
+- Mutations (create, update, delete)
+
+### 4.2 Resource Setup
+```typescript
+import { ListResourcesRequestSchema, ReadResourceRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+// Declare resources capability
+const server = new Server(
+  { name: "myservice-mcp", version: "1.0.0" },
+  { capabilities: { tools: {}, resources: {} } } // ✅ Enable resources
+);
+
+// List available resources (metadata only)
+server.setRequestHandler(ListResourcesRequestSchema, async () => {
+  return {
+    resources: [
+      {
+        uri: "myservice://contacts/all",
+        name: "All Contacts",
+        description: "Complete contact database (lazy-loaded)",
+        mimeType: "application/json",
+      },
+      {
+        uri: "myservice://analytics/dashboard",
+        name: "Analytics Dashboard",
+        description: "Real-time analytics data",
+        mimeType: "application/json",
+      },
+    ],
+  };
+});
+
+// Fetch resource content on-demand
+server.setRequestHandler(ReadResourceRequestSchema, async (request) => {
+  const { uri } = request.params;
+
+  switch (uri) {
+    case "myservice://contacts/all": {
+      const contacts = await client.get("/contacts?limit=1000"); // Fetch when requested
+      return {
+        contents: [{
+          uri,
+          mimeType: "application/json",
+          text: JSON.stringify(contacts, null, 2),
+        }],
+      };
+    }
+
+    case "myservice://analytics/dashboard": {
+      const analytics = await client.get("/analytics/dashboard");
+      return {
+        contents: [{
+          uri,
+          mimeType: "application/json",
+          text: JSON.stringify(analytics, null, 2),
+        }],
+      };
+    }
+
+    default:
+      throw new Error(`Unknown resource: ${uri}`);
+  }
+});
+```
+
+### 4.3 Resource Templates (Dynamic URIs)
+```typescript
+server.setRequestHandler(ListResourcesRequestSchema, async () => {
+  return {
+    resourceTemplates: [
+      {
+        uriTemplate: "myservice://contact/{id}",
+        name: "Contact Details",
+        description: "Full contact record by ID",
+        mimeType: "application/json",
+      },
+    ],
+  };
+});
+
+server.setRequestHandler(ReadResourceRequestSchema, async (request) => {
+  const { uri } = request.params;
+  
+  const contactMatch = uri.match(/^myservice:\/\/contact\/(.+)$/);
+  if (contactMatch) {
+    const contactId = contactMatch[1];
+    const contact = await client.get(`/contacts/${contactId}`);
+    return {
+      contents: [{
+        uri,
+        mimeType: "application/json",
+        text: JSON.stringify(contact, null, 2),
+      }],
+    };
+  }
+
+  throw new Error(`Unknown resource: ${uri}`);
+});
+```
+
+---
+
+## Phase 5: MCP Apps (If Applicable)
+
+### 5.1 Should You Build Apps?
+Build MCP Apps if you have:
+- ✅ Visual data (grids, cards, dashboards)
+- ✅ 3+ UI opportunities identified in Phase 1
+- ✅ Complex data relationships (better shown than described)
+- ✅ Interactive workflows (drag-drop, forms)
+
+Skip apps if:
+- ❌ Simple CRUD operations only
+- ❌ All operations return small JSON objects
+- ❌ No visual benefit
+
+### 5.2 App Architecture
+See `mcp-apps-integration` skill for full details. Quick checklist:
+
+- [ ] Create `src/apps/index.ts` — MCPAppsManager class
+- [ ] Create `src/ui/` directory — HTML components
+- [ ] Register resource handlers for UI files
+- [ ] Add app tools with `_meta.ui.resourceUri`
+- [ ] Implement `ListResourcesRequestSchema` handler
+- [ ] Implement `ReadResourceRequestSchema` handler
+- [ ] Add `build:ui` script to copy HTML to `dist/app-ui/`
+
+### 5.3 App Tool Naming
+**Pattern:** `view_` or `show_` prefix
+
+```typescript
+{
+  name: "view_contact_grid",
+  description: "Display contact search results in a data grid (visual UI component)",
+  inputSchema: { /* ... */ },
+  _meta: {
+    labels: { category: "contacts", access: "read", complexity: "simple" },
+    ui: { resourceUri: "ui://myservice/contact-grid" },
+  },
+}
+```
+
+### 5.4 Common App Patterns
+- **Contact Grid** — Search results table
+- **Dashboard** — Multi-widget analytics view
+- **Pipeline Board** — Kanban with drag-drop
+- **Opportunity Card** — Detail view for single record
+- **Calendar View** — Appointment/event calendar
+- **Timeline** — Activity feed
+
+Reference: 11 production GHL apps in `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/ghl-mcp-apps-only/`
+
+---
+
+## Phase 6: Progress & Logging
+
+### 6.1 Progress Notifications (For Long Operations)
+Any operation taking >5 seconds MUST send progress updates:
+
+```typescript
+if (name === "import_contacts") {
+  const progressToken = request.params._meta?.progressToken;
+  
+  if (progressToken) {
+    await server.notification({
+      method: "notifications/progress",
+      params: {
+        progressToken,
+        progress: 0.3, // 30%
+        total: 1.0,
+      },
+    });
+  }
+  
+  // ... do work
+  
+  if (progressToken) {
+    await server.notification({
+      method: "notifications/progress",
+      params: { progressToken, progress: 1.0, total: 1.0 },
+    });
+  }
+}
+```
+
+### 6.2 Structured Logging
+Log important operations for debugging:
+
+```typescript
+import { LoggingLevel } from "@modelcontextprotocol/sdk/types.js";
+
+await server.notification({
+  method: "notifications/message",
+  params: {
+    level: LoggingLevel.Info,
+    logger: "myservice",
+    data: {
+      operation: "create_contact",
+      contactId: newContact.id,
+      timestamp: new Date().toISOString(),
+    },
+  },
+});
+```
+
+**When to log:**
+- Info: Successful operations (create, update, delete)
+- Warning: Rate limits, retries, fallbacks
+- Error: API failures, validation errors
+- Debug: Detailed request/response data (dev only)
+
+---
+
+## Phase 7: Error Handling (Production-Ready)
+
+### 7.1 Tool Handler Error Wrapping
+```typescript
+server.setRequestHandler(CallToolRequestSchema, async (request) => {
+  const { name, arguments: args } = request.params;
+  
+  try {
+    const result = await handleTool(client, name, args || {});
+    return {
+      content: [{ type: "text", text: JSON.stringify(result, null, 2) }],
+    };
+  } catch (error) {
+    const message = error instanceof Error ? error.message : String(error);
+    
+    // Log the error
+    await server.notification({
+      method: "notifications/message",
+      params: {
+        level: LoggingLevel.Error,
+        logger: "myservice",
+        data: { tool: name, error: message },
+      },
+    });
+    
+    return {
+      content: [{ type: "text", text: `Error: ${message}` }],
+      isError: true,
+    };
+  }
+});
+```
+
+### 7.2 API Client Error Handling
+```typescript
+async request(endpoint: string, options: RequestInit = {}) {
+  const response = await fetch(url, options);
+
+  if (!response.ok) {
+    const errorText = await response.text();
+    
+    // Parse API error if JSON
+    try {
+      const errorJson = JSON.parse(errorText);
+      throw new Error(
+        `API error: ${response.status} - ${errorJson.message || errorJson.error || errorText}`
+      );
+    } catch {
+      throw new Error(
+        `API error: ${response.status} ${response.statusText} - ${errorText}`
+      );
+    }
+  }
+
+  return response.json();
+}
+```
+
+---
+
+## Phase 8: Prompts (Optional but Recommended)
+
+### 8.1 When to Add Prompts
+Add prompts for:
+- Common analysis workflows (e.g., "Analyze pipeline health")
+- Report generation (e.g., "Generate contact summary")
+- Quick actions (e.g., "Find overdue tasks")
+- Data exploration (e.g., "Show top performers")
+
+### 8.2 Prompt Implementation
+```typescript
+import { ListPromptsRequestSchema, GetPromptRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+server.setRequestHandler(ListPromptsRequestSchema, async () => {
+  return {
+    prompts: [
+      {
+        name: "contact_summary",
+        description: "Generate comprehensive contact summary with recent activity",
+        arguments: [
+          { name: "contactId", description: "Contact ID", required: true },
+        ],
+      },
+    ],
+  };
+});
+
+server.setRequestHandler(GetPromptRequestSchema, async (request) => {
+  const { name, arguments: args } = request.params;
+
+  if (name === "contact_summary") {
+    const { contactId } = args;
+    const contact = await client.get(`/contacts/${contactId}`);
+    const activities = await client.get(`/contacts/${contactId}/activities`);
+
+    return {
+      description: `Summary for ${contact.name}`,
+      messages: [{
+        role: "user",
+        content: {
+          type: "text",
+          text: `Generate a comprehensive summary:\n\n${JSON.stringify({ contact, activities }, null, 2)}`,
+        },
+      }],
+    };
+  }
+
+  throw new Error(`Unknown prompt: ${name}`);
+});
+```
+
+---
+
+## Phase 9: Testing Checklist
+
+### 9.1 Local Testing
+- [ ] All tools compile without errors (`npm run build`)
+- [ ] Server starts successfully (`npm start`)
+- [ ] Environment variables validated on startup
+- [ ] Test each tool in Claude Desktop
+- [ ] Test pagination (page 1, page 2)
+- [ ] Test error cases (invalid IDs, missing params)
+- [ ] Test apps render correctly (if applicable)
+- [ ] Check logs in Claude Desktop console
+
+### 9.2 Performance Testing
+- [ ] List operations return in <2 seconds
+- [ ] Lazy-loaded resources only fetch when requested
+- [ ] No unnecessary API calls
+- [ ] Pagination caps at API maximum
+- [ ] Progress notifications for operations >5 seconds
+
+---
+
+## Phase 10: Documentation
+
+### 10.1 README.md Structure
+```markdown
+# MCP Server for MyService
+
+MCP integration for MyService. Enables Claude Desktop to [core value prop].
+
+## Features
+- ✅ List/search/CRUD contacts
+- ✅ Analytics dashboard (MCP App)
+- ✅ Pipeline visualization (MCP App)
+- ✅ Progress tracking for imports
+
+## Installation
+[npx / manual / docker options]
+
+## Configuration
+[Environment variables with .env.example]
+
+## Available Tools
+[List of tools with descriptions]
+
+## MCP Apps (Rich UI)
+[List of app tools with screenshots]
+
+## Development
+[Build/dev instructions]
+```
+
+### 10.2 .env.example
+```bash
+# MyService API Credentials
+MY_SERVICE_API_KEY=your_api_key_here
+MY_SERVICE_API_SECRET=your_secret_here
+
+# Optional: Override base URL
+# MY_SERVICE_BASE_URL=https://sandbox.api.myservice.com
+
+# Optional: Logging
+# LOG_LEVEL=debug
+```
+
+---
+
+## Phase 11: Deployment
+
+### 11.1 Docker
+- [ ] Multi-stage Dockerfile
+- [ ] .dockerignore file
+- [ ] Test build locally
+- [ ] Test run locally
+
+### 11.2 Railway
+- [ ] railway.json with build + start commands
+- [ ] Environment variables documented
+- [ ] Test deployment
+
+### 11.3 npm Publishing
+- [ ] `bin` field in package.json
+- [ ] `files` field includes only dist/
+- [ ] .npmignore excludes src/, .env
+- [ ] Keywords for discoverability
+- [ ] Test `npx` installation locally
+
+### 11.4 GitHub
+- [ ] README.md complete
+- [ ] LICENSE file
+- [ ] .gitignore excludes node_modules, dist, .env
+- [ ] GitHub Actions for CI/CD (optional)
+
+---
+
+## Production Checklist (Final Review)
+
+### Code Quality
+- [ ] All tools have `_meta.labels`
+- [ ] All parameters have descriptions
+- [ ] Required fields marked explicitly
+- [ ] Pagination implemented for list operations
+- [ ] Error handling in all tool handlers
+- [ ] No hardcoded API keys or secrets
+- [ ] Logging for important operations
+
+### Features
+- [ ] Lazy-loaded resources for large datasets
+- [ ] Progress notifications for long operations (>5s)
+- [ ] MCP Apps for visual data (if applicable)
+- [ ] Prompts for common workflows (if applicable)
+
+### Documentation
+- [ ] README with installation instructions
+- [ ] .env.example with all required variables
+- [ ] Tool descriptions clear and helpful
+- [ ] Examples in README
+
+### Deployment
+- [ ] Compiles without errors
+- [ ] Runs in Claude Desktop
+- [ ] Docker image builds (if using Docker)
+- [ ] Railway deploys successfully (if using Railway)
+- [ ] npm package installs via npx (if publishing)
+
+---
+
+## Anti-Patterns to Avoid
+
+❌ **No labels on tools** — Always add `_meta.labels`
+❌ **Loading all data upfront** — Use lazy-loaded resources
+❌ **No pagination** — Every list operation needs page/pageSize
+❌ **Silent failures** — Always log errors and return clear messages
+❌ **No progress for slow ops** — Add progress notifications for >5s operations
+❌ **Building apps when not needed** — Only build apps if visually beneficial
+❌ **Missing descriptions** — Every parameter needs a description
+❌ **No environment validation** — Check env vars on startup
+❌ **Skipping error handling** — Wrap all tool handlers in try-catch
+❌ **Generic error messages** — Be specific ("Contact not found" not "Error")
+
+---
+
+## Reference Materials
+
+- **Skills:**
+  - `mcp-server-development` — Full TypeScript patterns
+  - `mcp-apps-integration` — MCP Apps guide
+  - `mcp-deployment` — Docker/Railway/npm
+
+- **Example Servers:**
+  - `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/mcp-servers/`
+  - 30 production servers with all patterns
+
+- **Example Apps:**
+  - `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/ghl-mcp-apps-only/`
+  - 11 production apps with UI components
+
+---
+
+## TL;DR — The Golden Rules
+
+1. **Labels on every tool** — category, access, complexity
+2. **Lazy-load large datasets** — Use resources, not tools
+3. **Paginate everything** — page/pageSize on all lists
+4. **Progress for slow ops** — >5 seconds = progress notifications
+5. **Apps for visual data** — Grids, dashboards, cards, timelines
+6. **Log important operations** — Info, Warning, Error levels
+7. **Handle errors gracefully** — Clear messages, no silent failures
+8. **Document thoroughly** — README, .env.example, descriptions
+9. **Test before shipping** — All tools work in Claude Desktop
+10. **Deploy with confidence** — Docker, Railway, npm ready to go
+
+**Follow this blueprint and your MCP servers will be production-ready, usable, and optimized for February 2026.**
diff --git a/skills/mcp-server-development/SKILL.md b/skills/mcp-server-development/SKILL.md
new file mode 100644
index 0000000..895cb07
--- /dev/null
+++ b/skills/mcp-server-development/SKILL.md
@@ -0,0 +1,1242 @@
+# MCP Server Development — TypeScript Best Practices
+
+**When to use this skill:** Building TypeScript-based MCP servers from scratch. Use when creating new integrations, API wrappers, or data sources for Claude Desktop.
+
+**What this covers:** Complete TypeScript MCP server patterns extracted from building 30+ production servers (ServiceTitan, Gusto, Mailchimp, Calendly, Toast, Zendesk, Trello, etc.).
+
+---
+
+## 1. Project Structure (Standard Pattern)
+
+```
+my-mcp-server/
+├── src/
+│   └── index.ts          # Main server file
+├── dist/                 # Compiled output (git ignored)
+├── package.json
+├── tsconfig.json
+├── .env.example         # Template for required env vars
+├── .gitignore
+├── Dockerfile           # Optional: for containerization
+├── railway.json         # Optional: for Railway deployment
+└── README.md
+```
+
+### The One-File Pattern (Preferred for Most Servers)
+For most MCP servers, **keep everything in one `src/index.ts` file** unless you have 20+ tools. This includes:
+- Configuration
+- API client class
+- Tool definitions
+- Tool handler function
+- Server setup
+
+**Why:** Easier to read, debug, and maintain. Split into modules only when file exceeds ~500 lines.
+
+---
+
+## 2. File Template (`src/index.ts`)
+
+```typescript
+#!/usr/bin/env node
+import { Server } from "@modelcontextprotocol/sdk/server/index.js";
+import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
+import {
+  CallToolRequestSchema,
+  ListToolsRequestSchema,
+} from "@modelcontextprotocol/sdk/types.js";
+
+// ============================================
+// CONFIGURATION
+// ============================================
+const MCP_NAME = "my-service";
+const MCP_VERSION = "1.0.0";
+const API_BASE_URL = "https://api.example.com";
+
+// ============================================
+// API CLIENT
+// ============================================
+class MyServiceClient {
+  private apiKey: string;
+  private baseUrl: string;
+
+  constructor(apiKey: string, baseUrl: string = API_BASE_URL) {
+    this.apiKey = apiKey;
+    this.baseUrl = baseUrl;
+  }
+
+  async request(endpoint: string, options: RequestInit = {}) {
+    const url = `${this.baseUrl}${endpoint}`;
+    
+    const response = await fetch(url, {
+      ...options,
+      headers: {
+        "Authorization": `Bearer ${this.apiKey}`,
+        "Content-Type": "application/json",
+        ...options.headers,
+      },
+    });
+
+    if (!response.ok) {
+      const errorText = await response.text();
+      throw new Error(`API error: ${response.status} ${response.statusText} - ${errorText}`);
+    }
+
+    return response.json();
+  }
+
+  async get(endpoint: string) {
+    return this.request(endpoint, { method: "GET" });
+  }
+
+  async post(endpoint: string, data: any) {
+    return this.request(endpoint, {
+      method: "POST",
+      body: JSON.stringify(data),
+    });
+  }
+
+  async put(endpoint: string, data: any) {
+    return this.request(endpoint, {
+      method: "PUT",
+      body: JSON.stringify(data),
+    });
+  }
+
+  async delete(endpoint: string) {
+    return this.request(endpoint, { method: "DELETE" });
+  }
+}
+
+// ============================================
+// TOOL DEFINITIONS
+// ============================================
+const tools = [
+  {
+    name: "get_items",
+    description: "List items with optional filters. Returns paginated results.",
+    inputSchema: {
+      type: "object" as const,
+      properties: {
+        page: { type: "number", description: "Page number (default 1)" },
+        pageSize: { type: "number", description: "Results per page (default 50, max 100)" },
+        status: { type: "string", description: "Filter by status: active, inactive, all" },
+        createdAfter: { type: "string", description: "Filter items created after (ISO 8601)" },
+      },
+    },
+  },
+  {
+    name: "get_item",
+    description: "Get detailed information about a specific item by ID.",
+    inputSchema: {
+      type: "object" as const,
+      properties: {
+        item_id: { type: "string", description: "Item ID" },
+      },
+      required: ["item_id"],
+    },
+  },
+  {
+    name: "create_item",
+    description: "Create a new item. Returns the created item with ID.",
+    inputSchema: {
+      type: "object" as const,
+      properties: {
+        name: { type: "string", description: "Item name" },
+        description: { type: "string", description: "Item description" },
+        status: { type: "string", description: "Status: active or inactive" },
+      },
+      required: ["name"],
+    },
+  },
+];
+
+// ============================================
+// TOOL HANDLER
+// ============================================
+async function handleTool(client: MyServiceClient, name: string, args: Record<string, unknown>) {
+  switch (name) {
+    case "get_items": {
+      const { page = 1, pageSize = 50, status, createdAfter } = args;
+      const params = new URLSearchParams();
+      params.append("page", String(page));
+      params.append("pageSize", String(Math.min(Number(pageSize), 100)));
+      if (status) params.append("status", String(status));
+      if (createdAfter) params.append("createdAfter", String(createdAfter));
+      
+      return await client.get(`/items?${params}`);
+    }
+    
+    case "get_item": {
+      const { item_id } = args;
+      return await client.get(`/items/${item_id}`);
+    }
+    
+    case "create_item": {
+      const { name, description, status = "active" } = args;
+      return await client.post("/items", { name, description, status });
+    }
+    
+    default:
+      throw new Error(`Unknown tool: ${name}`);
+  }
+}
+
+// ============================================
+// SERVER SETUP
+// ============================================
+async function main() {
+  const apiKey = process.env.MY_SERVICE_API_KEY;
+  
+  if (!apiKey) {
+    console.error("Error: MY_SERVICE_API_KEY environment variable required");
+    process.exit(1);
+  }
+
+  const client = new MyServiceClient(apiKey);
+
+  const server = new Server(
+    { name: `${MCP_NAME}-mcp`, version: MCP_VERSION },
+    { capabilities: { tools: {} } }
+  );
+
+  server.setRequestHandler(ListToolsRequestSchema, async () => ({
+    tools,
+  }));
+
+  server.setRequestHandler(CallToolRequestSchema, async (request) => {
+    const { name, arguments: args } = request.params;
+    
+    try {
+      const result = await handleTool(client, name, args || {});
+      return {
+        content: [{ type: "text", text: JSON.stringify(result, null, 2) }],
+      };
+    } catch (error) {
+      const message = error instanceof Error ? error.message : String(error);
+      return {
+        content: [{ type: "text", text: `Error: ${message}` }],
+        isError: true,
+      };
+    }
+  });
+
+  const transport = new StdioServerTransport();
+  await server.connect(transport);
+  console.error(`${MCP_NAME} MCP server running on stdio`);
+}
+
+main().catch(console.error);
+```
+
+---
+
+## 3. Package.json Template
+
+```json
+{
+  "name": "mcp-server-myservice",
+  "version": "1.0.0",
+  "type": "module",
+  "main": "dist/index.js",
+  "scripts": {
+    "build": "tsc",
+    "start": "node dist/index.js",
+    "dev": "tsx src/index.ts"
+  },
+  "dependencies": {
+    "@modelcontextprotocol/sdk": "^0.5.0"
+  },
+  "devDependencies": {
+    "@types/node": "^20.10.0",
+    "tsx": "^4.7.0",
+    "typescript": "^5.3.0"
+  }
+}
+```
+
+**Key points:**
+- `"type": "module"` — Always use ESM
+- `"main": "dist/index.js"` — Points to compiled output
+- `build` → Compile TypeScript
+- `start` → Run compiled version
+- `dev` → Run directly with tsx (development)
+
+---
+
+## 4. TypeScript Config (`tsconfig.json`)
+
+```json
+{
+  "compilerOptions": {
+    "target": "ES2022",
+    "module": "ES2022",
+    "moduleResolution": "node",
+    "outDir": "./dist",
+    "rootDir": "./src",
+    "strict": true,
+    "esModuleInterop": true,
+    "skipLibCheck": true,
+    "forceConsistentCasingInFileNames": true,
+    "resolveJsonModule": true,
+    "declaration": true,
+    "declarationMap": true,
+    "sourceMap": true
+  },
+  "include": ["src/**/*"],
+  "exclude": ["node_modules", "dist"]
+}
+```
+
+---
+
+## 5. Tool Naming Conventions
+
+**Pattern:** `verb_noun` (lowercase, snake_case)
+
+**CRUD Operations:**
+- `list_contacts` → List/search with optional filters
+- `get_contact` → Get one item by ID
+- `create_contact` → Create new item
+- `update_contact` → Update existing item
+- `delete_contact` → Delete item
+
+**Other Actions:**
+- `search_contacts` → Full-text search
+- `send_email` → Action-based
+- `schedule_appointment` → Action-based
+- `export_report` → Action-based
+
+**Anti-patterns (avoid):**
+- ❌ `getContacts` (camelCase)
+- ❌ `ContactsList` (PascalCase)
+- ❌ `contacts` (no verb)
+- ❌ `list-contacts` (kebab-case)
+
+---
+
+## 6. Input Schema Best Practices
+
+### Use `type: "object" as const`
+The `as const` ensures TypeScript infers literal types correctly.
+
+### Always describe parameters
+```typescript
+properties: {
+  page: { 
+    type: "number", 
+    description: "Page number (default 1)" // ✅ Good
+  },
+  email: { 
+    type: "string" // ❌ Missing description
+  },
+}
+```
+
+### Mark required fields
+```typescript
+inputSchema: {
+  type: "object" as const,
+  properties: {
+    contact_id: { type: "string", description: "Contact ID" },
+    name: { type: "string", description: "Contact name" },
+  },
+  required: ["contact_id"], // ✅ Explicitly mark required
+}
+```
+
+### Default values in descriptions
+```typescript
+page: { type: "number", description: "Page number (default 1)" },
+pageSize: { type: "number", description: "Results per page (default 50, max 100)" },
+```
+
+### Use enums for fixed options
+```typescript
+status: { 
+  type: "string", 
+  description: "Status: active, inactive, pending",
+  enum: ["active", "inactive", "pending"] // Optional but helpful
+},
+```
+
+---
+
+## 7. API Client Patterns
+
+### OAuth Token Management (Example: ServiceTitan)
+```typescript
+class ServiceTitanClient {
+  private accessToken: string | null = null;
+  private tokenExpiry: number = 0;
+
+  async getAccessToken(): Promise<string> {
+    // Return cached token if still valid (with 5 min buffer)
+    if (this.accessToken && Date.now() < this.tokenExpiry - 300000) {
+      return this.accessToken;
+    }
+
+    // Request new token
+    const response = await fetch(AUTH_URL, {
+      method: "POST",
+      headers: { "Content-Type": "application/x-www-form-urlencoded" },
+      body: new URLSearchParams({
+        grant_type: "client_credentials",
+        client_id: this.clientId,
+        client_secret: this.clientSecret,
+      }),
+    });
+
+    const data = await response.json();
+    this.accessToken = data.access_token;
+    this.tokenExpiry = Date.now() + (data.expires_in * 1000);
+    
+    return this.accessToken!;
+  }
+
+  async request(endpoint: string, options: RequestInit = {}) {
+    const token = await this.getAccessToken();
+    // ... use token in headers
+  }
+}
+```
+
+### API Key Auth (Most Common)
+```typescript
+class MyServiceClient {
+  private apiKey: string;
+
+  async request(endpoint: string, options: RequestInit = {}) {
+    return fetch(url, {
+      ...options,
+      headers: {
+        "Authorization": `Bearer ${this.apiKey}`, // or "X-API-Key"
+        "Content-Type": "application/json",
+        ...options.headers,
+      },
+    });
+  }
+}
+```
+
+### Error Handling Pattern
+```typescript
+async request(endpoint: string, options: RequestInit = {}) {
+  const response = await fetch(url, options);
+
+  if (!response.ok) {
+    const errorText = await response.text();
+    throw new Error(
+      `API error: ${response.status} ${response.statusText} - ${errorText}`
+    );
+  }
+
+  return response.json();
+}
+```
+
+---
+
+## 8. Tool Handler Switch Pattern
+
+**Always use a switch statement** for clarity and type safety:
+
+```typescript
+async function handleTool(client: MyClient, name: string, args: Record<string, unknown>) {
+  switch (name) {
+    case "list_items": {
+      // Destructure with defaults
+      const { page = 1, pageSize = 50, status } = args;
+      
+      // Build query params
+      const params = new URLSearchParams();
+      params.append("page", String(page));
+      params.append("pageSize", String(Math.min(Number(pageSize), 100)));
+      if (status) params.append("status", String(status));
+      
+      return await client.get(`/items?${params}`);
+    }
+    
+    case "get_item": {
+      const { item_id } = args;
+      // No validation needed if required in schema
+      return await client.get(`/items/${item_id}`);
+    }
+    
+    case "create_item": {
+      const { name, description, status = "active" } = args;
+      return await client.post("/items", { name, description, status });
+    }
+    
+    default:
+      throw new Error(`Unknown tool: ${name}`);
+  }
+}
+```
+
+**Pattern notes:**
+- Use block scope `{ }` for each case
+- Destructure args with defaults
+- Build query params explicitly (type-safe)
+- Return API response directly (let caller handle formatting)
+- Always include `default` case with error
+
+---
+
+## 9. Environment Variables
+
+### Required Pattern
+```typescript
+async function main() {
+  const apiKey = process.env.MY_SERVICE_API_KEY;
+  const apiSecret = process.env.MY_SERVICE_API_SECRET;
+  
+  if (!apiKey) {
+    console.error("Error: MY_SERVICE_API_KEY environment variable required");
+    process.exit(1);
+  }
+  
+  if (!apiSecret) {
+    console.error("Error: MY_SERVICE_API_SECRET environment variable required");
+    process.exit(1);
+  }
+
+  const client = new MyServiceClient(apiKey, apiSecret);
+  // ... rest of setup
+}
+```
+
+### .env.example Template
+```bash
+# MyService MCP Server Configuration
+MY_SERVICE_API_KEY=your_api_key_here
+MY_SERVICE_API_SECRET=your_api_secret_here
+
+# Optional: Override base URL for testing
+# MY_SERVICE_BASE_URL=https://sandbox.example.com
+```
+
+**Best practices:**
+- Prefix all env vars with service name
+- Use SCREAMING_SNAKE_CASE
+- Provide `.env.example` with all required vars
+- Document what each var is for
+- Exit with clear error if missing (don't fail silently)
+
+---
+
+## 10. Pagination Handling
+
+### Standard Pattern
+```typescript
+{
+  name: "list_contacts",
+  description: "List contacts with pagination. Use page and pageSize to navigate results.",
+  inputSchema: {
+    type: "object" as const,
+    properties: {
+      page: { 
+        type: "number", 
+        description: "Page number (default 1, starts at 1)" 
+      },
+      pageSize: { 
+        type: "number", 
+        description: "Results per page (default 50, max 100)" 
+      },
+    },
+  },
+}
+```
+
+### In Tool Handler
+```typescript
+case "list_contacts": {
+  const { page = 1, pageSize = 50 } = args;
+  const params = new URLSearchParams();
+  params.append("page", String(page));
+  params.append("pageSize", String(Math.min(Number(pageSize), 100))); // Cap at API max
+  
+  return await client.get(`/contacts?${params}`);
+}
+```
+
+**Why cap pageSize:**
+- Most APIs have max limits (50, 100, 200)
+- Prevents user from requesting 10,000 results
+- Documents the actual API limitation
+
+---
+
+## 11. Error Handling (Server Level)
+
+```typescript
+server.setRequestHandler(CallToolRequestSchema, async (request) => {
+  const { name, arguments: args } = request.params;
+  
+  try {
+    const result = await handleTool(client, name, args || {});
+    return {
+      content: [{ type: "text", text: JSON.stringify(result, null, 2) }],
+    };
+  } catch (error) {
+    const message = error instanceof Error ? error.message : String(error);
+    return {
+      content: [{ type: "text", text: `Error: ${message}` }],
+      isError: true,
+    };
+  }
+});
+```
+
+**Pattern:**
+- Always wrap in try-catch
+- Extract error message safely (`error instanceof Error`)
+- Return with `isError: true`
+- Format response consistently
+
+---
+
+## 12. Common Patterns Across 30+ Servers
+
+### 1. List Operations
+- **Always support pagination:** page, pageSize
+- **Common filters:** status, createdAfter, updatedAfter, search/query
+- **Return format:** Matches API response (usually `{ data: [], total: N, page: 1 }`)
+
+### 2. Get Operations
+- **Single required param:** Usually `id`, `contact_id`, `job_id` etc.
+- **Return full object:** Don't filter fields unless API requires it
+
+### 3. Create/Update Operations
+- **Required vs optional:** Mark clearly in schema
+- **Return created object:** Include new ID in response
+- **Validation:** Let API do validation, don't duplicate
+
+### 4. Search Operations
+- Separate `search_` tool if different from `list_`
+- Support query string + optional filters
+- Document what fields are searched
+
+### 5. Date/Time Handling
+- **Always use ISO 8601:** `"2025-02-03T14:30:00Z"`
+- **Document timezone:** UTC unless specified
+- **Filter params:** `createdAfter`, `createdBefore`, `updatedAfter`, etc.
+
+---
+
+## 13. Build & Run Commands
+
+### Development
+```bash
+# Install dependencies
+npm install
+
+# Run in development (with hot reload)
+npm run dev
+
+# Build TypeScript
+npm run build
+
+# Run compiled version
+npm start
+```
+
+### Add to Claude Desktop
+```json
+{
+  "mcpServers": {
+    "my-service": {
+      "command": "node",
+      "args": ["/absolute/path/to/dist/index.js"],
+      "env": {
+        "MY_SERVICE_API_KEY": "your_key_here"
+      }
+    }
+  }
+}
+```
+
+---
+
+## 14. Testing Checklist
+
+Before considering an MCP server "done":
+
+- [ ] All tools defined with clear descriptions
+- [ ] All parameters documented
+- [ ] Required parameters marked explicitly
+- [ ] Pagination implemented (page, pageSize)
+- [ ] Error handling returns clear messages
+- [ ] Environment variables validated on startup
+- [ ] `.env.example` provided with all required vars
+- [ ] Compiles without errors (`npm run build`)
+- [ ] Runs successfully (`npm start`)
+- [ ] Test at least one tool in Claude Desktop
+- [ ] README.md with setup instructions
+
+---
+
+## 15. When to Split Into Multiple Files
+
+**Keep in one file (`src/index.ts`) if:**
+- Under 500 lines
+- Fewer than 15 tools
+- Single API client
+
+**Split into modules when:**
+- Over 500 lines
+- 20+ tools
+- Multiple API clients (different auth patterns)
+- Shared utilities across tools
+
+**Suggested structure when splitting:**
+```
+src/
+├── index.ts          # Server setup + main
+├── client.ts         # API client class
+├── tools.ts          # Tool definitions array
+├── handlers.ts       # Tool handler function
+└── types.ts          # TypeScript interfaces
+```
+
+---
+
+## 16. Modern MCP Features (Labels, Lazy Loading, Progress)
+
+### Tool Metadata & Labels
+
+**Use `_meta` to add rich metadata to tools:**
+
+```typescript
+{
+  name: "search_contacts",
+  description: "Search contacts with filters",
+  inputSchema: { /* ... */ },
+  _meta: {
+    // Human-readable labels for categorization
+    labels: {
+      category: "contacts",
+      access: "read",
+      complexity: "simple",
+    },
+    // Visibility control
+    visibility: ["app", "model"], // or just ["model"] to hide from apps
+    // UI hints (for MCP Apps)
+    ui: {
+      resourceUri: "ui://myservice/contact-search",
+      preferredPresentation: "inline", // or "modal", "sidebar"
+    },
+  },
+}
+```
+
+**Common label patterns:**
+```typescript
+// By category
+labels: { category: "contacts" | "deals" | "analytics" | "admin" }
+
+// By operation type
+labels: { access: "read" | "write" | "delete" }
+
+// By complexity (for model selection)
+labels: { complexity: "simple" | "complex" | "batch" }
+
+// By data sensitivity
+labels: { sensitivity: "public" | "internal" | "confidential" }
+```
+
+**Benefits:**
+- Hosts can filter/group tools by labels
+- Apps can query only tools they need
+- Models can prioritize simpler tools
+- Better tool discovery in Claude Desktop
+
+---
+
+### Lazy-Loaded Resources
+
+**Pattern:** Resources that load on demand (not all upfront)
+
+```typescript
+import { ListResourcesRequestSchema, ReadResourceRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+// Register resource handlers
+server.setRequestHandler(ListResourcesRequestSchema, async () => {
+  // Return resource metadata (not content)
+  return {
+    resources: [
+      {
+        uri: "myservice://contacts/list",
+        name: "Contact List",
+        description: "All contacts in the system",
+        mimeType: "application/json",
+      },
+      {
+        uri: "myservice://analytics/dashboard",
+        name: "Analytics Dashboard",
+        description: "Real-time analytics data",
+        mimeType: "application/json",
+      },
+    ],
+  };
+});
+
+server.setRequestHandler(ReadResourceRequestSchema, async (request) => {
+  const { uri } = request.params;
+
+  switch (uri) {
+    case "myservice://contacts/list": {
+      // Fetch contacts on-demand when requested
+      const contacts = await client.get("/contacts");
+      return {
+        contents: [{
+          uri,
+          mimeType: "application/json",
+          text: JSON.stringify(contacts, null, 2),
+        }],
+      };
+    }
+
+    case "myservice://analytics/dashboard": {
+      // Fetch analytics on-demand
+      const analytics = await client.get("/analytics/dashboard");
+      return {
+        contents: [{
+          uri,
+          mimeType: "application/json",
+          text: JSON.stringify(analytics, null, 2),
+        }],
+      };
+    }
+
+    default:
+      throw new Error(`Unknown resource: ${uri}`);
+  }
+});
+```
+
+**When to use lazy-loaded resources:**
+- Large datasets that shouldn't load upfront
+- Real-time data that changes frequently
+- Expensive API calls
+- User-specific data (load per request)
+
+**Resource URI patterns:**
+- `myservice://type/identifier` — Custom scheme
+- `file:///path/to/data.json` — File system
+- `ui://myservice/component` — UI components (for MCP Apps)
+
+---
+
+### Resource Templates
+
+**Pattern:** Dynamic resource URIs with parameters
+
+```typescript
+server.setRequestHandler(ListResourcesRequestSchema, async () => {
+  return {
+    resourceTemplates: [
+      {
+        uriTemplate: "myservice://contact/{id}",
+        name: "Contact Details",
+        description: "Detailed information for a specific contact",
+        mimeType: "application/json",
+      },
+      {
+        uriTemplate: "myservice://report/{year}/{month}",
+        name: "Monthly Report",
+        description: "Monthly analytics report",
+        mimeType: "application/json",
+      },
+    ],
+  };
+});
+
+server.setRequestHandler(ReadResourceRequestSchema, async (request) => {
+  const { uri } = request.params;
+
+  // Parse template parameters
+  const contactMatch = uri.match(/^myservice:\/\/contact\/(.+)$/);
+  if (contactMatch) {
+    const contactId = contactMatch[1];
+    const contact = await client.get(`/contacts/${contactId}`);
+    return {
+      contents: [{
+        uri,
+        mimeType: "application/json",
+        text: JSON.stringify(contact, null, 2),
+      }],
+    };
+  }
+
+  const reportMatch = uri.match(/^myservice:\/\/report\/(\d{4})\/(\d{2})$/);
+  if (reportMatch) {
+    const [, year, month] = reportMatch;
+    const report = await client.get(`/reports/${year}/${month}`);
+    return {
+      contents: [{
+        uri,
+        mimeType: "application/json",
+        text: JSON.stringify(report, null, 2),
+      }],
+    };
+  }
+
+  throw new Error(`Unknown resource: ${uri}`);
+});
+```
+
+**Use cases:**
+- Per-entity detail views
+- Time-based data (reports, analytics)
+- Filtered datasets
+- Generated documents
+
+---
+
+### Progress Notifications (Long-Running Operations)
+
+**Pattern:** Send progress updates for slow operations
+
+```typescript
+import { 
+  CallToolRequestSchema,
+  LoggingLevel,
+  ProgressNotificationSchema 
+} from "@modelcontextprotocol/sdk/types.js";
+
+server.setRequestHandler(CallToolRequestSchema, async (request) => {
+  const { name, arguments: args, _meta } = request.params;
+
+  if (name === "import_contacts") {
+    const { fileUrl } = args;
+    
+    // Send progress notifications
+    const progressToken = _meta?.progressToken;
+    
+    if (progressToken) {
+      // Download file
+      await server.notification({
+        method: "notifications/progress",
+        params: {
+          progressToken,
+          progress: 0.2,
+          total: 1.0,
+        },
+      });
+
+      // Parse file
+      await server.notification({
+        method: "notifications/progress",
+        params: {
+          progressToken,
+          progress: 0.5,
+          total: 1.0,
+        },
+      });
+
+      // Import records
+      await server.notification({
+        method: "notifications/progress",
+        params: {
+          progressToken,
+          progress: 0.8,
+          total: 1.0,
+        },
+      });
+    }
+
+    // Do the actual work
+    const result = await importContactsFromFile(fileUrl);
+
+    // Final progress
+    if (progressToken) {
+      await server.notification({
+        method: "notifications/progress",
+        params: {
+          progressToken,
+          progress: 1.0,
+          total: 1.0,
+        },
+      });
+    }
+
+    return {
+      content: [{ 
+        type: "text", 
+        text: `Imported ${result.count} contacts successfully` 
+      }],
+    };
+  }
+  
+  // ... other tools
+});
+```
+
+**When to use progress notifications:**
+- Operations taking >5 seconds
+- Multi-step workflows
+- File uploads/downloads
+- Batch operations
+- Data imports/exports
+
+---
+
+### Logging for Debugging
+
+**Pattern:** Send structured logs to host
+
+```typescript
+// In tool handler
+try {
+  await server.notification({
+    method: "notifications/message",
+    params: {
+      level: LoggingLevel.Info,
+      logger: "myservice",
+      data: {
+        operation: "create_contact",
+        contactId: newContact.id,
+        timestamp: new Date().toISOString(),
+      },
+    },
+  });
+
+  const result = await client.post("/contacts", data);
+  
+  return { content: [{ type: "text", text: JSON.stringify(result) }] };
+  
+} catch (error) {
+  await server.notification({
+    method: "notifications/message",
+    params: {
+      level: LoggingLevel.Error,
+      logger: "myservice",
+      data: {
+        operation: "create_contact",
+        error: error.message,
+        timestamp: new Date().toISOString(),
+      },
+    },
+  });
+  
+  throw error;
+}
+```
+
+**Log levels:**
+- `LoggingLevel.Debug` — Detailed debug info
+- `LoggingLevel.Info` — Informational messages
+- `LoggingLevel.Warning` — Warning conditions
+- `LoggingLevel.Error` — Error conditions
+- `LoggingLevel.Critical` — Critical failures
+
+---
+
+### Prompts (for Auto-Completion)
+
+**Pattern:** Provide predefined prompt templates
+
+```typescript
+import { ListPromptsRequestSchema, GetPromptRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+server.setRequestHandler(ListPromptsRequestSchema, async () => {
+  return {
+    prompts: [
+      {
+        name: "analyze_pipeline",
+        description: "Analyze sales pipeline health and suggest actions",
+        arguments: [
+          {
+            name: "pipelineId",
+            description: "Pipeline ID to analyze",
+            required: false,
+          },
+        ],
+      },
+      {
+        name: "contact_summary",
+        description: "Generate comprehensive contact summary with activity history",
+        arguments: [
+          {
+            name: "contactId",
+            description: "Contact ID",
+            required: true,
+          },
+        ],
+      },
+    ],
+  };
+});
+
+server.setRequestHandler(GetPromptRequestSchema, async (request) => {
+  const { name, arguments: args } = request.params;
+
+  switch (name) {
+    case "analyze_pipeline": {
+      const { pipelineId } = args || {};
+      
+      // Fetch data for the prompt
+      const pipeline = pipelineId 
+        ? await client.get(`/pipelines/${pipelineId}`)
+        : await client.get("/pipelines/main");
+      
+      const opportunities = await client.get(`/opportunities?pipelineId=${pipeline.id}`);
+
+      return {
+        description: `Analyzing pipeline: ${pipeline.name}`,
+        messages: [
+          {
+            role: "user",
+            content: {
+              type: "text",
+              text: `Please analyze this sales pipeline and suggest actions:\n\n${JSON.stringify({ pipeline, opportunities }, null, 2)}`,
+            },
+          },
+        ],
+      };
+    }
+
+    case "contact_summary": {
+      const { contactId } = args;
+      const contact = await client.get(`/contacts/${contactId}`);
+      const activities = await client.get(`/contacts/${contactId}/activities`);
+
+      return {
+        description: `Summary for ${contact.name}`,
+        messages: [
+          {
+            role: "user",
+            content: {
+              type: "text",
+              text: `Generate a comprehensive summary of this contact:\n\nContact: ${JSON.stringify(contact, null, 2)}\n\nRecent Activity: ${JSON.stringify(activities, null, 2)}`,
+            },
+          },
+        ],
+      };
+    }
+
+    default:
+      throw new Error(`Unknown prompt: ${name}`);
+  }
+});
+```
+
+**Use cases:**
+- Common analysis workflows
+- Report generation
+- Data summarization
+- Quick actions for users
+
+---
+
+### Roots Listing (for File Systems)
+
+**Pattern:** List root directories/containers
+
+```typescript
+import { ListRootsRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+server.setRequestHandler(ListRootsRequestSchema, async () => {
+  return {
+    roots: [
+      {
+        uri: "myservice://workspaces/",
+        name: "All Workspaces",
+      },
+      {
+        uri: "myservice://contacts/",
+        name: "Contacts Database",
+      },
+      {
+        uri: "myservice://reports/",
+        name: "Reports Archive",
+      },
+    ],
+  };
+});
+```
+
+**When to use roots:**
+- File system-like data structures
+- Multiple top-level containers
+- Workspace/project organization
+- Document hierarchies
+
+---
+
+### Sampling (for AI Completions)
+
+**Pattern:** Request LLM completions from the host
+
+```typescript
+// NOTE: Most servers don't need this - it's for servers that help with AI tasks
+import { CreateMessageRequestSchema } from "@modelcontextprotocol/sdk/types.js";
+
+// If your server needs to request completions FROM the model
+// (rare - usually the model calls your tools, not the other way around)
+server.setRequestHandler(CreateMessageRequestSchema, async (request) => {
+  const { messages, maxTokens } = request.params;
+  
+  // This would be handled by the HOST, not your server
+  // Only implement if your server orchestrates AI workflows
+  throw new Error("Sampling not implemented");
+});
+```
+
+**When to use:** Almost never. Only for meta-servers that orchestrate AI workflows.
+
+---
+
+## 17. Resources & References
+
+- **MCP SDK Docs:** https://modelcontextprotocol.io
+- **Example Servers:** `/Users/jakeshore/.clawdbot/workspace/mcp-diagrams/mcp-servers/`
+- **Tool Best Practices:** https://modelcontextprotocol.io/docs/tools
+- **Schema Validation:** Use Zod if complex validation needed
+- **Progress Notifications:** https://modelcontextprotocol.io/docs/concepts/progress
+- **Resources Guide:** https://modelcontextprotocol.io/docs/concepts/resources
+
+---
+
+## 18. Quick Start Command
+
+```bash
+# Create new MCP server
+mkdir mcp-server-myservice
+cd mcp-server-myservice
+
+# Init package.json
+npm init -y
+
+# Install deps
+npm install @modelcontextprotocol/sdk
+npm install -D typescript @types/node tsx
+
+# Create structure
+mkdir src dist
+touch src/index.ts tsconfig.json .env.example .gitignore
+
+# Copy template from this skill into src/index.ts
+# Configure package.json scripts
+# Configure tsconfig.json
+# Build and test
+```
+
+---
+
+**Summary:**
+- One-file pattern for most servers
+- Clear tool naming (verb_noun)
+- Comprehensive input schemas with descriptions
+- Tool metadata with labels for categorization
+- Lazy-loaded resources for on-demand data
+- Progress notifications for long operations
+- Structured logging for debugging
+- Prompts for common workflows
+- Environment variable validation
+- Consistent error handling
+- Pagination support
+- ISO 8601 dates
+- Test before shipping
+
+This skill captures patterns from 30+ production MCP servers plus modern MCP features (labels, lazy loading, progress, prompts). Follow these and you'll get it right on attempt 1.
diff --git a/skills/mcp-skill/.clawdhub/origin.json b/skills/mcp-skill/.clawdhub/origin.json
new file mode 100644
index 0000000..2e75374
--- /dev/null
+++ b/skills/mcp-skill/.clawdhub/origin.json
@@ -0,0 +1,7 @@
+{
+  "version": 1,
+  "registry": "https://clawhub.ai",
+  "slug": "mcp-skill",
+  "installedVersion": "1.0.0",
+  "installedAt": 1770110462685
+}
diff --git a/skills/mcp-skill/README.md b/skills/mcp-skill/README.md
new file mode 100644
index 0000000..2503bcc
--- /dev/null
+++ b/skills/mcp-skill/README.md
@@ -0,0 +1 @@
+MCP Skill
diff --git a/skills/mcp-skill/SKILL.md b/skills/mcp-skill/SKILL.md
new file mode 100644
index 0000000..70e49be
--- /dev/null
+++ b/skills/mcp-skill/SKILL.md
@@ -0,0 +1,14 @@
+# MCP Skill
+
+This skill wraps the MCP at https://mcp.exa.ai/mcp for various tools such as web search, deep research, and more.
+
+## Tools Included
+- web_search_exa
+- web_search_advanced_exa
+- get_code_context_exa
+- deep_search_exa
+- crawling_exa
+- company_research_exa
+- linkedin_search_exa
+- deep_researcher_start
+- deep_researcher_check