5.9 KiB

Raw Blame History

Monitor Mode - Key Findings

Test Results Summary

Test 1: Simple Query with Monitor Mode

Command:

./dist/index.js --monitor --debug "What is 2+2? Answer in one sentence."

Log File: logs/claudish_2025-11-10_14-05-42.log

Key Discoveries

1. Claude Code Protocol Structure

Claude Code makes multiple API calls in sequence:

Warmup Call (Haiku 4.5) - Fast model for planning
- Model: claude-haiku-4-5-20251001
- Purpose: Initial context loading and warmup
- Contains full system prompts and project context
Main Call (Sonnet 4.5) - Primary model for execution
- Model: claude-sonnet-4-5-20250929
- Purpose: Actual task execution
- Receives tools and can execute them
Tool Execution Calls (when needed)
- Subsequent calls with tool results
- Streams responses back

2. Request Headers

Claude Code sends comprehensive metadata:

{
  "anthropic-beta": "claude-code-20250219,interleaved-thinking-2025-05-14,fine-grained-tool-streaming-2025-05-14",
  "anthropic-dangerous-direct-browser-access": "true",
  "anthropic-version": "2023-06-01",
  "user-agent": "claude-cli/2.0.36 (external, cli)",
  "x-api-key": "sk-ant-api03-...",
  "x-app": "cli",
  "x-stainless-arch": "arm64",
  "x-stainless-helper-method": "stream",
  "x-stainless-lang": "js",
  "x-stainless-os": "MacOS",
  "x-stainless-package-version": "0.68.0",
  "x-stainless-runtime": "node",
  "x-stainless-runtime-version": "v24.3.0",
  "x-stainless-timeout": "600"
}

Header Notes:

x-stainless-* headers are set by Claude Code's Anthropic SDK (generated by Stainless)
x-stainless-timeout: 600 = 10 minutes (per API call timeout, set by SDK, not configurable)

Key Beta Features:

claude-code-20250219 - Claude Code specific features
interleaved-thinking-2025-05-14 - Thinking mode support
fine-grained-tool-streaming-2025-05-14 - Streaming tool calls

3. Prompt Caching

Claude Code uses extensive prompt caching with cache_control:

{
  "type": "text",
  "text": "...",
  "cache_control": {
    "type": "ephemeral"
  }
}

Caching is applied to:

System prompts
Project context (CLAUDE.md)
Tool definitions
Large context blocks

4. System Prompt Structure

The system prompt includes:

Identity
- "You are Claude Code, Anthropic's official CLI for Claude."
Agent-Specific Instructions
- Different instructions for different agent types
- File search specialist, code reviewer, etc.
Project Context
- Full CLAUDE.md contents
- Wrapped in <system-reminder> tags

Environment Information

Working directory: /path/to/claude-code/mcp/claudish
Platform: darwin
OS Version: Darwin 25.1.0
Today's date: 2025-11-11

Git Status
- Current branch
- Modified files
- Recent commits

5. Tool Definitions

Claude Code provides these tools:

Task - Launch specialized agents
Bash - Execute shell commands
Glob - File pattern matching
Grep - Content search
Read - Read files
Edit - Edit files
Write - Write files
NotebookEdit - Edit Jupyter notebooks
WebFetch - Fetch web content
WebSearch - Search the web
BashOutput - Get output from background shells
KillShell - Kill background shells
Skill - Execute skills
SlashCommand - Execute slash commands

Each tool has complete JSON Schema definitions with detailed descriptions and parameter specifications.

Current Issues

Issue 1: API Key Authentication

Problem: When ANTHROPIC_API_KEY is set to a placeholder (for OpenRouter mode), Claude Code sends that placeholder to Anthropic, which rejects it:

{
  "type": "error",
  "error": {
    "type": "authentication_error",
    "message": "invalid x-api-key"
  }
}

Root Cause: Our OpenRouter mode requires ANTHROPIC_API_KEY=sk-ant-api03-placeholder to prevent Claude Code from showing a dialog, but this same placeholder is used in monitor mode.

Solution Options:

Option A: Don't set ANTHROPIC_API_KEY when using monitor mode
- Pros: Claude Code uses its native authentication
- Cons: May show dialog if not authenticated
Option B: Detect monitor mode in CLI and skip API key validation
- Pros: Clean user experience
- Cons: User still needs to be authenticated with Claude Code
Option C: Allow users to provide their own Anthropic key for monitor mode
- Pros: Explicit control
- Cons: Extra setup step

Recommended: Option B - Skip ANTHROPIC_API_KEY validation in monitor mode, let Claude Code handle authentication naturally.

Next Steps

✅ Fix API key handling in monitor mode
⏳ Test with real Claude Code authentication
⏳ Document tool execution patterns
⏳ Document streaming response format
⏳ Test with complex multi-tool scenarios
⏳ Create comprehensive protocol documentation

Monitor Mode Implementation Status

✅ Working:

API key extraction from headers
Request logging (full JSON)
Headers logging (complete metadata)
Pass-through proxy to Anthropic
Token counting endpoint support

⏳ To Verify:

Streaming response logging
Tool call/result patterns
Multi-turn conversations
Error handling

❌ Issues:

API key placeholder rejection (fixable)

Insights for Proxy Implementation

From these logs, we learned:

Streaming is Always Used - Claude Code requests stream: true by default
Prompt Caching is Critical - Extensive use of ephemeral caching
Beta Features Required - Must support claude-code-20250219 beta
Tool Streaming is Fine-Grained - Uses fine-grained-tool-streaming-2025-05-14
Thinking Mode - Supports interleaved-thinking-2025-05-14
Multiple Models - Haiku for warmup, Sonnet for execution
Rich Metadata - Extensive headers for tracking and debugging

Last Updated: 2025-11-10 Log File: logs/claudish_2025-11-10_14-05-42.log

5.9 KiB Raw Blame History