mirror of https://github.com/tiennm99/claude-central-gateway.git synced 2026-04-17 13:20:56 +00:00

Files

tiennm99 170cdb1324 docs: Add comprehensive documentation suite

- Project overview, system architecture, code standards
- API reference with 15+ examples
- Quick start guide with troubleshooting
- Updated README with feature highlights and compatibility matrix

2026-04-05 11:47:18 +07:00

4.7 KiB

Raw Permalink Blame History

Quick Start Guide

1-Minute Setup

Prerequisites

OpenAI API key (get from platform.openai.com)
Vercel account (optional, for deployment)
Claude Code IDE

Deploy to Vercel

Click the button in the README or:

git clone https://github.com/tiennm99/claude-central-gateway
cd claude-central-gateway
npm install
vercel

Configure Environment Variables

In Vercel Dashboard:

Select your project → Settings → Environment Variables
Add:
- GATEWAY_TOKEN: my-secret-token-abc123def456 (generate a random string)
- OPENAI_API_KEY: Your OpenAI API key (starts with sk-proj-)
- MODEL_MAP: (Optional) claude-sonnet-4-20250514:gpt-4o

Configure Claude Code

Set two environment variables:

export ANTHROPIC_BASE_URL=https://your-project.vercel.app
export ANTHROPIC_AUTH_TOKEN=my-secret-token-abc123def456

Then run Claude Code:

claude

That's it! Claude Code now routes through your gateway.

Verify It Works

Test with curl

curl -X POST https://your-project.vercel.app/v1/messages \
  -H "x-api-key: my-secret-token-abc123def456" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-20250514",
    "max_tokens": 100,
    "messages": [
      {"role": "user", "content": "Say hello!"}
    ]
  }'

Expected response:

{
  "id": "msg_...",
  "type": "message",
  "role": "assistant",
  "content": [
    {"type": "text", "text": "Hello! How can I help you?"}
  ],
  "stop_reason": "end_turn",
  "usage": {"input_tokens": 10, "output_tokens": 7}
}

Health Check

curl https://your-project.vercel.app/

Response:

{
  "status": "ok",
  "name": "Claude Central Gateway"
}

Alternative Deployments

Cloudflare Workers

npm install
npm run deploy:cf

Then set environment variables in wrangler.toml or Cloudflare dashboard.

Local Development

npm install
npm run dev

Gateway runs on http://localhost:5173.

Model Mapping Examples

Mapping to cheaper models:

MODEL_MAP=claude-sonnet-4-20250514:gpt-4-mini,claude-opus:gpt-4-turbo

Single mapping:

MODEL_MAP=claude-sonnet-4-20250514:gpt-4o

No mapping (pass through): Leave MODEL_MAP empty; model names are used as-is (may fail if OpenAI doesn't recognize them).

Troubleshooting

"Unauthorized" Error (401)

Check GATEWAY_TOKEN is set and matches your client's ANTHROPIC_AUTH_TOKEN
Verify header is x-api-key (case-sensitive)

"Not found" Error (404)

Only /v1/messages endpoint is implemented
Health check at / should return 200

OpenAI API Errors (5xx)

Check OPENAI_API_KEY is valid and has available credits
Check MODEL_MAP points to valid OpenAI models
Monitor OpenAI dashboard for rate limits

Streaming not working

Ensure client sends "stream": true in request
Check response has Content-Type: text/event-stream header
Verify client supports Server-Sent Events

Next Steps

Read the API Reference for complete endpoint documentation
Review System Architecture to understand how it works
Set up monitoring for OpenAI API usage and costs
Rotate GATEWAY_TOKEN periodically for security

Cost Optimization Tips

Use MODEL_MAP to route to cheaper models:

MODEL_MAP=claude-sonnet-4-20250514:gpt-4-mini

Set conservative max_tokens limits in Claude Code settings
Monitor OpenAI API dashboard weekly for unexpected usage spikes
Consider usage alerts in OpenAI dashboard

FAQ

Q: Is my token exposed if I use the hosted version? A: The gateway is stateless; tokens are compared server-side. Use a strong random token (32+ characters) and rotate periodically.

Q: Can multiple machines use the same gateway? A: Yes, they all share the same GATEWAY_TOKEN and cost. Not suitable for multi-user scenarios.

Q: What if OpenAI API goes down? A: Gateway will return a 500 error. No built-in fallback or retry logic.

Q: Does the gateway log my requests? A: Hono middleware logs request method/path/status. Request bodies are not logged by default.

Q: Can I use this with other LLM providers? A: Only if they support OpenAI's Chat Completions API format. See penny-pincher-provider for compatible providers.

Q: How do I update the gateway? A: Pull latest changes and redeploy:

git pull origin main
vercel

Getting Help

API questions: See API Reference
Architecture questions: See System Architecture
Issues: Open a GitHub issue with details about your setup and error logs

4.7 KiB Raw Permalink Blame History