LLM Gateway

Route requests to OpenAI, Anthropic, Google Gemini, and other LLM providers through a unified OpenAI-compatible API.

What you’ll build

In this tutorial, you configure the following.

Configure agentgateway as an LLM proxy
Connect to your preferred LLM provider (OpenAI, Anthropic, Gemini, etc.)
Route requests through a unified OpenAI-compatible API
Optionally set up header-based routing to multiple providers

Single Provider

Step 1: Install agentgateway

curl -sL https://agentgateway.dev/install | bash

Step 2: Choose your LLM provider

Set your API key

export OPENAI_API_KEY=your-api-key

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: gpt-4.1-nano
    provider: openAI
    params:
      model: gpt-4.1-nano
      apiKey: "$OPENAI_API_KEY"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4.1-nano",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Set your API key

export ANTHROPIC_API_KEY=your-api-key

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: claude-sonnet-4-20250514
    provider: anthropic
    params:
      model: claude-sonnet-4-20250514
      apiKey: "$ANTHROPIC_API_KEY"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-20250514",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Set your API key

export GEMINI_API_KEY=your-api-key

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: gemini-2.0-flash
    provider: gemini
    params:
      model: gemini-2.0-flash
      apiKey: "$GEMINI_API_KEY"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-2.0-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Set your API key

export XAI_API_KEY=your-api-key

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: grok-4-latest
    provider: openAI
    params:
      model: grok-4-latest
      apiKey: "$XAI_API_KEY"
      baseUrl: "https://api.x.ai"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "grok-4-latest",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Configure AWS credentials

aws configure

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: amazon.nova-lite-v1:0
    provider: bedrock
    params:
      model: amazon.nova-lite-v1:0
      region: us-west-2
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "amazon.nova-lite-v1:0",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Set your credentials

export AZURE_CLIENT_SECRET=your-client-secret

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: gpt-4
    provider: azure
    params:
      model: gpt-4
      azureEndpoint: "https://your-resource.openai.azure.com"
      azureTenantId: "your-tenant-id"
      azureClientId: "your-client-id"
      azureClientSecret: "$AZURE_CLIENT_SECRET"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Start Ollama first

ollama serve

Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: llama3
    provider: openAI
    params:
      model: llama3
      baseUrl: "http://localhost:11434"
EOF

Start agentgateway

agentgateway -f config.yaml

Test the API

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Example output:

{
  "id": "chatcmpl-...",
  "object": "chat.completion",
  "choices": [{
    "message": {
      "role": "assistant",
      "content": "Hello! How can I help you today?"
    }
  }]
}

Multiple Providers

Route to different LLM providers based on a header. This lets you switch providers without changing your application code.

ℹ️

This example uses the traditional binds/listeners/routes configuration format because it demonstrates header-based HTTP routing. For simpler use cases, see the simplified llm: format in the examples above or learn more in the Routing-based configuration guide.

Step 1: Set your API keys

export OPENAI_API_KEY=your-openai-key
export ANTHROPIC_API_KEY=your-anthropic-key
export GEMINI_API_KEY=your-gemini-key

Step 2: Create the config

cat > config.yaml << 'EOF'
# yaml-language-server: $schema=https://agentgateway.dev/schema/config
binds:
- port: 3000
  listeners:
  - routes:
    - name: anthropic
      matches:
      - path:
          pathPrefix: /
        headers:
        - name: x-provider
          value:
            exact: anthropic
      backends:
      - ai:
          name: anthropic
          provider:
            anthropic:
              model: claude-sonnet-4-20250514
      policies:
        cors:
          allowOrigins: ["*"]
          allowHeaders: ["*"]
        backendAuth:
          key: "$ANTHROPIC_API_KEY"

    - name: gemini
      matches:
      - path:
          pathPrefix: /
        headers:
        - name: x-provider
          value:
            exact: gemini
      backends:
      - ai:
          name: gemini
          provider:
            gemini:
              model: gemini-2.0-flash
      policies:
        cors:
          allowOrigins: ["*"]
          allowHeaders: ["*"]
        backendAuth:
          key: "$GEMINI_API_KEY"

    - name: openai-default
      backends:
      - ai:
          name: openai
          provider:
            openAI:
              model: gpt-4.1-nano
      policies:
        cors:
          allowOrigins: ["*"]
          allowHeaders: ["*"]
        backendAuth:
          key: "$OPENAI_API_KEY"
EOF

Step 3: Start agentgateway

agentgateway -f config.yaml

Step 4: Open the UI

Go to http://localhost:15000/ui/ and click Routes to see your configured providers.

Multiple LLM Providers Routes

The UI shows:

anthropic - Routes requests with x-provider: anthropic header
gemini - Routes requests with x-provider: gemini header
openai-default - Default route for all other requests

Click Backends to see all configured AI providers.

Multiple LLM Providers Backends

The Backends page shows:

3 total backends with 3 AI providers configured
Each backend displays the provider, model, and policies
You can see the route each backend is associated with

Step 5: Test each provider

Use Anthropic.

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-provider: anthropic" \
  -d '{"model": "claude-sonnet-4-20250514", "messages": [{"role": "user", "content": "Hello!"}]}'

Use Gemini.

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-provider: gemini" \
  -d '{"model": "gemini-2.0-flash", "messages": [{"role": "user", "content": "Hello!"}]}'

Use OpenAI (default, no header needed).

curl http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4.1-nano", "messages": [{"role": "user", "content": "Hello!"}]}'

Next steps

LLM Overview

Understanding LLM gateway features

Cost Tracking

Monitor LLM spending

Rate Limiting

Configure rate limits

LLM Gateway

What you’ll build

Single Provider

Step 1: Install agentgateway

Step 2: Choose your LLM provider

Set your API key

Create the config

Start agentgateway

Test the API

Set your API key

Create the config

Start agentgateway

Test the API

Set your API key

Create the config

Start agentgateway

Test the API

Set your API key

Create the config

Start agentgateway

Test the API

Configure AWS credentials

Create the config

Start agentgateway

Test the API

Set your credentials

Create the config

Start agentgateway

Test the API

Start Ollama first

Create the config

Start agentgateway

Test the API

Multiple Providers

Step 1: Set your API keys

Step 2: Create the config

Step 3: Start agentgateway

Step 4: Open the UI

Step 5: Test each provider

Next steps

What could be improved?