Routing-based configuration for LLMs

Agentgateway offers two ways to configure LLM providers, each optimized for different use cases.

LLM-based configuration
Routing-based configuration

Choosing between the two

Use this decision tree to choose the right configuration mode.

ℹ️

You can use both configuration modes in the same file if needed, but typically one mode is sufficient for most use cases.

Question	Answer	Recommendation
Are you only routing to LLM providers?	Yes	LLM-based configuration
Do you need model-based routing with header matching?	Yes	LLM-based configuration
Do you need custom, path-based routing (e.g., `/openai`, `/anthropic`)?	Yes	Routing-based configuration
Do you need to route to non-LLM backends?	Yes	Routing-based configuration
Do you need multiple listeners on different ports?	Yes	Routing-based configuration

Simplified LLM configuration

The simplified llm configuration is designed specifically for LLM use cases. Use this approach when your primary goal is to route traffic to LLM providers.

In general, the docs use the simplified LLM configuration.

About

When to use the simplified LLM configuration:

You are building an LLM gateway or AI proxy.
You need to route requests to one or more LLM providers.
You want model-based routing with header matching.
You need LLM-specific policies like JWT authentication or authorization.

The benefits of this approach are:

Concise: Less configuration needed for common LLM scenarios.
Model-centric: Focus on LLM models rather than HTTP routing.
LLM policies: Built-in support for JWT auth and authorization rules.
Easy multi-provider: Simple syntax for routing to multiple providers.

Example

# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  models:
  - name: "*"
    provider: openAI
    params:
      apiKey: "$OPENAI_API_KEY"

Advanced example with policies and matching

# yaml-language-server: $schema=https://agentgateway.dev/schema/config

llm:
  policies:
    jwtAuth:
      issuer: agentgateway.dev
      audiences: [api.example.com]
      jwks:
        file: ./public-key.json
    authorization:
      rules:
      - 'jwt.email.endsWith("@example.com")'
  models:
  - name: claude-haiku
    provider: anthropic
    params:
      model: claude-3-5-haiku-20241022
      apiKey: "$ANTHROPIC_API_KEY"
    matches:
    - headers:
      - name: x-org
        value:
          exact: engineering
  - name: gpt-4
    provider: openAI
    params:
      model: gpt-4o
      apiKey: "$OPENAI_API_KEY"

Traditional HTTP routing configuration

The traditional binds/listeners/routes configuration provides full control over HTTP routing. Use this approach when you need advanced HTTP routing capabilities or non-LLM backends.

About

When to use the traditional routing-based configuration:

You need complex HTTP routing based on paths, methods, or query parameters.
You are routing to non-LLM backends alongside LLM providers.
You need fine-grained control over listeners and ports.
You require advanced HTTP policies like CORS, rate limiting, or transformations.

The benefits of this approach are:

Flexible routing: Full HTTP routing capabilities with path, method, query, and header matching.
Mixed backends: Route to both LLM and non-LLM backends in the same configuration.
HTTP policies: Access to all HTTP-level policies like CORS, rate limiting, and transformations.
Multiple listeners: Configure different ports and protocols.

Example

# yaml-language-server: $schema=https://agentgateway.dev/schema/config

binds:
- port: 3000
  listeners:
  - protocol: HTTP
    routes:
    - backends:
      - ai:
          name: openai
          provider:
            openAI:
              model: gpt-3.5-turbo
      policies:
        backendAuth:
          key: "$OPENAI_API_KEY"

Advanced example with HTTP routing

# yaml-language-server: $schema=https://agentgateway.dev/schema/config

binds:
- port: 3000
  listeners:
  - protocol: HTTP
    routes:
    # Route OpenAI requests
    - name: openai-route
      matches:
      - path:
          pathPrefix: /openai
      backends:
      - ai:
          name: openai
          provider:
            openAI:
              model: gpt-4o
      policies:
        backendAuth:
          key: "$OPENAI_API_KEY"
    # Route Anthropic requests
    - name: anthropic-route
      matches:
      - path:
          pathPrefix: /anthropic
      backends:
      - ai:
          name: anthropic
          provider:
            anthropic:
              model: claude-3-5-haiku-20241022
      policies:
        backendAuth:
          key: "$ANTHROPIC_API_KEY"
    # Non-LLM backend
    - name: api-route
      matches:
      - path:
          pathPrefix: /api
      backends:
      - http:
          host: api.example.com:443

Content safety and PII protection

Routing-based configuration for LLMs

Choosing between the two

Simplified LLM configuration

About

Example

Advanced example with policies and matching

Traditional HTTP routing configuration

About

Example

Advanced example with HTTP routing

What could be improved?