Use with Third-Party Providers

Connect Chimera to any LLM backend — cloud APIs, local servers, or OpenAI-compatible endpoints — by setting a few environment variables or passing arguments to create_provider().

How Provider Detection Works

chimera.create_provider() resolves a provider in this order:

Explicit provider_type — if you pass "anthropic", "openai", "ollama", "compatible", etc., that is used directly.
Model name prefix — claude-* maps to Anthropic, gpt-* / o1-* / o3-* to OpenAI, gemini-* to Google, llama* / mistral* / qwen* / phi* to Ollama.
Provider catalog — checks the built-in model catalog.
Environment variables — if ANTHROPIC_BASE_URL or ANTHROPIC_AUTH_TOKEN is set, falls back to the Anthropic provider. If OPENAI_API_KEY is set, falls back to OpenAI.

The model name itself comes from the model= argument, or from the ANTHROPIC_MODEL / OPENAI_MODEL environment variable when omitted.

GLM-5 via api.z.ai

Set the three environment variables that the Anthropic provider reads:

export ANTHROPIC_BASE_URL="https://api.z.ai/api/anthropic"
export ANTHROPIC_AUTH_TOKEN="your-token"
export ANTHROPIC_MODEL="glm-5"

Then create a provider with zero arguments:

import chimera

provider = chimera.create_provider()  # picks up all three env vars

Or pass the model explicitly and let the env vars supply the rest:

provider = chimera.create_provider(model="glm-5")

Because glm-5 does not match any known prefix, the factory sees ANTHROPIC_BASE_URL in the environment and routes to the Anthropic provider with the custom base URL.

OpenRouter

OpenRouter exposes an Anthropic-compatible API, so the same env vars apply:

export ANTHROPIC_BASE_URL="https://openrouter.ai/api"
export ANTHROPIC_AUTH_TOKEN="your-openrouter-key"
export ANTHROPIC_MODEL="anthropic/claude-sonnet-4-20250514"

import chimera

provider = chimera.create_provider()

Model names use OpenRouter’s provider/model format (e.g. google/gemini-2.0-flash, meta-llama/llama-3-70b). Any model listed on OpenRouter works — the Anthropic provider forwards the name as-is. OpenRouter handles failover and load balancing on their side.

vLLM / Local OpenAI-Compatible Endpoints

For servers that expose /v1/chat/completions (vLLM, LiteLLM, Together, Fireworks, Groq), use the compatible provider type:

import chimera

provider = chimera.create_provider(
    "compatible",
    model="my-local-model",
    base_url="http://localhost:8000/v1",
)

base_url is required for compatible. An optional api_key can be passed if the server requires authentication:

provider = chimera.create_provider(
    "compatible",
    model="meta-llama/Llama-3-70B",
    base_url="https://api.together.xyz/v1",
    api_key="your-together-key",
)

Ollama

Models whose names start with llama, mistral, qwen, or phi are auto-detected as Ollama:

import chimera

provider = chimera.create_provider(model="llama3.2")

For other model names, specify the provider type explicitly:

provider = chimera.create_provider(
    "ollama",
    model="deepseek-r1",
    base_url="http://localhost:11434",
)

The default base URL is http://localhost:11434, so you can omit it when Ollama runs on the standard port.

Built-in Provider Catalog

chimera.providers.catalog.ProviderCatalog.default() ships with entries for popular hosted and local model lines so you can bring up a provider with a single string. The catalog auto-resolves base URLs, env vars, context windows, and per-Mtok pricing. New entries added across W11-W15:

Catalog id	Provider	Source / env vars	Context
`bedrock/claude-sonnet-4`, `bedrock/claude-haiku-3.5`	compatible (AWS Bedrock)	`AWS_BEDROCK_ENDPOINT`, `AWS_BEDROCK_KEY`	200k
`azure/gpt-4o`, `azure/gpt-4o-mini`	compatible (Azure OpenAI)	`AZURE_OPENAI_ENDPOINT`, `AZURE_OPENAI_KEY`	128k
`groq/llama-3.3-70b`	compatible (Groq)	`GROQ_API_KEY`	128k
`deepseek-chat`, `deepseek-reasoner`	compatible (DeepSeek API)	`DEEPSEEK_API_KEY`	64k
`deepseek-v4`, `deepseek-v4-pro`	compatible (DeepSeek API)	`DEEPSEEK_API_KEY`	128k
`deepseek-v4-pro:cloud`	ollama (local cloud passthrough)	`OLLAMA_HOST`	262k
`deepseek-v3.1-terminus`, `deepseek-coder-v3`	compatible (DeepSeek API)	`DEEPSEEK_API_KEY`	128k
`qwen3-coder`, `qwen3-coder-30b`, `qwen3-32b`	ollama	`OLLAMA_HOST`	131k
`glm-4.6`, `glm-5.1`	anthropic (api.z.ai)	`ANTHROPIC_AUTH_TOKEN`	200k
`gpt-oss-120b`, `gpt-oss-20b`	ollama	`OLLAMA_HOST`	131k
`kimi-k2-0905-preview`, `kimi-k2.5`	anthropic (api.moonshot.ai)	`MOONSHOT_API_KEY`	200k
`mistral-codestral-2511`	ollama	`OLLAMA_HOST`	256k
`gemma3-27b-instruct`	ollama	`OLLAMA_HOST`	131k

Use a catalog entry directly:

from chimera.providers.catalog import ProviderCatalog

catalog = ProviderCatalog.default()
provider = catalog.create("deepseek-coder-v3")  # reads $DEEPSEEK_API_KEY

Or register your own entry:

from chimera.providers.catalog import ModelConfig, ProviderCatalog

catalog = ProviderCatalog.default()
catalog.register(ModelConfig(
    model="acme/internal-llm",
    provider_type="compatible",
    base_url="https://llm.acme.internal/v1",
    api_key_env="ACME_LLM_KEY",
    context_window=64_000,
    cost=(0.50, 1.50),  # USD per Mtok input / output
))
provider = catalog.create("acme/internal-llm")

create_provider() falls through to the catalog automatically when a model name matches a registered entry, so most code paths can keep using chimera.create_provider(model="deepseek-coder-v3") and the right provider/base-url/key combination is wired up for them.

Using .env Files

Keep credentials out of your shell history by storing them in a .env file at the project root:

ANTHROPIC_BASE_URL="https://api.z.ai/api/anthropic"
ANTHROPIC_AUTH_TOKEN="your-token"
ANTHROPIC_MODEL="glm-5"

Source it before running Chimera:

source .env
chimera code

.env is included in .gitignore by default. Never commit API keys. :::---

CLI Usage

All provider settings work with chimera code and every other subcommand. The CLI reads the same environment variables:

export ANTHROPIC_BASE_URL="https://openrouter.ai/api"
export ANTHROPIC_AUTH_TOKEN="your-key"
chimera code --model anthropic/claude-sonnet-4-20250514

The --model flag overrides ANTHROPIC_MODEL. Everything else is picked up from the environment.

Next Steps

Build a Coding Agent — use providers programmatically inside an agent loop.
Use the REPL — interactive session with any provider.