Skip to content

Chimera 0.5.0 — Five-Strong Coding-Agent Family

Chimera 0.5.0 — Five-Strong Coding-Agent Family

Section titled “Chimera 0.5.0 — Five-Strong Coding-Agent Family”

Released: 2026-04-30

The 0.5.0 release ships the full mink / otter / ferret / weasel / shrew coding-agent family on a single Chimera substrate. v0.4.0 was cut for the mink wave-2 milestone; 0.5.0 rolls up otter waves 1–2, mink waves 2–3, and the wave-5 ship + wave-6 cross-CLI verification + wave-7 gap closure for the three new CLIs (ferret / weasel / shrew).

All five share the same Agent / AgentLoop / EventSourcedSession / provider factory, the same 26-event EventBus, and the same tool registry — adding a sixth is an additive walk through the same agent allocation pattern.

CLIPostureLines that distinguish it
chimera minkTUI-firstglm-5.1:cloud defaults, 31 slash commands, 11 benchmark adapters
chimera otterServer-firstTUI + HTTP serve + ACP serve --acp, share-by-link, 26 slash commands
chimera ferretSandbox-first / IDE-flagshipthree sandbox modes, three approval presets, IDE-first ACP, cloud bridge
chimera weaselMinimal harness, four modesinteractive / print / RPC / SDK, four-command slash palette, auto-discovered extensions
chimera shrewSmall-model tunedlocal-first provider chain, 11 curated skills, MoE-aware context sizing, Aider Polyglot + GAIA

Composition over rebuild — none of the new CLIs forks Chimera; each is a thin posture on the upstream substrate.

chimera ferret — sandbox-first / IDE-flagship

Section titled “chimera ferret — sandbox-first / IDE-flagship”

Three sandbox modes (read-only / workspace-write / workspace-write-network), three approval presets (read-only / auto / full), an IDE-first ACP transport that is a strict superset of otter’s ACP schema with four extra notification kinds (code/diff, editor/open_file, terminal/output, progress/step), and an optional chimera ferret bridge long-poll HTTPS pipe with bearer auth.

Terminal window
chimera ferret --version # chimera ferret 0.5.0
chimera ferret -p "audit the repo" # default: read-only sandbox + read-only approval
chimera ferret --sandbox workspace-write --approval auto -p "fix tests"
chimera ferret serve # ACP JSON-RPC stdio (default)
chimera ferret serve --http --port 5173 # opt-in HTTP + SSE
chimera ferret bridge --remote-url https://bridge.example.com

13 modules, 303 tests, 8 user docs.

chimera weasel — minimal four-mode harness

Section titled “chimera weasel — minimal four-mode harness”

A single resolver picks interactive / print / rpc / sdk from --mode, -p, or default. Newline-delimited JSON-RPC 2.0 over stdio (four methods: prompt, cancel, get_state, list_models; all five JSON-RPC error codes exported). Embeddable SDK with sync run, async arun, sync / async streaming, multi-turn chat. Auto-discovered extensions from .weasel/extensions/* (project) and ~/.weasel/extensions/ (user).

Terminal window
chimera weasel # interactive REPL
chimera weasel -p "summarize TODOs" # one-shot print
chimera weasel -p "..." --json
chimera weasel -p "..." --stream-json
echo '{"jsonrpc":"2.0","id":1,"method":"list_models","params":{}}' | chimera weasel --mode rpc
python -c "from chimera.weasel.sdk import Agent; print(Agent().run('list files').text)"

Four-command slash palette (/help, /exit, /clear, /model) — no /agent, no /share, no /init by design. 9 modules, 164 tests, 7 user docs.

Layered on top of weasel. Local-first provider chain inversion (probes llama.cpp at $LLAMACPP_BASE_URL and Ollama at $OLLAMA_BASE_URL before any cloud key). Default qwen3.6-35b-a3b on llama.cpp.

11 curated skill markdowns under knowledge/, protocols/, tools/ with stdlib-only frontmatter parsing. Three small-model-fit extensions:

  • moe_offload — MoE-aware context-window sizing.
  • scaffold_fit — sub-13B prompt wrapping (idempotent).
  • tool_filter — sub-9B tool-list trimming.

Aider Polyglot + GAIA benchmark adapters (chimera shrew bench aider-polyglot --bench-limit 5, chimera shrew bench gaia --level 1). --max-steps defaulted to 30, restricted --allowed-tools=Read,Write,Edit,Bash. 14 modules, 193 tests, 7 user docs.

  • chimera otter introduced as a sibling to chimera mink — streaming tool calls, hooks, sessions, and the same provider abstraction.
  • HTTP server (chimera otter serve --port) and ACP server (chimera otter serve --acp) both wired.
  • Slash palette grew to 26 entries; persisted-run inspection (sessions list/show); share-by-link (chimera otter share); preset registry; LSP first-class tools.
  • Full O-WIRE-1..6 set landed: real provider, MCP runtime, plugin → agent registry, rules → system prompt, custom commands → slash registry, LSP default-on.
  • chimera mink runs cost — per-run cost rollups and granular token breakdowns (cache, reasoning, per-step).
  • Tau-bench (#90) end-to-end wireup + SWE-bench Verified (#84) adapter scaffold with IPythonTool + condensation hooks.
  • 11 benchmark adapters under chimera/eval/benchmarks/ (cline, context, dpai, feature, humaneval-plus, livecodebench, math500, mbpp, swe-polybench, swt, tau-bench).
  • TLS termination on otter serve --tls-cert / --tls-key.
  • Cooperative cancellation propagation: POST /session/<id>/cancel drains the agent loop without killing the worker.
  • Server-Sent Events resume-after-disconnect: Last-Event-ID header honored across reconnects.
  • Per-step SSE events plumbed through _drive_agent_streaming over async_run_events; legacy fallback preserved.
  • GET /runs and GET /runs/cost HTTP routes — eventlog inspection and per-run cost rollups reachable over HTTP without dropping into the CLI.
  • New chimera/events/sourcing/ subsystem — registry, projector, sqlite_store, sink, types, export.
  • Append-only event log with file locking, crash recovery, and gap detection; sessions can be reconstructed deterministically from the log alone.
  • SQLite store + snapshot fast-resume in EventSourcedSession — resumed sessions skip log replay when a snapshot is current.
  • snapshot_after_turn wired into otter’s REPL + server.
  • AsyncSSHEnvironment + SFTP transfer + ProxyJump bastion-host support + control-master multiplexing.
  • [ssh] extra; chimera mink --remote ssh://user@host[:port]/path.
  • $HOME expansion fix in checkpoint / restore tar paths so remote-environment snapshots round-trip correctly.
  • SWE-bench Verified scaffold landed (chimera/eval/benchmarks/swe_bench_verified.py) with IPythonTool, SWEBenchConfig, and condensation hooks.
  • IPython tool + LLM condensation plumbed end-to-end through the SWE-bench Verified harness.
  • HumanEval (full 164) live from each of the five CLIs against glm-5.1, kimi-k2.6, deepseek-v4-pro.

Exercised end-to-end on 2026-04-30 against three Ollama-Cloud models:

CLI \ Modelglm-5.1:cloudkimi-k2.6:clouddeepseek-v4-pro:cloud
minkmath + bashmath + bashmath + bash
ottermath (bash flake)math + bashmath + bash
ferretmath + bashmath + bashmath (bash flake)
weaselmath + bashmath + bashmath (bash flake)
shrewmath (small-model bash limit)math (small-model bash limit)math (auto-deny)

Math row 15/15. Bash row 9/15 (failures concentrated on smaller context windows + auto-deny on shrew’s deepseek path; no regressions attributable to wave-7 work). Total wall-clock 813 s across 30 cells.

  • 5654+ tests passing (5651 in the V1 validation pass + 660 ferret / weasel / shrew tests verified independently).
  • 553+ mypy source files clean (Success: no issues found in 553 source files).
  • uv run ruff check chimera/ clean.
  • All five trademark scrubs cleanbash scripts/all_trademark_scrub.sh exits 0 (passed: 5 (mink otter ferret weasel shrew); failed: 0).
  • CI green on Python 3.11 / 3.12 / 3.13 with five per-codename trademark-scrub jobs wired (mink-trademark-scrub, otter-trademark-scrub, ferret-trademark-scrub, weasel-trademark-scrub, shrew-trademark-scrub).

See Migrating from v0.4 to v0.5 for the step-by-step guide, including the AgentPreset.build()CodingAgent.from_preset(...) rewrite, the chimera code REPL default flip (and the --legacy-react escape hatch), and the deprecation timeline through v0.7.0.

  • No breaking API changes. Existing chimera mink users keep every flag and slash command they had at 0.4.0.
  • The four new CLIs (otter, ferret, weasel, shrew) are additive subcommands under the same chimera entry point.
  • The chimera/events/sourcing/ subsystem is opt-in — existing sessions built on Session continue to work; switch to EventSourcedSession when you want crash-recovery + snapshot fast-resume.
  • The [ssh] extra is opt-in; only required if you use --remote ssh://....