2.2.25 — Latest Release

⚒ Anvil

The only AI coding assistant that doesn’t lock you in.

Your providers. Your credentials. Your data. Your cost.

90+Commands

45Tools

7Agent Types

35AI Providers

~22 MBSingle Binary

Get Anvil Free
Browse AnvilHub

Install — macOS / Linux (Homebrew)
brew install culpur/anvil/anvil

Install — macOS / Linux (curl)
curl -fsSL https://anvilhub.culpur.net/install.sh | bash

Install — Windows (PowerShell)
irm https://anvilhub.culpur.net/install.ps1 | iex

Already installed? Update in place
anvil upgrade
SHA256 verified • out-of-band checksum manifest • refuses to install unverified binaries • shell completions included for bash, zsh, fish, PowerShell

Why Anvil

Every other tool comes with a leash

Other AI coding assistants lock you to one vendor’s pipes — one provider, one pricing model, one set of rate limits. Your code, your data, your costs all flow through infrastructure you don’t control.

Anvil is the inverse. Pick your provider. Use your own API keys, or run everything locally through Ollama. Switch freely mid-conversation. When one hits a rate limit, fall over to the next. When the provider does something you don’t like, leave.

🔑

Your Providers

Anthropic, OpenAI, Google, xAI, Ollama local, or Ollama Cloud. Configure priority chains. Automatic failover when one throttles. Never locked in.

🔒

Your Credentials

AES-256-GCM encrypted vault with Argon2id KDF. 21 credential types. Nothing touches disk unencrypted. Per-project scopes.

🛡

Your Data

Single binary. Zero telemetry. Local Ollama support. Run air-gapped. Your prompts and code never leave your machine unless you send them.

💰

Your Cost

Per-provider budgets. Per-session tracking. Hard caps. See every token’s cost before you spend it. Zero-cost inference with Ollama.

📡

Your Access

Type /remote-control and hand any session to any browser. 6-digit pairing. Full bidirectional control. Code from your phone.

🏡

Your Deployment

Run on your laptop. Run on a server. Share a session across devices. Nothing to install on the browser side. Your infrastructure, your rules.

Who it’s for

Built for people who want to own their workflow

👤

Privacy-conscious developers

You don’t want every prompt going to a cloud API — and you can’t afford a $50K local-inference stack. Ollama plus Anvil is the answer.

💼

Consultants & contractors

You juggle credentials across clients and need isolation between projects. Anvil’s vault + per-tab sandboxing solves it.

🌍

Open-source maintainers

You’re tired of single-provider lock-in and pricing changes. Own your keys, own your models, switch when you want.

👥

Small teams

You want deployment choice. Cloud providers, local Ollama, or a mix. Per-tab policy lets each engineer pick what fits the task.

Signature Feature

Live Remote Control

No other AI coding assistant does this. Type /remote-control in your terminal. Open the generated URL on your phone, your tablet, a colleague’s laptop. Enter the 6-digit code. Both sides have full control.

📱

Full bidirectional control

Type messages, run commands, manage tabs from any device. Not a transcript — a live session.

⚡

Real-time streaming

See AI responses token-by-token in the browser. WebSocket relay with automatic reconnection.

⚙

Configure from the browser

Swap providers, change models, manage credentials — 17 config panels with full parity to the TUI.

🔐

Encrypted pairing

6-digit secure pairing code. Encrypted relay. Your session is yours.

2.2.25 — The Security Hardening Release

The most important release since v2

v2.2.25 is a security hardening release. We put Anvil through a deep, adversarially-verified audit across ten attack surfaces — ten specialist reviewers, every finding checked by three independent skeptics — and fixed all fifteen confirmed issues. There were no critical vulnerabilities, and the core cryptography, OAuth flow, and memory safety held up untouched. No new commands, no config migration; a recommended upgrade for every user.

The browser relay now re-verifies pairing at the Rust layer before acting on any privileged message, so nothing can drive a session or approve its own permission prompts without completing this host’s own handshake. On Windows, credential files and the daemon token are written with owner-only ACLs bound at creation, and the daemon pipe checks the connecting client’s identity. The sandbox denies read access to your credential directories and scrubs secret environment variables by default, and denies outbound network under strict mode. The remote-MCP transport is size-capped against memory exhaustion and refuses URLs that resolve to private or link-local addresses. Every fix ships under a full workspace test gate, and the Windows paths were validated against the shipped target. Seven native binaries continue to ship: macOS ARM64, macOS Intel, Linux x86_64, Linux ARM64, Windows x86_64, FreeBSD x86_64, and NetBSD x86_64.

🔐

Relay pairing enforced locally

The daemon re-verifies pairing at the Rust layer before acting on any privileged message. A peer that has not completed this host’s own handshake can no longer drive a session, answer its own permission prompts, change config, or open an SSH connection (CWE-306 / CWE-862).

🔑

Windows credential storage

Secret files and the daemon capability token are written with an owner-only ACL bound at creation, matching the 0600 protection Anvil has always used on Unix. The daemon pipe now verifies the connecting client’s identity, validated against the shipped Windows target (CWE-732 / CWE-284).

🛡

Sandbox denies secrets by default

When the sandbox is active, agent-run commands lose read access to your credential directories and have secret environment variables scrubbed. Under sandbox.require = strict, outbound network is denied too. Verified live: a sandboxed read of a vault secret returns nothing (CWE-522 / CWE-668).

🌐

Remote MCP bounded & SSRF-guarded

A malicious MCP server can no longer exhaust memory or point Anvil at an internal address. Response and line sizes are capped, and outbound URLs are checked against their resolved IP — loopback, private, link-local, and cloud-metadata ranges are refused by default (CWE-400 / CWE-918).

Transparent tools

See exactly what the model is doing — as it does it

Every other AI coding tool shows you a one-line tool summary after it completes. Anvil shows you the actual input the model sent the moment the call fires, in a bordered card you can expand to see the full JSON request and the full result. No more guessing what the agent is searching for, what it’s about to edit, or which file it just opened.

🔎

Tool-call cards

Glob, Grep, Read, Write, Edit, Bash, WebSearch, and every MCP tool render as a bordered card the moment the model invokes them. Pattern, path, command, search query — visible immediately, not after completion.

🔍

Ctrl+O to expand

Hit Ctrl+O on any tool card to expand the full input JSON and the full result. Inspect exactly what the model asked for, and exactly what came back. The complete loop, every time.

📊

Streaming progress indicators

Cards animate in-place from active to done. Errors render with red borders. Long-running Bash commands show elapsed time. You never wonder whether the agent is making progress.

📝

Scrollback correctness

PageUp/Up in HISTORICAL VIEW used to show only the first 1–4 characters of each assistant line (#, ##, **, -). The pending-message line cache wasn’t invalidating on token delta. Fixed. Your full conversation is now scrollable, every line intact.

SSH + continuity

A terminal to your server, in the same window

An AI session in one tab. A live SSH terminal to the machine you’re working on in the next tab. Tell the agent to draft a config change, watch it land in real time, paste the test command into the SSH tab. No context switch, no separate terminal app, no copy-paste between windows.

💻

/ssh modal form

Type /ssh host and a modal opens with fields for host, port, user, auth method, key path, passphrase, and a save-as-alias field. Default key root is ~/.ssh. Ctrl+F opens a bare-name resolver that autocompletes your existing key files.

🔐

Vault-encrypted aliases

Saved connections go into the encrypted vault as HostCredential aliases — AES-256-GCM, Argon2id KDF. Recall any host by name. Passphrases stay encrypted at rest. Per-project scopes supported.

🖥

russh + vt100 rendering

Each SSH tab runs a real russh session with full vt100 terminal emulation. Ctrl+B prefix keys (tmux-style: digit to jump tabs, q to close). Resize is honored. Colors and box-drawing render correctly.

🔁

Session continuity

anvil --continue now honors the model saved in .meta.json, so Ollama sessions reconnect to Ollama instead of failing on default auth. On exit, the last line shows the session ID/name with the exact anvil --resume <name> command to paste. Never lose context.

Quality bar

What it took to get here

4,400+

Workspace tests passing across every crate. Zero failures. Verified on each v2.2.25 release-pipeline run, which cross-builds all seven platform binaries under a full workspace test gate before publishing.

~22–27 MB

Single static binary. No runtime, no Node, no Python, no install prereqs. Drop it on a server and run.

7 platforms

macOS ARM64, macOS Intel, Linux x86_64, Linux ARM64, Windows x86_64 — plus FreeBSD x86_64 and NetBSD x86_64 as of v2.2.16, all carried forward in v2.2.25. Every binary SHA256-verified and signed by the release pipeline.

300+ commits

Since v2.2.5 — the unified arc that became v2.2.13, v2.2.16, v2.2.20, v2.2.23, v2.2.24, and now v2.2.25. Every commit ships under a full workspace test gate. Verified on each release-pipeline run.

Models

35 providers, one terminal

Anthropic

claude-opus-4-8, claude-sonnet-5, claude-haiku-4-5 — the full current model family with OAuth and API-key support.

OpenAI

GPT-5, o3, o4-mini — all current OpenAI models. Bring your own key.

Google Gemini

Gemini 2.5 Pro, Flash — via OpenAI-compatible endpoint.

Recommended

Ollama (Local)

Run any model locally — Llama, Qwen, Mistral, DeepSeek, Phi. Zero cost, full privacy, air-gap ready. Hardware-aware tuning built in.

xAI Grok

Grok-4, Grok-3, Grok-3-mini — xAI’s latest models with competitive pricing.

Ollama Cloud

kimi-k2.6:cloud, gpt-oss:120b-cloud — routed through the local Ollama daemon via ed25519 device key. No separate account; your daemon handles auth.

AWS Bedrock

Anthropic, Meta, Mistral and Amazon models served through AWS — signs requests with your existing AWS_ACCESS_KEY_ID credentials.

GitHub Copilot

Reuse your Copilot subscription for GPT-4o, Claude and o-series models — authenticated via GITHUB_TOKEN.

Azure OpenAI

OpenAI frontier models through your Azure deployment — uses your AZURE_OPENAI_ENDPOINT and per-tenant deployment names.

Cursor

Cursor subscription routed at the API layer — bring your CURSOR_API_KEY and use it from the terminal.

Antigravity

Google Code Assist via OAuth — Gemini family routed through the Code Assist endpoint, no static API key required.

Alibaba DashScope

Qwen3, Qwen2.5 and DashScope-hosted models — OpenAI-compatible mode with your DASHSCOPE_API_KEY.

Groq

Llama, Qwen, Mixtral and Kimi at sub-second latencies on Groq LPU hardware — bring your own key.

Fireworks AI

Production inference for Llama, Qwen, DeepSeek and Mixtral — fast cold-starts, function calling, JSON mode.

Mistral AI

Mistral Large, Codestral and Devstral served direct from Mistral — bring your MISTRAL_API_KEY.

Perplexity

Sonar and online-search-augmented models with built-in web grounding — bring your PERPLEXITY_API_KEY.

DeepSeek

DeepSeek-V3 and DeepSeek-R1 served direct from DeepSeek — strong reasoning and coding at low per-token cost.

Together AI

Llama, Qwen, DeepSeek and 100+ open-weight models on Together’s inference fleet — bring your own key.

DeepInfra

OpenAI-compatible hosting for Llama, Qwen, Mixtral and other open-weight models — pay-per-token, no minimums.

Cerebras

Llama and Qwen on Cerebras wafer-scale silicon — the fastest tokens-per-second of any provider on the list.

NVIDIA NIM

NVIDIA Inference Microservices for Llama, Nemotron, Mixtral and Mistral — bring your NVIDIA_API_KEY.

HuggingFace

Inference API for thousands of open-weight models — authenticate with your HF_TOKEN.

Moonshot AI

Kimi K2 and Moonshot V1 served direct from Moonshot — long-context Chinese-bilingual frontier model.

Nebius

Nebius AI Studio — OpenAI-compatible serving for Llama, Qwen, DeepSeek and Mixtral on Nebius infrastructure.

Scaleway

Scaleway Generative APIs — Llama, Mistral and Qwen hosted in EU data centres.

STACKIT

STACKIT Model Serving — OpenAI-compatible endpoint for sovereign-EU model hosting on Schwarz Group infrastructure.

Baseten

Custom and open-weight model deployments on Baseten — OpenAI-compatible inference for production workloads.

Cortecs

Cortecs inference platform — OpenAI-compatible serving for open-weight LLMs with usage-based pricing.

302.AI

302.AI aggregator — one API key, hundreds of frontier and open-weight models behind an OpenAI-compatible facade.

Zai

GLM-4 and Kimi family served by Z.ai — recognised by the ‘kimi’ and ‘glm’ slugs in the model picker.

OpenRouter

One key for hundreds of models from every major vendor — pay-as-you-go routing with automatic fallback.

LM Studio

Run any model on your own machine via the LM Studio desktop app — OpenAI-compatible local server, no auth required.

Chutes

Chutes serverless GPU inference — OpenAI-compatible endpoint for open-weight LLMs at competitive token rates.

MiniMax

MiniMax abab and M1 frontier models — long-context Chinese-English bilingual served direct from MiniMax.

OpenCode

Community OpenCode inference endpoint — OpenAI-compatible serving for coding-tuned open-weight models.

OpenCode-Go

Community OpenCode-Go endpoint — mirrors the OpenCode shape on a separate inference fleet.

Type /provider list in Anvil for the full TAB-completable picker.

Also in the box

Everything you expect, none of the lock-in

🔌

MCP integration

Full Model Context Protocol support. Auto-discover MCP tools at startup. /mcp list for live status. Plug in any MCP server.

🌐

Per-tab parallel inference

Each tab is an independent runtime — own model, own provider, own conversation. Fire prompts concurrently. Tab bar shows unread and pending-permission markers live.

💻

SSH tabs

/ssh host opens a vt100 terminal tab via russh, right next to your AI sessions. Connections saved to the encrypted vault as named aliases.

⚙

90+ slash commands

Deep hierarchical autocomplete. /vault, /mcp, /fork, /share, /daily, /ollama — full palette for AI-assisted development.

🎨

37-widget status line

16 presets. Interactive visual editor in TUI and web. Drag-and-drop. Per-widget category colors. Build your perfect status bar.

🤖

7 agent types

Multi-agent orchestration for complex workflows. Spawn background agents, track progress, review output — with a reviewer-agent approval gate.

📦

AnvilHub marketplace

Skills, plugins, agents, and themes. Install from the TUI or the web viewer. Free to use, free to contribute.

🔥

Security-hardened by default

Locally-enforced relay pairing, owner-only credential storage on every platform, a sandbox that denies secrets and network by default, and a bounded, SSRF-guarded MCP transport. Audited across ten attack surfaces with every finding independently verified. 4,400+ tests passing across a seven-platform release gate.

Downloads

Every platform. No account. No sign-in.

macOS ARM64 (Apple Silicon)
macOS Intel (x86_64)
Linux x86_64
Linux ARM64
Windows x86_64
FreeBSD x86_64
NetBSD x86_64

Own your workflow

One binary. 35 providers. No lock-in. No telemetry. No account required. Free and open to use.

Download 2.2.25
Full Changelog