Beta

A portable open-source operating system for agents. ~6 ms coldstarts, 32x cheaper than sandboxes. Powered by WebAssembly and V8 isolates.

Works with
PiPi
Claude CodeClaude Code*
CodexCodex*
OpenCodeOpenCode*
AmpAmp*
*Coming Soon
agents.ts
import { AgentOs } from "@agent-os/core";
 
const vm = await AgentOs.create();
 
// Create an agent session
const { sessionId } = await vm.createSession("pi");
 
// Stream events (tool calls, text output, etc.)
vm.onSessionEvent(sessionId, (event) => console.log(event));
 
// Send a prompt and wait for the response
await vm.prompt(sessionId, "Write a Python script that calculates pi");
Read the Docs
View Package Registry

A new operating system architecture.

Built from the ground up for lightweight agents. Agent OS provides the flexibility of Linux with lower overhead than sandboxes.

WebAssembly

WebAssembly + V8 Isolates

High-performance virtualization without specialized infrastructure. The same battle-hardened isolation technology that powers Google Chrome.

Battle-tested technology

You're probably using this technology right now to view this page. Bring the same power to your agents. No VMs, no containers, no overhead.

Performance benchmarks

Agent OS vs. traditional sandboxes. Booting an agent in a container takes a full process and hundreds of milliseconds; Agent OS starts one in a lightweight isolate in about 5 ms — and packs far more into the same memory.

Cold Start lower is better
0xfaster
Containers — one process each
~0 ms
Agent OS — one shared process
~0 ms

Same host. Each container boots its own process before code can run; Agent OS runs every agent in one shared process — the first instruction executes in ~6 ms vs ~3,150 ms (516× faster). Toggle the percentile to compare median vs tail latency.

Memory Per Instance lower is better
0xsmaller
reserved
~1024 MBreserved / instance
~0 MB
~0 MBused / instance
Agent OSWhat's measured: Memory footprint added per concurrent execution.

Why the gap: In-process isolates share the host's memory. Each additional execution only adds its own heap and stack. Sandboxes allocate a dedicated environment with a minimum memory reservation, even if the code inside uses far less.

Sandbox baseline: Daytona, the cheapest mainstream sandbox provider as of March 30, 2026. Default sandbox: 1 vCPU + 1 GiB RAM.

Agent OS: ~131 MB for a full Pi coding agent session with MCP servers and file system mounts.
~131 MB
Cheapest sandbox~1024 MB

Sandboxes reserve idle RAM per agent; Agent OS isolates share the host.

Cost Per Execution-Second lower is better
0xcheaper
One server — AWS ARM4 executions
One per sandbox
~1 GB reserved

$0.0084/hr server ÷ 4 executions = $0.00000058/s per execution-second

Agent OSWhat's measured: server price per second / concurrent executions per server

Why it's cheaper: Each execution uses ~131 MB instead of a ~1024 MB sandbox minimum. And you run on your own hardware, which is significantly cheaper than per-second sandbox billing.

Sandbox baseline: Daytona, the cheapest mainstream sandbox provider as of March 30, 2026. Default sandbox: 1 vCPU + 1 GiB RAM at $0.0504/vCPU-h + $0.0162/GiB-h.

Agent OS: ~131 MB baseline per execution, assuming 70% utilization (industry-standard HPA scaling threshold). Select a hardware tier above to compare.
$0.00000000/s
Cheapest sandbox$0.000018/s

Assumes one agent per sandbox, needed for isolation.

Measured on Intel i7-12700KF. Cold start baseline: E2B, the fastest mainstream sandbox provider as of March 30, 2026. Cost baseline: Daytona, the cheapest mainstream sandbox provider as of March 30, 2026 (1 vCPU + 1 GiB default). Cost assumes 70% utilization on self-hosted hardware vs. per-second sandbox billing. Benchmark document

Everything routes through the harness.

The harness is the kernel of every agent session — brokering requests and responses between your tools and MCP resources, session state, the sandbox where code runs, and the orchestration layer that ties agents together. Each piece stays isolated, yet composable.

Meet your agent's new operating system.

Low overhead and cost.

No VMs to boot. No containers to pull. Start in milliseconds with minimal memory footprint.

Traditional sandboxes take seconds to spin up and consume hundreds of megabytes. Agent OS starts instantly and runs lean, so you can scale to thousands of agents without the cost. More details in benchmarks below.

~6msp99 coldstart
32xcheaper than sandboxes

Embed in your backend.

Your APIs. Your toolchains. No complex agent authentication needed. Just JavaScript functions or hooks.

Mount anything as a file system.

S3, SQLite, Google Drive, or the host file system. No per-agent credentials needed.

Agents think in files. Agent OS lets you expose any storage backend as a familiar directory tree. The host handles credential scoping, so agents never see API keys or secrets.

Granular security.

Fully configurable network and file system security. Control rate limits, bandwidth limits, and file system permissions. Set precise CPU and memory limitations per agent.

Your laptop, your infra, or on-prem.

Railway, Vercel, Kubernetes, and more. Deploy wherever your code already runs.

Agent OS is just an npm package. No vendor lock-in, no special infrastructure. Your agents run in your stack, on your terms.

RailwayVercelKubernetesECSLambdaGoogle Cloud Run

Agents that just work.

Every agent deserves a runtime that understands it.

Supports Claude Code, Codex, OpenCode, Amp, and more

Run any coding agent with a single unified API. Swap agents without changing your infrastructure.

Simple sessions API

Create, manage, and resume agent sessions with a few lines of code. State persists automatically.

Coming Soon

Embedded LLM metering

Track token usage, cost, and latency per agent. No per-agent API keys needed. The host handles credential scoping.

Universal transcript format

One transcript format across all agents. Powered by ACP. Compare, debug, and audit any session.

Automatic transcript persistence

Every conversation is saved. Replay sessions, audit behavior, and build on past context without extra code.

Infrastructure that disappears.

Deploy anywhere. Scale to anything. Forget about servers.

Runs on your infra

Managed hosting or self-hosted. Same API, same experience, your choice of where it runs.

Easy to deploy on prem

A single npm package. No Kubernetes operators, no sidecar containers. Just install and run.

Low overhead

No VMs to boot. No containers to pull. Start in milliseconds with minimal memory footprint.

Mount anything as a file system

S3, GitHub, databases. No per-agent credentials needed. The host handles access scoping.

Extend with a sandbox when needed

Agent OS handles most tasks, but pairs seamlessly with sandboxes for heavier workloads.

Orchestration without complexity.

Coordinate agents, humans, and systems out of the box.

Authentication

Authenticate agent connections with your existing auth model. Validate credentials and attach user state on connect.

Webhooks

Receive external events and route them into agents with lightweight HTTP handlers and durable queues.

Multiplayer & Realtime

Multiple clients can observe and collaborate with the same agent environment in real time.

Agent-to-Agent

Let agents delegate work to other agents through host-defined tools and shared orchestration flows.

Workflows

Chain agent tasks into durable workflows with retries, branching, and resumable execution built in.

Queues

Serialize agent work with durable queues for backpressure, async processing, and ordered execution.

SQLite

Give agents access to a persistent SQLite database through host tools for structured state and queryable memory.

Security without compromise.

The same isolation technology trusted by browsers worldwide.

Restrict CPU and memory granularly

Set precise resource limits per agent. No runaway processes, no noisy neighbors.

Programmatic network control

Allow, deny, or proxy any outbound connection. Full control over what your agents can reach.

Custom authentication

Bring your own auth. API keys, OAuth, JWTs. Agents authenticate on your terms.

Isolated private network

Each agent runs in its own network namespace. No cross-talk between tenants.

Powered by WebAssembly and V8 isolates

The same sandboxing technology behind Google Chrome. Battle-tested at planet scale.

After
Before
Unix Operators
Agent OS Operators

Left: Unix timesharing, UW-Madison, 1978. Right: "Data flock (digits)" by Philipp Schmitt, CC BY-SA 4.0

From humans to agents

The operating system is changing for the next generation of software operators.

Learn more