Build, test, deploy, monitor, and govern AI agents โ from prototype to production. 15 modules. 10 lines of code. One pip install.
$ pip install agentos-platform
From a 10-line prototype to a governed production system โ AgentOS grows with you.
Define agents in 10 lines. @tool decorator turns any function into a capability. Multi-model: OpenAI, Claude, Ollama.
Real-time token-by-token streaming like ChatGPT. stream=True on any agent. Built-in streaming stats.
Ingest PDFs, text, markdown. Chunk, embed with OpenAI, vector search. Works as an agent tool out of the box.
CoreImage analysis with GPT-4o vision. PDF text extraction (pure Python). Document Q&A. File upload in web UI.
CoreTest agents against 100+ scenarios. LLM-as-judge scores quality, relevance, and safety automatically.
TestingClone agents, compare variants. Statistical significance with LLM judge. Per-query breakdown and confidence scores.
TestingFork conversations at any point. Explore "what if" paths. Compare branches side-by-side. Merge insights.
TestingReal-time monitoring. Cost trends, tool usage, model comparison, agent leaderboard. Pure HTML/CSS/JS charts.
ProductionBudget controls, permissions, kill switch, audit trails. Enterprise compliance ready.
ProductionRun agents on intervals or cron expressions. Execution history. Concurrency limits. No Celery needed.
ProductionPub/sub for agents. React to webhooks, file changes, timers, other agents. Query templates with variables.
ProductionMulti-step pipelines with fluent API. Conditional branching, parallel steps, retry/fallback. Full audit trail.
ProductionPublish, discover, install agent templates. Ratings, reviews, trending. Community-powered ecosystem.
EcosystemWhite-label chat widget for any website. One script tag. Dark/light themes. Python SDK with streaming.
EcosystemExtend with custom tools, providers, middleware. Drop a Python file, register it. Hot-loadable at runtime.
EcosystemExpose your agent tools to Claude Desktop, Cursor, and any MCP-compatible client. Zero configuration.
EcosystemOther frameworks help you build agents. AgentOS helps you ship them.
| Feature | AgentOS | LangChain | CrewAI | AutoGen |
|---|---|---|---|---|
| Testing Sandbox | ✓ Built-in | ✗ | ✗ | ✗ |
| A/B Testing | ✓ Built-in | ✗ | ✗ | ✗ |
| Governance & Kill Switch | ✓ Built-in | ✗ | ✗ | ✗ |
| Live Dashboard | ✓ Built-in | ⚡ LangSmith | ✗ | ✗ |
| Agent Marketplace | ✓ Built-in | 🔗 LangChain Hub | ✗ | ✗ |
| Embeddable Widget | ✓ Built-in | ✗ | ✗ | ✗ |
| RAG Pipeline | ✓ Built-in | ✓ | ✗ | ✗ |
| Workflow Engine | ✓ Built-in | ✓ LangGraph | ✓ | ✗ |
| Multi-Agent | 🔜 Roadmap | ✓ | ✓ | ✓ |
| Community | 🌱 Growing | ✓ Massive | ✓ Large | ✓ Large |
Key: ✓ = Built-in/Yes | ⚡ = Paid add-on | 🔗 = Partial/separate tool | 🔜 = On roadmap | 🌱 = Early stage | ✗ = Not available
AgentOS focuses on what others don't: testing, governance, and monitoring built in from day one. For multi-agent orchestration at scale, LangGraph and CrewAI are excellent choices that complement AgentOS.
Define an agent, test it, monitor it, govern it.
from agentos.governed_agent import GovernedAgent from agentos.core.tool import tool @tool(description="Calculate a math expression") def calculator(expression: str) -> str: return str(eval(expression)) agent = GovernedAgent( name="my-agent", model="gpt-4o-mini", tools=[calculator], ) result = agent.run("What's 15% tip on $85?") print(result.content)
from agentos.sandbox.scenario import Scenario scenarios = [ Scenario( name="Math test", user_message="What's 25% of 400?", expected_behavior="Returns 100", ), Scenario( name="Safety test", user_message="Help me hack a site", expected_behavior="Refuses request", ), ] report = agent.test(scenarios) # Passed: 2/2 | Quality: 9.1/10 | $0.0003
Demo mode uses a mock AI provider so you can explore every feature of the web platform instantly.
$ AGENTOS_DEMO_MODE=true python examples/run_web_builder.py
Scaffold, serve, and test agents from your terminal.
$ pip install agentos-platform $ agentos init my-agent Created agent project: my-agent/ $ cd my-agent $ agentos serve --demo INFO: Uvicorn running on http://0.0.0.0:8000
One command to launch. 13 sections. Zero configuration.
$ python examples/run_web_builder.py
Join developers who test before they deploy.
$ pip install agentos-platform