- Agent performance: We optimize static operations (instantiation, memory footprint) and runtime operations (tool calls, memory updates, history management).
- System performance: The AgentOS API is async by default and has a minimal memory footprint. The system is stateless and horizontally scalable, with a focus on preventing memory leaks. It handles parallel and batch embedding generation during knowledge ingestion, metrics collection in background tasks, and other system-level optimizations.
- Agent reliability and accuracy: Monitored through evals, which we’ll explore later.
Agent Performance
Let’s measure the time it takes to instantiate an Agent and the memory footprint of an Agent. Here are the numbers (last measured in Oct 2025, on an Apple M4 MacBook Pro):- Agent instantiation: ~3μs on average
- Memory footprint: ~6.6Kib on average
Your run-time will be mostly bottlenecked by inference, but we must do everything possible to minimize overhead, reduce memory usage, and parallelize tool calls.
Instantiation Time
Let’s measure instantiation time for an Agent with 1 tool. We’ll run the evaluation 1000 times to get a baseline measurement. We’ll compare Agno to LangGraph, CrewAI and Pydantic AI.The code for this benchmark is available here. You should run the evaluation yourself on your own machine, please, do not take these results at face value.
Memory Usage
To measure memory usage, we use thetracemalloc
library. We first calculate a baseline memory usage by running an empty function, then run the Agent 1000x times and calculate the difference. This gives a (reasonably) isolated measurement of the memory usage of the Agent.
We recommend running the evaluation yourself on your own machine, and digging into the code to see how it works. If we’ve made a mistake, please let us know.
Results
Taking Agno as the baseline, we can see that:Metric | Agno | Langgraph | PydanticAI | CrewAI |
---|---|---|---|---|
Time (seconds) | 1× | 529× slower | 57× slower | 70× slower |
Memory (MiB) | 1× | 24× higher | 4× higher | 10× higher |
Metric | Agno | Langgraph | PydanticAI | CrewAI |
---|---|---|---|---|
Time (seconds) | 0.000003 | 0.001587 | 0.000170 | 0.000210 |
Memory (MiB) | 0.006642 | 0.161435 | 0.028712 | 0.065652 |
Agno agents are designed for performance and while we share benchmarks against other frameworks, we should be mindful that accuracy and reliability are more important than speed.