Performance

If you’re building with Agno, you’re guaranteed best-in-class performance by default. Our obsession with performance is necessary because even simple AI workflows can spawn hundreds of Agents and because many tasks are long-running — stateless, horizontal scalability is key for success. At Agno, we optimize performance across 3 dimensions:

Agent performance: We optimize static operations (instantiation, memory footprint) and runtime operations (tool calls, memory updates, history management).
System performance: The AgentOS API is async by default and has a minimal memory footprint. The system is stateless and horizontally scalable, with a focus on preventing memory leaks. It handles parallel and batch embedding generation during knowledge ingestion, metrics collection in background tasks, and other system-level optimizations.
Agent reliability and accuracy: Monitored through evals, which we’ll explore later.

Agent Performance

Let’s measure the time it takes to instantiate an Agent and the memory footprint of an Agent. Here are the numbers (last measured in Oct 2025, on an Apple M4 MacBook Pro):

Agent instantiation: ~3μs on average
Memory footprint: ~6.6Kib on average

We’ll show below that Agno Agents instantiate 529× faster than Langgraph, 57× faster than PydanticAI, and 70× faster than CrewAI. Agno Agents also use 24× lower memory than Langgraph, 4× lower than PydanticAI, and 10× lower than CrewAI.

Run time performance is bottlenecked by inference and hard to benchmark accurately, so we focus on minimizing overhead, reducing memory usage, and parallelizing tool calls.

Instantiation Time

Let’s measure instantiation time for an Agent with 1 tool. We’ll run the evaluation 1000 times to get a baseline measurement. We’ll compare Agno to LangGraph, CrewAI and Pydantic AI.

The code for this benchmark is available here. You should run the evaluation yourself on your own machine, please, do not take these results at face value.

# Setup virtual environment
./scripts/perf_setup.sh
source .venvs/perfenv/bin/activate

# Agno
python cookbook/evals/performance/instantiate_agent_with_tool.py

# LangGraph
python cookbook/evals/performance/comparison/langgraph_instantiation.py
# CrewAI
python cookbook/evals/performance/comparison/crewai_instantiation.py
# Pydantic AI
python cookbook/evals/performance/comparison/pydantic_ai_instantiation.py

LangGraph is on the right, let’s start it first and give it a head start. Then CrewAI and Pydantic AI follow, and finally Agno. Agno obviously finishes first, but let’s see by how much.

Memory Usage

To measure memory usage, we use the tracemalloc library. We first calculate a baseline memory usage by running an empty function, then run the Agent 1000x times and calculate the difference. This gives a (reasonably) isolated measurement of the memory usage of the Agent. We recommend running the evaluation yourself on your own machine, and digging into the code to see how it works. If we’ve made a mistake, please let us know.

Results

Taking Agno as the baseline, we can see that:

Metric	Agno	Langgraph	PydanticAI	CrewAI
Time (seconds)	1×	529× slower	57× slower	70× slower
Memory (MiB)	1×	24× higher	4× higher	10× higher

Exact numbers from the benchmark:

Metric	Agno	Langgraph	PydanticAI	CrewAI
Time (seconds)	0.000003	0.001587	0.000170	0.000210
Memory (MiB)	0.006642	0.161435	0.028712	0.065652

Agno agents are designed for performance and while we share benchmarks against other frameworks, we should be mindful that accuracy and reliability are more important than speed.

Introduction

Learn

Help

Performance

Agent Performance

Instantiation Time

Memory Usage

Results

Introduction

Learn

Help

​Agent Performance

​Instantiation Time

​Memory Usage

​Results

Agent Performance

Instantiation Time

Memory Usage

Results