Accuracy with Tools

Code

This example shows an evaluation that runs the provided agent with the provided input and then evaluates the answer that the agent gives.

Code

from typing import Optional

from agno.agent import Agent
from agno.eval.accuracy import AccuracyEval, AccuracyResult
from agno.models.openai import OpenAIChat
from agno.tools.calculator import CalculatorTools

evaluation = AccuracyEval(
    name="Tools Evaluation",
    model=OpenAIChat(id="o4-mini"),
    agent=Agent(
        model=OpenAIChat(id="gpt-5-mini"),
        tools=[CalculatorTools()],
    ),
    input="What is 10!?",
    expected_output="3628800",
)

result: Optional[AccuracyResult] = evaluation.run(print_results=True)
assert result is not None and result.avg_score >= 8

Accuracy with Given Answer Accuracy with Teams

⌘I

Overview

Use Cases

Concepts

Models

Accuracy with Tools

Code

Overview

Use Cases

Concepts

Models

​Code

Code