Skip to main content

Code

cookbook/11_models/mistral/image_transcribe_document_agent.py
from agno.agent import Agent
from agno.media import Image
from agno.models.mistral.mistral import MistralChat

agent = Agent(
    model=MistralChat(id="pixtral-12b-2409"),
    markdown=True,
)

agent.print_response(
    "Transcribe this document.",
    images=[
        Image(url="https://ciir.cs.umass.edu/irdemo/hw-demo/page_example.jpg"),
    ],
)

Usage

1

Set up your virtual environment

uv venv --python 3.12
source .venv/bin/activate
2

Set your API key

export MISTRAL_API_KEY=xxx
3

Install dependencies

uv pip install -U mistralai agno
4

Run Agent

python cookbook/11_models/mistral/image_transcribe_document_agent.py