Code

cookbook/models/mistral/image_transcribe_document_agent.py
from agno.agent import Agent
from agno.media import Image
from agno.models.mistral.mistral import MistralChat

agent = Agent(
    model=MistralChat(id="pixtral-12b-2409"),
    markdown=True,
)

agent.print_response(
    "Transcribe this document.",
    images=[
        Image(url="https://ciir.cs.umass.edu/irdemo/hw-demo/page_example.jpg"),
    ],
)

Usage

1

Create a virtual environment

Open the Terminal and create a python virtual environment.
python3 -m venv .venv
source .venv/bin/activate
2

Set your API key

export MISTRAL_API_KEY=xxx
3

Install libraries

pip install -U mistralai agno
4

Run Agent

python cookbook/models/mistral/image_transcribe_document_agent.py