from pathlib import Path from agno.agent import Agent from agno.media import Image from agno.models.openai import OpenAIChat agent = Agent( model=OpenAIChat(id="gpt-5-mini"), markdown=True, ) image_path = Path(__file__).parent.joinpath("sample.jpg") agent.print_response( "Write a 3 sentence fiction story about the image", images=[Image(filepath=image_path)], )
Create a virtual environment
Terminal
python3 -m venv .venv source .venv/bin/activate
Install libraries
pip install -U agno openai
Run Agent
python cookbook/agents/multimodal/image_to_text.py