Image to Text Analysis

Code
Usage

This example demonstrates how to create an agent that can analyze images and generate creative text content based on the visual content.

Code

image_to_text.py

from pathlib import Path

from agno.agent import Agent
from agno.media import Image
from agno.models.openai import OpenAIChat

agent = Agent(
    model=OpenAIChat(id="gpt-5-mini"),
    markdown=True,
)

image_path = Path(__file__).parent.joinpath("sample.jpg")
agent.print_response(
    "Write a 3 sentence fiction story about the image",
    images=[Image(filepath=image_path)],
)

Usage

Create a virtual environment

Open the Terminal and create a python virtual environment.

python3 -m venv .venv
source .venv/bin/activate

Install libraries

pip install -U agno openai

Export your OpenAI API key

  export OPENAI_API_KEY="your_openai_api_key_here"

Create a Python file

Create a Python file and add the above code.

touch image_to_text.py

Run Agent

python image_to_text.py

Find All Cookbooks

Explore all the available cookbooks in the Agno repository. Click the link below to view the code on GitHub:Agno Cookbooks on GitHub

Structured Input with Pydantic Models Image to Structured Output

⌘I

Overview

Use Cases

Concepts

Models

Image to Text Analysis

Code

Usage

Overview

Use Cases

Concepts

Models

​Code

​Usage

Code

Usage