Gemini
Image Agent
Examples
- Introduction
- Getting Started
- Agents
- Teams
- Workflows
- Applications
Agent Concepts
- Multimodal
- RAG
- Knowledge
- Memory
- Async
- Hybrid Search
- Storage
- Tools
- Vector Databases
- Embedders
Models
- Anthropic
- AWS Bedrock
- AWS Bedrock Claude
- Azure AI Foundry
- Azure OpenAI
- Cohere
- DeepInfra
- DeepSeek
- Fireworks
- Gemini
- Basic Agent
- Streaming Agent
- Agent with Structured Outputs
- Agent with Tools
- Agent with Storage
- Agent with Knowledge
- Image Agent
- Flash Thinking Agent
- Audio Input (Bytes Content)
- Audio Input (Upload the file)
- Audio Input (Local file)
- Agent with PDF Input (Local file)
- Agent with PDF Input (URL)
- Video Input (Bytes Content)
- Video Input (File Upload)
- Video Input (Local File Upload)
- Groq
- Hugging Face
- Mistral
- NVIDIA
- Ollama
- OpenAI
- Perplexity
- Together
- xAI
- IBM
- LM Studio
- LiteLLM
- LiteLLM OpenAI
Gemini
Image Agent
Code
cookbook/models/google/gemini/image_input.py
from agno.agent import Agent
from agno.media import Image
from agno.models.google import Gemini
from agno.tools.duckduckgo import DuckDuckGoTools
agent = Agent(
model=Gemini(id="gemini-2.0-flash-exp"),
tools=[DuckDuckGoTools()],
markdown=True,
)
agent.print_response(
"Tell me about this image and give me the latest news about it.",
images=[
Image(
url="https://upload.wikimedia.org/wikipedia/commons/b/bf/Krakow_-_Kosciol_Mariacki.jpg"
),
],
stream=True,
)
Usage
1
Create a virtual environment
Open the Terminal
and create a python virtual environment.
python3 -m venv .venv
source .venv/bin/activate
2
Set your API key
export GOOGLE_API_KEY=xxx
3
Install libraries
pip install -U google-genai duckduckgo-search agno
4
Run Agent
python cookbook/models/google/gemini/image_input.py