> ## Documentation Index
> Fetch the complete documentation index at: https://docs.agno.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Deep Research Multimodal Input (Interactions)

Deep Research accepts images and documents (PDFs) as input, then conducts web-based research grounded in that content. Pass them as Agno `Image` / `File` objects with a URL. GCS and Gemini URIs pass through; regular HTTP URLs are downloaded and base64-encoded automatically.

## Code

```python cookbook/90_models/google/gemini_interactions/deep_research_multimodal.py theme={null}
from agno.agent import Agent
from agno.media import File, Image
from agno.models.google import GeminiInteractions

agent = Agent(
    model=GeminiInteractions(
        agent="deep-research-preview-04-2026",
        thinking_summaries="auto",
    ),
    markdown=True,
)

if __name__ == "__main__":
    agent.print_response(
        "Analyze the interspecies dynamics in this image and research the "
        "symbiotic relationships shown.",
        images=[
            Image(
                url="https://storage.googleapis.com/generativeai-downloads/images/generated_elephants_giraffes_zebras_sunset.jpg"
            )
        ],
    )

    agent.print_response(
        "What is this document about, and how does it relate to current "
        "research trends?",
        files=[File(url="https://arxiv.org/pdf/1706.03762")],
    )
```

## Usage

<Steps>
  <Snippet file="create-venv-step.mdx" />

  <Step title="Set your API key">
    ```bash theme={null}
    export GOOGLE_API_KEY=xxx
    ```
  </Step>

  <Step title="Install dependencies">
    ```bash theme={null}
    uv pip install -U "google-genai>=2.0" agno
    ```
  </Step>

  <Step title="Run Agent">
    ```bash theme={null}
    python cookbook/90_models/google/gemini_interactions/deep_research_multimodal.py
    ```
  </Step>
</Steps>
