Provider Categories
OpenAI
GPT models with tools, vision, audio, and reasoning.
Anthropic
Sonnet and Opus with extended thinking and prompt caching.
Gemini with native video, audio, search, and Imagen generation.
Open Source
Llama, Mistral, DeepSeek via Groq, Together, Fireworks.
Enterprise
Azure OpenAI, AWS Bedrock, Vertex AI, NVIDIA, IBM watsonx.
Local
Ollama, vLLM, LMStudio for private, offline inference.
All Providers
| Provider | Import |
|---|---|
| OpenAI | from agno.models.openai import OpenAIResponses |
| Anthropic | from agno.models.anthropic import Claude |
from agno.models.google import Gemini | |
| xAI | from agno.models.xai import xAI |
| Perplexity | from agno.models.perplexity import Perplexity |
| Groq | from agno.models.groq import Groq |
| DeepSeek | from agno.models.deepseek import DeepSeek |
| Mistral | from agno.models.mistral import MistralChat |
| Together | from agno.models.together import Together |
| Fireworks | from agno.models.fireworks import Fireworks |
| Cohere | from agno.models.cohere import CohereChat |
| Cerebras | from agno.models.cerebras import Cerebras |
| Azure OpenAI | from agno.models.azure import AzureOpenAI |
| Azure AI Foundry | from agno.models.azure import AzureAIFoundry |
| AWS Bedrock | from agno.models.aws import BedrockChat |
| Vertex AI | from agno.models.vertexai import VertexAI |
| NVIDIA | from agno.models.nvidia import NVIDIAChat |
| IBM watsonx | from agno.models.ibm import WatsonX |
| Ollama | from agno.models.ollama import Ollama |
| vLLM | from agno.models.vllm import vLLM |
| LMStudio | from agno.models.lmstudio import LMStudio |
| llama.cpp | from agno.models.llamacpp import LlamaCpp |
| LiteLLM | from agno.models.litellm import LiteLLM |
| OpenRouter | from agno.models.openrouter import OpenRouter |
| Portkey | from agno.models.portkey import Portkey |