Skip to main content

Code

cookbook/11_models/vllm/tool_use.py
"""Build a Web Search Agent using xAI."""

from agno.agent import Agent
from agno.models.vllm import VLLM
from agno.tools.hackernews import HackerNewsTools

agent = Agent(
    model=VLLM(
        id="NousResearch/Nous-Hermes-2-Mistral-7B-DPO", top_k=20, enable_thinking=False
    ),
    tools=[HackerNewsTools()],
    markdown=True,
)
agent.print_response("Whats happening in France?")

Usage

1

Set up your virtual environment

uv venv --python 3.12
source .venv/bin/activate
2

Install Libraries

uv pip install -U agno openai vllm
3

Start vLLM server

vllm serve NousResearch/Nous-Hermes-2-Mistral-7B-DPO \
    --enable-auto-tool-choice \
    --tool-call-parser hermes \
    --dtype float16 \
    --max-model-len 8192 \
    --gpu-memory-utilization 0.9
4

Run Agent

python cookbook/11_models/vllm/tool_use.py