from agno.agent import Agent from agno.models.vllm import VLLM agent = Agent( model=VLLM(id="Qwen/Qwen2.5-7B-Instruct", top_k=20, enable_thinking=False), markdown=True, ) agent.print_response("Share a 2 sentence horror story")
Create a virtual environment
Terminal
python3 -m venv .venv source .venv/bin/activate
Setup vLLM Server
pip install vllm python -m vllm.entrypoints.openai.api_server \ --model Qwen/Qwen2.5-7B-Instruct \ --port 8000
Install libraries
pip install -U openai agno
Run Agent
python cookbook/models/vllm/basic.py