Text-to-SQL Agent

A self-learning SQL agent that queries Formula 1 data (1950-2020) and improves through accumulated knowledge. Customize and connect it to your own data to build a powerful text-to-SQL agent.

What Makes This Different

Most Text-to-SQL tutorials show you how to generate SQL from natural language. This one goes further:

Knowledge-Based Query Generation: The agent searches a knowledge base before writing SQL, ensuring consistent patterns
Data Quality Handling: Instead of cleaning messy data, the agent learns to handle inconsistencies (mixed types, date formats, naming conventions)
Self-Learning Loop: Users can save validated queries, which the agent retrieves for similar future questions

What You’ll Learn

Concept	Description
Semantic Model	Define table metadata to guide query generation
Knowledge Base	Store and retrieve query patterns and data quality notes
Data Quality Handling	Handle type mismatches and inconsistencies without ETL
Self-Learning	Save validated queries to improve future responses
Agentic Memory	Remember user preferences across sessions

Prerequisites

Python 3.12+
Docker (for PostgreSQL with pgvector)
OpenAI API key

Setup

Clone the repository

git clone https://github.com/agno-agi/agno.git
cd agno

Create and activate virtual environment

uv venv .venvs/text-to-sql --python 3.12
source .venvs/text-to-sql/bin/activate

Install dependencies

uv pip install -r cookbook/01_showcase/01_agents/text_to_sql/requirements.in

Set environment variables

export OPENAI_API_KEY=your-openai-key

Start PostgreSQL with pgvector

./cookbook/scripts/run_pgvector.sh

This starts a PostgreSQL container with pgvector on port 5532.

Check setup

python cookbook/01_showcase/01_agents/text_to_sql/scripts/check_setup.py

Load F1 data and knowledge base

python cookbook/01_showcase/01_agents/text_to_sql/scripts/load_f1_data.py
python cookbook/01_showcase/01_agents/text_to_sql/scripts/load_knowledge.py

Run the Agent

Basic Queries

Run simple aggregation and filtering queries:

python cookbook/01_showcase/01_agents/text_to_sql/examples/basic_queries.py

Demonstrates:

Simple aggregation queries (counts, sums)
Filtering by year or driver
Using the semantic model to find relevant tables

Self-Learning Loop

See how the agent saves validated queries to improve:

python cookbook/01_showcase/01_agents/text_to_sql/examples/learning_loop.py

Demonstrates:

Query execution and validation
Saving queries to the knowledge base
How saved queries improve future responses

Edge Cases

Test complex queries and error handling:

python cookbook/01_showcase/01_agents/text_to_sql/examples/edge_cases.py

Evaluate Accuracy

Run automated accuracy testing:

python cookbook/01_showcase/01_agents/text_to_sql/examples/evaluate.py

Agent Configuration

sql_agent = Agent(
    name="SQL Agent",
    model=OpenAIResponses(id="gpt-5.2"),
    db=sql_agent_db,
    knowledge=sql_agent_knowledge,
    system_message=system_message,
    tools=[
        SQLTools(db_url=DB_URL),
        ReasoningTools(add_instructions=True),
        save_validated_query,
    ],
    add_datetime_to_context=True,
    enable_agentic_memory=True,
    search_knowledge=True,
    add_history_to_context=True,
    num_history_runs=5,
    read_chat_history=True,
    read_tool_call_history=True,
    markdown=True,
)

Parameter	Purpose
`model`	GPT-5.2 for query generation and reasoning
`db`	PostgreSQL connection for query execution
`knowledge`	Vector database storing query patterns and table metadata
`SQLTools`	Execute SQL queries against the database
`ReasoningTools`	Think step-by-step before query construction
`save_validated_query`	Custom tool to persist successful queries
`enable_agentic_memory`	Remember user preferences across sessions
`search_knowledge`	Search knowledge base before writing SQL
`add_history_to_context`	Include recent conversation for context

How It Works

Query Workflow

User asks a question
Agent searches knowledge base for similar queries
Agent identifies tables from semantic model
Agent constructs and executes SQL
Agent validates results and presents answer
Agent offers to save the query for future use

Knowledge Base

The knowledge base stores three types of information:

Type	Purpose
Table metadata	Column names, types, and descriptions
Query patterns	Reusable SQL patterns for common operations
Validated queries	User-approved queries saved for retrieval

Semantic Model

The semantic model provides high-level context about available tables:

{
    "tables": [
        {
            "table_name": "drivers_championship",
            "table_description": "Driver championship standings (1950 to 2020).",
            "use_cases": [
                "Driver standings by year",
                "Comparing driver points across seasons"
            ]
        }
    ]
}

Self-Learning Loop

Agent executes a query and validates results
Agent asks: “Would you like to save this query?”
If confirmed, stores the question, SQL, and explanation
Future similar questions retrieve this pattern automatically

Troubleshooting

Database connection refused

Ensure PostgreSQL is running:

docker ps | grep pgvector

If not running:

./cookbook/scripts/run_pgvector.sh

Query returns wrong results for position

The position column in drivers_championship is TEXT, not INT. Use string comparison:

WHERE position = '1'  -- Correct
WHERE position = 1    -- Incorrect

Knowledge base not found

Ensure you’ve loaded the knowledge base:

python cookbook/01_showcase/01_agents/text_to_sql/scripts/load_knowledge.py

Get Started

Basics

Advanced

Production

Providers

Other

Additional Resources

Text-to-SQL Agent

What Makes This Different

What You’ll Learn

Prerequisites

Setup

Run the Agent

Basic Queries

Self-Learning Loop

Edge Cases

Evaluate Accuracy

Agent Configuration

How It Works

Query Workflow

Knowledge Base

Semantic Model

Self-Learning Loop

Troubleshooting

Source Code

Get Started

Basics

Advanced

Production

Providers

Other

Additional Resources

​What Makes This Different

​What You’ll Learn

​Prerequisites

​Setup

​Run the Agent

​Basic Queries

​Self-Learning Loop

​Edge Cases

​Evaluate Accuracy

​Agent Configuration

​How It Works

​Query Workflow

​Knowledge Base

​Semantic Model

​Self-Learning Loop

​Troubleshooting

​Source Code

What Makes This Different

What You’ll Learn

Prerequisites

Setup

Run the Agent

Basic Queries

Self-Learning Loop

Edge Cases

Evaluate Accuracy

Agent Configuration

How It Works

Query Workflow

Knowledge Base

Semantic Model

Self-Learning Loop

Troubleshooting

Source Code