Turn any input (text, audio, image, video, or file) into structured, reviewed data. Agno is natively multi-modal and type-safe, so a labeling stack is a few lines of Python. Add anDocumentation Index
Fetch the complete documentation index at: https://docs.agno.com/llms.txt
Use this file to discover all available pages before exploring further.
output_schema and the agent returns data in that shape:
cookbook/data_labeling/_01_text_classification/basic.py
Classification object. Swap the schema and instructions and the same pattern covers extraction, span labeling, scoring, and preference ranking.
Data labeling workflows
Pick the page that matches the shape of label you need.| Workload | Input | Output | Page |
|---|---|---|---|
| Structured extraction | Any modality | Typed Pydantic object | Structured extraction |
| Classification | Any modality | One label, label set, or spans | Classification |
| Scoring / evaluation | Prompt + response | Rubric scores | LLM as judge |
| Preference ranking | Prompt + two responses | Winner + rationale | Preference data |
| Non-text input | Image, audio, video, PDF | Any of the above | Multimodal inputs |
| Reviewed labels | Any input | Adjudicated label + audit trail | Quality pipeline |
Guided paths
| You have | You want | Start with |
|---|---|---|
| Free-form text or documents | Typed records for a warehouse | Structured extraction |
| Model outputs to grade | A score per output | LLM as judge |
| Response pairs | RLHF / DPO training data | Preference data |
| Quality requirements | Two-model agreement and adjudication | Quality pipeline |
Model choice
Each workload uses the model that fits the modality. For example:| Modality | Default in the cookbook |
|---|---|
| Text, document/PDF | openai:gpt-5.5 |
| Image | openai:gpt-5.5 |
| Audio, video | gemini-3-flash-preview |
| Second labeler in the quality pipeline | claude-sonnet-4-5 |
Explore
Structured extraction
Turn any modality into a typed object, with optional per-field confidence.
Classification
Single-label, multi-label, hierarchical, and span labeling.
LLM as judge
Score outputs against a rubric. The same machinery, used for evals.
Preference data
Rank A vs B for RLHF and DPO datasets.
Multimodal inputs
Feed images, audio, video, and PDFs into any labeler.
Quality pipeline
Two labelers, a reviewer, and an adjudicator with an audit trail.