Translation Agent

An emotion-aware translation agent that translates text, analyzes the emotional tone, selects an appropriate voice, and generates localized audio output using Cartesia TTS.

Multi-Step Workflow

The agent follows a sequential workflow:

Identify: Parse text and target language
Translate: Convert text preserving meaning
Analyze Emotion: Detect emotional tone
Get Language Code: Map to 2-letter code
List Voices: Get available Cartesia voices
Select Voice: Choose voice matching language + emotion
Localize Voice: Create language-specific clone
Generate Audio: Create TTS output

Prerequisites

Python 3.12+
OpenAI API key
Cartesia API key

Setup

Clone the repository

git clone https://github.com/agno-agi/agno.git
cd agno

Create and activate virtual environment

uv venv --python 3.12
source .venv/bin/activate

Install dependencies

uv pip install -r cookbook/01_showcase/01_agents/translation_agent/requirements.in

Get Cartesia API key

Set environment variables

export OPENAI_API_KEY=sk-***
export CARTESIA_API_KEY=your-cartesia-api-key

Run the Agent

Basic Translation

Translate text with voice generation:

python cookbook/01_showcase/01_agents/translation_agent/examples/basic_translation.py

Demonstrates:

Text translation
Voice selection
Audio file generation

Emotional Content

Handle emotionally nuanced text:

python cookbook/01_showcase/01_agents/translation_agent/examples/emotional_content.py

Demonstrates:

Emotion detection
Voice matching for emotional tone
Preserving sentiment in translation

Batch Translate

Process multiple translations:

python cookbook/01_showcase/01_agents/translation_agent/examples/batch_translate.py

Agent Configuration

translation_agent = Agent(
    name="Translation Agent",
    description="Translates text and generates localized voice notes",
    instructions=AGENT_INSTRUCTIONS,
    model=OpenAIResponses(id="gpt-5.2"),
    tools=[CartesiaTools()],
    add_datetime_to_context=True,
    add_history_to_context=True,
    num_history_runs=5,
    enable_agentic_memory=True,
    markdown=True,
)

Parameter	Purpose
`model`	GPT-5.2 for translation and emotion analysis
`instructions`	Step-by-step workflow for translation
`CartesiaTools`	Voice listing, localization, and TTS

How It Works

Translation Workflow

Identify text and target language
Translate text accurately
Analyze emotion of translated text
Get language code (fr, es, de, etc.)
List available Cartesia voices
Select voice matching language + emotion
Create localized voice clone
Generate audio with localized voice
Return translated text + audio

Emotion-Voice Mapping

Emotion	Voice Characteristics
Neutral	Clear, professional, moderate pace
Happy	Upbeat, energetic, slightly faster
Sad	Slower, softer, lower energy
Angry	Stronger, more intense
Excited	High energy, dynamic, faster
Calm	Soothing, steady, relaxed
Professional	Formal, clear, authoritative

Supported Languages

Language	Code
French	fr
Spanish	es
German	de
Italian	it
Portuguese	pt
Japanese	ja
Chinese	zh
Korean	ko

Troubleshooting

Cartesia API errors

Verify your API key:

echo $CARTESIA_API_KEY

Check your Cartesia dashboard for usage limits.

Voice not available for language

The agent will select the closest available voice and localize it. Some language combinations may have limited voice options.

Audio not generated

Check the response object for audio content:

if response.audio:
    print(f"Audio bytes: {len(response.audio[0].content)}")

Get Started

Basics

Advanced

Production

Providers

Other

Additional Resources

Translation Agent

Multi-Step Workflow

Prerequisites

Setup

Run the Agent

Basic Translation

Emotional Content

Batch Translate

Agent Configuration

How It Works

Translation Workflow

Emotion-Voice Mapping

Supported Languages

Troubleshooting

Source Code

Get Started

Basics

Advanced

Production

Providers

Other

Additional Resources

​Multi-Step Workflow

​Prerequisites

​Setup

​Run the Agent

​Basic Translation

​Emotional Content

​Batch Translate

​Agent Configuration

​How It Works

​Translation Workflow

​Emotion-Voice Mapping

​Supported Languages

​Troubleshooting

​Source Code

Multi-Step Workflow

Prerequisites

Setup

Run the Agent

Basic Translation

Emotional Content

Batch Translate

Agent Configuration

How It Works

Translation Workflow

Emotion-Voice Mapping

Supported Languages

Troubleshooting

Source Code