Skip to main content
ExampleDescription
Audio Input OutputFetch the audio file and convert it to a base64 encoded string.
Audio Sentiment AnalysisGive a sentiment analysis of this audio conversation. Use speaker A, speaker B to identify speakers.
Audio StreamingMono (Change to 2 if Stereo).
Audio To TextGive a transcript of this audio conversation. Use speaker A, speaker B to identify speakers.
Image To AudioConvert image descriptions to audio output.
Image To ImageTransform images using agent-driven processing.
Image To Structured OutputExtract structured data from images.
Image To TextImage to Text Example.
Media Input For ToolExample showing how tools can access media (images, videos, audio, files) passed to the agent.
Video CaptionGenerate captions from video content.