- Image generation and analysis with contextual responses
- Audio transcription and generation
- Video processing and generation
- Understanding documents
- Multi-tool and multi-turn interactions
Process and generate images, audio, video, and files with agents and teams.