Voxtral TTS Demo
Voxtral TTS is a text-to-speech model that can synthesize realistic speech. This release includes an open-weight model with fixed voices, and our proprietary model with voice customization capabilities.
Test the full extent of our Voxtral TTS model in this demo space, or visit our AI Studio for a better experience. For our open-weights release, learn more about it here.
Fixed Voices
Enter text to synthesize and select a predefined voice available through our AI Studio.
Select a predefined voice
Examples