Gradio

Voxtral TTS Demo

Please sign-in to this space by clicking on "Sign in with Hugging Face" above.

Voxtral TTS is a text-to-speech model that can synthesize realistic speech. This release includes an open-weight model with fixed voices, and our proprietary model with voice customization capabilities.

Test the full extent of our Voxtral TTS model in this demo space, or visit our AI Studio for a better experience. For our open-weights release, learn more about it here.

Voxtral TTS Demo

Please sign-in to this space by clicking on "Sign in with Hugging Face" above.

Fixed Voices