Google is rolling out Gemini 3.1 Flash TTS, its latest text-to-speech AI model, promising more natural and expressive synthesized voices. The update brings granular control over vocal performance, aiming to empower developers and enterprises building next-generation audio applications.
First detailed by Deepmind, the model achieves an impressive Elo score of 1,211 on the Artificial Analysis TTS leaderboard, indicating a strong human preference for its output quality.
