Google is releasing Gemini 3.5 Live Translate, a new audio model aiming to deliver fluid, natural-sounding voice translations in over 70 languages. This advancement builds on two decades of Google's machine learning efforts in language translation.
The system automatically detects languages and generates speech that mirrors the original speaker's intonation, pacing, and pitch. Unlike traditional turn-by-turn translation, Gemini 3.5 Live Translate continuously generates audio, maintaining a few seconds lag to ensure contextual accuracy while staying in sync with the speaker.
Broader Rollout and Developer Access
The technology is rolling out across Google products. Developers can access it via the Gemini Live API and Google AI Studio. Enterprise users will see it in private preview within Google Meet starting this month.
