Gemini 3.5 Live Translate is our latest audio model, delivering near real-time speech-to-speech translation in over 70 languages — preserving intonation, pacing, and pitch.
Gemini 3.5 Live Translate is available in public preview via the Gemini Live API and Google AI Studio, in private preview for Google Meet, and rolling out globally in Google Translate on Android and iOS.
Public preview for developers. Build real-time voice translation apps with the Gemini Live API. Integrate in minutes.
Speech translation in video meetings with 70+ languages and 2000+ language combinations. Private preview for Workspace customers.
Rolling out globally. Connect headphones for seamless tone-preserving translation. New listening mode on Android.
Platforms like Agora, Fishjam, LiveKit, Pipecat, and Vision Agents enable developers to build custom voice translation apps.
Unlike traditional translation systems that wait for a speaker to finish, Gemini 3.5 Live Translate processes speech as it is streamed — detecting languages automatically, handling multilingual input, and staying just seconds behind the speaker.
The model ingests live audio in real time, automatically detecting the source language from 70+ supported languages without any manual configuration.
Gemini 3.5 processes meaning and intent, not just words. It preserves intonation, pacing, and pitch so the translated speech sounds natural and expressive.
The model generates smooth, natural-sounding translated speech output that maintains the speaker's vocal character — all within seconds of the original utterance.
All generated audio is imperceptibly watermarked with SynthID, ensuring AI-generated content remains detectable and preventing misuse.
From automatic language detection to noise robustness, the model is built for real-world voice translation scenarios.
Processes speech as it's streamed, staying just a few seconds behind the speaker — no awkward pauses.
Automatic detection and translation across 70+ languages without manual language selection.
Maintains the speaker's intonation, pacing, and pitch — translation that sounds like you.
Built for loud environments. Handles background noise and multiple speakers with ease.
Every output carries an imperceptible SynthID watermark for responsible AI deployment.
Public preview on the Gemini Live API. Integrate real-time translation into any application.
In Google Meet, supports translation between any of 70+ languages — not just to and from English.
On Android, hold phone to ear like a call for private, headphone-free translation.
Partners and developers testing Gemini 3.5 Live Translate share their first impressions.
Quick answers about Gemini 3.5 Live Translate, its capabilities, and availability.