Google Meet's AI Translates Conversations in Real-Time

New AI-powered feature enables instant spoken language translation, connecting global teams faster than expected.

Google Meet now offers real-time speech translation, powered by AI, allowing seamless communication across language barriers. This feature, developed by Google Meet, DeepMind, and Research teams, significantly reduces translation latency, making natural conversations possible for users.

Katie Rowan

By Katie Rowan

September 23, 2025

4 min read

Google Meet's AI Translates Conversations in Real-Time

Key Facts

  • Google Meet now offers real-time language translation for spoken conversations.
  • The feature is currently available in Italian, Portuguese, German, and French.
  • The development, a collaboration between Google Meet, DeepMind, and Research teams, took two years instead of an estimated five.
  • New 'one-shot' large models reduced translation latency from 10-20 seconds to 2-3 seconds.
  • The system aims to make simultaneous conversation across different languages feasible.

Why You Care

Ever been in a video call where language was a barrier? Imagine understanding everyone, instantly. Google Meet has launched a new AI-powered feature that translates spoken language in near real-time. This means you can communicate effortlessly with colleagues or clients worldwide. How will this change your global collaborations?

This creation is a big deal for anyone working in diverse teams or across international borders. It removes a significant hurdle in virtual communication. Your meetings can now be more inclusive and productive, regardless of the languages spoken.

What Actually Happened

Google Meet now offers real-time language translation, a feature enabling communication across different languages. This capability translates speech in near real-time, allowing you to connect with anyone on your call. The company reports that this feature is currently available in Italian, Portuguese, German, and French.

This creation resulted from a collaboration between Google Meet, Google DeepMind, and Google Research teams. Fredric Lindstrom, who leads the audio engineering team in Meet, noted the rapid acceleration due to AI. His team initially estimated a five-year creation timeline for instantaneous translation, as he explains. However, the project was completed in just two years, much faster than anticipated, according to the announcement.

Why This Matters to You

This real-time translation capability can significantly enhance your virtual meeting experience. Think of it as having a personal interpreter built directly into your video calls. It fosters better understanding and stronger connections with international colleagues.

For example, imagine you are presenting to a French-speaking client. The system translates your English speech into French for them, and their French responses into English for you. This creates a fluid, natural dialogue. What opportunities does this open up for your business or personal connections?

Key Benefits of Real-Time Translation:

  • Reduced Latency: Conversations flow more naturally with minimal delay.
  • Enhanced Inclusion: Everyone can participate fully, regardless of their native language.
  • Improved Productivity: Misunderstandings are minimized, leading to more efficient meetings.
  • Global Reach: Connect with a wider audience without language being a barrier.

Huib Kleinhout, who leads product management for audio quality, highlighted the core creation. He stated, “You send audio in and almost immediately, the model starts outputting audio.” This drastically cuts down the time lag, making conversations feel more natural for you.

The Surprising Finding

The most unexpected aspect of this creation was the speed of its creation. Initially, the team believed real-time translation would take five years to achieve. However, as Fredric Lindstrom explained, “As things go with AI, things just went faster and faster.” The project concluded in just two years, a remarkable acceleration.

Previous audio translation methods involved a multi-step process: transcribing speech, translating text, then converting it back to speech. This older approach caused significant delays, often 10-20 seconds, making natural conversation impossible, the research shows. The new approach, using “large models” capable of “one-shot” translation, bypassed these steps. This reduced latency to a mere two to three seconds, which the team found to be the “sweet spot” for natural conversation, as detailed in the blog post. This rapid creation and technical leap truly surprised the engineers themselves.

What Happens Next

Expect to see this feature expand to more languages in the coming months. The company reports that the Google community, including engineers from Pixel, Cloud, and Chrome, continues to refine the system. This ongoing collaboration with Google DeepMind aims to improve translation quality and expand its reach.

For example, you might soon be able to join a meeting with participants speaking five different languages, all understanding each other in real-time. Actionable advice for you: explore how this tool can integrate into your existing workflows. Consider piloting it for international team meetings or client presentations. This system will likely become standard in virtual communication platforms, further flattening global communication barriers. The team revealed that their focus remains on ensuring high-quality translation despite challenges like accents or background noise.

Ready to start creating?

Create Voiceover

Transcribe Speech

Create Dialogues

Create Visuals

Clone a Voice