Google Meet's AI Translation: Bridging Language Gaps in Real-Time

New AI-powered feature enables near real-time spoken translation in Google Meet, connecting global teams.

Google Meet now offers real-time spoken language translation, powered by AI. This innovation allows users to communicate seamlessly across different languages during video calls. The feature currently supports Italian, Portuguese, German, and French.

Katie Rowan

By Katie Rowan

September 23, 2025

3 min read

Google Meet's AI Translation: Bridging Language Gaps in Real-Time

Key Facts

  • Google Meet has launched a real-time spoken language translation feature.
  • The feature is currently available in Italian, Portuguese, German, and French.
  • The development was a collaboration between Google Meet, DeepMind, and Google Research teams.
  • Previous translation methods had 10-20 second latency; the new method aims for 2-3 second latency.
  • The project was completed in two years, much faster than the initial five-year estimate due to AI advancements.

Why You Care

Ever been on a video call where language barriers made collaboration a headache? Imagine effortlessly understanding colleagues speaking a different language. Google Meet just rolled out a new AI-powered feature that makes real-time spoken translation a reality. This means your global meetings can now flow more smoothly than ever before. How will this change your international collaborations?

What Actually Happened

Google Meet, in collaboration with DeepMind and Google Research teams, has launched a real-time spoken translation feature. This creation allows users to communicate across language differences during live video calls, according to the announcement. The feature translates speech in near real-time. Currently, speech translation in Google Meet is available in Italian, Portuguese, German, and French, the company reports. This creation aims to connect people globally, making virtual meetings more inclusive.

Fredric Lindstrom, who leads the audio engineering team in Meet, highlighted the rapid pace of AI creation. His team began working on Speech Translation about two years ago. At that time, existing models could only handle offline translation, as detailed in the blog post. The main challenge was achieving instantaneous translation for live calls.

Why This Matters to You

This new capability directly impacts how you interact in a globalized world. No more waiting for delayed translations or struggling to grasp nuances. The goal is to make conversations feel natural, almost as if a human interpreter were present. Think of it as having a universal translator in your pocket for every meeting.

Key Benefits of Real-Time Translation:

  • Enhanced Collaboration: Break down communication barriers instantly.
  • Increased Efficiency: Reduce time spent clarifying misunderstandings.
  • Greater Inclusivity: Empower more team members to participate fully.
  • Natural Conversation Flow: Experience near-instantaneous translation.

“You send audio in and almost immediately, the model starts outputting audio,” Huib Kleinhout, who leads product management for audio quality, explained. This drastically reduces latency. This means you can focus on the discussion, not the language. How will this improved communication impact your team’s productivity?

The Surprising Finding

Initially, the Google team anticipated a much longer creation timeline for real-time speech translation. “When we started, we thought, ‘Maybe this will take five years,’” Fredric Lindstrom explained. However, the rapid advancement of AI significantly accelerated this process. Two years later, the feature is live. This accelerated timeline challenges the common assumption that complex AI solutions require extensive, multi-year creation cycles. “As things go with AI,” Lindstrom noted, “things just went faster and faster.” The swift progress underscores AI’s evolving capability to solve intricate problems more quickly than expected.

Previous audio translation technologies faced significant latency issues. They involved a multi-step process: transcribing speech, translating text, then converting it back to speech. This often resulted in delays of 10-20 seconds, making natural conversation impossible, the technical report explains. The new approach, using ‘one-shot’ translation models, bypasses these delays.

What Happens Next

While currently available in four languages, expect Google to expand the real-time translation feature to more languages in the coming months. The team will continue refining the system, addressing challenges like accents and background noise. For example, imagine a multinational company conducting quarterly reviews where executives from different continents can speak freely. The system will handle the language conversion seamlessly. Your role in global teams could become much simpler.

This creation suggests a future where language is no longer a significant barrier in virtual communication. Businesses should consider how to integrate this system to foster more inclusive and efficient remote work environments. The industry implication is a move towards truly global, frictionless digital interactions. This could set a new standard for collaboration platforms.

Ready to start creating?

Create Voiceover

Transcribe Speech

Create Dialogues

Create Visuals

Clone a Voice