Google Boosts AI Creation with Veo 2 and Imagen 3

New versions of Google's video and image generation models promise enhanced realism and creative control for users.

Google has unveiled Veo 2 and Imagen 3, advanced AI models for video and image generation. These updates are integrated into Google Labs tools, offering users improved realism, cinematic understanding, and diverse artistic styles for their creative projects.

Sarah Kline

By Sarah Kline

December 5, 2025

3 min read

Google Boosts AI Creation with Veo 2 and Imagen 3

Key Facts

  • Google has released updated versions of its video and image generation models: Veo 2 and Imagen 3.
  • Veo 2 generates high-quality videos with improved realism and understanding of cinematography, up to 4K resolution.
  • Imagen 3 produces brighter, better composed images with more diverse art styles.
  • These models are available in Google Labs tools: VideoFX, ImageFX, and a new tool called Whisk.
  • Veo 2 reduces the frequency of 'hallucinated' unwanted details in video outputs.

Why You Care

Ever wished you could direct a Hollywood-quality scene with just a few words? Or conjure a image that perfectly matches your vision? Google is making these dreams more accessible for you. They’ve just rolled out significant updates to their AI-powered creation tools. This means your next video project or visual content could look more professional and polished than ever before. Your creative possibilities are expanding rapidly.

What Actually Happened

Google has officially announced new versions of its generative AI models, Veo 2 and Imagen 3. These updates are now available through Google Labs tools, specifically VideoFX and ImageFX, according to the announcement. A new experimental tool called Whisk also integrates these capabilities. Veo 2 focuses on creating high-quality videos, offering improved realism and a deeper understanding of cinematography. Meanwhile, Imagen 3 aims to produce brighter, better composed images with a wider range of art styles, as detailed in the blog post. These advancements build upon earlier iterations of Veo and Imagen, which were introduced earlier this year.

Why This Matters to You

These new models mean a significant upgrade for your creative set of tools. Imagine generating video content that truly understands your cinematic requests. Think of it as having a virtual film crew at your fingertips. For example, you could ask Veo 2 for a “low-angle tracking shot that glides through the middle of a scene” and it will deliver. This level of detail was previously difficult to achieve with AI. The company reports that Veo 2 also minimizes unwanted details, like extra fingers, which can plague AI-generated content.

Key Improvements for Creators

FeatureVeo 2 (Video)Imagen 3 (Image)
RealismImproved understanding of physics and movementBrighter, better composed images
CinematographyRecognizes genres, lenses, cinematic effectsMore diverse art styles
ResolutionUp to 4K, extended to minutes in lengthEnhanced visual quality
Detail ControlReduces “hallucinated” unwanted elementsGreater control over image composition

How will these enhanced capabilities change your daily content creation workflow? Elias Roman, Senior Director of Product Management at Google Labs, highlighted the potential, stating, “it’s been exciting to watch people bring their ideas to life with help from these models.” This indicates a focus on empowering users.

The Surprising Finding

One particularly interesting aspect of Veo 2 is its understanding of cinematography. You might assume AI struggles with nuanced creative directions. However, the technical report explains that Veo 2 comprehends specific cinematic language. For instance, asking for an “18mm lens” in your prompt will result in the wide-angle shot associated with that lens. Similarly, suggesting “shallow depth of field” will blur the background and focus on your subject. This goes beyond simple object generation. It demonstrates a surprising grasp of artistic intent and technical camera work. It challenges the common assumption that AI only produces generic outputs. This level of control offers creators artistic precision.

What Happens Next

These updated models are already accessible through Google Labs tools. We can expect further refinements and broader integration in the coming months. For example, content creators might see these capabilities integrated into popular video editing software by early next year. Your actionable takeaway is to experiment with VideoFX, ImageFX, and Whisk now. Familiarize yourself with their expanded creative potential. The industry implications are clear: AI-powered content generation is becoming increasingly and user-friendly. This will likely democratize high-quality content production. Aäron van den Oord, a Research Scientist at Google DeepMind, emphasized the ongoing commitment to safety and responsible creation, as mentioned in the release. This suggests continued ethical considerations will guide future advancements.

Ready to start creating?

Create Voiceover

Transcribe Speech

Create Dialogues

Create Visuals

Clone a Voice