SIMA 2: AI Agent Learns, Reasons, and Interacts in 3D Worlds

DeepMind's SIMA 2, powered by Gemini, evolves into an interactive gaming companion with advanced reasoning.

DeepMind has unveiled SIMA 2, an advanced AI agent that can not only follow instructions but also reason, converse, and self-improve within virtual 3D environments. This evolution, leveraging Gemini models, marks a significant step toward more general and helpful AI, impacting future robotics and AI embodiment.

Sarah Kline

By Sarah Kline

December 6, 2025

4 min read

SIMA 2: AI Agent Learns, Reasons, and Interacts in 3D Worlds

Key Facts

  • DeepMind introduced SIMA 2, an advanced AI agent for 3D virtual worlds.
  • SIMA 2 integrates Gemini models, enabling reasoning, conversation, and self-improvement.
  • The agent can understand high-level goals and execute goal-oriented actions.
  • It was trained using human demonstration videos and Gemini-generated labels.
  • SIMA 2 represents a significant step towards Artificial General Intelligence (AGI).

Why You Care

Ever wished your video game companion could actually understand you and help out intelligently? What if an AI could not just follow commands but also think, chat, and learn alongside you in virtual worlds? This isn’t science fiction anymore. DeepMind has just introduced SIMA 2, an AI agent that promises to change how we interact with virtual environments. This creation could fundamentally alter your gaming experiences and even shape the future of AI in robotics.

What Actually Happened

DeepMind, a leading AI research company, recently announced SIMA 2 ( Instructable Multiworld Agent). This new version builds upon the original SIMA, which was designed to follow basic instructions across various virtual environments. According to the announcement, SIMA 2 integrates the capabilities of their Gemini models. This integration allows SIMA 2 to evolve from a simple instruction-follower into a more interactive gaming companion. The team revealed that SIMA 2 can now think about its goals, converse with users, and improve itself over time. This represents a significant step towards Artificial General Intelligence (AGI), with broad implications for robotics and AI-embodiment in general.

Why This Matters to You

SIMA 2 brings a new level of intelligence to virtual interactions. Imagine an AI that doesn’t just react but actively understands your intentions and goals. The company reports that SIMA 2 can now describe what it intends to do. It also details the steps it’s taking to accomplish its goals. This means a more collaborative and less frustrating experience for you.

For example, if you’re playing a complex open-world game, SIMA 2 could help you strategize or complete intricate tasks. Think of it as having a highly intelligent, communicative partner in your virtual adventures. How might this change the way you approach gaming or even virtual collaboration?

Key Capabilities of SIMA 2:

  • Reasoning: Understands high-level goals and performs complex thinking.
  • Conversation: Can chat with users about actions and environments.
  • Self-betterment: Learns and gets better over time in virtual worlds.
  • Goal-Oriented Action: Skillfully executes actions to achieve objectives.

As mentioned in the release, SIMA 2’s new architecture integrates Gemini’s reasoning abilities. This helps it understand a user’s high-level goal, perform complex reasoning in pursuit, and skillfully execute goal-oriented actions within games.

The Surprising Finding

The most striking aspect of SIMA 2 is its move beyond simple instruction following. The first SIMA could perform over 600 language-following skills, like “turn left” or “climb the ladder,” as detailed in the blog post. However, SIMA 2 can now think and reason about instructions, not just execute them. This is a crucial distinction. The study finds that by embedding a Gemini model as the agent’s core, SIMA 2 can do more than just respond to commands. It can proactively understand and plan. This challenges the common assumption that AI agents are merely automatons. Instead, they are becoming truly intelligent collaborators.

For instance, in games it had never seen before, SIMA 2 successfully completed tasks where SIMA 1 struggled. This includes finding a campfire in ‘ASKA’ or completing a task in ‘MineDojo’. This ability to generalize and reason in novel environments is a significant leap forward.

What Happens Next

The creation of SIMA 2 points to exciting future applications, likely emerging in the next 12-24 months. We can expect to see more AI companions in virtual reality and gaming. For example, future versions might assist in educational simulations or even complex professional training environments. The team revealed that they trained SIMA 2 using a mixture of human demonstration videos with language labels. They also used Gemini-generated labels. This approach will likely accelerate its learning capabilities.

This system also has major implications for robotics. As the company reports, it brings us closer to AI-embodiment. Imagine robots that can understand your complex verbal commands and reason through tasks in the real world. For you, this means potentially more intuitive and helpful AI assistants in your daily life. Keep an eye on DeepMind’s progress; their SIMA 2 agent is setting a new standard for intelligent virtual interaction.

Ready to start creating?

Create Voiceover

Transcribe Speech

Create Dialogues

Create Visuals

Clone a Voice