Google DeepMind launches Genie 3 to create interactive 3D worlds from text
The new Genie 3 model from Google DeepMind introduces frame-by-frame generation, visual memory, and live modifications to AI-created 3D environments.
Google DeepMind has introduced Genie 3, an AI world model capable of generating explorable 3D environments in real time from a simple text prompt.
Unlike earlier versions, it supports several minutes of continuous interaction, basic visual memory, and real-time changes such as altering weather or adding characters.
The system allows users to navigate these spaces at 24 frames per second in 720p resolution, retaining object placement for about a minute.
Users can trigger events within the virtual world by typing new instructions, making Genie 3 suitable for applications ranging from education and training to video games and robotics.
Genie 3’s improvements over Genie 2 include frame-by-frame generation with memory tracking and dynamic scene creation without relying on pre-built 3D assets.
However, the AI model still has limits, including the inability to replicate real-world locations with geographic accuracy and restricted interaction capabilities. Multi-agent features are still in development.
Currently offered as a limited research preview to select academics and creators, Genie 3 will be made more widely available over time.
Google DeepMind has noted that safety and responsibility remain central concerns during the gradual rollout.
Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!