18 Oct 2025

Veo 3.1 brings audio and control to AI filmmaking

With Veo 3.1, creators gain full artistic control, combining reference images, frames and audio to build immersive AI-driven video scenes.

Google DeepMind has unveiled Veo 3.1, the newest upgrade to its video generation model, bringing more artistic freedom, realism and sound integration to its AI filmmaking tool, Flow.

The update gives creators advanced scene control and introduces generated audio across existing features like ‘Ingredients to Video’, ‘Frames to Video’ and ‘Extend’.

Users can now fine-tune visuals by combining multiple reference images, seamlessly link frames into longer clips, and edit scenes with new insert and removal tools that handle shadows and lighting automatically.

Flow’s new precision tools mark a significant step toward cinematic-level storytelling powered by AI.

Veo 3.1 is also accessible through the Gemini API, Vertex AI and the Gemini app, broadening its availability to developers and enterprises alike.

These enhancements signal Google’s ongoing ambition to push the boundaries of generative video technology for creative and professional applications.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!