Google is pushing deeper into AI-generated content with Gemini Omni, a new model designed to turn input in various formats into editable videos through natural language instructions.
The move, announced at Google I/O 2026, extends Gemini beyond text and image generation to multimodal video creation. Gemini Omni combines Gemini’s reasoning and generation tools to create video output from text, image, audio, and video inputs.
The first model in the family, Gemini Omni Flash, is deployed through the Gemini app, Google Flow, and YouTube Shorts. Additional output formats, including images and audio, will be supported in the coming months.
Also read: Google brings Gemini a unique 24/7 AI assistant that integrates Gmail and Workspace
From prompts to conversational editing
Unlike traditional AI video tools that often require repeated prompts or separate editing steps, Gemini Omni is built around conversational editing. Users can continue to improve their videos across multiple instructions while maintaining continuity from previous changes.
Characters remain consistent across scenes, and edits preserve the context of previous prompts, allowing you to make changes to your video without having to restart the creative process. Users can modify the environment, change actions, add objects, or introduce entirely new elements while maintaining the flow of the original scene.
The system also aims to bring more realism to the generated content by applying a broader understanding of physics and contextual knowledge.
Combine multiple inputs into one video
Gemini Omni can work with multiple formats of media simultaneously. You can use existing videos, images, sketches, and audio files as references and convert them into a single output.
This model also leverages Gemini’s broad understanding of historical, scientific, and cultural backgrounds to create explanations and visual storytelling formats in parallel with creative content generation.
Google also introduced an avatar feature that allows users to create digital versions of AI-generated videos using their own voices.
Gemini Omni Flash is rolling out globally to Google AI Plus, Pro, and Ultra subscribers, and will also be available in YouTube Shorts and YouTube Create.
First publication date
