Google’s Gemini Omni brings AI video generation and conversational editing

AI Video & Visuals


Google is pushing deeper into AI-generated content with Gemini Omni, a new model designed to turn input in various formats into editable videos through natural language instructions.

The move, announced at Google I/O 2026, extends Gemini beyond text and image generation to multimodal video creation. Gemini Omni combines Gemini’s reasoning and generation tools to create video output from text, image, audio, and video inputs.

The first model in the family, Gemini Omni Flash, is deployed through the Gemini app, Google Flow, and YouTube Shorts. Additional output formats, including images and audio, will be supported in the coming months.

Also read: Google brings Gemini a unique 24/7 AI assistant that integrates Gmail and Workspace

From prompts to conversational editing

Unlike traditional AI video tools that often require repeated prompts or separate editing steps, Gemini Omni is built around conversational editing. Users can continue to improve their videos across multiple instructions while maintaining continuity from previous changes.

Characters remain consistent across scenes, and edits preserve the context of previous prompts, allowing you to make changes to your video without having to restart the creative process. Users can modify the environment, change actions, add objects, or introduce entirely new elements while maintaining the flow of the original scene.

The system also aims to bring more realism to the generated content by applying a broader understanding of physics and contextual knowledge.

Combine multiple inputs into one video

Gemini Omni can work with multiple formats of media simultaneously. You can use existing videos, images, sketches, and audio files as references and convert them into a single output.

This model also leverages Gemini’s broad understanding of historical, scientific, and cultural backgrounds to create explanations and visual storytelling formats in parallel with creative content generation.

Google also introduced an avatar feature that allows users to create digital versions of AI-generated videos using their own voices.

Gemini Omni Flash is rolling out globally to Google AI Plus, Pro, and Ultra subscribers, and will also be available in YouTube Shorts and YouTube Create.

Follow Storyboard18 on Google for the latest brand marketing and industry updates, as well as in-depth coverage of digital news. Get the latest perspectives only on Storyboard18.

First publication date May 20, 2026, 09:47:48 IST



Source link