Google is testing a new omni model for video generation

AI Video & Visuals


Google appears to be preparing a new Gemini video generation tool called omni. A recently published screenshot of Gemini’s video generation tab includes the following line: “Start with an idea or try a template. Powered by Omni.” The placement is important because Omni appears near where “Toucan,” the currently active video generation tool powered by Veo discovered ahead of Google I/O 2025, is mentioned.

Currently, Gemini’s video generation flow is shown as utilizing Veo 3.1, but image generation is tied to Nano Banana 2 and Nano Banana Pro, with Google describing Nano Banana Pro as built on Gemini 3 and Nano Banana 2 as Gemini 3.1 Flash Image. The open question is whether Omni is a new wrapper around Veo, a new Gemini video model, or an early step towards a Gemini Omni model that can process images and video within a single system. Omni appears in visible UI strings as well as hidden references, so it can also be used as a public product name.

If true (as it’s still pretty speculative), the Gemini would be the first top-of-the-line Omni model to feature a video output.

Google is currently implementing a split model strategy using Veo for video and the Gemini-based Nano Banana model for image generation. Omni could bring those trucks closer together. It also comes in the race for AI video, with ByteDance’s Seedance 2.0 topping video generation benchmarks.

Perhaps the most notable launch window is Google I/O 2026. Google says the event will be held from May 19th to 20th and will likely set the stage for a big reveal for the Gemini media generation, as it will include Gemini and broader AI updates.

sauce



Source link