Introducing Gemini Omni

Last year, Nano Banana brought Gemini intelligence to image generation and editing. Since then, it’s helped millions of people restore old photos, design from sketches, and visualize ideas in ways that weren’t possible before. We built Gemini from the beginning to be natively multimodal, and now we’re taking the next step.

Introducing Gemini Omni, where Gemini’s reasoning ability and creativity meet. Omni is a new model that lets you create anything from any input, including video. Omni allows you to combine images, audio, video, and text as input to produce high-quality videos based on Gemini’s real-world knowledge. You can also easily edit videos through conversations.

Today, we’re rolling out Gemini Omni Flash, the first model in the Omni family, to Gemini apps, Google Flow, and YouTube Shorts. Over time, output formats such as images and audio will also be supported. Here’s what’s special about Omni:

Edit videos through conversations

Gemini Omni allows you to easily edit videos using natural language. Every instruction builds on the last one. Characters are consistent, physics is maintained, and scenes remember what came before.

Change the world around you. Change certain things or change everything. Your video will be the starting point for something you could never have filmed on your own.

Source link