Google deploys Gemini Omni AI for video generation

AI Video & Visuals


Google has officially introduced Gemini Omni, a multimodal AI model that integrates reasoning power and creative generation across video, images, audio, and text input. This release starts with Gemini Omni Flash and is immediately available to all Google AI Plus, Pro, and Ultra subscribers worldwide through the Gemini app and Google Flow. Additionally, users of the YouTube Shorts and YouTube Create apps will have free access, and access will be available to developers and businesses via the API in the coming weeks.

Gemini Omni stands out for its ability to generate high-quality, context-aware videos based on natural language instructions and reference media. Users can perform conversational video editing and maintain scene continuity and character consistency across multiple editing steps. Improved physical understanding of models allows for more realistic visual effects and scene changes, supporting both creative storytelling and technical explanation. Gemini Omni embeds a SynthID watermark on all outputs for verification and transparency.

This release marks a significant expansion of Google’s multimodal AI services, targeting a wide audience including content creators, educators, and enterprise users seeking advanced video and media generation tools. By offering Gemini Omni Flash across its AI subscription tier and popular creator platform, Google aims to leverage its existing user base and responsible AI development expertise to directly compete with other AI generation tools on the market.

sauce





Source link