Google DeepMind launches new AI model V2A that can generate video soundtracks and dialogue | Technology News

AI Video & Visuals


Google DeepMind, Google's AI research lab, recently announced V2A, a new model that can generate audio from video.

Google DeepMind V2A | Video to Voice AI | DeepmindGoogle has no plans to make V2A publicly available in the near future. (Image source: Google)

Video generation models such as Sora, Dream Machine, Veo, and Kling are rapidly improving, allowing users to generate videos from text prompts. However, the majority of these systems are limited to silent videos. Google DeepMind appears to recognize this problem, and is currently working on developing a new large-scale language model that can generate soundtracks and dialogue for videos.

In a blog post, the tech giant's AI research lab unveiled V2A (Video to Audio), a new AI model in development that combines video pixels with natural language text prompts to generate a rich soundscape for the on-screen action.

V2A is compatible with Veo, the text-to-video conversion model the company announced at the recently concluded Google I/O 2024, and can be used to add dramatic music, realistic sound effects, and dialogue that matches the mood of a video. Google says the new large-scale language model can also be used with “traditional footage,” such as silent films and archival material.

YouTube Poster

The new V2A model can generate “an unlimited number of soundtracks” for any video, features optional “positive prompts” and “negative prompts” so you can tailor the output to your preferences, and also watermarks the generated audio with SynthID technology.

DeepMind's V2A technology takes audio descriptions as input and uses a diffusion model trained on a combination of audio, transcripts, and videos. The model hasn't been trained on many videos, so the output can be distorted. Google also said it won't open V2A to the public anytime soon to prevent it from being misused.


© IE Online Media Services, Inc.

First uploaded: 18 Jun 2024 17:10 IST



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *