Google DeepMind launches new AI model V2A that can generate video soundtracks and dialogue

Google DeepMind, Google's AI research lab, recently announced V2A, a new model that can generate audio from video.

Google DeepMind V2A | Video to Voice AI | Deepmind Google has no plans to make V2A publicly available in the near future. (Image source: Google)

Video generation models such as Sora, Dream Machine, Veo, and Kling are rapidly improving, allowing users to generate videos from text prompts. However, the majority of these systems are limited to silent videos. Google DeepMind appears to recognize this problem, and is currently working on developing a new large-scale language model that can generate soundtracks and dialogue for videos.

In a blog post, the tech giant's AI research lab unveiled V2A (Video to Audio), a new AI model in development that combines video pixels with natural language text prompts to generate a rich soundscape for the on-screen action.

V2A is compatible with Veo, the text-to-video conversion model the company announced at the recently concluded Google I/O 2024, and can be used to add dramatic music, realistic sound effects, and dialogue that matches the mood of a video. Google says the new large-scale language model can also be used with “traditional footage,” such as silent films and archival material.

YouTube Poster

The new V2A model can generate “an unlimited number of soundtracks” for any video, features optional “positive prompts” and “negative prompts” so you can tailor the output to your preferences, and also watermarks the generated audio with SynthID technology.

DeepMind's V2A technology takes audio descriptions as input and uses a diffusion model trained on a combination of audio, transcripts, and videos. The model hasn't been trained on many videos, so the output can be distorted. Google also said it won't open V2A to the public anytime soon to prevent it from being misused.

First uploaded: 18 Jun 2024 17:10 IST

Source link

binance "oppna konto commented on Forget Ray-Ban Meta smart glasses. We tested cheaper ones that support ChatGPT.: Thanks for sharing. I read many of your blog posts
Binance账户 commented on The Smartest Man Who Ever Lived: Your point of view caught my eye and was very inte
打开Binance账户 commented on Top 10 Machine Learning Jobs with the Best Salaries in 2023: Your point of view caught my eye and was very inte
binance Registrera dig commented on Generative-AI-Jobs: Die 11 gefragtesten KI-Berufe: Thanks for sharing. I read many of your blog posts
create a binance account commented on WHOOP 4.0 review: Fitness tracker brand launches new AI features: Can you be more specific about the content of your

Google DeepMind launches new AI model V2A that can generate video soundtracks and dialogue | Technology News

Google DeepMind, Google's AI research lab, recently announced V2A, a new model that can generate audio from video.

Leave a Reply

RECENT POSTS

Omnelytics AI deploys enterprise artificial intelligence platform to accelerate global digital transformation

Govind says government is yet to finalize application for AI Untuk Rakyat program

YouTube tightens monetization rules, low quality AI videos will be lost

Google DeepMind, Google's AI research lab, recently announced V2A, a new model that can generate audio from video.

Related Posts

Leave a Reply