Categories: Technology

DeepMind’s ‘V2A’ AI technology generates video soundtracks from pixels and text prompts

Google’s DeepMind team has developed new technology that can generate soundtracks for videos using video-to-audio (V2A) technology. This technology can create music, sound effects, and speech both from text prompts and from the video’s pixels. This advancement opens up new possibilities for soundtrack composers and can be applied to both automatic video generation services and existing footage such as archive material and silent movies.

One interesting aspect of this technology is the ability to input both ‘positive prompts’ to guide the audio in a certain direction and ‘negative prompts’ to avoid certain elements. This means that users can create a wide variety of different soundtracks for the same video clip. The system can also generate audio using just video pixels, eliminating the need for text prompts if desired.

While V2A currently has some limitations, such as the quality of the audio being dependent on the video quality and imperfect lip synchronization when generating speech, Google DeepMind is working on further research to address these issues. To learn more and see additional examples of V2A technology in action, visit the Google DeepMind website.

If you’re interested in staying up to date on the latest music and gear news, reviews, deals, and features, you can sign up to receive updates directly to your inbox.

Share
Published by

Recent Posts

Jeff Bezos, Amazon’s founder, intends to sell $5 billion in shares.

Amazon founder Jeff Bezos has announced plans to sell 25 million shares in the tech…

2 mins ago

First case of tularemia confirmed in Jefferson County by Colorado health officials

Jefferson County Public Health has reported the first case of tularemia in a Wheat Ridge…

11 mins ago

Shares of SmartETFs Advertising & Marketing Technology ETF (MRAD) on NYSE Arca Increase by 0.9%

SmartETFs Advertising & Marketing Technology ETF (NYSEARCA:MRAD – Get Free Report) saw its share price…

15 mins ago

WNW (Meiwu Technology) Stock Price Falls by 0.6% on NYSE

Meiwu Technology Company Limited (NYSE: WNW) saw its share price decrease by 0.6% in trading…

21 mins ago

Cultural Shock at Camp STEAMology: Museum of Discovery and Science.

School may be out, but MODS' Camp STEAMology: Culture Shock is here to keep kids…

25 mins ago

DFHTU Stock in Deerfield Healthcare Technology Acquisitions Drops by 2.2% on OTCMKTS

Deerfield Healthcare Technology Acquisitions Corp. (OTCMKTS:DFHTU) saw a 2.2% drop in its share price on…

27 mins ago