Google DeepMind will be able to generate music and sounds for silent videos

Google DeepMind has unveiled an innovative technology that can generate background music and sound effects for silent videos. This “video-audio” system is designed to simplify the video editing process, especially for content creators.

Google DeepMind has introduced an innovative technology that can generate background music and sound effects for silent videos.

Process of the technology

Introduction of information from the user. Content creators upload their silent video and can provide keywords or phrases to guide the AI in creating a soundtrack. For example, for a video of a person walking in the dark, cues such as “movies, horror, music, suspense, footsteps on concrete” can be used to help the AI understand the mood and environment.
AI Performance. DeepMind’s AI model first analyzes the visual content of the video. This data is then combined with the user’s textual cues. Using a diffusion model, the AI iteratively processes this information and eventually generates background sounds that complement the video.

Customization of the audio track. The model can create different audio options for a single video, allowing creators to choose the best option for their project. The DeepMind system can also take into account the emotional tone of the cues. For example, cues emphasizing “tension” can produce tense background music, whereas cues like “joyful celebration” will create more upbeat sounds.

Google DeepMind will be able to generate music and sounds for silent videos (94ca3b24 d9ee 4b9b 9725 ace67f9327c6)

Future Development

DeepMind is actively improving the technology. Future plans are for the AI to automatically generate sounds based solely on video content, eliminating the need for user prompts. Work is also underway to improve the synchronization of generated dialogue with the lip movements of characters in the video.

DeepMind is also working on improving the synchronization of generated dialogue with the lip movements of characters in the video.

This “video-audio” technology has the potential to revolutionize video editing, especially for content creators who don’t have access to professional audio tools or expertise. DeepMind is taking steps to make the video creation process more accessible and efficient.

DeepMind is taking steps to make the video creation process more accessible and efficient.

Google DeepMind will be able to generate music and sounds for silent videos

Process of the technology

Future Development

Google will respond to Apple’s Genmoji in Pixel 9: An unusual thing has been spotted in Android code

The new 2025 BMW X3 M50 gets a more powerful six-cylinder engine

More in:AI and neural networks

Gemini has added an “Answer now” button to skip in-depth analysis

Samsung has confirmed the free status of key Galaxy AI features

Google launches personal intelligence feature for Gemini

OpenAI has launched a separate ChatGPT Translate service for text, speech and image translation

News

Anker launched a pink version of the 10,000 mAh Nano powerbank with a retractable USB-C cable

Vivo X200T will get Dimensity 9400+, 5000 nits screen and ZEISS cameras

Adobe has added new AI tools to Premiere and After Effects

Sony hands over control of its smart TV business to TCL

Insiders disagree on iPhone Air 2 release date

About ForGeeks.pro

ForGeeks.pro

Search

Process of the technology

Future Development

Share

You may also like

More in:AI and neural networks

News

Latest Posts