MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

MuseTalk It is a real-time high-quality audio-driven lip-sync model developed by Tencent Music Tianqin Lab, which is specifically used for virtual mouth shape generation. The model can automatically adjust the facial image of the digital character according to the input audio signal, so that its lip shape is highly synchronized with the audio content, thereby achieving the effect of matching the lip shape with the sound. MuseTalk performs well in lip shape generation, and can generate accurate lip shapes with good picture consistency, especially for real-person video generation.

The main features of MuseTalk include:

  1. Real-time performance: Real-time inference at more than 30 frames per second can be achieved on NVIDIA Tesla V100.
  2. Multi-language support: supports audio input in multiple languages such as Chinese, English and Japanese, which enables it to provide services to users in different countries and regions.
  3. High-precision lip sync: Through Latent Space Inpainting technology, high-precision lip modification can be performed on a 256 x 256 pixel facial area.
  4. High picture consistency: The generated lip shape matches the sound accurately and the picture consistency is good.
  5. Wide range of application scenarios: Suitable for a variety of video content processing needs, such as self-media production, virtual anchors, etc.

However, the deployment process of MuseTalk is rather cumbersome and difficult for novice users, and it has high requirements for computer graphics cards and memory. Fortunately, Google launched Google Colab, with which we can quickly, free and easily deploy MuseTalk. Google Colab (also known as Colaboratory) is a free cloud development environment provided by Google, mainly used for tasks such as data analysis, machine learning and deep learning. It is based on Jupyter Notebook, and users can directly write and execute Python code through the browser, and can share and collaborate on editing code with others.

First, open this address:

https://colab.research.google.com/github/camenduru/MuseTalk-jupyter/blob/main/MuseTalk_jupyter.ipynb

Click the upper right corner, change the runtime type, and select T4GPU

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans
MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

You can see that Google Colab has allocated us free 12G memory, 78G hard disk, and GPU computing resources;

Click the small triangle to run the code:

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

After about 3 minutes, the operation is successful.

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

When you see the line Running on public URL, it means that MuseTalk has been successfully deployed, then click this URL:

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

Upload an audio and a reference video:

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

It takes more than 10 seconds to process the video after it is uploaded

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

Then click: Generate

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

If: Error appears, Connection errored out.

You can shorten the video and audio duration to about 20 seconds, and then run it again;

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

The last step takes more time, usually more than 20 minutes;

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans
MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

When the video appears on the right, the processing is complete:

MuseTalk offline lip-sync digital human tool allows beginners to quickly and for free deploy AI digital humans

Then click download in the upper right corner to download the processed video.

 

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
TutorialEncyclopedia

AI face-changing local one-click AI video face-changing integration package, Facefusion 2.6.1 does not need to be deployed and one-click start

2024-7-21 9:10:05

Encyclopedia

How to use Midjourney to generate sexy beauty pictures? One Midjourney prompt word AI generates 10 sexy beauty portrait styles

2024-7-21 11:29:53

Search