Microsoft launches VASA-1 AI framework that can convert photos into videos, realistic lip-syncing portrait videos

Microsoft launches VASA-1 AI framework that can convert photos into videos, realistic lip-sync portrait videos

according toMicrosoftOfficial press release, Microsoft today announced a VASA-1 framework for image-generated videos. This AI framework only needs a real-life portrait photo and a personal voice audio clip.It can generate accurate and realisticLip Sync Videos(Generates a scripted video), which is said to be particularly natural in terms of facial expressions and head movements.

Microsoft launches VASA-1 AI framework that can convert photos into videos, realistic lip-sync portrait videos

Currently, many related research in the industry focuses on lip syncing, while facial dynamic behavior and head movement are usually ignored. As a result, the generated faces will appear stiff, unconvincing and have the uncanny valley phenomenon.

Microsoft's VASA-1 framework overcomes the limitations of previous facial generation technology. Researchers used the diffusion Transformer model to train on overall facial dynamics and head movements. The model treats all possible facial dynamics, including lip movements, expressions, eye gaze, and blinking, as a single latent variable (that is, generating an entire highly detailed face at once), and is said to be able to instantly generate 512×512 resolution 40 FPS videos.

Microsoft launches VASA-1 AI framework that can convert photos into videos, realistic lip-sync portrait videos

Microsoft also used 3D technology to assist in marking facial features and designed an additional loss function, claiming that VASA-1 can not only generate high-quality facial videos, but also effectively capture and reproduce facial 3D structure.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Microsoft launches VASA-1 AI framework that can convert photos into videos, realistic lip-sync portrait videos

Overseas writing platform Medium will ban paid articles generated entirely by AI from May

The hot AI interview is driving workers crazy

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Overseas writing platform Medium will ban paid articles generated entirely by AI from May

The hot AI interview is driving workers crazy

UK regulator to review Microsoft's OpenAI collaboration

ChatGPT helps 5 million users file taxes! One of the world's largest tax agencies cooperates with Microsoft

Microsoft plans to integrate OpenAI's Sora video generation model into Copilot, but it will take time

Bill Gates tells environmentalists not to worry too much about AI's power consumption

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow