Alibaba Cloudannounced on February 25th that its visual lifeinto a base modelgazillion-phase 2.1 (Wan) Open Source.
This open source uses the most lenient Apache 2.0 protocolThe newest version of the program is the "14B" and "1.3B" parameter specifications, which are open source and support both text-generated video and graph-generated video tasks, and can be downloaded from Github, HuggingFace, and the Magic Hitch community.
It is reported that the 14B Wanphase model excels in command following, complex motion generation, physical modeling, and text and video generation in the review set VBench.With a total score of 86.22%, Manphase 2.1 outperforms domestic and international models such as Sora, Luma, and Pika.Version 1.3B not only outperforms larger open-source models and even approaches some closed-source models, but also runs on consumer graphics cards, claiming "Generate 480P video with only 8.2GB of video memory" for secondary model development and academic research.
In terms of algorithm design, Maxthon has developed efficient causal 3D VAE and scalable pre-training strategies based on the mainstream DiT architecture and the Flow Matching paradigm for linear noise trajectories. Taking 3D VAE as an example, in order to efficiently support the encoding and decoding of videos of arbitrary length, Maxthon implements a feature caching mechanism in the causal convolution module of 3D VAE, which replaces the direct end-to-end coding and decoding process of long videos, and realizes the efficient coding and decoding of unlimited-length 1080P videos. In addition, by advancing the spatial downsampling compression, the memory footprint of 29% during inference is further reduced without performance loss.
The experimental results of the Wanxiang team show that in the 14 main dimensions and 26 sub-dimension tests of motion quality, visual quality, style and multi-targetingThe 10,000 phases have achieved industry-leading performance and five firsts..
With open source address:
-
Github:https://github.com/Wan-Video
-
HuggingFace:https://huggingface.co/Wan-AI
-
Magic Match Community:https://modelscope.cn/organization/Wan-AI