Zhipu AI open-sources CogVideoX-5B video generation model, which can be run on RTX 3060 graphics card

News on August 28,Zhipu AI Open SourceCogVideoX-5B Video Generation ModelCompared with the previously open source CogVideoX-2B, the official said that its video generation quality is higher and the visual effects are better.

Official statementThe model's reasoning performance has been greatly optimized, and the reasoning threshold has been greatly reduced., you can run CogVideoX-2B on early graphics cards such as GTX 1080Ti, and run the CogVideoX-5B model on desktop "dessert cards" such as RTX 3060.

CogVideoX is a large-scale DiT (diffusion transformer) model for text-to-video tasks. It mainly uses the following techniques:

3D causal VAE: achieves efficient video reconstruction by compressing video data into latent space and decoding in the temporal dimension.
Expert Transformer: combines text embedding and video embedding, uses 3D-RoPE as position encoding, adopts expert adaptive layer to normalize the data of two modalities, and uses 3D full attention mechanism for spatiotemporal joint modeling.

The detailed parameters of CogVideoX-5B and CogVideoX-2B are as follows:

Zhipu AI open-sources CogVideoX-5B video generation model, which can be run on RTX 3060 graphics card

Attached related links:

Code repository: https://github.com/THUDM/CogVideo
Model download: https://huggingface.co/THUDM/CogVideoX-5b
Paper link: https://arxiv.org/pdf/2408.06072

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Zhipu AI open-sources CogVideoX-5B video generation model, which can be run on RTX 3060 graphics card

Amazon is reported to release Alexa AI subscription version in October: monthly fee is $10, sorting and summarizing the information flow that users are interested in

Zhipu AI: GLM-4-Flash large model API interface is open to the public for free

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Amazon is reported to release Alexa AI subscription version in October: monthly fee is $10, sorting and summarizing the information flow that users are interested in

Zhipu AI: GLM-4-Flash large model API interface is open to the public for free

Zhipu AI announces the open source of "Qingying" homologous video generation model - CogVideoX

Kunlun Wanwei announced the release and open source of "Tiangong Model 3.0" on April 17: 400 billion parameters, claimed to have better performance than Grok 1.0

Runway releases third-generation video generation model, generating 10-second clips in 90 seconds

AI image generation has a new leader! The open source model FLUX.1 has been released. Are Midjourney and DALL·E 3 nervous?

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow