Yisu: China's first large model for generating ultra-long Sora-level videos

Yisu: China's first large model for generating ultra-long Sora-level videos

A glimpse into the world YiSu is a video generation system developed by Beijing Jijiashijie Technology Co., Ltd. and the Department of Automation of Tsinghua University.Large ModelThis model can generate videos longer than 1 minute and has advantages such as large movements and strong expressiveness. In addition, the YiSu model is cheaper and faster, making it suitable for large-scale product applications.YisuIt is not just a video generation model, but also an important step towards a world model. The world model is crucial for general intelligence in the physical world such as autonomous driving general robots, and plays a key role in data generation, closed-loop simulation, and end-to-end solutions. YiSu demonstrated the same architecture based on video generation, and the effect of using it for autonomous driving and robot scene world models.

YiSu Function

  1. Multimodal fusion capability: The Yisu model is not limited to processing single text or image data, it also has the ability of multimodal fusion. This means that the model can simultaneously understand and generate video content containing multiple information such as text, images, audio, etc. This multimodal fusion capability makes the Yisu model more widely applicable in the field of video generation.
  2. Efficient training and reasoning: By optimizing algorithms and architectures, the Yisu model has achieved significant improvements in both training and reasoning speed. This enables the model to generate video content more quickly and improves the efficiency of video generation. At the same time, the efficient training process also enables the Yisu model to adapt to new data and scenarios more quickly.
  3. Terminal-side operation capability: The Yisu model has the ability to run directly on the terminal device without relying on cloud support. This allows users to quickly generate video content on local devices without waiting for cloud processing time, improving the convenience and flexibility of video generation.
  4. High cost-effectiveness: Compared with other video generation solutions, the Yisu model is lower in cost, faster in speed, and extremely cost-effective. This makes the Yisu model more suitable for various application scenarios, especially those that are cost-sensitive or require fast generation of video content.
  5. Continuous iteration and optimization: The Yisu team is committed to continuous iteration and optimization of the model. They plan to grow and evolve rapidly at the rate of one small version per week and one large version per month. In the future, the Yisu model will achieve significant improvements in video duration, controllability, reasoning speed, operating cost, and understanding of the physical world, providing users with better video generation services.
  6. Ultra-long duration: Yisu natively supports 16-second video generation and has the ability to expand to more than 1 minute, breaking the duration limitation of traditional video generation models.
  7. High performance: The model has a large range of motion, strong expressiveness, and can understand the laws of the physical world, making the generated videos more realistic, natural, and dynamic.

Technical features:

Self-developed architecture: Yisu adopts the video generation large model technology independently developed by the team, combining the advantages of LLM and diffusion model to achieve efficient video generation.

Multimodal fusion: The model is optimized for processing multimodal data and can better understand and generate video content containing multiple information such as text, images, audio, etc.

Efficient training and inference: By optimizing algorithms and architecture, Yisu has achieved significant improvements in both training and inference speeds, improving the efficiency of video generation.

Official website address:https://world-dreamer.github.io/ 

 

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
productvideo

Viggle: Free AI video generation tool that can generate videos with pictures and text

2024-6-28 9:35:45

productvideo

Meitu MoKi: Meitu's AI short video creation tool allows everyone to become a short film director

2024-6-28 10:08:01

Search