Yuanxiang releases XVERSE-MoE-A4.2B large model free for commercial use

YuanxiangReleased XVERSE-MoE-A4.2B Large Model, using a hybrid expert model architecture with an activation parameter of 4.2B, the effect is comparable to the 13B model.Open Source,Free for commercial use, which can be used by a large number of small and medium-sized enterprises, researchers and developers to promote low-cost deployment.

The model hasExtremeCompression and extraordinary performance are two major advantages. Sparse activation technology is used, and the effect exceeds many top models in the industry and is close to super large models. Yuanxiang MoE technology is self-developed and innovative, and efficient fusion operators, fine-grained expert design, load balancing loss terms, etc. are developed. Finally, the architecture setting corresponding to Experiment 4 is adopted.

Yuanxiang releases XVERSE-MoE-A4.2B large model for free commercial use

In terms of commercial applications, Yuanxiang Big Model has carried out in-depth cooperation with multiple Tencent products to provide innovative user experience for the fields of culture, entertainment, tourism and finance.

Hugging Face: https://huggingface.co/xverse/XVERSE-MoE-A4.2B
ModelScope Magic: https://modelscope.cn/models/xverse/XVERSE-MoE-A4.2B
Github: https://github.com/xverse-ai/XVERSE-MoE-A4.2B

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Yuanxiang releases XVERSE-MoE-A4.2B large model for free commercial use

Harvey Partners with OpenAI to Build Custom-Trained Case Law Models for Legal Professionals

Intel's chipmaking business lost billions last year

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Harvey Partners with OpenAI to Build Custom-Trained Case Law Models for Legal Professionals

Intel's chipmaking business lost billions last year

Yuanxiang's open source model has 30 quantitative versions that can be deployed at a lower cost

Kunlun Wanwei announced the release and open source of "Tiangong Model 3.0" on April 17: 400 billion parameters, claimed to have better performance than Grok 1.0

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

"World's First" Single RTX 4090 Server Inference, Kunlun Wanwei Open Source 200 Billion Sparse Large Model Tiangong MoE

Please enter the code

....Payment confirmation in progress....

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow