Tencent launches Hunyuan-Large model: 389B total parameters, the industry has open-sourced the largest MoE model based on Transformer

Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced

TencentAnnouncing the launch of Hunyuan-Large Large ModelOfficially, it'sThe industry has nowOpen SourceThe Transformer-based Maximum MoE Model ofThe program has 389 billion total parameters (389B) and 52 billion active parameters (52B).

Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced

Tencent has open-sourced Hunyuan-A52B-Pretrain, Hunyuan-A52B-Instruct, and Hunyuan-A52B-Instruct-FP8 at Hugging Face, and has released a technical report and a training and reasoning operation manual detailing the model capabilities and the operation of training and reasoning.

Among the modeling technology advantages are the following:

High-quality synthesized data: By augmenting training with synthetic data, Hunyuan-Large is able to learn richer representations, handle long contextual inputs, and better generalize to unseen data.
KV Cache Compression: Adoption of Grouped Query Attention (GQA) and Cross-Layer Attention (CLA) strategies significantly reduces the memory footprint and computational overhead of the KV cache, and improves inference throughput
Expert-specific learning rate scaling: Setting different learning rates for different experts ensures that each sub-model learns effectively from the data and contributes to the overall performance
long context processing capabilityThe pre-trained model supports up to 256K text sequences and the Instruct model supports 128K text sequences, which significantly improves the processing power of long context tasks.
Extensive benchmarkingExtensive experiments in multiple languages and tasks have proven the effectiveness and safety of Hunyuan-Large in real-world applications.

The relevant links are as follows:

paper:https://arxiv.org/pdf/2411.02265
Github:https://github.com/Tencent/Tencent-Hunyuan-Large
Huggingface:https://huggingface.co/tencent/Tencent-Hunyuan-Large
Tencent Cloud:https://cloud.tencent.com/product/hunyuan

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced

Chongqing Municipality: Aims to Widely Apply Robots in All Economic and Social Sectors by 2027

Beyond OCR, Google's AI tech InkSight accurately recognizes handwritten text

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Chongqing Municipality: Aims to Widely Apply Robots in All Economic and Social Sectors by 2027

Beyond OCR, Google's AI tech InkSight accurately recognizes handwritten text

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

Mobvoi Opens Up the Large Model "Sequence Monkey" Open Source Dataset

Official announcement! Tencent has reduced the price of large models and Hunyuan-lite is free

Ali Tongyi Qianqian open source Qwen2.5 large model, claiming performance beyond Llama

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow