Tencent hybrid self-research deep thinking model T1 released: spit fast, can return in seconds, good at ultra-long text processing

1AI fromTencent HunyuanWeChat public was informed that on March 21, Tencent mixed yuan officially launched the official version of the self-developed deep thinking model mixed yuan T1.

According to the official"T1" is fast in spitting out words, can reply in a second, and is also good at processing ultra-long texts, which is a strong self-developed product of Tencent.inference model. Through massive reinforcement learning, combined with specialized optimization of science challenges such as math, logical reasoning, science, and code, the official version of Hybrid T1 further enhances reasoning skills.

On common benchmarks that exemplify the foundational capabilities of inference models, theFor example, in the large language model evaluation enhancement dataset MMLU-PRO, the hybrid T1 scored 87.2 points, second only to o1The T1 has also been recognized as the industry's leading mathematical and logical reasoning model. In CEval, AIME, Zebra Logic, and other public benchmark tests of Chinese and English knowledge and competition-level mathematical and logical reasoning, Hybrid T1 also scored at the level of the industry's leading reasoning models.

"T1" also demonstrated very strong adaptability in multiple alignment tasks, command-following tasks, and tool utilization tasks.

Officially, the official version of the Hybrid T1 follows the innovative architecture of the Hybrid Turbo S and adopts the Hybrid-Mamba-Transformer fusion model. This is the first time in the industry that the Hybrid-Mamba architecture has been losslessly applied to a very large inference model. This architecture effectively reduces the computational complexity of the traditional Transformer structure and reduces the memory consumption of the KV-Cache, thus significantly reducing the training and inference costs.

Officials also claimed thatThe Hybrid T1 also demonstrates unique strengths in the area of ultra-long textual reasoning.The hybrid T1 can effectively solve the problem of context loss and long distance information dependency. Based on the excellent long text capture capability, Hybrid Mamba T1 can effectively solve the common context loss and long distance information dependency problems in long text reasoning. At the same time, the hybrid Mamba architecture is optimized for long sequence processing, which ensures the ability to capture long text information while significantly reducing resource consumption through efficient computation, and achieves a 2X increase in decoding speed with a similar number of activation parameters.

Tencent Hybrid T1 is now live: https://llm.hunyuan.tencent.com/#/chat/ hy-t1

In terms of API usage, the hybrid T1 is already online on the Tencent Cloud website, with an input price of $1 per million tokens and an output price of $4 per million tokens.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Tencent hybrid self-research deep thinking model T1 released: fast spitting, can reply in seconds, good at super-long text processing

Microsoft Co-Develops Aardvark Weather System: AI Accurately Predicts Future Weather Without Supercomputing Complex Simulation

OpenAI Releases First ChatGPT AI Impact on Human Emotional Health Study

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Microsoft Co-Develops Aardvark Weather System: AI Accurately Predicts Future Weather Without Supercomputing Complex Simulation

OpenAI Releases First ChatGPT AI Impact on Human Emotional Health Study

Tencent hybrid new reasoning model T1 official announcement: released on March 21st

OpenAI o1 Inference Modeling API goes live, open only to select developers

Tencent Hybrid 3D Generation Big Model 2.0 Open Source Release, Simultaneously Launched the "Industry's First One-Stop 3D Content AI Creation Platform"

Tencent hybrid released and open-sourced graphic video model: can generate 5-second short videos, but also automatically with background sound effects

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow