Tencent hybrid self-research deep thinking model T1 released: fast spitting, can reply in seconds, good at super-long text processing

1AI fromTencent HunyuanWeChat public was informed that on March 21, Tencent mixed yuan officially launched the official version of the self-developed deep thinking model mixed yuan T1.

According to the official"T1" is fast in spitting out words, can reply in a second, and is also good at processing ultra-long texts, which is a strong self-developed product of Tencent.inference model. Through massive reinforcement learning, combined with specialized optimization of science challenges such as math, logical reasoning, science, and code, the official version of Hybrid T1 further enhances reasoning skills.

On common benchmarks that exemplify the foundational capabilities of inference models, theFor example, in the large language model evaluation enhancement dataset MMLU-PRO, the hybrid T1 scored 87.2 points, second only to o1The T1 has also been recognized as the industry's leading mathematical and logical reasoning model. In CEval, AIME, Zebra Logic, and other public benchmark tests of Chinese and English knowledge and competition-level mathematical and logical reasoning, Hybrid T1 also scored at the level of the industry's leading reasoning models.

"T1" also demonstrated very strong adaptability in multiple alignment tasks, command-following tasks, and tool utilization tasks.

Officially, the official version of the Hybrid T1 follows the innovative architecture of the Hybrid Turbo S and adopts the Hybrid-Mamba-Transformer fusion model. This is the first time in the industry that the Hybrid-Mamba architecture has been losslessly applied to a very large inference model. This architecture effectively reduces the computational complexity of the traditional Transformer structure and reduces the memory consumption of the KV-Cache, thus significantly reducing the training and inference costs.

Officials also claimed thatThe Hybrid T1 also demonstrates unique strengths in the area of ultra-long textual reasoning.The hybrid T1 can effectively solve the problem of context loss and long distance information dependency. Based on the excellent long text capture capability, Hybrid Mamba T1 can effectively solve the common context loss and long distance information dependency problems in long text reasoning. At the same time, the hybrid Mamba architecture is optimized for long sequence processing, which ensures the ability to capture long text information while significantly reducing resource consumption through efficient computation, and achieves a 2X increase in decoding speed with a similar number of activation parameters.

Tencent Hybrid T1 is now live: https://llm.hunyuan.tencent.com/#/chat/ hy-t1

In terms of API usage, the hybrid T1 is already online on the Tencent Cloud website, with an input price of $1 per million tokens and an output price of $4 per million tokens.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Microsoft Co-Develops Aardvark Weather System: AI Accurately Predicts Future Weather Without Supercomputing Complex Simulation

2025-3-21 20:44:45

Information

OpenAI Releases First ChatGPT AI Impact on Human Emotional Health Study

2025-3-22 12:42:02

Search