-
AliCloud Launches New QwQ-32B Reasoning Model with Only 1/20 Parameters, Comparable to DeepSeek R1
March 6, 2011 - Research has shown that reinforcement learning can significantly improve the inference capabilities of models such as DeepSeek-R1, which achieves state-of-the-art performance by integrating cold-start data and multi-stage training, enabling deep thinking and complex inference. AliCloud Tongyi Qianqi officials today announced the availability of their latest inference model, QwQ-32B, a model with 32 billion parameters that rivals the performance of DeepSeek-R1 with 671 billion parameters, 37 billion of which are activated. This achievement highlights the importance of bringing reinforcement learning to ...- 1.9k
-
Amazon is developing inference model Nova: fast and capable of deep thinking, sources say
March 4, 2011 - Amazon is developing an artificial intelligence model with advanced "reasoning" capabilities, Business Insider reported this evening. According to a source directly involved in the project, the new product is scheduled to be released as early as June under the brand name "Nova," a series of generative AI models that Amazon launched late last year. The person said Amazon wants the new models to use a "hybrid reasoning" approach, which can give quick answers, but also more complex thinking in the same system. Reasoning models have become an AI field in recent years...- 763
-
Tencent Yuanbao: Mixed Yuan T1 Deep Thinking Model is open for unlimited use by all users
February 19th news, tencent meta official tonight announced that its deep thinking model "mixed yuan T1" can now be used by all users unlimited (mixed yuan + DeepSeek two models free unlimited). According to the introduction, Tencent mixed yuan T1 and DeepSeek-R1 are both reasoning models, can understand the multiple dimensions of the problem and the underlying logical relationships, especially suitable for completing complex tasks. In addition, Tencent Yuanbao can not only use DeepSeek-R1 full-blooded version and Mixed Yuan T1 for deep thinking, but also can use DeepSeek-...- 2.4k
-
For Less Than $50 to Train, Researchers Build an Inference Model That Rivals OpenAI o1
Feb. 6 (Bloomberg) -- Artificial intelligence researchers at Stanford University and the University of Washington have successfully trained an AI model with "reasoning" capabilities for less than $50 (note: currently about 364 yuan) in cloud computing, according to a study released on Friday. The model, called s1, performed similarly to top reasoning models such as OpenAI's o1 and DeepSeek's r1 in tests of mathematical and programming ability. The s1 model and the data and code used to train it are now available on GitHub ...- 3.1k
-
NetEaseYouDao launches "ZiYao-o1", the first reasoning model that outputs step-by-step explanations in China
January 22 news, NetEaseYouDao today announced the launch of the first domestic output step-by-step explanation of the reasoning model "ZiYao-o1", and officially open source. According to the official introduction, ZiYao-o1 is a 14B lightweight single model, support in the deployment of consumer-grade graphics cards, the use of chain of thought technology, able to provide a detailed problem solving process, to strong logic and reasoning ability, to achieve higher problem solving accuracy, and to provide the Chinese logical reasoning. NetEase said that there are not many open source models available for deployment, and the parameter scale is large, which can't be run on consumer-grade graphics cards with low graphics memory, even with low-bit quantization technology, making...- 1.5k
-
OpenAI o1 Inference Modeling API goes live, open only to select developers
December 18, 2012 - Entering the ninth day of its "12 Days of OpenAI" campaign, OpenAI today announced that its "inference" AI model, o1, has been officially opened to some developers via APIs, and has been updated with a number of developer tools, including GPT-4o, real-time APIs, and fine-tuning APIs. OpenAI today announced that its "inference" AI model o1 is officially available to some developers via APIs, with simultaneous updates to developer tools including GPT-4o, real-time APIs and fine-tuning APIs. It is reported that the first batch of developers who can use the o1 API are OpenAI's "Level 5" users. To reach this level, developers need to spend at least $1,000 on the OpenAI platform (...- 2.6k
-
With only 0.25B parameters, Chengdu Humanoid Robotics Innovation Center debuts R-DDIRM high-speed inference model.
According to Chengdu Municipal Bureau of Science and Technology, on November 22, Chengdu Humanoid Robotics Innovation Center made a breakthrough in technological innovation, and recently launched the first diffusion-based high-speed reasoning model for humanoid robots, R-DDIRM (Denoising Diffusion Implicit Robot Model). Following the launch of China's first diffusion-based humanoid robot task generative model R-DDPRM (Denoising Diffusion Probability Robot Model) in May this year, Chengdu Humanoid Robotics Innovation Center has made a breakthrough in technological innovation.- 2.8k
-
Preview of inference model DeepSeek-R1-Lite goes live, claims to rival OpenAI o1-preview
November 21st, DeepSeek announced that the preview version of its newly developed inference model DeepSeek-R1-Lite is officially online. Officially, the DeepSeek R1 series of models are trained using reinforcement learning, and the reasoning process includes a great deal of reflection and verification, and the chain of thought can be tens of thousands of words long. The series of models have achieved reasoning results comparable to OpenAI o1-preview in math, code, and a variety of complex logical reasoning tasks, and have shown users the complete thinking process that o1 has not disclosed. DeepSe...- 5.3k