-
Ali Tongyi Qianqian Launches Visual Reasoning Model QVQ-Max: Analyzes, Reasons About Image and Video Content
March 28th news, early this morning, Ali Tongyi Qianqi team announced the launch of a new generation of visual reasoning model QVQ-Max. According to the official introduction, QVQ-Max is not only able to understand the picture and video content, but also can provide analysis and reasoning for the above information. In addition to analyzing and reasoning, QVQ-Max can also design illustrations, generate short video scripts, and even create role-playing content according to users' needs. Core Capabilities: From Observation to Reasoning The capabilities of QVQ-Max can be summarized in three areas: detailed observation, in-depth reasoning and flexible application. Here are the respective...- 1.2k
-
Ali Tongyi Thousand Questions Open Source Visual Reasoning Model QVQ-72B-Preview: Think Like a Physicist
Ali Tongyi Qwen team released a blog post today (December 25th), announcing the launch of QVQ-72B-Preview, an open source visual reasoning model based on the Qwen2-VL-72B build, which is capable of finding solutions through logical reasoning calmly in the face of complex physics problems like a master of physics. Ali Tongyi Thousand Questions team evaluates QVQ-72B-Preview on 4 datasets, and 1AI attaches the relevant introduction as follows: MMMU: A university-level multidisciplinary and multimodal evaluation set designed to examine the model visual...- 3.3k