SenseTime Jueying launches the industry's first native multi-modal large model vehicle-side deployment: 8 billion parameters, 40 tokens per second

SenseTimeCo-founder and chief scientist Wang Xiaogang announced on the 17th,Shang Tang Jue YingThe first in the industry to achieve nativeMultimodal large modelThe vehicle-side 8B model has a first packet delay of less than 300 milliseconds, an inference speed of 40 Tokens/second, and covers mainstream computing platforms.

SenseTime has created a computing engine called "HyperPPL" for multimodal large models. It currently expands and supports mainstream in-vehicle computing hardware, is compatible with a variety of mainstream operating systems, and adapts to the deployment platforms of multiple in-vehicle chips.

SenseTime Jueying said that HyperPPL is optimized for multi-person scenarios in vehicles, so that when there are multiple people in the vehicle at the same time, the model reasoning efficiency of the multi-modal large model on the vehicle side is not significantly reduced compared to a single person.

SenseTime previously stated that Shenzhen’s first autonomous driving bus line uses its vehicles and technology, and all driving operations do not require human intervention.

Next year, automotive chips (NVIDIA Thor) with a computing power of over 1,000 TOPS will be available. Based on a computing platform with higher computing power, SenseTime expects that the first packet delay of Jueying's multi-modal large model vehicle-side deployment solution will be greatly reduced, and the inference speed will be further improved.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

SenseTime Jueying launches the industry's first native multi-modal large model vehicle-side deployment: 8 billion parameters, 40 tokens per second

Proton launches AI email writing assistant that can run locally or on the server

The U.S. Department of Commerce is spending $400 million to boost chip production

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Proton launches AI email writing assistant that can run locally or on the server

The U.S. Department of Commerce is spending $400 million to boost chip production

Tsinghua University and Zhejiang University launch open source alternatives to GPT-4V! Open source visual models such as LLaVA and CogAgent explode

Zhipu open-sources the next-generation multimodal large model CogVLM2

SenseTime’s AI video generation platform launches CCTV reporter Wang Bingbing’s AI digital human “AI Bingbing”

SenseTime releases Vimi, the first "controllable" large model for generating character videos, to create a 1-minute character video from a photo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow