All Tags

Visual Model

DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.

DeepSeek's official public website released a blog post yesterday (December 13), announcing the open source DeepSeek-VL2 model, which has achieved very advantageous results in various evaluation indexes, and officially said that its visual model has officially entered the era of Mixture of Experts (MoE). Citing the official press release, 1AI attached the highlights of DeepSeek-VL2 as follows: Data: double the quality of training data compared to the first generation of DeepSeek-VL, and introduction of terrain understanding, visual localization, visual storytelling...
Information
- 786
12/14
Apple launches all-around visual model 4M-21 that can handle 21 different modalities

Researchers from Apple and the Swiss Federal Institute of Technology in Lausanne (EPFL) have jointly developed a single any-to-any modality model that can be trained on dozens of highly diverse modalities and co-trained on large-scale multimodal datasets and text corpora. The model, named 4M-21, is trained on 21 different modalities and completes at least 3 times more tasks than existing models without losing performance. The study used the 4M pre-training scheme to expand the scale of the model and dataset, increase the type and number of modalities involved in training the model, and train on multiple datasets...
Information
- 3.5k
6/26
Meta launches SceneScript AI visual model, using programmable language to predict and build 3D scenes in real time

According to Meta's official press release, the company has developed a visual model called "SceneScript", which claims to be able to use a programmable language to quickly "build" scenes, infer room geometry in real time, and convert related data into architectural approximations. Image source Meta's official press release Meta claims that the relevant method can efficiently and lightly build indoor 3D models, claiming that "only a few KB of memory are needed to generate clear and complete geometric shapes", and the relevant shape data is "interpretable", and users can easily read...
Information
- 1.7k
3/26

❯

Checking in, please wait

Click for today's check-in bonus!

You have earned {{mission.data.mission.credit}} points today!

Check-in

Leaderboard

{{item.credit}}

Lasted {{item.count}} days

My Coupons

_￥_Coupons

Limitation of useExpired and Unavailable

Limitation of use
before

Limitation of usePermanently valid

Coupon ID:
×

Available for the following products: Available for the following products categories: Unrestricted use:

[{{ct.name}}]

Available for all products and product types

No coupons available!

Cart

×

Delete

Shopping Cart is Empty!

Empty Cart Checkout

You have a new message

No new messages

Write a new message More

{{userData.name}}Verify

Visual Model

DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.

Apple launches all-around visual model 4M-21 that can handle 21 different modalities

Meta launches SceneScript AI visual model, using programmable language to predict and build 3D scenes in real time

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow