-
DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.
DeepSeek's official public website released a blog post yesterday (December 13), announcing the open source DeepSeek-VL2 model, which has achieved very advantageous results in various evaluation indexes, and officially said that its visual model has officially entered the era of Mixture of Experts (MoE). Citing the official press release, 1AI attached the highlights of DeepSeek-VL2 as follows: Data: double the quality of training data compared to the first generation of DeepSeek-VL, and introduction of terrain understanding, visual localization, visual storytelling...- 786
-
Apple launches all-around visual model 4M-21 that can handle 21 different modalities
Researchers from Apple and the Swiss Federal Institute of Technology in Lausanne (EPFL) have jointly developed a single any-to-any modality model that can be trained on dozens of highly diverse modalities and co-trained on large-scale multimodal datasets and text corpora. The model, named 4M-21, is trained on 21 different modalities and completes at least 3 times more tasks than existing models without losing performance. The study used the 4M pre-training scheme to expand the scale of the model and dataset, increase the type and number of modalities involved in training the model, and train on multiple datasets...- 3.5k
-
Meta launches SceneScript AI visual model, using programmable language to predict and build 3D scenes in real time
According to Meta's official press release, the company has developed a visual model called "SceneScript", which claims to be able to use a programmable language to quickly "build" scenes, infer room geometry in real time, and convert related data into architectural approximations. Image source Meta's official press release Meta claims that the relevant method can efficiently and lightly build indoor 3D models, claiming that "only a few KB of memory are needed to generate clear and complete geometric shapes", and the relevant shape data is "interpretable", and users can easily read...- 1.7k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: