-
Apple launches all-around visual model 4M-21 that can handle 21 different modalities
Researchers from Apple and the Swiss Federal Institute of Technology in Lausanne (EPFL) have jointly developed a single any-to-any modality model that can be trained on dozens of highly diverse modalities and co-trained on large-scale multimodal datasets and text corpora. The model, named 4M-21, is trained on 21 different modalities and completes at least 3 times more tasks than existing models without losing performance. The study used the 4M pre-training scheme to expand the scale of the model and dataset, increase the type and number of modalities involved in training the model, and train on multiple datasets...- 2.8k
-
Meta launches SceneScript AI visual model, using programmable language to predict and build 3D scenes in real time
According to Meta's official press release, the company has developed a visual model called "SceneScript", which claims to be able to use a programmable language to quickly "build" scenes, infer room geometry in real time, and convert related data into architectural approximations. Image source Meta's official press release Meta claims that the relevant method can efficiently and lightly build indoor 3D models, claiming that "only a few KB of memory are needed to generate clear and complete geometric shapes", and the relevant shape data is "interpretable", and users can easily read...- 1.5k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: