-
Zhipu AI open-sources video understanding model CogVLM2-Video, which can answer time-related questions
Zhipu AI announced that it has trained a new video understanding model CogVLM2-Video and made it open source. It is reported that most current video understanding models use frame averaging and video tag compression methods, which leads to the loss of temporal information and cannot accurately answer time-related questions. Some models that focus on time question-answering datasets are too limited to specific formats and applicable fields, making the models lose a wider range of question-answering capabilities. ▲ Official effect demonstration Zhipu AI proposed a method for constructing automatic time positioning data based on visual models, generating 30,000 time-related…
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: