All Tags

CogVLM2-Video

Zhipu AI open-sources video understanding model CogVLM2-Video, which can answer time-related questions

Zhipu AI announced that it has trained a new video understanding model CogVLM2-Video and made it open source. It is reported that most current video understanding models use frame averaging and video tag compression methods, which leads to the loss of temporal information and cannot accurately answer time-related questions. Some models that focus on time question-answering datasets are too limited to specific formats and applicable fields, making the models lose a wider range of question-answering capabilities. ▲ Official effect demonstration Zhipu AI proposed a method for constructing automatic time positioning data based on visual models, generating 30,000 time-related…
Information
- 3k
7/13

❯

Search

Checking in, please wait

Click for today's check-in bonus!

You have earned {{mission.data.mission.credit}} points today!

Check-in

Leaderboard

{{item.credit}}

Lasted {{item.count}} days

More

My Coupons

_￥_Coupons

Limitation of useExpired and Unavailable

Limitation of use
before

Limitation of usePermanently valid

Coupon ID:
×

Available for the following products: Available for the following products categories: Unrestricted use:

[{{ct.name}}]

Available for all products and product types

No coupons available!

Cart

×

Delete

Shopping Cart is Empty!

Empty Cart Checkout

You have a new message

No new messages

Write a new message More