All Tags

VoxBlink2

Wuhan University and China Mobile's Jiutian AI team jointly open-sourced the audio and video speaker recognition dataset VoxBlink2

Wuhan University, China Mobile's Jiutian AI team, and Duke Kunshan University have jointly released VoxBlink2, an open-source audio and video speaker recognition dataset of more than 110,000 hours based on YouTube data. The dataset contains 9,904,382 high-quality audio clips and their corresponding video clips from 111,284 users on YouTube. It is currently the largest publicly available audio and video speaker recognition dataset. The release of the dataset aims to enrich the open-source speech corpus and support the training of large voiceprint models. The VoxBlink2 dataset is mined through the following steps: Candidate…
Information
- 5.6k
7/26

❯

Search

Checking in, please wait

Click for today's check-in bonus!

You have earned {{mission.data.mission.credit}} points today!

Check-in

Leaderboard

{{item.credit}}

Lasted {{item.count}} days

More

My Coupons

_￥_Coupons

Limitation of useExpired and Unavailable

Limitation of use
before

Limitation of usePermanently valid

Coupon ID:
×

Available for the following products: Available for the following products categories: Unrestricted use:

[{{ct.name}}]

Available for all products and product types

No coupons available!

Cart

×

Delete

Shopping Cart is Empty!

Empty Cart Checkout

You have a new message

No new messages

Write a new message More