-
Harvard, Google release 1 million public domain books to provide legitimate data for AI training
December 13, 2011 - Harvard University and Google announced the joint release of 1 million public domain books as an AI training dataset, TechCrunch reported on December 12th. Image source Pexels The data required for AI training is costly, but more suitable for well-funded tech companies. As a result, Harvard plans to release a dataset of about 1 million public domain books covering a wide range of genres, languages, and authors, including classic authors such as Dickens, Dante, and Shakespeare that are no longer under copyright, due to the fact that the copyrights on these works...- 589
-
Wuhan University and China Mobile's Jiutian AI team jointly open-sourced the audio and video speaker recognition dataset VoxBlink2
Wuhan University, China Mobile's Jiutian AI team, and Duke Kunshan University have jointly released VoxBlink2, an open-source audio and video speaker recognition dataset of more than 110,000 hours based on YouTube data. The dataset contains 9,904,382 high-quality audio clips and their corresponding video clips from 111,284 users on YouTube. It is currently the largest publicly available audio and video speaker recognition dataset. The release of the dataset aims to enrich the open-source speech corpus and support the training of large voiceprint models. The VoxBlink2 dataset is mined through the following steps: Candidate…- 6.8k
-
The world's largest Oracle "dataset" is open source
The "Digital Oracle Bone Co-creation Center" officially opened the world's largest oracle bone inscription multimodal dataset today, which contains a total of 10,000 oracle bone rubbings and copies, the corresponding positions of oracle bone words, corresponding character heads, corresponding interpretations, as well as word grouping and interpretation order. It is reported that all researchers can develop algorithms such as oracle bone detection, recognition, copy generation, glyph matching and interpretation based on this dataset to accelerate the intelligentization of oracle bone research. The Digital Oracle Bone Co-creation Center is composed of the Ministry of Education Oracle Bone Information Processing Laboratory of Anyang Normal University, Tencent SSV Digital Culture Laboratory, Tencent Youtu Laboratory, and the Chinese Academy of Social Sciences Oracle Bone Research Center.- 3.6k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: