The world's largest Oracle "dataset" is open source

The "Digital Oracle Co-creation Center" was officially launched todayOpen SourceThe world's largestOracleMultimodalityDataset, contains a total of 10,000 rubbings and copies of oracle bones, data such as the corresponding positions of oracle bone characters, corresponding character heads, corresponding interpretations, as well as word groupings, and interpretation order.

The world's largest Oracle "dataset" is open source

It is reported that all researchers can develop algorithms such as oracle bone detection, recognition, copy generation, glyph matching and interpretation based on this dataset to accelerate the process of intelligent oracle bone research.

The Digital Oracle Co-creation Center was jointly initiated by the Ministry of Education Oracle Information Processing Laboratory of Anyang Normal University, Tencent SSV Digital Culture Laboratory, Tencent Youtu Laboratory, the Oracle Bone Studies and Shang History Research Center of the Chinese Academy of Social Sciences, Anyang Workstation of the Institute of Archaeology, Chinese Academy of Social Sciences, the Ministry of Education Key Laboratory of Multimedia Trusted Perception and Efficient Computing of Xiamen University, and the Center for Chinese Character Civilization Research of Zhengzhou University, and has received support from global universities and research institutions such as the Institute of Ancient History of the Chinese Academy of Social Sciences, the University of Cambridge in the UK, the École des Hautes Études en Supérieures, France, Ritsumeikan University in Japan, Rutgers University in the United States, and the University of California, Los Angeles.

Tencent Youtu Lab, Tencent SSV Digital Culture Lab, Xiamen University, and Anyang Normal University jointly developed AI model technology:

  • Oracle bone character detection model:The labeling accuracy exceeds 90%

  • Model generation:Pixel-by-pixel alignment of copy and rubbing

  • Glyph matching model:Automatically match similar words

  • Oracle bone calibration model:Realize "removing duplicate copies" and "tracing the origin of rubbings" among a large number of rubbings and copies

The world's largest oracle multimodal dataset has beenOracle AI Collaboration Platform" is now online. The platform can also query oracle bone inscriptions and oracle bone fragments. You can visit and experience the specific functions yourself:

https://www.jgwlbq.org.cn/home

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
Information

Zhipu released and open-sourced the fourth generation of CodeGeeX, a large model for code generation, claiming to have the best performance for scales below 10 billion

2024-7-6 8:54:03

Information

Alibaba Cloud CTO Zhou Jingren: Tongyi open source model downloads exceed 20 million, firmly embrace open source

2024-7-6 8:55:38

Search