-
Chinese Internet Corpus AI Resource Platform Released: 27 Datasets, Total 2.7T
January 11 news, China Association for Cyberspace Security issued a notice on January 9, for the community to release the Chinese Internet corpus resource platform to support industry sectors, content modality, volume scale and other labels classification, easy for users to download and use. The Association said that under the guidance of the Central Internet Information Office, together with the National Internet Emergency Response Center, in the early release of the Chinese Internet basic corpus 1.0, based on the corpus building and sharing mechanism established by the ad hoc committee, to bring together a number of new high-quality and credible data, after a series of rigorous and detailed data processing and processing, such as source screening, content filtering, data de-emphasis, and so on, the...- 3.9k