男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Education

Guideline to develop AI-backed Chinese language database

Digitalization of ancient texts promotes cultural heritage, Mandarin learning

By Zhao Yimeng | China Daily | Updated: 2025-04-01 09:10
Share
Share - WeChat

China is accelerating the digitalization of ancient texts and boosting access to oracle bone script data, aiming to integrate cultural heritage with digital Chinese, officials said on Monday.

The Ministry of Education, the National Language Commission and the Cyberspace Administration of China issued a guideline to promote the digitalization of the Chinese language and characters. The focus is on developing national language resources and large-scale Chinese language models to support artificial intelligence.

The guideline aims to establish a national corpus and strategic language resources information database by 2027. By 2035, the country hopes it will have significantly expanded the presence of the Chinese language in global digital and generative AI scenarios.

Liu Peijun, head of the Department of Language Information Management at the Ministry of Education, said the guideline calls for the digitalization of linguistic and cultural heritage, while promoting the construction of a national digital language and script museum.

It emphasizes advancing key technologies for ancient text digitalization, enhancing the accessibility of oracle bone script data and launching a multilingual digital education program to facilitate Chinese language learning globally, Liu said at a news conference.

A key aspect of this initiative is the development of large-scale linguistic data resources. The guideline outlines a plan to build a national corpus with extensive Chinese language datasets to support AI applications.

Among the pilot projects, Beijing Normal University has launched a large-scale Classical Chinese language model, an AI-driven initiative that sets a new benchmark in the field, Liu said.

Kang Zhen, vice-president of BNU, said the university has developed a range of digital language databases, including a comprehensive holographic Chinese character database, a digital resource of the ancient Chinese dictionary Shuowen Jiezi, and repositories for ancient inscriptions and handwritten texts.

These resources have played a crucial role in linguistic research and cultural preservation, Kang added.

The university's AI Taiyan, a Classical Chinese large language model trained with 1.8 billion parameters, has been designed for high-accuracy interpretation of ancient texts, supporting tasks such as word and phrase explanations, as well as classical-to-modern Chinese translation.

China is also spearheading the construction of a new national corpus to strengthen linguistic infrastructure in the AI era, said Wang Hui, deputy head of the Ministry of Education's Department of Language Application and Administration.

"Currently, most linguistic datasets remain limited to single-text formats and specific academic domains, lacking the scale and diversity required for AI applications," Wang said.

The department has begun planning for the corpus this year, seeking to launch two flagship databases, the Chinese civilization corpus for AI-assisted teaching and research, and the Chinese grand reading system corpus, Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 石景山区| 怀安县| 辛集市| 金门县| 永仁县| 宁城县| 彰化市| 徐州市| 岚皋县| 科技| 宁都县| 靖西县| 五莲县| 武宁县| 承德县| 济阳县| 衢州市| 高州市| 天柱县| 射阳县| 四子王旗| 郴州市| 页游| 徐水县| 西宁市| 云梦县| 应用必备| 山东省| 大姚县| 石台县| 潜江市| 双柏县| 太仆寺旗| 恩施市| 金湖县| 白朗县| 泸水县| 长岛县| 宁安市| 云南省| 开原市| 中江县| 永城市| 七台河市| 钟祥市| 孝感市| 丰县| 黄浦区| 郧西县| 酒泉市| 舒兰市| 宣化县| 柳江县| 梅州市| 安新县| 丰台区| 久治县| 临朐县| 申扎县| 双牌县| 新巴尔虎左旗| 方山县| 五台县| 无锡市| 卓资县| 奈曼旗| 禄丰县| 灯塔市| 葵青区| 桦南县| 崇信县| 鄯善县| 辉县市| 宁津县| 昌宁县| 田林县| 奈曼旗| 扬中市| 通城县| 汤阴县| 彭州市| 武冈市|