研究方向:自然语言处理、机器翻译
研究生课程:自然语言处理导论
电子邮箱: michenggang@gmail.com
米成刚,博士,讲师,硕士研究生导师,“西外学者”青年优秀人才入选者,ty8天游登录线路中心语言大数据研究中心负责人。担任中国计算机学会自然语言处理专业委员会委员。目前主要关注大语言模型在数字人文与语言教学中的应用。主持国家自然科学基金1项,省部级项目1项。以骨干成员参与国家重点研发计划,科技部重大专项,国家自然科学基金重点项目等国家级项目多项。近年来,在人工智能及自然语言处理领域权威国际学术期刊及会议Neural Networks,Neurocomputing,Computer Speech & Language,ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP),IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI),Machine Translation,International Conference on Computational Linguistics (COLING),International Conference on Tools for Artificial Intelligence (ICTAI),International Conference on Language Resources and Evaluation (LREC)等发表论文二十余篇。参与申请并获批软件著作权6项,研究成果已在多地实现成果转化。
1. 部分发表论文 (2018年至今,*表示通讯作者)
期刊论文
[1] Chenggang Mi, Shaoliang Xie. Language relatedness evaluation for multilingual neural machine translation[J]. Neurocomputing, 2024, 570: 127115.
[2] Chenggang Mi, Shaoliang Xie, and Yi Fan. 2024. Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation. ACM Transactions on Asian and Low-Resource Language Information Processing. 23, 2, Article 31 (February 2024), 19 pages. https://doi.org/10.1145/3639930.
[3] Chenggang Mi. 2023. Improving the Robustness of Loanword Identification in Social Media Texts. ACM Transactions on Asian and Low-Resource Language Information Processing. 22, 4, Article 101 (April 2023), 19 pages. https://doi.org/10.1145/3572773
[4] Chenggang Mi. 2023. Loanword identification based on web resources: A case study on wikipedia. Computer Speech & Language. Vol 81 (March 2023). https://doi.org/10.1016/j.csl.2023.101517
[5] Chenggang Mi, Lei Xie and Yanning Zhang. Improving Data Augmentation for Low Resource Speech-to-Text Translation with Diverse Paraphrasing [J]. Neural Networks, 2022, 148: 194-205.
[6] Chenggang Mi, Lei Xie and Yanning Zhang. Loanword Identification in Low-resource Languages with Minimal Supervision. In ACM Transactions on Asian and Low-Resource Language Information Processing, 2020.
[7] Chenggang Mi, Lei Xie and Yanning Zhang. Improving Adversarial Neural Machine Translation for Morphologically Rich Language. In IEEE Transactions on Emerging Topics in Computational Intelligence, 2020.
[8] Zhu, S., Mi, C.(*), Li, T. et al. Improving bilingual word embeddings mapping with monolingual context information. Machine Translation (2021). https://doi.org/10.1007/s10590-021-09274-0
会议论文
[1] Chenggang Mi, Shaolin Zhu, Yi Fan and Lei Xie. Incorporating Typological Features into Language Selection for Multilingual Neural Machine Translation. In APWeb-WAIM 2021: Proceedings of the 5th APWeb-WAIM International Joint Conference on Web and Big Data, Guangzhou, China, August 23-25, 2021.
[2] Shaolin Zhu, Chenggang Mi (*), and Xiayang Shi. An explainable evaluation of unsupervised transfer learning for parallel sentences mining. In APWeb-WAIM 2021: Proceedings of the 5th APWeb-WAIM International Joint Conference on Web and Big Data, Guangzhou, China, August 23-25, 2021.
[3] Shaolin Zhu, Chenggang Mi (*), and Linlin Zhang. Inducing bilingual word representations for non-isomorphic spaces by an unsupervised way. In KSEM 2021: Proceedings of the 14th International Conference on Knowledge Science, Engineering and Management, KSEM 2021.
[4] Chenggang Mi, Yating Yang, Lei Wang, Xi Zhou and Tonghai Jiang. Toward Better Loanword Identification in Uyghur Using Cross-lingual Word Embeddings. In COLING 2018: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, August 20-26, 2018.
[5] Chenggang Mi, Yating Yang, Xi Zhou, LeiWang and Tonghai Jiang. Improved Spoken Uyghur Segmentation for Neural Machine Translation. In ICTAI 2018: Proceedings of the 30th International Conference on Tools with Artificial Intelligence, Volos, Greece, November 5-7, 2018.
[6] Chenggang Mi, Yating Yang, Lei Wang, Xi Zhou and Tonghai Jiang. A Neural Network Based Model for Loanword Identification in Uyghur. In LREC 2018: Proceedings of 11th edition of the Language Resources and Evaluation Conference, Miyazaki, Japan, 7-12 May 2018.
2. 主持科研项目
[1] 国家自然科学基金青年基金:“面向低资源语言神经网络机器翻译的知识迁移方法研究”(2020.01-2022.12);
[2] 中国科学院“西部青年学者”B类项目:“面向维汉机器翻译的双语联合切分模型研究”(2015.09-2019.08)。