Zhuoyuan Mao   毛 卓远   毛 卓遠


Updates stopped from April 1, 2024.

About

  1. From Apr. 2021 to Mar. 2024, I was a Ph.D. student at IST of Graduate School of Informatics, Kyoto University, and a member of Natural Language Processing Lab. (Kurohashi & Chu & Murawaki Lab.). My research advisors were Prof. Sadao Kurohashi and Associate Prof. Chenhui Chu.

Research Interest

  1. Natural Language Processing, Machine Translation, Machine Learning in NLP, Multilinguality

Education

  1. Ph.D. student, Informatics Apr. 2021 -- Mar. 2024
    Kyoto University, Kyoto, Japan
  2. Master of Informatics, Intelligence Science and Technology Apr. 2019 -- Mar. 2021
    Kyoto University, Kyoto, Japan
  3. Bachelor of Science, Mathematics Aug. 2013 -- Jul. 2017
    Bachelor of Economics (Minor), Finances Feb. 2014 -- Jan. 2017
    East China University of Science and Technology, Shanghai, China
  4. Changzhou Senior High School of Jiangsu Province, Changzhou, China

Thesis

  1. Master Thesis
    Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation.
    [pdf] [git]
  2. Ph.D. Thesis
    Breaking Language Barriers: Enhancing Multilingual Representation for Sentence Alignment and Translation.

Experiences

  1. Research Internship (Apple AIML) Jun. 2023 -- Sep. 2023
    Apple, Tokyo, Japan
  2. Student Researcher (Google Research) Aug. 2022 -- Oct. 2022
    Google, Tokyo, Japan
  3. Teaching Assistant (Fundamentals of Artificial Intelligence) Apr. 2022 -- Aug. 2022
    Kyoto University, Kyoto, Japan
  4. JSPS Research Fellowship for Young Scientists, DC2 Apr. 2022 -- Mar. 2024
    Japan Society for the Promotion of Science (JSPS), Japan
  5. Research Internship Feb. 2022 -- Mar. 2022
    SenseTime, Kyoto, Japan
  6. Teaching Assistant (Fundamentals of Artificial Intelligence) Oct. 2021 -- Feb. 2022
    Kyoto University, Kyoto, Japan
  7. The University Fellowship in Informatics Apr. 2021 -- Mar. 2022
    Kyoto University, Kyoto, Japan
  8. Research Assistant Apr. 2021 -- May. 2021
    Kyoto University, Kyoto, Japan
  9. Teaching Assistant (Fundamentals of Artificial Intelligence) Apr. 2021 -- Aug. 2021
    Kyoto University, Kyoto, Japan
  10. Teaching Assistant (Fundamentals of Artificial Intelligence) Oct. 2020 -- Feb. 2021
    Kyoto University, Kyoto, Japan
  11. Student Exchange Program (MLO, Computer Science) Feb. 2020 -- Aug. 2020 Mar. 2020 (Mar. 2020 -- Aug. 2020 Remote)
    École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
  12. Japanese Course Oct. 2017 -- Mar. 2019
    Yamano Japanese Language School, Tokyo, Japan
  13. Bachelor Thesis (Statistics) Sep. 2016 -- Jun. 2017
    East China Normal University, Shanghai, China
  14. Internship, Investment Consultant Assistant Aug. 2016 -- Sep. 2016
    CreditEase, Shanghai, China

Publications [ACL] [dblp] [Google Scholar]

  1. Haiyue Song, Zhuoyuan Mao, Raj Dabre, Chenhui Chu and Sadao Kurohashi. 2024. DiverSeg: Leveraging Diverse Segmentations with Cross-granularity Alignment for Neural Machine Translation. Journal of Natural Language Processing, 31(1), pp. 155-188, Mar. 2024. [pdf]
  2. Zhen Wan, Fei Cheng, Zhuoyuan Mao, Qianying Liu, Haiyue Song, Jiwei Li and Sadao Kurohashi. 2023. GPT-RE: In-context Learning for Relation Extraction using Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), pp. 3534-3547, Singapore. [pdf]
  3. Zhuoyuan Mao, Raj Dabre, Qianying Liu, Haiyue Song, Chenhui Chu and Sadao Kurohashi. 2023. Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), pp. 1300-1316, Toronto, Canada. [pdf]
  4. Zhuoyuan Mao, Haiyue Song, Raj Dabre, Chenhui Chu and Sadao Kurohashi. 2023. Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation. In Workshop on Multilingual, Multimodal and Multitask Language Generation (Multi3Generation), pp. 16-25, Tampere, Finland. [pdf]
  5. Zhuoyuan Mao and Tetsuji Nakagawa. 2023. LEALLA: Learning Lightweight Language-agnostic Sentence Embedding with Knowledge Distillation. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), pp. 1886-1894, Dubrovnik, Croatia. [pdf] [model]
  6. Zhen Wan, Fei Cheng, Qianying Liu, Zhuoyuan Mao, Haiyue Song and Sadao Kurohashi. 2023. Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023): Findings Volume, pp. 2580-2585, Dubrovnik, Croatia. [pdf]
  7. Zhuoyuan Mao, Chenhui Chu and Sadao Kurohashi. 2023. Efficiently Learning Multilingual Sentence Representation for Cross-lingual Sentence Classification. 言語処理学会 第29回年次大会 (NLP2023), pp. 2830-2835, 沖縄, 日本. [git] [pdf]
  8. Zhen Wan, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Jiwei Li. 2022. Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), pp. 1731-1738, Abu Dhabi, UAE. [pdf]
  9. Yibin Shen, Qianying Liu, Zhuoyuan Mao, Fei Cheng and Sadao Kurohashi. 2022. Textual Enhanced Contrastive Learning for Solving Math Word Problems. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022): Findings Volume, pp. 4297-4307, Abu Dhabi, UAE. [pdf]
  10. Haiyue Song, Raj Dabre, Zhuoyuan Mao, Chenhui Chu and Sadao Kurohashi. 2022. BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022), pp. 85-94, Taipei, Taiwan Online. [pdf]
  11. Yibin Shen, Qianying Liu, Zhuoyuan Mao, Zhen Wan, Fei Cheng and Sadao Kurohashi. 2022. Seeking Diverse Reasoning Logic: Controlled Equation Expression Generation for Solving Math Word Problems. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022), pp. 254-260, Taipei, Taiwan Online. [pdf]
  12. Chenhui Chu, Zhuoyuan Mao, Toshiaki Nakazawa, Daisuke Kawahara and Sadao Kurohashi. 2022. SCTB-V2: the 2nd Version of the Chinese Treebank in the Scientific Domain. Language Resources and Evaluation (LRE), pp. 1-15, Oct. 2022. [pdf]
  13. Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan and Sadao Kurohashi. 2022. When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?. In Proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL2022): Findings Volume, pp. 1766-1775, Seattle, USA. [pdf]
  14. Zhen Wan, Fei Cheng, Zhuoyuan Mao, Qianying Liu, Haiyue Song and Sadao Kurohashi. 2022. Improving Medical Relation Extraction with Distantly Supervised Pre-training. 言語処理学会 第28回年次大会 (NLP2022), pp. 610-614, 浜松, 日本 Online. [pdf]
  15. Haiyue Song, Raj Dabre, Zhuoyuan Mao, Chenhui Chu and Sadao Kurohashi. 2022. Representative Data Selection for Sequence-to-Sequence Pre-training. 言語処理学会 第28回年次大会 (NLP2022), pp. 1-5, 浜松, 日本 Online. [pdf]
  16. Zhuoyuan Mao, Chenhui Chu and Sadao Kurohashi. 2022. Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation. ACM Trans. Asian Low-Resour. Lang. Inf. Process. (TALLIP), 21, 4, Article 68 (July 2022), 29 pages. [pdf] [git]
  17. Zhuoyuan Mao, Prakhar Gupta, Pei Wang, Chenhui Chu, Martin Jaggi and Sadao Kurohashi. 2021. Lightweight Cross-Lingual Sentence Representation Learning. In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP2021), pp. 2902-2913, Bangkok, Thailand Online. [pdf] [git]
  18. Zhuoyuan Mao, Prakhar Gupta, Chenhui Chu, Martin Jaggi and Sadao Kurohashi. 2021. Learning Cross-lingual Sentence Representations for Multilingual Document Classification with Token-level Reconstruction. 言語処理学会 第27回年次大会 (NLP2021), pp. 1049-1053, 北九州, 日本 Online. [pdf] [git]
  19. Zhuoyuan Mao, Yibin Shen, Chenhui Chu, Sadao Kurohashi and Cheqing Jin. 2020. Meta Ensemble for Japanese-Chinese Neural Machine Translation: Kyoto-U+ECNU Participation to WAT 2020. In Proceedings of the 7th Workshop on Asian Translation (WAT2020), pp. 64-71, Suzhou, China Online. [pdf]
  20. Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Eiichiro Sumita. 2020. Pre-training via Leveraging Assisting Languages and Data Selection for Neural Machine Translation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop (ACL2020SRW), pp. 279-285, Seattle, USA Online. [pdf]
  21. Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song and Sadao Kurohashi. 2020. JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation. In Proceedings of the 12th Language Resources and Evaluation Conference (LREC2020), pp. 3683-3691, Marseille, France. [pdf] [git]
  22. Zhuoyuan Mao, Raj Dabre, Fabien Cromieres, Haiyue Song, 中尾亮太 and 黒橋禎夫. 2020. ニューラル機械翻訳のための言語知識に基づくマルチタスク事前学習. 言語処理学会 第26回年次大会 (NLP2020), pp. 1061-1064, 茨城, 日本 Online. [pdf] [git]

Miscellaneous

  1. Reviewing papers for AAAI, ACL, TALLIP, EMNLP, EACL, ARR