沈晓宇

助理教授

xyshen@eitech.edu.cn

背景介绍:

2015年本科毕业于南京大学软件学院,之后在德国马克斯普朗克信息研究所、萨尔大学研究生院读博,师从Gerhard Weikum和Dietrich Klakow。研究方向主要包括隐变量贝叶斯模型、对话生成、可控和可解释文本生成技术。2020年9月加入Amazon Alexa AI柏林研发中心担任机器学习科学家职位,领导了 Alexa 智能客服商品问答项目。


截止目前申报人已在自然语言处理顶级会议发表论文 40 余篇。发表论文被引用2100余次,h-index 20,i10-index 32。获得过南京大学优秀毕业生、优秀毕业设计、优秀海外自费留学生等奖项。其中为低资源文本生成而设计的标注框架获得了 COLING 2020 最佳 demo 论文奖,探究弱监督学习的有效性论文获得了ACL 2023 最佳主题论文奖。曾多次在东京大学、剑桥大学等作特邀报告。也是包括 ACL, EMNLP, NAACL, AAAI, TOIS 在内多个顶级会议和期刊的委员会成员、ACL 问答方向的领域主席和 High-Performance Computing for AI in Big Model Era 的主题编辑。


研究领域:

近年来,大语言模型的快速发展表明,随着模型规模和数据规模的不断增长,模型性能可以稳步提升。展望未来,我的研究兴趣主要集中在如下三个方面,探索如何构造可用、可信赖的大模型:

(1)可解释性:探索大模型从概率答案生成器到逻辑思维者的路径

(2)跨语言泛化:让大模型从以英语为中心的专家到多语言专家

(3)领域专业化:让大模型从通用领域迁移到专有领域;探索系统化的方法让大模型快速学习领域知识


教育背景:

2015-2021:博士(主修自然语言处理),萨尔大学计算机系-马克斯普朗克信息研究所

2011-2015:学士(主修软件工程),南京大学软件工程系


工作经历:

2020-2023:亚马逊Alexa AI,机器学习科学家


学术经历:

2018/5-2018/9:理化学研究所人工智能研究中心,访问学者

2016/9-2017/1:东京大学计算机系/国立情报所,访问学者


学术兼职(部分)

2022/11至今topic editor in "High-Performance Computing for AI in Big Model Era"

2023/1-2023/7area chair in question answering track of ACL2023


获奖情况及荣誉:

  • ACL 2023 Special Theme Paper Award

  • 2020年优秀自费留学生奖学金

  • COLING 2020 Best Demo Paper Award

  • 马克斯普朗克协会博士奖学金

  • 南京大学优秀毕业设计

  • 南京大学优秀毕业生

  • 国家奖学金


代表性论著:

总体情况

40余篇人工智能顶会论文


论著信息及引用数据

Google Scholar:

http://scholar.google.com/citations?hl=en&user=BWfPrE4AAAAJ


10篇代表作(*表示通讯作者)

  1. Shen, Xiaoyu*, Akari Asai, Bill Byrne, and Adria De Gispert. "xPQA: Cross-Lingual Product Question Answering in 12 Languages." In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), pp. 103-115. 2023

  2. Dawei Zhu, Xiaoyu Shen*, Marius Mosbach, Andreas Stephan, and Dietrich Klakow. 2023. Weaker Than You Think: A Critical Look at Weakly Supervised Learning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14229–14253, Toronto, Canada. Association for Computational Linguistics.

  3. Tang, Ze, Xiaoyu Shen*, Chuanyi Li, Jidong Ge, Liguo Huang, Zhelin Zhu, and Bin Luo. "AST-trans: Code summarization with efficient tree-structured attention." In Proceedings of the 44th International Conference on Software Engineering, pp. 150-162. 2022.

  4. Su, Hui, Weiwei Shi, Xiaoyu Shen*, Zhou Xiao, Tuo Ji, Jiarui Fang, and Jie Zhou. "Rocbert: Robust chinese bert with multimodal contrastive pretraining." In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 921-931. 2022.

  5. Chang, Ernie, Xiaoyu Shen*, Hui-Syuan Yeh, and Vera Demberg. "On Training Instance Selection for Few-Shot Neural Text Generation." In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 8-13. 2021.

  6. Su, Hui, Xiaoyu Shen*, Zhou Xiao, Zheng Zhang, Ernie Chang, Cheng Zhang, Cheng Niu, and Jie Zhou. "Moviechats: Chat like humans in a closed domain." In Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp. 6605-6619. 2020.

  7. Shen, Xiaoyu*, Ernie Chang, Hui Su, Cheng Niu, and Dietrich Klakow. "Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence." In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7155-7165. 2020.

  8. Shen, Xiaoyu*, Yang Zhao, Hui Su, and Dietrich Klakow. "Improving latent alignment in text summarization by generalizing the pointer generator." In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp. 3762-3773. 2019.

  9. Shen, Xiaoyu*, Jun Suzuki, Kentaro Inui, Hui Su, Dietrich Klakow, and Satoshi Sekine. "Select and Attend: Towards Controllable Content Selection in Text Generation." In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 579-590. 2019.

  10. Shen, Xiaoyu*, Hui Su, Wenjie Li, and Dietrich Klakow. "Nexus network: Connecting the preceding and the following in dialogue generation." In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4316-4327. 2018.