Biography
- I am a Research Scientist at Microsoft AI. I earned my Master’s Degree from Peking University in 2020.
- I co-founded WizardLM project, which contributed the state-of-the-art LLMs WizardLM, WizardCoder and WizardMath, I also created widely adopted methods Evol-Instruct, RLEIF and Arena-Learning.
- My research interests include Natural Language Processing, Reinforcement Learning, and Multimodal LLM.
News
- 2 papers accepted by ICLR 2024!
- [Aug 2023] We release WizardMath.
- [Jun 2023] We release WizardCoder.
- [Apr 2023] We release WizardLM. Project link: https://github.com/nlpxucan/WizardLM
- 2 papers accepted by ACL 2023!
- 1 paper accepted by EMNLP 2022!
- 1 paper accepted by NAACL 2022 as an Oral paper!
- 2 papers accepted by ACL 2022!
- 1 paper accepted by EMNLP 2019!
Publications
( *: Equal contribution, #: The intern I mentored)
- Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
Haipeng Luo#*, Qingfeng Sun*, Can Xu, Pu Zhao, Qingwei Lin, Jianguang Lou, Shifeng Chen, Yansong Tang, Weizhu Chen
arXiv preprint arXiv:2407.10627 - AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Mengkang Hu, Pu Zhao, Can Xu, Qingfeng Sun, Jianguang Lou, Qingwei Lin, Ping Luo, Saravan Rajmohan, Dongmei Zhang
arXiv preprint arXiv:2408.00764 - WizardLM: Empowering Large Language Models to Follow Complex Instructions
Can Xu*, Qingfeng Sun*, Kai Zheng*, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
ICLR 2024 - WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, Qingwei Lin, Daxin Jiang
ICLR 2024 - WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo#*, Qingfeng Sun*, Can Xu, Pu Zhao, Jianguang Lou, Chongyang Tao, Xiubo Geng, Qingwei Lin, Shifeng Chen, Dongmei Zhang
arXiv preprint arXiv:2308.09583 - MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
Jiazhan Feng#, Qingfeng Sun, Can Xu, Pu Zhao, Yaming Yang, Chongyang Tao, Dongyan Zhao, Qingwei Lin
ACL 2023 - Adversarial Knowledge Stimulated Contrastive Prompting for Few-shot Language Learners
Kai Zheng, Qingfeng Sun, Yaming Yang, Tengchao Lv, Yeyong Pi, Changlin Zhao, Fei Xu, Qi Zhang
ACL 2023, Findings - Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting
Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang
NAACL 2022 [Oral Paper ] - Multimodal Dialogue Response Generation
Qingfeng Sun, Yujing Wang, Can Xu, Kai Zheng, Yaming Yang, Huang Hu, Fei Xu, Jessica Zhang, Xiubo Geng, Daxin Jiang
ACL 2022 - PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks
Yufei Wang, Can Xu, Qingfeng Sun, Huang Hu, Chongyang Tao, Xiubo Geng, Daxin Jiang
ACL 2022 - Knowledge Stimulated Contrastive Prompting for Low-Resource Stance Detection
Kai Zheng, Qingfeng Sun, Yaming Yang, Fei Xu
EMNLP 2022, Findings - Hierarchical Attention Prototypical Networks for Few-Shot Text Classification
Shengli Sun*, Qingfeng Sun*, Kevin Zhou, Tengchao Lv
EMNLP 2019
Experiences
- July. 2020 - Now, Research Scientist, Microsoft AI.
- Sept. 2018 - June. 2020, Research Intern, Microsoft XiaoIce.
Academic Services
Program Committee for
- ICLR 2025
- NeurIPS 2024
- NAACL 2024
- ACL 2023
- EACL 2023
- KDD 2022, 2023
- EMNLP 2022
- COLING 2022