Xiaodong GU


Associate Professor

School of Software, Shanghai Jiao Tong University

Contact:

Room 1208, Software Building, No.800 Dongchuan Road, Shanghai, China
Email:

Research Interest:

My research focuses on large language models for natural and programming languages. I develop efficient machine learning methodologies for software code. My research topics are:

Github 

Selected Publications

[Full List] [Google Scholar]

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers
Yuling Shi, Hongyu Zhang, Chengcheng Wan and Xiaodong Gu*
In Proceedings of the 47th International Conference on Software Engineering (ICSE 2025). Ottawa, Ontario, Canada, April 27 - May 3, 2025. (CCF-A)
[paper] [code] [bibtex]

On the Effectiveness of Large Language Models in Domain-Specific Code Generation
Yalan Lin, Meng Chen, Yuhan Hu, Hongyu Zhang, Chengcheng Wan, Zhao Wei, Yong Xu, Juhong Wang, Xiaodong Gu*
ACM Transactions on Software Engineering and Methodology (TOSEM 2024) (CCF-A)
[paper]

How Effectively Do Code Language Models Understand Poor-Readability Code?
Chao Hu, Yitian Chai, Hao Zhou, Fandong Meng, Jie Zhou and Xiaodong Gu*
In Proceedings of the 39th ACM/IEEE International Conference on Automated Software Engineering (ASE 2024). Sacramento, California, United States, Oct 27 - Nov 1, 2024. (CCF-A)
[paper] [code] [bibtex]

VarGAN: Adversarial Learning of Variable Semantic Representations
Yalan Lin, Chengcheng Wan, Shuwen Bai, Xiaodong Gu*
IEEE Transactions on Software Engineering (TSE 2024) (CCF-A)
[paper] [code]

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark
Mingsheng Jiao, Tingrui Yu, Xuan Li, Guanjie Qiu, Xiaodong Gu*, Beijun Shen
In Proceedings of the 38th International Conference on Automated Software Engineering (ASE 2023), Kirchberg, Luxembourg, Sept. 11-15, 2023 (CCF-A)
[paper] [slides] [code]

InfeRE: Step-by-Step Regex Generation via Chain of Inference
Shuai Zhang, Xiaodong Gu, Yuting Chen, Beijun Shen
In Proceedings of the 38th International Conference on Automated Software Engineering (ASE 2023), Kirchberg, Luxembourg, Sept. 11-15, 2023 (CCF-A)
[paper] [slides] [code] [bibtex]

Self-Supervised Query Reformulation for Code Search
Yuetian Mao, Chengcheng Wan, Yuze Jiang, Xiaodong Gu*
In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023), San Francisco, California, United States, Dec 3-9, 2023 (CCF-A)
[paper] [slides] [code] [bibtex]

Diet Code Is Healthy: Simplifying Programs for Pre-Trained Models of Code
Zhaowei Zhang, Hongyu Zhang, Beijun Shen, Xiaodong Gu*
In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022), Singapore, Nov 14-18, 2022 (CCF-A)
[paper] [slides] [code] [bibtex]

Cross-Domain Deep Code Search with Meta Learning
Yitian Chai, Hongyu Zhang, Beijun Shen and Xiaodong Gu*
In Proceedings of the 44th International Conference on Software Engineering (ICSE 2022) (CCF-A)
[paper] [code] [slides] [bibtex]

Deep Code Search
Xiaodong Gu, Hongyu Zhang, Sunghun Kim
In Proceedings of the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 27 - June 3, 2018 (CCF-A)
[paper] [code] [slides] [bibtex]

Deep API Learning
Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim
In Proceedings of the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016), Seattle, WA, USA, November 13–18, 2016 (CCF-A)
[project website] [paper] [slides] [code] [bibtex]

Teaching

Students

I am grateful to the wonderful students I have been collaborating with

Alumni

Grants

Services

Program Committee ACL (2023), EMNLP (2021, 2022, 2023), COLING (2020, 2022, 2024), IJCAI (2023), EACL (2023)
Reviewer Board Automated Software Engineering (AUSE), Empirical Software Engineering (EMSE)
Journal Reviewer TSE, TOSEM, EMSE, IST, JSS, FCS