About me

Bio

Hi, I am a third year Ph.D. student in the LLM for Software Engineering Lab (LLMSE), affiliated with the School of Software at Shanghai Jiao Tong University in China. I’m grateful to be advised by Prof. Xiaodong Gu and Prof. Beijun Shen.

My main research interests are in the field of Natural Language Processing and Software Engineering. Some of my recent projects can be found in my Github homepage here. Feel free to contact me if you are interested in my work or have any questions.

We have multiple potential projects available with abundant computing resources! If you are interested in collaboration or internship (remote is also welcome), please feel free to contact me.

Preprints

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Yuling Shi, Songsong Wang, Chengcheng Wan, Xiaodong Gu

TL;DR: A multi-level LLM debugger.

AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation

Yixiong Fang, Tianran Sun, Yuling Shi, Xiaodong Gu

TL;DR: A context compression method for long context scenarios.

Publications

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers

Yuling Shi, Hongyu Zhang, Chengcheng Wan, Xiaodong Gu

In Proceedings of the 47th International Conference on Software Engineering (ICSE 2025)

TL;DR: A zero-shot LLM-generated code detector.

A Morley-Wang-Xu element method for a fourth order elliptic singular perturbation problem

Xuehai Huang, Yuling Shi and Wenqing Wang

Journal of Scientific Computing (Q1), 2021

TL;DR: A FEM solver for PDEs.

Experiences

  • Research Intern at Microsoft, 2023

    • I’m grateful to be advised by Dr. Yufan Huang and Dr. Maoquan Wang to work on analyzing neural representations of code. And some of my work contributed to the following paper on EMNLP 2023. [pdf]

Awards

  • National Scholarship
  • Fifth place in Shanghai Table Tennis Doubles Championship and third place in teams representing my university
  • First Prize in National Olympiad in Physics at High school (Provincial Area)

Materials to share

  • šŸ”„ A collection of resources for repo-level code generation. [Github]
  • A simple script to detect word by word plagiarism for Academic Writing course in SJTU. [Github]
  • A booklet on how to write math papers and some related topics created with Lin Dong. [pdf] (in Chinese)
  • A tool for enrolling in courses automatically at SUFE. [Github]

Thank you for visiting my homepage!