UTFix: Change Aware Unit Test Repairing using LLM (SPLASH 2025 - OOPSLA)

Sun 12 - Sat 18 October 2025 Singapore

co-located with ICFP/SPLASH 2025

Who

Shanto Rahman, Sachit Kuhar, Berk Cirisci, Pranav Garg, Shiqi Wang, Xiaofei Ma, Anoop Deoras, Baishakhi Ray

Track

SPLASH 2025 OOPSLA

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 18 Oct 2025 14:15 - 14:30 at Orchid Plenary Ballroom - Synthesis 2 Chair(s): Jeffrey S. Foster

Abstract

Software updates, including bug fixes and feature additions, are frequent in modern applications, but they often leave test suites outdated, resulting in undetected bugs and increased chances of system failures. A recent study by Meta revealed that 14%-22% of software failures stem from outdated test cases that fail to reflect changes in the codebase. This highlights the need to keep test cases in sync with code changes to ensure software reliability. In this paper, we present UTFix, a novel approach for repairing unit test cases (unit test) when their corresponding focal methods (focal method) undergo modifications. UTFix addresses two critical issues: assertion failures and reduced code coverage caused by changes in the focal method.

Our approach leverages large language models (LLMs) to repair test cases by providing contextual information such as static code slices, dynamic execution traces, and failure messages. We evaluate UTFix using both synthetic and real-world benchmarks. The synthetic benchmark includes diverse changes from popular open-source Python projects, where UTFix successfully repaired 89.2% of assertion failures and achieved 100% code coverage for 96 tests. Additionally, we curated real-world examples, finding that UTFix repaired a significant number of assertion failures and coverage issues. To the best of our knowledge, this is the first comprehensive study focused on test case repair in evolving Python projects. Our contributions include the development of UTFix, the creation of synthetic and real-world benchmarks, and the demonstration of the effectiveness of LLM-based methods in addressing test case failures due to software evolution.

Shanto Rahman

The University of Texas at Austin

United States

Sachit Kuhar

Amazon Web Services

Berk Cirisci

Amazon Web Services

Pranav Garg

AWS

United States

Shiqi Wang

AWS AI Labs

United States

Xiaofei Ma

AWS AI Labs

Anoop Deoras

AWS AI Labs

Baishakhi Ray

Columbia University

United States

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sat 18 Oct
Displayed time zone: Perth change

13:45 - 15:30	Synthesis 2OOPSLA at Orchid Plenary Ballroom Chair(s): Jeffrey S. Foster Tufts University

13:45 15m Talk		Tunneling Through the Hill: Multi-Way Intersection for Version-Space Algebras in Program Synthesis OOPSLA Guanlin Chen Peking University, Ruyi Ji Peking University, Shuhao Zhang Peking University, Yingfei Xiong Peking University DOI
14:00 15m Talk		Language-Parametric Reference Synthesis OOPSLA Daniel A. A. Pelsmaeker Delft University of Technology, Netherlands, Aron Zwaan Delft University of Technology, Casper Bach University of Southern Denmark, Arjan J. Mooij Zürich University of Applied Sciences
14:15 15m Talk		UTFix: Change Aware Unit Test Repairing using LLM OOPSLA Shanto Rahman The University of Texas at Austin, Sachit Kuhar Amazon Web Services, Berk Cirisci Amazon Web Services, Pranav Garg AWS, Shiqi Wang AWS AI Labs, Xiaofei Ma AWS AI Labs, Anoop Deoras AWS AI Labs, Baishakhi Ray Columbia University
14:30 15m Talk		Synthesizing DSLs for Few-Shot Learning OOPSLA Paul Krogmeier University of Illinois at Urbana-Champaign, P. Madhusudan University of Illinois at Urbana-Champaign
14:45 15m Talk		Synthesizing Implication Lemmas for Interactive Theorem Proving OOPSLA Ana Brendel University of California Los Angeles, Aishwarya Sivaraman Meta, Todd Millstein University of California at Los Angeles
15:00 15m Talk		Synthesizing Sound and Precise Abstract Transformers for Nonlinear Hyperbolic PDE Solvers OOPSLA Jacob Laurel Georgia Institute of Technology, Ignacio Laguna Lawrence Livermore National Laboratory, Jan Hueckelheim Argonne National Laboratory