SPLASH 2025
Sun 12 - Sat 18 October 2025 Singapore
co-located with ICFP/SPLASH 2025
Thu 16 Oct 2025 14:45 - 15:00 at Orchid East - Compilation 1 Chair(s): Hidehiko Masuhara

Machine learning (ML) compilers rely on graph-level transformations to enhance the runtime performance of ML models. However, performing local transformations on individual operations can create effects far beyond the location of the rewrite. In particular, a local rewrite can change the profitability or legality of hard-to-predict downstream transformations, particularly regarding data layout, parallelization, fine-grained scheduling, and memory management. As a result, program transformations are often driven by manually-tuned compiler heuristics, which are quickly rendered obsolete by new hardware and model architectures.

Instead of hand-written local heuristics, we propose the use of equality saturation. We replace such heuristics with a more robust \textit{global} performance model, which accounts for downstream transformations. Equality saturation addresses the challenge of local optimizations inadvertently constraining or negating the benefits of subsequent transformations, thereby providing a solution that is inherently adaptable to newer workloads. While this approach still requires a global performance model to evaluate the profitability of transformations, it holds significant promise for increased automation and adaptability.

This paper addresses challenges in applying equality saturation on real-world ML compute graphs and state-of-the-art hardware. By doing so, we present an improved method for discovering effective compositions of graph optimizations. We study different cost modeling approaches to deal with fusion and layout optimization, and tackle scalability issues that arise from considering a very wide range of algebraic optimizations. We design an equality saturation pass for the XLA compiler, with an implementation in C++ and Rust. We demonstrate an average speedup of $3.45%$ over XLA’s optimization flow across our benchmark suite on various CPU and GPU platforms, with a maximum speedup of $56.26%$ for NasRNN on CPU.

Thu 16 Oct

Displayed time zone: Perth change

13:45 - 15:30
Compilation 1OOPSLA at Orchid East
Chair(s): Hidehiko Masuhara Institute of Science Tokyo
13:45
15m
Talk
Bridging the Gap between Real-World and Formal Binary Lifting through Filtered-Simulation
OOPSLA
Jihee Park KAIST, Insu Yun KAIST, Sukyoung Ryu KAIST
Link to publication DOI
14:00
15m
Talk
Compiling Classical Sequent Calculus to Stock Hardware: The Duality of Compilation
OOPSLA
Philipp Schuster University of Tübingen, Marius Müller University of Tübingen, Klaus Ostermann University of Tübingen, Jonathan Immanuel Brachthäuser University of Tübingen
14:15
15m
Talk
HybridPersist: A Compiler Support for User-Friendly and Efficient PM Programming
OOPSLA
Yiyu Zhang Nanjing University, Yongzhi Wang Xidian University, Yanfeng Gao Nanjing University, Xuandong Li Nanjing University, Zhiqiang Zuo Nanjing University
14:30
15m
Talk
JavART: a Lightweight Rule-Based JIT Compiler Using Translation Rules Extracted from a Learning Approach
OOPSLA
Hanzhang Wang Fudan University, China, Wei Peng Fudan University, Wenwen Wang University of Georgia, Yunping Lu Fudan University, Pen-Chung Yew University of Minnesota at Twin Cities, Weihua Zhang Fudan University
14:45
15m
Talk
Mind the Abstraction Gap: Bringing Equality Saturation to Real-World ML Compilers
OOPSLA
Arya Vohra University of Chicago, Leo Seojun Lee University of Oxford, Jakub Bachurski University of Cambridge, Oleksandr Zinenko Brium, Phitchaya Mangpo Phothilimthana OpenAI, Albert Cohen Google DeepMind, William S. Moses University of Illinois Urbana-Champaign
15:00
15m
Talk
Scaling Optimization Over Uncertainty via Compilation
OOPSLA
Minsung Cho , John Gouwar Northeastern University, Steven Holtzen Northeastern University
15:15
15m
Talk
Tracing Just-in-time Compilation for Effects and Handlers
OOPSLA
Marcial Gaißert University of Tübingen, CF Bolz-Tereick Heinrich-Heine-Universität Düsseldorf, Jonathan Immanuel Brachthäuser University of Tübingen
Pre-print