HieraSynth: A Parallel Framework for Complete Super-Optimization with Hierarchical Space Decomposition (SPLASH 2025 - OOPSLA)

Sun 12 - Sat 18 October 2025 Singapore

co-located with ICFP/SPLASH 2025

Who

Sirui Lu, Rastislav Bodík

Track

SPLASH 2025 OOPSLA

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 16 Oct 2025 16:30 - 16:45 at Orchid Plenary Ballroom - Parallelism Chair(s): Tony Hosking

Abstract

Modern optimizing compilers generate efficient code but rarely achieve theoretical optimality, often necessitating manual fine-tuning for performance-critical applications. This challenge is amplified on modern processors with complex vector instruction sets like RISC-V Vectors (RVV), where writing optimal code requires deep hardware-specific knowledge. Super-optimizers address this gap by automatically synthesizing high-performance code but face a fundamental scalability constraint: as instruction set size increases, the maximum synthesizable program length decreases inversely. We introduce HieraSynth, a parallel framework for complete super-optimization that overcomes this constraint through hierarchical decomposition on instruction selection rather than the conventional peephole-style approach of decomposing on program length. Unlike non-exhaustive approaches that cannot guarantee optimality, HieraSynth preserves completeness, ensuring that a solution matching the specification will be found if one exists. Our approach systematically partitions program spaces into manageable subspaces, aggressively prunes unrealizable branches, and achieves near-linear speedup through independent parallel exploration of subspaces. We implement HieraSynth as a library and demonstrate its effectiveness with an RVV super-optimizer capable of handling instruction sets with up to 700 instructions while synthesizing programs with 7-8 instructions, a significant advancement over previous approaches limited to 1-3 instructions with similar instruction set sizes. Specifically, when compared to existing systems, HieraSynth can handle up to 10.66× larger instruction set for a given program length, or synthesize up to 4.75× larger programs for a fixed instruction set. Evaluations show that HieraSynth can discover optimizations surpassing human-designed code and significantly reduce synthesis time, making super-optimization more practical for modern vector architectures.

Sirui Lu

OpenAI

United States

Rastislav Bodík

Google Research, Brain Team