SPLASH 2025
Sun 12 - Sat 18 October 2025 Singapore
co-located with ICFP/SPLASH 2025
Sat 18 Oct 2025 17:00 - 17:25 at Peony NW - Afternoon Paper Session 2 Chair(s): Mikhail Barash

The need for scalable and personalized content in programming education has led to growing interest in the automation of programming exercise generation. To support this, a comprehensive understanding of existing exercises is crucial. This research contributes to the latter by classifying existing exercises based on educational topic and difficulty level. We leverage a lexicon-based analysis in combination with machine learning and advanced natural language processing techniques, providing a foundation for AI-driven content generation. In particular, we utilize BERTopic for topic modeling and five different machine learning models for predicting difficulty levels in programming exercises. Our dataset comprises 106 programming exercise descriptions used in three introductory programming courses in higher education, along with performance data from up to 189 learners. The results demonstrate that lexicon-based approaches significantly improve topic modeling accuracy and coherence compared to the baseline, with reduced variance and more consistent cluster stability. Although difficulty prediction remains challenging due to the complexity of defining ground truth, lexicon integration leads to modest yet consistent performance gains. This work lays an essential groundwork for scalable and resource-efficient solutions for the classification and generation of personalized programming exercises.

Sat 18 Oct

Displayed time zone: Perth change

16:00 - 17:30
Afternoon Paper Session 2SPLASH-E at Peony NW
Chair(s): Mikhail Barash University of Bergen
15:45
25m
Full-paper
Derivation Visualization for Context-Free Grammar Design: Helping Students Understand Context-Free Grammars
SPLASH-E
Marco T Morazan Seton Hall University, Andrés M. Garced Seton Hall University, Tijana Minić
16:10
25m
Full-paper
Interactive Theorem Provers for Proof Education
SPLASH-E
Romina Mahinpei Princeton University, Manoel Horta Ribeiro , Mae Milano Princeton University
16:35
25m
Talk
Waddle: A Serious Game to Teach Writing, Reading, and Debugging Programs
SPLASH-E
Florian Sihler Ulm University, Naomi Panda , Simon Berlinger Ulm University, Germany, Matthias Tichy Ulm University
Link to publication File Attached
17:00
25m
Full-paper
Personalization of Programming Education: An NLP-based Bi-dimensional Classification of Programming Exercises
SPLASH-E
Tommie Lombarts Eindhoven University of Technology, Gijs Walravens Eindhoven University of Technology, Mazyar Seraj Eindhoven University of Technology, Lina Ochoa Eindhoven University of Technology, Mark van den Brand Eindhoven University of Technology