CGO 2024
2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)
Powered by
Conference Publishing Consulting

2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), March 02–06, 2024, Edinburgh, United Kingdom

CGO 2024 – Proceedings

Contents - Abstracts - Authors

Frontmatter

Title Page
Welcome from the General Chairs
Welcome from the Program Chairs
CGO 2024 Organization
CGO 2024 Sponsors and Supporters

Compilers for Machine Learning

A Tensor Algebra Compiler for Sparse Differentiation
Amir Shaikhha ORCID logo, Mathieu Huot ORCID logo, and Shideh Hashemian ORCID logo
(University of Edinburgh, United Kingdom; University of Oxford, United Kingdom)
Energy-Aware Tile Size Selection for Affine Programs on GPUs
Malith Jayaweera ORCID logo, Martin Kong, Yanzhi Wang, and David Kaeli ORCID logo
(Northeastern University, USA; Ohio State University, USA)
Published Artifact Artifacts Available Artifacts Functional
PolyTOPS: Reconfigurable and Flexible Polyhedral Scheduler
Gianpietro Consolaro, Zhen Zhang, Harenome Razanajato, Nelson Lossing, Nassim Tchoulak, Adilla Susungi, Artur Cesar Araujo Alves, Renwei Zhang, Denis Barthou, Corinne Ancourt, and Cédric Bastoul
(Huawei Technologies, France; Mines Paris-PSL, France; Huawei Technologies, China)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced

Machine-Learning Guided Optimizations

AskIt: Unified Programming Interface for Programming with Large Language Models
Katsumi Okuda ORCID logo and Saman AmarasingheORCID logo
(Massachusetts Institute of Technology, USA; Mitsubishi Electric Corporation, Japan)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Revealing Compiler Heuristics through Automated Discovery and Optimization
Volker Seeker, Chris Cummins, Murray Cole, Björn Franke ORCID logo, Kim Hazelwood ORCID logo, and Hugh Leather ORCID logo
(Meta AI Research, USA; University of Edinburgh, United Kingdom)
SLaDe: A Portable Small Language Model Decompiler for Optimized Assembly
Jordi Armengol-Estapé ORCID logo, Jackson Woodruff ORCID logo, Chris Cummins ORCID logo, and Michael F. P. O'Boyle ORCID logo
(University of Edinburgh, United Kingdom; Meta AI Research, USA)
Published Artifact Artifacts Available
TapeFlow: Streaming Gradient Tapes in Automatic Differentiation
Milad Hakimi and Arrvindh Shriraman
(Simon Fraser University, Canada)
Video

Compilers for GPUs

A Framework for Fine-Grained Synchronization of Dependent GPU Kernels
Abhinav Jangda ORCID logo, Saeed MalekiORCID logo, Maryam Mehri DehnaviORCID logo, Madan Musuvathi ORCID logo, and Olli Saarikivi ORCID logo
(Microsoft Research, USA; University of Toronto, Canada)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Enhancing Performance through Control-Flow Unmerging and Loop Unrolling on GPUs
Alnis Murtovi, Giorgis Georgakoudis ORCID logo, Konstantinos Parasyris, Chunhua Liao ORCID logo, Ignacio Laguna, and Bernhard Steffen
(TU Dortmund, Germany; Lawrence Livermore National Laboratory, USA)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Retargeting and Respecializing GPU Workloads for Performance Portability
Ivan R. Ivanov ORCID logo, Oleksandr Zinenko ORCID logo, Jens Domke ORCID logo, Toshio Endo ORCID logo, and William S. Moses ORCID logo
(Tokyo Institute of Technology, Japan; RIKEN R-CCS, Japan; Google DeepMind, France; University of Illinois at Urbana-Champaign, USA; Google DeepMind, USA)
Published Artifact Info Artifacts Available Artifacts Reusable Results Reproduced
Seer: Predictive Runtime Kernel Selection for Irregular Problems
Ryan Swann, Muhammad Osama ORCID logo, Karthik Sangaiah, and Jalal Mahmud
(AMD, USA)
Artifacts Reusable Results Reproduced

Custom Processors

AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators
Nicolas Bohm Agostini ORCID logo, Jude Haris ORCID logo, Perry Gibson ORCID logo, Malith Jayaweera ORCID logo, Norm Rubin, Antonino Tumeo ORCID logo, José L. AbellánORCID logo, José Cano ORCID logo, and David Kaeli ORCID logo
(Northeastern University, USA; Pacific Northwest National Laboratory, USA; University of Glasgow, United Kingdom; University of Murcia, Spain)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Ecmas: Efficient Circuit Mapping and Scheduling for Surface Code
Mingzheng Zhu, Hao Fu, Jun Wu ORCID logo, Chi Zhang, Wei Xie, and Xiang-Yang Li
(University of Science and Technology of China, China)
PresCount: Effective Register Allocation for Bank Conflict Reduction
Xiaofeng Guan ORCID logo, Hao Zhou ORCID logo, Guoqing Bao ORCID logo, Handong Li, Liang Zhu, and Jianguo Yao
(Shanghai Jiao Tong University, China; Shanghai Enflame Technology, China)
Tackling the Matrix Multiplication Micro-kernel Generation with Exo
Adrián Castelló ORCID logo, Julian Bellavita, Grace Dinh ORCID logo, Yuka Ikarashi ORCID logo, and Héctor Martínez
(Universitat Politècnica de València, Spain; Cornell University, USA; University of California at Berkeley, USA; Massachusetts Institute of Technology, USA; Universidad de Córdoba, Spain)
Published Artifact Artifacts Available Artifacts Reusable

Compiler Construction

One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution
Luisa Cicolini ORCID logo, Filippo Carloni ORCID logo, Marco D. Santambrogio ORCID logo, and Davide Conficconi ORCID logo
(Politecnico di Milano, Italy)
Published Artifact Video Info Artifacts Available Artifacts Reusable Results Reproduced
Whose Baseline Compiler Is It Anyway?
Ben L. Titzer ORCID logo
(Carnegie Mellon University, USA)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Enabling Fine-Grained Incremental Builds by Making Compiler Stateful
Ruobing Han ORCID logo, Jisheng Zhao ORCID logo, and Hyesoon Kim ORCID logo
(Georgia Institute of Technology, USA)

Custom Environments

Compile-Time Analysis of Compiler Frameworks for Query Compilation
Alexis Engelke ORCID logo and Tobias Schwarz ORCID logo
(TU Munich, Germany)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
DrPy: Pinpointing Inefficient Memory Usage in Multi-Layer Python Applications
Jinku Cui, Qidong Zhao ORCID logo, Yueming Hao ORCID logo, and Xu Liu ORCID logo
(North Carolina State University, USA)
Artifacts Functional
SCHEMATIC: Compile-Time Checkpoint Placement and Memory Allocation for Intermittent Systems
Hugo Reymond ORCID logo, Jean-Luc Béchennec ORCID logo, Mikaël Briday ORCID logo, Sébastien Faucou ORCID logo, Isabelle Puaut ORCID logo, and Erven Rohou ORCID logo
(Université de Rennes - Inria - CNRS - IRISA, France; Nantes Université - École Centrale Nantes - CNRS - LS2N - UMR 6004, France)
Video
Latent Idiom Recognition for a Minimalist Functional Array Language using Equality Saturation
Jonathan Van der Cruysse and Christophe DubachORCID logo
(McGill University, Canada)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced

Static/Dynamic Analyses

BEC: Bit-Level Static Analysis for Reliability against Soft Errors
Yousun Ko ORCID logo and Bernd Burgstaller ORCID logo
(Yonsei University, South Korea)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Boosting the Performance of Multi-solver IFDS Algorithms with Flow-Sensitivity Optimizations
Haofeng Li, Jie Lu, Haining Meng, Liqing Cao, Lian Li, and Lin Gao
(Institute of Computing Technology at Chinese Academy of Sciences, China; University of Chinese Academy of Sciences, China; Zhongguancun Laboratory, China; TianqiSoft, China)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Representing Data Collections in an SSA Form
Tommy McMichenORCID logo, Nathan Greiner ORCID logo, Peter Zhong ORCID logo, Federico Sossai ORCID logo, Atmn Patel ORCID logo, and Simone Campanoni ORCID logo
(Northwestern University, USA)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentation
Wenlei He, Hongtao Yu, Lei Wang, and Taewook Oh
(Meta, USA)

Supporting Tools

Compiler Testing with Relaxed Memory Models
Luke GeesonORCID logo and Lee Smith
(University College London, United Kingdom; Arm, United Kingdom)
Published Artifact Info Artifacts Available Artifacts Reusable Results Reproduced
High-Throughput, Formal-Methods-Assisted Fuzzing for LLVM
Yuyou Fan and John Regehr ORCID logo
(University of Utah, USA)
Published Artifact Info Artifacts Available
EasyTracker: A Python Library for Controlling and Inspecting Program Execution
Théo Barollet, Christophe Guillon, Manuel Selva, François Broquedis, Florent Bouchez-Tichadou, and Fabrice Rastello
(University Grenoble Alpes - Inria - CNRS - Grenoble INP - LIG, France)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced
OptiWISE: Combining Sampling and Instrumentation for Granular CPI Analysis
Yuxin Guo, Alex W. Chadwick, Márton Erdős, Utpal Bora, Ilias Vougioukas ORCID logo, Giacomo Gabrielli, and Timothy M. JonesORCID logo
(University of Cambridge, United Kingdom; Arm, USA; Arm, United Kingdom)
Published Artifact Video Artifacts Available Artifacts Reusable Results Reproduced

Practice and Experience

EasyView: Bringing Performance Profiles into Integrated Development Environments
Qidong Zhao ORCID logo, Milind Chabbi ORCID logo, and Xu Liu ORCID logo
(North Carolina State University, USA; Scalable Machines Research, USA)
Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Experiences Building an MLIR-Based SYCL Compiler
Ettore Tiotto, Víctor Pérez ORCID logo, Whitney Tsang, Lukas Sommer ORCID logo, Julian Oppermann ORCID logo, Victor Lomüller ORCID logo, Mehdi Goli ORCID logo, and James Brodman ORCID logo
(Intel Corporation, Canada; Codeplay Software, United Kingdom; Intel Corporation, USA)
Published Artifact Artifacts Available
Unveiling and Vanquishing Goroutine Leaks in Enterprise Microservices: A Dynamic Analysis Approach
Georgian-Vlad Saioc, Dmitriy Shirchenko, and Milind Chabbi ORCID logo
(Aarhus University, Denmark; Uber Technologies, Denmark; Uber Technologies, USA)
Video

Acceleration Techniques

A System-Level Dynamic Binary Translator using Automatically-Learned Translation Rules
Jinhu Jiang, Chaoyi Liang, Rongchao Dong, Zhaohui Yang, Zhongjun Zhou, Wenwen Wang, Pen-Chung Yew, and Weihua Zhang
(Fudan University, China; University of Georgia, USA; University of Minnesota at Twin Cities, USA)
Instruction Scheduling for the GPU on the GPU
Ghassan ShobakiORCID logo, Pınar Muyan-Özçelik ORCID logo, Josh Hutton, Bruce Linck, Vladislav Malyshenko, Austin Kerbow ORCID logo, Ronaldo Ramirez-Ortega, and Vahl Scott Gordon
(California State University, Sacramento, USA; Advanced Micro Devices, USA)
Published Artifact Artifacts Available Artifacts Functional
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication
Qiang Fu, Thomas B. Rolinger, and H. Howie Huang
(Advanced Micro Devices, USA; NVIDIA, USA; George Washington University, USA)
oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation
Jianhui Li, Zhennan Qin, Yijie Mei, Jingze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui Jin, Yan Zhang, Jason Ye, Eric Lin, and Dan Lavery
(Intel, USA; Intel, China)
Published Artifact Artifacts Available Artifacts Reusable Results Reproduced

proc time: 0.1