ASPLOS 2023
28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 (ASPLOS 2023)
Powered by
Conference Publishing Consulting

28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 (ASPLOS 2023), March 25–29, 2023, Vancouver, BC, Canada

ASPLOS 2023 – Proceedings

Contents - Abstracts - Authors

Frontmatter

Title Page
ASPLOS 2023 Volume II Program Chairs’ Message
Committees

Papers

Achieving Sub-second Pairwise Query over Evolving Graphs
Hongtao Chen, Mingxing Zhang, Ke Yang, Kang Chen, Albert Zomaya, Yongwei Wu, and Xuehai Qian
(Tsinghua University, China; Beijing HaiZhi XingTu Technology, China; University of Sydney, Australia; Purdue University, USA)
Publisher's Version
AfterImage: Leaking Control Flow Data and Tracking Load Operations via the Hardware Prefetcher
Yun Chen, Lingfeng Pei, and Trevor E. Carlson
(National University of Singapore, Singapore)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional
A Generic Service to Provide In-Network Aggregation for Key-Value Streams
Yongchao He, Wenfei Wu, Yanfang Le, Ming Liu, and ChonLam Lao
(Tsinghua University, China; Peking University, China; Intel, USA; University of Wisconsin-Madison, USA; Harvard University, USA)
Publisher's Version
A Prediction System Service
Zhizhou Zhang, Alvin Oliver Glova, Timothy Sherwood, and Jonathan Balkind
(University of California at Santa Barbara, USA)
Publisher's Version
AtoMig: Automatically Migrating Millions Lines of Code from TSO to WMM
Martin Beck, Koustubha Bhat, Lazar Stričević, Geng Chen, Diogo Behrens, Ming Fu, Viktor Vafeiadis, Haibo Chen, and Hermann Härtig
(Huawei Dresden Research Center, Dresden, Germany; Huawei Fundamental Software Innovation Lab, Shenzhen, China; MPI-SWS, Kaiserslautern, Germany; Huawei Central Software Institute, Shenzhen, China; Shanghai Jiao Tong University, Shanghai, China; TU Dresden, Dresden, Germany)
Publisher's Version
BeeHive: Sub-second Elasticity for Web Services with Semi-FaaS Execution
Ziming Zhao, Mingyu Wu, Jiawei Tang, Binyu Zang, Zhaoguo Wang, and Haibo Chen
(Shanghai Jiao Tong University, China; Shanghai AI Laboratory, China; Engineering Research Center for Domain-specific Operating Systems of the Ministry of Education of China, China)
Publisher's Version
Better Than Worst-Case Decoding for Quantum Error Correction
Gokul Subramanian Ravi, Jonathan M. Baker, Arash Fayyazi, Sophia Fuhui Lin, Ali Javadi-Abhari, Massoud Pedram, and Frederic T. Chong
(University of Chicago, USA; University of Southern California, USA; IBM, USA)
Publisher's Version
Betty: Enabling Large-Scale GNN Training with Batch-Level Graph Partitioning
Shuangyan Yang, Minjia Zhang, Wenqian Dong, and Dong Li
(University of California at Merced, USA; Microsoft Research, USA; Florida International University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Carbon Explorer: A Holistic Framework for Designing Carbon Aware Datacenters
Bilge Acun, Benjamin Lee, Fiodar Kazhamiaka, Kiwan Maeng, Udit Gupta, Manoj Chakkaravarthy, David Brooks, and Carole-Jean Wu
(Meta, USA; University of Pennsylvania, USA; Stanford University, USA; Harvard University, USA)
Publisher's Version
CommonGraph: Graph Analytics on Evolving Data
Mahbod Afarin, Chao Gao, Shafiur Rahman, Nael Abu-Ghazaleh, and Rajiv Gupta
(University of California at Riverside, USA)
Publisher's Version
Compilation Consistency Modulo Debug Information
Theodore Luo Wang, Yongqiang Tian, Yiwen Dong, Zhenyang Xu, and Chengnian Sun
(University of Waterloo, Canada)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional
Compiling Distributed System Models with PGo
Finn Hackett, Shayan Hosseini, Renato Costa, Matthew Do, and Ivan Beschastnikh
(University of British Columbia, Canada)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Copy-on-Pin: The Missing Piece for Correct Copy-on-Write
David Hildenbrand, Martin Schulz, and Nadav Amit
(TU Munich, Germany; Red Hat, Germany; VMware, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Decker: Attack Surface Reduction via On-Demand Code Mapping
Chris Porter, Sharjeel Khan, and Santosh Pande
(Georgia Institute of Technology, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
DeepUM: Tensor Migration and Prefetching in Unified Memory
Jaehoon Jung, Jinpyo Kim, and Jaejin Lee
(Moreh, South Korea; Seoul National University, South Korea)
Publisher's Version
Ditto: End-to-End Application Cloning for Networked Cloud Services
Mingyu Liang, Yu Gan, Yueying Li, Carlos Torres, Abhishek Dhanotia, Mahesh Ketkar, and Christina Delimitrou
(Cornell University, USA; Meta, USA; Intel, USA; Massachusetts Institute of Technology, USA)
Publisher's Version
DPACS: Hardware Accelerated Dynamic Neural Network Pruning through Algorithm-Architecture Co-design
Yizhao Gao, Baoheng Zhang, Xiaojuan Qi, and Hayden Kwok-Hay So
(University of Hong Kong, Hong Kong)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Ecovisor: A Virtual Energy System for Carbon-Efficient Applications
Abel Souza, Noman Bashir, Jorge Murillo, Walid Hanafy, Qianlin Liang, David Irwin, and Prashant Shenoy
(University of Massachusetts at Amherst, USA)
Publisher's Version
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning
Diandian Gu, Yihao Zhao, Yinmin Zhong, Yifan Xiong, Zhenhua Han, Peng Cheng, Fan Yang, Gang Huang, Xin Jin, and Xuanzhe Liu
(Peking University, China; Microsoft Research, China)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
EVStore: Storage and Caching Capabilities for Scaling Embedding Tables in Deep Recommendation Systems
Daniar H. Kurniawan, Ruipu Wang, Kahfi S. Zulkifli, Fandi A. Wiranata, John Bent, Ymir Vigfusson, and Haryadi S. Gunawi
(University of Chicago, USA; Beijing University of Technology, China; Bandung Institute of Technology, Indonesia; Seagate Technology, USA; Emory University, USA)
Publisher's Version
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Sheng-Chun Kao, Suvinay Subramanian, Gaurav Agrawal, Amir Yazdanbakhsh, and Tushar Krishna
(Georgia Institute of Technology, USA; Google, USA; Microsoft, USA)
Publisher's Version
FrozenQubits: Boosting Fidelity of QAOA by Skipping Hotspot Nodes
Ramin Ayanzadeh, Narges Alavisamani, Poulami Das, and Moinuddin Qureshi
(Georgia Institute of Technology, USA)
Publisher's Version Info
GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture
Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seungwon Min, Amna Masood, Jeongmin Park, Jinjun Xiong, C. J. Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William Dally, and Wen-mei Hwu
(University of Illinois at Urbana-Champaign, USA; NVIDIA, USA; AMD, USA; University at Buffalo, USA; IBM Research, USA; Stanford University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional
GZKP: A GPU Accelerated Zero-Knowledge Proof System
Weiliang Ma, Qian Xiong, Xuanhua Shi, Xiaosong Ma, Hai Jin, Haozhao Kuang, Mingyu Gao, Ye Zhang, Haichen Shen, and Weifang Hu
(Huazhong University of Science and Technology, China; Hamad Bin Khalifa University, Qatar; Tsinghua University, China; Scroll Foundation, Seychelles)
Publisher's Version
Hacky Racers: Exploiting Instruction-Level Parallelism to Generate Stealthy Fine-Grained Timers
Haocheng Xiao and Sam Ainsworth
(University of Edinburgh, UK)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs
Yaoyao Ding, Cody Hao Yu, Bojian Zheng, Yizhi Liu, Yida Wang, and Gennady Pekhimenko
(University of Toronto, Canada; Vector Institute, Canada; Amazon Web Services, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
HuffDuff: Stealing Pruned DNNs from Sparse Accelerators
Dingqing Yang, Prashant J. Nair, and Mieszko Lis
(University of British Columbia, Canada)
Publisher's Version
Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon
Jennifer Switzer, Gabriel Marcano, Ryan Kastner, and Pat Pannuto
(University of California at San Diego, USA)
Publisher's Version
Khuzdul: Efficient and Scalable Distributed Graph Pattern Mining Engine
Jingji Chen and Xuehai Qian
(Purdue University, USA)
Publisher's Version
KIT: Testing OS-Level Virtualization for Functional Interference Bugs
Congyu Liu, Sishuai Gong, and Pedro Fonseca
(Purdue University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
LeaFTL: A Learning-Based Flash Translation Layer for Solid-State Drives
Jinghan Sun, Shaobo Li, Yunxin Sun, Chao Sun, Dejan Vucinic, and Jian Huang
(University of Illinois at Urbana-Champaign, USA; ETH Zurich, Switzerland; Western Digital Research, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Lucid: A Non-intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
Qinghao Hu, Meng Zhang, Peng Sun, Yonggang Wen, and Tianwei Zhang
(Nanyang Technological University, Singapore; Shanghai AI Laboratory, China; SenseTime, China)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
MC Mutants: Evaluating and Improving Testing for Memory Consistency Specifications
Reese Levine, Tianhao Guo, Mingun Cho, Alan Baker, Raph Levien, David Neto, Andrew Quinn, and Tyler Sorensen
(University of California at Santa Cruz, USA; New York University, USA; University of California at Davis, USA; Google, Canada; Google, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Mobius: Fine Tuning Large-Scale Models on Commodity GPU Servers
Yangyang Feng, Minhui Xie, Zijie Tian, Shuo Wang, Youyou Lu, and Jiwu Shu
(Tsinghua University, China)
Publisher's Version
MSCCLang: Microsoft Collective Communication Language
Meghan Cowan, Saeed Maleki, Madanlal Musuvathi, Olli Saarikivi, and Yifan Xiong
(Microsoft Research, USA; Microsoft Research, China)
Publisher's Version
Navigating the Dynamic Noise Landscape of Variational Quantum Algorithms with QISMET
Gokul Subramanian Ravi, Kaitlin Smith, Jonathan M. Baker, Tejas Kannan, Nathan Earnest, Ali Javadi-Abhari, Henry Hoffmann, and Frederic T. Chong
(University of Chicago, USA; IBM, USA)
Publisher's Version
NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers
Jiawei Liu, Jinkun Lin, Fabian Ruffy, Cheng Tan, Jinyang Li, Aurojit Panda, and Lingming Zhang
(University of Illinois at Urbana-Champaign, USA; New York University, USA; Northeastern University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
NUBA: Non-Uniform Bandwidth GPUs
Xia Zhao, Magnus Jahre, Yuhua Tang, Guangda Zhang, and Lieven Eeckhout
(Academy of Military Sciences, China; NTNU, Norway; National University of Defense Technology, China; Ghent University, Belgium)
Publisher's Version
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
Jaeyong Song, Jinkyu Yim, Jaewon Jung, Hongsun Jang, Hyung-Jin Kim, Youngsok Kim, and Jinho Lee
(Yonsei University, South Korea; Seoul National University, South Korea; Samsung Electronics, South Korea)
Publisher's Version Published Artifact Artifacts Available
Pond: CXL-Based Memory Pooling Systems for Cloud Platforms
Huaicheng Li, Daniel S. Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, Mark D. Hill, Marcus Fontoura, and Ricardo Bianchini
(Virginia Tech, USA; Carnegie Mellon University, USA; Microsoft Azure, USA; University of Washington, USA; Unaffiliated, USA; Google, USA; Microsoft, USA; Intel, USA; University of Wisconsin-Madison, USA; Stone, USA)
Publisher's Version
Prism: Optimizing Key-Value Store for Modern Heterogeneous Storage Devices
Yongju Song, Wook-Hee Kim, Sumit Kumar Monga, Changwoo Min, and Young Ik Eom
(Sungkyunkwan University, South Korea; Konkuk University, South Korea; Virginia Tech, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Probabilistic Concurrency Testing for Weak Memory Programs
Mingyu Gao, Soham Chakraborty, and Burcu Kulahcioglu Ozkan
(Delft University of Technology, Netherlands)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Propeller: A Profile Guided, Relinking Optimizer for Warehouse-Scale Applications
Han Shen, Krzysztof Pszeniczny, Rahman Lavaee, Snehasish Kumar, Sriraman Tallam, and Xinliang David Li
(Google, USA; Google, Switzerland)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Protecting Data Integrity of Web Applications with Database Constraints Inferred from Application Code
Haochen Huang, Bingyu Shen, Li Zhong, and Yuanyuan Zhou
(University of California at San Diego, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Qompress: Efficient Compilation for Ququarts Exploiting Partial and Mixed Radix Operations for Communication Reduction
Andrew Litteken, Lennart Maximilian Seifert, Jason Chadwick, Natalia Nottingham, Frederic T. Chong, and Jonathan M. Baker
(University of Chicago, USA)
Publisher's Version
RAIZN: Redundant Array of Independent Zoned Namespaces
Thomas Kim, Jekyeom Jeon, Nikhil Arora, Huaicheng Li, Michael Kaminsky, David G. Andersen, Gregory R. Ganger, George Amvrosiadis, and Matias Bjørling
(Carnegie Mellon University, USA; Enriched Ag, USA; Western Digital, Denmark)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional
Revisiting Log-Structured Merging for KV Stores in Hybrid Memory Systems
Zhuohui Duan, Jiabo Yao, Haikun Liu, Xiaofei Liao, Hai Jin, and Yu Zhang
(Huazhong University of Science and Technology, China)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
Scoped Buffered Persistency Model for GPUs
Shweta Pandey, Aditya K Kamath, and Arkaprava Basu
(IISc Bangalore, India; University of Washington, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
ShakeFlow: Functional Hardware Description with Latency-Insensitive Interface Combinators
Sungsoo Han, Minseong Jang, and Jeehoon Kang
(KAIST, South Korea)
Publisher's Version Published Artifact Artifacts Available
Sigma: Compiling Einstein Summations to Locality-Aware Dataflow
Tian Zhao, Alexander Rucker, and Kunle Olukotun
(Stanford University, USA)
Publisher's Version
SMAPPIC: Scalable Multi-FPGA Architecture Prototype Platform in the Cloud
Grigory Chirkov and David Wentzlaff
(Princeton University, USA)
Publisher's Version
Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow
Zhiyao Li, Jiaxiang Li, Taijie Chen, Dimin Niu, Hongzhong Zheng, Yuan Xie, and Mingyu Gao
(Tsinghua University, China; Northwestern University, USA; Alibaba DAMO Academy, China; Shanghai Qi Zhi Institute, China)
Publisher's Version
SpecPMT: Speculative Logging for Resolving Crash Consistency Overhead of Persistent Memory
Chencheng Ye, Yuanchao Xu, Xipeng Shen, Yan Sha, Xiaofei Liao, Hai Jin, and Yan Solihin
(Huazhong University of Science and Technology, China; North Carolina State University, USA; University of Central Florida, USA)
Publisher's Version
Stepwise Debugging for Hardware Accelerators
Griffin Berlstein, Rachit Nigam, Christophe Gyurgyik, and Adrian Sampson
(Cornell University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
Liwei Guo, Wonkyo Choe, and Felix Xiaozhu Lin
(University of Virginia, USA)
Publisher's Version
TensorIR: An Abstraction for Automatic Tensorized Program Optimization
Siyuan Feng, Bohan Hou, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, and Tianqi Chen
(Shanghai Jiao Tong University, China; Carnegie Mellon University, USA; OctoML, USA; University of Washington, USA; University of California at Berkeley, USA; Amazon Web Services, USA)
Publisher's Version
TiLT: A Time-Centric Approach for Stream Query Optimization and Parallelization
Anand Jayarajan, Wei Zhao, Yudi Sun, and Gennady Pekhimenko
(University of Toronto, Canada)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
TLP: A Deep Learning-Based Cost Model for Tensor Program Tuning
Yi Zhai, Yu Zhang, Shuo Liu, Xiaomeng Chu, Jie Peng, Jianmin Ji, and Yanyong Zhang
(University of Science and Technology of China, China)
Publisher's Version
Towards a Machine Learning-Assisted Kernel with LAKE
Henrique Fingler, Isha Tarte, Hangchen Yu, Ariel Szekely, Bodun Hu, Aditya Akella, and Christopher J. Rossbach
(University of Texas at Austin, USA; Meta, USA; Massachusetts Institute of Technology, USA; Katana Graph, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
uBFT: Microsecond-Scale BFT using Disaggregated Memory
Marcos K. Aguilera, Naama Ben-David, Rachid Guerraoui, Antoine Murat, Athanasios Xygkis, and Igor Zablotchi
(VMware Research, USA; EPFL, Switzerland; Massachusetts Institute of Technology, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
uGrapher: High-Performance Graph Operator Computation via Unified Abstraction for Graph Neural Networks
Yangjie Zhou, Jingwen Leng, Yaoxu Song, Shuwen Lu, Mian Wang, Chao Li, Minyi Guo, Wenting Shen, Yong Li, Wei Lin, Xiangwen Liu, and Hanqing Wu
(Shanghai Jiao Tong University, China; Alibaba Group, China)
Publisher's Version
VClinic: A Portable and Efficient Framework for Fine-Grained Value Profilers
Xin You, Hailong Yang, Kelun Lei, Zhongzhi Luan, and Depei Qian
(Beihang University, China)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
VDom: Fast and Unlimited Virtual Domains on Multiple Architectures
Ziqi Yuan, Siyu Hong, Rui Chang, Yajin Zhou, Wenbo Shen, and Kui Ren
(Zhejiang University, China)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced
WACO: Learning Workload-Aware Co-optimization of the Format and Schedule of a Sparse Tensor Program
Jaeyeon Won, Charith Mendis, Joel S. Emer, and Saman Amarasinghe
(Massachusetts Institute of Technology, USA; University of Illinois at Urbana-Champaign, USA; NVIDIA, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional
Where Did My Variable Go? Poking Holes in Incomplete Debug Information
Cristian Assaiante, Daniele Cono D'Elia, Giuseppe Antonio Di Luna, and Leonardo Querzoni
(Sapienza University of Rome, Italy)
Publisher's Version

proc time: 17.01