ASPLOS 2021
26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2021)
Powered by
Conference Publishing Consulting

26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2021), April 19–23, 2021, Virtual, USA

ASPLOS 2021 – Proceedings

Contents - Abstracts - Authors

Frontmatter

Title Page
Article: asplos21foreword-fm000-p doi:
ASPLOS 2021 General Chair’s Message
Article: asplos21foreword-fm001-p doi:
ASPLOS 2021 Program Chairs’ Message
Article: asplos21foreword-fm004-p doi:
ASPLOS 2021 Organization
Article: asplos21foreword-fm002-p doi:

Session 1: Packet Up

PacketMill: Toward Per-Core 100-Gbps Networking
Alireza Farshin, Tom Barbette, Amir Roozbeh, Gerald Q. Maguire Jr., and Dejan Kostić
(KTH, Sweden; Ericsson Research, Sweden)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p393-p doi:10.1145/3445814.3446724
Autonomous NIC Offloads
Boris Pismenny, Haggai Eran, Aviad Yehezkel, Liran Liss, Adam Morrison, and Dan Tsafrir
(Technion, Israel; NVIDIA, Israel; Tel Aviv University, Israel; VMware Research, USA)
Publisher's Version Article: asplos21main-p499-p doi:10.1145/3445814.3446732
Dagger: Efficient and Fast RPCs in Cloud Microservices with Near-Memory Reconfigurable NICs
Nikita Lazarev, Shaojie Xiang, Neil Adit, Zhiru Zhang, and Christina Delimitrou
(Cornell University, USA)
Publisher's Version Article: asplos21main-p59-p doi:10.1145/3445814.3446696

Session 2: Memory Systems

BCD Deduplication: Effective Memory Compression using Partial Cache-Line Deduplication
Sungbo Park, Ingab Kang, Yaebin Moon, Jung Ho Ahn, and G. Edward Suh
(Intel Corporation, USA; University of Michigan, USA; Seoul National University, South Korea; Cornell University, USA)
Publisher's Version Article: asplos21main-p379-p doi:10.1145/3445814.3446722
KLOCs: Kernel-Level Object Contexts for Heterogeneous Memory Systems
Sudarsun Kannan, Yujie Ren, and Abhishek Bhattacharjee
(Rutgers University, USA; Yale University, USA)
Publisher's Version Article: asplos21main-p844-p doi:10.1145/3445814.3446745
Rethinking Software Runtimes for Disaggregated Memory
Irina Calciu, M. Talha Imran, Ivan Puddu, Sanidhya Kashyap, Hasan Al Maruf, Onur Mutlu, and Aasheesh Kolli
(VMware Research, USA; Pennsylvania State University, USA; ETH Zurich, Switzerland; EPFL, Switzerland; University of Michigan, USA; Google, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p210-p doi:10.1145/3445814.3446713

Session 3: Flow

DiAG: A Dataflow-Inspired Architecture for General-Purpose Processors
Dong Kai Wang and Nam Sung Kim
(University of Illinois at Urbana-Champaign, USA)
Publisher's Version Article: asplos21main-p101-p doi:10.1145/3445814.3446703
LifeStream: A High-Performance Stream Processing Engine for Periodic Streams
Anand Jayarajan, Kimberly Hau, Andrew Goodwin, and Gennady Pekhimenko
(University of Toronto, Canada; SickKids Hospital, Canada; University of Sydney, Australia)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p402-p doi:10.1145/3445814.3446725
When Application-Specific ISA Meets FPGAs: A Multi-layer Virtualization Framework for Heterogeneous Cloud FPGAs
Yue Zha and Jing Li
(University of Pennsylvania, USA)
Publisher's Version Article: asplos21main-p79-p doi:10.1145/3445814.3446699

Session 4: Microservices

Sage: Practical and Scalable ML-Driven Performance Debugging in Microservices
Yu Gan, Mingyu Liang, Sundar Dev, David Lo, and Christina Delimitrou
(Cornell University, USA; Google, USA)
Publisher's Version Article: asplos21main-p85-p doi:10.1145/3445814.3446700
Nightcore: Efficient and Scalable Serverless Computing for Latency-Sensitive, Interactive Microservices
Zhipeng Jia and Emmett Witchel
(University of Texas at Austin, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p89-p doi:10.1145/3445814.3446701
Sinan: ML-Based and QoS-Aware Resource Management for Cloud Microservices
Yanqi Zhang, Weizhe Hua, Zhuangzhuang Zhou, G. Edward Suh, and Christina Delimitrou
(Cornell University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p43-p doi:10.1145/3445814.3446693

Session 5: Pages and Machine Architecture

NOREBA: A Compiler-Informed Non-speculative Out-of-Order Commit Processor
Ali Hajiabadi, Andreas Diavastos, and Trevor E. Carlson
(National University of Singapore, Singapore; Universitat Politècnica de Catalunya, Spain)
Publisher's Version Article: asplos21main-p415-p doi:10.1145/3445814.3446726
Fast Local Page-Tables for Virtualized NUMA Servers with vMitosis
Ashish Panwar, Reto Achermann, Arkaprava Basu, Abhishek Bhattacharjee, K. Gopinath, and Jayneel Gandhi
(IISc Bangalore, India; University of British Columbia, Canada; Yale University, USA; VMware Research, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced NOLINKDECO Article: asplos21main-p158-p doi:10.1145/3445814.3446709
PTEMagnet: Fine-Grained Physical Memory Reservation for Faster Page Walks in Public Clouds
Artemiy Margaritov, Dmitrii Ustiugov, Amna Shahab, and Boris Grot
(University of Edinburgh, UK)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p111-p doi:10.1145/3445814.3446704

Session 6: Languages and Systems I

In-Fat Pointer: Hardware-Assisted Tagged-Pointer Spatial Memory Safety Defense with Subobject Granularity Protection
Shengjie Xu, Wei Huang, and David Lie
(University of Toronto, Canada)
Publisher's Version Artifacts Functional Results Reproduced Article: asplos21main-p1439-p doi:10.1145/3445814.3446761
Judging a Type by Its Pointer: Optimizing GPU Virtual Functions
Mengchi Zhang, Ahmad Alawneh, and Timothy G. Rogers
(Purdue University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p515-p doi:10.1145/3445814.3446734
Enclosure: Language-Based Restriction of Untrusted Libraries
Adrien Ghosn, Marios Kogias, Mathias Payer, James R. Larus, and Edouard Bugnion
(EPFL, Switzerland; Microsoft Research, UK)
Publisher's Version Article: asplos21main-p459-p doi:10.1145/3445814.3446728

Session 7: Towards Improved Throughputs

Switches for HIRE: Resource Scheduling for Data Center In-Network Computing
Marcel Blöcher, Lin Wang, Patrick Eugster, and Max Schmidt
(TU Darmstadt, Germany; Vrije Universiteit Amsterdam, Netherlands; USI Lugano, Switzerland; Purdue University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p1375-p doi:10.1145/3445814.3446760
Probabilistic Profiling of Stateful Data Planes for Adversarial Testing
Qiao Kang, Jiarong Xing, Yiming Qiu, and Ang Chen
(Rice University, USA)
Publisher's Version Article: asplos21main-p1594-p doi:10.1145/3445814.3446764
MERCI: Efficient Embedding Reduction on Commodity Hardware via Sub-query Memoization
Yejin Lee, Seong Hoon Seo, Hyunji Choi, Hyoung Uk Sul, Soosung Kim, Jae W. Lee, and Tae Jun Ham
(Seoul National University, South Korea)
Publisher's Version Article: asplos21main-p265-p doi:10.1145/3445814.3446717

Session 8: Tools and Frameworks

SherLock: Unsupervised Synchronization-Operation Inference
Guangpu Li, Dongjie Chen, Shan Lu, Madanlal Musuvathi, and Suman Nath
(University of Chicago, USA; Nanjing University, China; Microsoft Research, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p974-p doi:10.1145/3445814.3446754
SIMDRAM: A Framework for Bit-Serial SIMD Processing using DRAM
Nastaran Hajinazar, Geraldo F. Oliveira, Sven Gregorio, João Dinis Ferreira, Nika Mansouri Ghiasi, Minesh Patel, Mohammed Alser, Saugata Ghose, Juan Gómez-Luna, and Onur Mutlu
(ETH Zurich, Switzerland; Simon Fraser University, Canada; University of Illinois at Urbana-Champaign, USA)
Publisher's Version Article: asplos21main-p926-p doi:10.1145/3445814.3446749
Clobber-NVM: Log Less, Re-execute More
Yi Xu, Joseph Izraelevitz, and Steven Swanson
(University of California at San Diego, USA; University of Colorado at Boulder, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Article: asplos21main-p476-p doi:10.1145/3445814.3446730

Session 9: Mapping and Management of Quantum and Cloud

Time-Optimal Qubit Mapping
Chi Zhang, Ari B. Hayes, Longfei Qiu, Yuwei Jin, Yanhao Chen, and Eddy Z. Zhang
(University of Pittsburgh, USA; Rutgers University, USA)
Publisher's Version Article: asplos21main-p129-p doi:10.1145/3445814.3446706
Orchestrated Trios: Compiling for Efficient Communication in Quantum Programs with 3-Qubit Gates
Casey Duckering, Jonathan M. Baker, Andrew Litteken, and Frederic T. Chong
(University of Chicago, USA)
Publisher's Version Article: asplos21main-p276-p doi:10.1145/3445814.3446718
FaasCache: Keeping Serverless Computing Alive with Greedy-Dual Caching
Alexander Fuerst and Prateek Sharma
(Indiana University, USA)
Publisher's Version Published Artifact Artifacts Available Article: asplos21main-p1225-p doi:10.1145/3445814.3446757

Session 10: Persistence I

Hippocrates: Healing Persistent Memory Bugs without Doing Any Harm
Ian Neal, Andrew Quinn, and Baris Kasikci
(University of Michigan, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p44-p doi:10.1145/3445814.3446694
Jaaru: Efficiently Model Checking Persistent Memory Programs
Hamed Gorjiara, Guoqing Harry Xu, and Brian Demsky
(University of California at Irvine, USA; University of California at Los Angeles, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p544-p doi:10.1145/3445814.3446735
Corundum: Statically-Enforced Persistent Memory Safety
Morteza Hoseinzadeh and Steven Swanson
(University of California at San Diego, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p171-p doi:10.1145/3445814.3446710

Session 11: Quantum Abstractions

Qraft: Reverse Your Quantum Circuit and Know the Correct Program Output
Tirthak Patel and Devesh Tiwari
(Northeastern University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p772-p doi:10.1145/3445814.3446743
Logical Abstractions for Noisy Variational Quantum Algorithm Simulation
Yipeng Huang, Steven Holtzen, Todd Millstein, Guy Van den Broeck, and Margaret Martonosi
(Rutgers University, USA; University of California at Los Angeles, USA; Princeton University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p936-p doi:10.1145/3445814.3446750
CutQC: Using Small Quantum Computers for Large Quantum Circuit Evaluations
Wei Tang, Teague Tomesh, Martin Suchara, Jeffrey Larson, and Margaret Martonosi
(Princeton University, USA; Argonne National Laboratory, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p1241-p doi:10.1145/3445814.3446758

Session 12: Persistence II

PMFuzz: Test Case Generation for Persistent Memory Programs
Sihang Liu, Suyash Mahar, Baishakhi Ray, and Samira Khan
(University of Virginia, USA; University of California at San Diego, USA; Columbia University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p8-p doi:10.1145/3445814.3446691
Fast, Flexible, and Comprehensive Bug Detection for Persistent Memory Programs
Bang Di, Jiawen Liu, Hao Chen, and Dong Li
(Hunan University, China; University of California at Merced, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p827-p doi:10.1145/3445814.3446744
PMEM-Spec: Persistent Memory Speculation (Strict Persistency Can Trump Relaxed Persistency)
Jungi Jeong and Changhee Jung
(Purdue University, USA)
Publisher's Version Article: asplos21main-p75-p doi:10.1145/3445814.3446698

Session 13: Systems Software

VSync: Push-Button Verification and Optimization for Synchronization Primitives on Weak Memory Models
Jonas Oberhauser, Rafael Lourenco de Lima Chehab, Diogo Behrens, Ming Fu, Antonio Paolillo, Lilith Oberhauser, Koustubha Bhat, Yuzhong Wen, Haibo Chen, Jaeho Kim, and Viktor Vafeiadis
(Huawei, Germany; Huawei, China; Shanghai Jiao Tong University, China; MPI-SWS, Germany)
Publisher's Version Artifacts Functional Results Reproduced Article: asplos21main-p922-p doi:10.1145/3445814.3446748
CubicleOS: A Library OS with Software Componentisation for Practical Isolation
Vasily A. Sartakov, Lluís Vilanova, and Peter Pietzuch
(Imperial College London, UK)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p486-p doi:10.1145/3445814.3446731
Benchmarking, Analysis, and Optimization of Serverless Function Snapshots
Dmitrii Ustiugov, Plamen Petrov, Marios Kogias, Edouard Bugnion, and Boris Grot
(University of Edinburgh, UK; Microsoft Research, UK; EPFL, Switzerland)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p212-p doi:10.1145/3445814.3446714

Session 14: Beyond the Pixels

Rhythmic Pixel Regions: Multi-resolution Visual Sensing System towards High-Precision Visual Computing at Low Power
Venkatesh Kodukula, Alexander Shearer, Van Nguyen, Srinivas Lingutla, Yifei Liu, and Robert LiKamWa
(Arizona State University, USA)
Publisher's Version Article: asplos21main-p588-p doi:10.1145/3445814.3446737
Q-VR: System-Level Design for Future Mobile Collaborative Virtual Reality
Chenhao Xie, Xie Li, Yang Hu, Huwan Peng, Michael Taylor, and Shuaiwen Leon Song
(Pacific Northwest National Laboratory, USA; University of Sydney, Australia; University of Texas at Dallas, USA; University of Washington, USA)
Publisher's Version Article: asplos21main-p228-p doi:10.1145/3445814.3446715
Warehouse-Scale Video Acceleration: Co-design and Deployment in the Wild
Parthasarathy Ranganathan, Daniel Stodolsky, Jeff Calow, Jeremy Dorfman, Marisabel Guevara, Clinton Wills Smullen IV, Aki Kuusela, Raghu Balasubramanian, Sandeep Bhatia, Prakash Chauhan, Anna Cheung, In Suk Chong, Niranjani Dasharathi, Jia Feng, Brian Fosco, Samuel Foss, Ben Gelb, Sara J. Gwin, Yoshiaki Hase, Da-ke He, C. Richard Ho, Roy W. Huffman Jr., Elisha Indupalli, Indira Jayaram, Poonacha Kongetira, Cho Mon Kyaw, Aaron Laursen, Yuan Li, Fong Lou, Kyle A. Lucke, JP Maaninen, Ramon Macias, Maire Mahony, David Alexander Munday, Srikanth Muroor, Narayana Penukonda, Eric Perkins-Argueta, Devin Persaud, Alex Ramirez, Ville-Mikko Rautio, Yolanda Ripley, Amir Salek, Sathish Sekar, Sergey N. Sokolov, Rob Springer, Don Stark, Mercedes Tan, Mark S. Wachsler, Andrew C. Walton, David A. Wickeraad, Alvin Wijaya, and Hon Kwan Wu
(Google, USA)
Publisher's Version Article: asplos21main-p388-p doi:10.1145/3445814.3446723

Session 15: Races and Concurrency

Automatically Detecting and Fixing Concurrency Bugs in Go Software Systems
Ziheng Liu, Shuofei Zhu, Boqin Qin, Hao Chen, and Linhai Song
(Pennsylvania State University, USA; Beijing University of Posts and Telecommunications, China; University of California at Davis, USA)
Publisher's Version Article: asplos21main-p1170-p doi:10.1145/3445814.3446756
C11Tester: A Race Detector for C/C++ Atomics
Weiyu Luo and Brian Demsky
(University of California at Irvine, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p172-p doi:10.1145/3445814.3446711
Kard: Lightweight Data Race Detection with Per-Thread Memory Protection
Adil Ahmad, Sangho Lee, Pedro Fonseca, and Byoungyoung Lee
(Purdue University, USA; Microsoft Research, USA; Seoul National University, South Korea)
Publisher's Version Article: asplos21main-p442-p doi:10.1145/3445814.3446727

Session 16: Robots, Optimization, and Robo-Optimization

Quantifying the Design-Space Tradeoffs in Autonomous Drones
Ramyad Hadidi, Bahar Asgari, Sam Jijina, Adriana Amyette, Nima Shoghi, and Hyesoon Kim
(Georgia Institute of Technology, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p321-p doi:10.1145/3445814.3446721
Robomorphic Computing: A Design Methodology for Domain-Specific Accelerators Parameterized by Robot Morphology
Sabrina M. Neuman, Brian Plancher, Thomas Bourgeat, Thierry Tambe, Srinivas Devadas, and Vijay Janapa Reddi
(Harvard University, USA; Massachusetts Institute of Technology, USA)
Publisher's Version Article: asplos21main-p857-p doi:10.1145/3445814.3446746
Gamma: Leveraging Gustavson’s Algorithm to Accelerate Sparse Matrix Multiplication
Guowei Zhang, Nithya Attaluri, Joel S. Emer, and Daniel Sanchez
(Massachusetts Institute of Technology, USA)
Publisher's Version Article: asplos21main-p95-p doi:10.1145/3445814.3446702

Session 17: Solid State Drives

Reducing Solid-State Drive Read Latency by Optimizing Read-Retry
Jisung Park, Myungsuk Kim, Myoungjun Chun, Lois Orosa, Jihong Kim, and Onur Mutlu
(ETH Zurich, Switzerland; Seoul National University, South Korea; Kyungpook National University, South Korea)
Publisher's Version Article: asplos21main-p278-p doi:10.1145/3445814.3446719
RecSSD: Near Data Processing for Solid State Drive Based Recommendation Inference
Mark Wilkening, Udit Gupta, Samuel Hsia, Caroline Trippel, Carole-Jean Wu, David Brooks, and Gu-Yeon Wei
(Harvard University, USA; Facebook, USA)
Publisher's Version Published Artifact Artifacts Available Article: asplos21main-p1455-p doi:10.1145/3445814.3446763
Prolonging 3D NAND SSD Lifetime via Read Latency Relaxation
Chun-Yi Liu, Yunju Lee, Myoungsoo Jung, Mahmut Taylan Kandemir, and Wonil Choi
(Pennsylvania State University, USA; KAIST, South Korea)
Publisher's Version Article: asplos21main-p500-p doi:10.1145/3445814.3446733

Session 18: Security I

PIBE: Practical Kernel Control-Flow Hardening with Profile-Guided Indirect Branch Elimination
Victor Duta, Cristiano Giuffrida, Herbert Bos, and Erik van der Kouwe
(Vrije Universiteit Amsterdam, Netherlands)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p701-p doi:10.1145/3445814.3446740
Computing with Time: Microarchitectural Weird Machines
Dmitry Evtyushkin, Thomas Benjamin, Jesse Elwell, Jeffrey A. Eitel, Angelo Sapello, and Abhrajit Ghosh
(College of William & Mary, USA; Perspecta Labs, USA)
Publisher's Version Article: asplos21main-p460-p doi:10.1145/3445814.3446729
HerQules: Securing Programs via Hardware-Enforced Message Queues
Daming D. Chen, Wen Shih Lim, Mohammad Bakhshalipour, Phillip B. Gibbons, James C. Hoe, and Bryan Parno
(Carnegie Mellon University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p548-p doi:10.1145/3445814.3446736

Session 19: Better Hardware through Compilers

Effective Simulation and Debugging for a High-Level Hardware Language using Software Compilers
Clément Pit-Claudel, Thomas Bourgeat, Stella Lau, Arvind, and Adam Chlipala
(Massachusetts Institute of Technology, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p315-p doi:10.1145/3445814.3446720
A Compiler Infrastructure for Accelerator Generators
Rachit Nigam, Samuel Thomas, Zhijing Li, and Adrian Sampson
(Cornell University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p206-p doi:10.1145/3445814.3446712
Compiler-Driven FPGA Virtualization with SYNERGY
Joshua Landgraf, Tiffany Yang, Will Lin, Christopher J. Rossbach, and Eric Schkufza
(University of Texas at Austin, USA; VMware Research, USA; Katana Graph, USA; Amazon, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p990-p doi:10.1145/3445814.3446755

Session 20: Data Driven Optimization

BayesPerf: Minimizing Performance Monitoring Errors using Bayesian Statistics
Subho S. Banerjee, Saurabh Jha, Zbigniew Kalbarczyk, and Ravishankar K. Iyer
(University of Illinois at Urbana-Champaign, USA)
Publisher's Version Article: asplos21main-p648-p doi:10.1145/3445814.3446739
Training for Multi-resolution Inference using Reusable Quantization Terms
Sai Qian Zhang, Bradley McDanel, H. T. Kung, and Xin Dong
(Harvard University, USA; Franklin & Marshall College, USA)
Publisher's Version Article: asplos21main-p731-p doi:10.1145/3445814.3446741
A Hierarchical Neural Model of Data Prefetching
Zhan Shi, Akanksha Jain, Kevin Swersky, Milad Hashemi, Parthasarathy Ranganathan, and Calvin Lin
(University of Texas at Austin, USA; Google, USA)
Publisher's Version Article: asplos21main-p963-p doi:10.1145/3445814.3446752

Session 21: Supporting Hardware Parallelism

Vectorization for Digital Signal Processors via Equality Saturation
Alexa VanHattum, Rachit Nigam, Vincent T. Lee, James Bornholt, and Adrian Sampson
(Cornell University, USA; Facebook Reality Labs, USA; University of Texas at Austin, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p142-p doi:10.1145/3445814.3446707
Scalable FSM Parallelization via Path Fusion and Higher-Order Speculation
Junqiao Qiu, Xiaofan Sun, Amir Hossein Nodehi Sabet, and Zhijia Zhao
(Michigan Technological University, USA; University of California at Riverside, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p113-p doi:10.1145/3445814.3446705
VeGen: A Vectorizer Generator for SIMD and Beyond
Yishen Chen, Charith Mendis, Michael Carbin, and Saman Amarasinghe
(Massachusetts Institute of Technology, USA; University of Illinois at Urbana-Champaign, USA)
Publisher's Version Artifacts Functional Results Reproduced Article: asplos21main-p28-p doi:10.1145/3445814.3446692

Session 22: Neural Net Optimization

Neural Architecture Search as Program Transformation Exploration
Jack Turner, Elliot J. Crowley, and Michael F. P. O'Boyle
(University of Edinburgh, UK)
Publisher's Version Article: asplos21main-p969-p doi:10.1145/3445814.3446753
Analytical Characterization and Design Space Exploration for Optimization of CNNs
Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, and P. Sadayappan
(University of Utah, USA; Washington State University, USA; Ohio State University, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p1367-p doi:10.1145/3445814.3446759
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space Search
Kartik Hegde, Po-An Tsai, Sitao Huang, Vikas Chandra, Angshuman Parashar, and Christopher W. Fletcher
(University of Illinois at Urbana-Champaign, USA; NVIDIA, USA; Facebook, USA)
Publisher's Version Article: asplos21main-p1450-p doi:10.1145/3445814.3446762

Session 23: Beyond Neural Nets

Statistical Robustness of Markov Chain Monte Carlo Accelerators
Xiangyu Zhang, Ramin Bashizade, Yicheng Wang, Sayan Mukherjee, and Alvin R. Lebeck
(Duke University, USA)
Publisher's Version Article: asplos21main-p62-p doi:10.1145/3445814.3446697
NeuroEngine: A Hardware-Based Event-Driven Simulation System for Advanced Brain-Inspired Computing
Hunjun Lee, Chanmyeong Kim, Yujin Chung, and Jangwoo Kim
(Seoul National University, South Korea)
Publisher's Version Article: asplos21main-p593-p doi:10.1145/3445814.3446738
Defensive Approximation: Securing CNNs using Approximate Computing
Amira Guesmi, Ihsen Alouani, Khaled N. Khasawneh, Mouna Baklouti, Tarek Frikha, Mohamed Abid, and Nael Abu-Ghazaleh
(University of Sfax, Tunisia; Polytechnic University of Hauts-de-France, France; George Mason University, USA; University of California at Riverside, USA)
Publisher's Version Article: asplos21main-p898-p doi:10.1145/3445814.3446747

Session 24: Languages and Systems II

Language-Parametric Compiler Validation with Application to LLVM
Theodoros Kasampalis, Daejun Park, Zhengyao Lin, Vikram S. Adve, and Grigore Roşu
(University of Illinois at Urbana-Champaign, USA; Runtime Verification, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p949-p doi:10.1145/3445814.3446751
Incremental CFG Patching for Binary Rewriting
Xiaozhu Meng and Weijie Liu
(Rice University, USA; Indiana University at Bloomington, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p1687-p doi:10.1145/3445814.3446765
Who’s Debugging the Debuggers? Exposing Debug Information Bugs in Optimized Binaries
Giuseppe Antonio Di Luna, Davide Italiano, Luca Massarelli, Sebastian Österlund, Cristiano Giuffrida, and Leonardo Querzoni
(Sapienza University of Rome, Italy; Apple, USA; Vrije Universiteit Amsterdam, Netherlands)
Publisher's Version Artifacts Functional Results Reproduced Article: asplos21main-p58-p doi:10.1145/3445814.3446695

Session 25: Security II

Speculative Interference Attacks: Breaking Invisible Speculation Schemes
Mohammad Behnia, Prateek Sahu, Riccardo Paccagnella, Jiyong Yu, Zirui Neil Zhao, Xiang Zou, Thomas Unterluggauer, Josep Torrellas, Carlos Rozas, Adam Morrison, Frank Mckeen, Fangfei Liu, Ron Gabor, Christopher W. Fletcher, Abhishek Basak, and Alaa Alameldeen
(University of Illinois at Urbana-Champaign, USA; University of Texas at Austin, USA; Intel Corporation, USA; Tel Aviv University, Israel; Toga Networks, Israel; Simon Fraser University, Canada)
Publisher's Version Artifacts Functional Results Reproduced Article: asplos21main-p148-p doi:10.1145/3445814.3446708
Jamais Vu: Thwarting Microarchitectural Replay Attacks
Dimitrios Skarlatos, Zirui Neil Zhao, Riccardo Paccagnella, Christopher W. Fletcher, and Josep Torrellas
(University of Illinois at Urbana-Champaign, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p255-p doi:10.1145/3445814.3446716
Streamline: A Fast, Flushless Cache Covert-Channel Attack by Enabling Asynchronous Collusion
Gururaj Saileshwar, Christopher W. Fletcher, and Moinuddin Qureshi
(Georgia Institute of Technology, USA; University of Illinois at Urbana-Champaign, USA)
Publisher's Version Published Artifact Artifacts Available Artifacts Functional Results Reproduced Article: asplos21main-p744-p doi:10.1145/3445814.3446742

proc time: 0.16