PPoPP 2026
31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2026)
Powered by
Conference Publishing Consulting

31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2026), January 31 – February 4, 2026, Sydney, NSW, Australia

PPoPP 2026 – Author Index

Contents - Abstracts - Authors

A B C D F G H J K L M N P Q R S T W X Y Z

Agrawal, Kunal PPoPP '26: "Waste-Efficient Work Stealing ..."
Arovi, Md Amit Hasan PPoPP '26: "Fixing Non-blocking Data Structures ..."
Beltran, Vicenç PPoPP '26: "Rethinking Thread Scheduling ..."
Bi, Jun PPoPP '26: "FlashAttention-T: Towards ..."
Bian, Haodong PPoPP '26: "PANA: A Fine-Grained Runtime-Adaptive ..."
Blelloch, Guy E. PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Brown, Trevor PPoPP '26: "Multiverse: Transactional ..."
Cai, Yanxin PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Cao, Hongliang PPoPP '26: "ElasGNN: An Elastic Training ..."
Cao, Huanqi PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Cao, Qianwen PPoPP '26: "Accelerating Sparse Transformer ..."
Cao, Yuchao PPoPP '26: "Cacheman: A Comprehensive ..."
Casas, Marc PPoPP '26: "Characterizing Matrix Multiplication ..." PPoPP '26: "DiggerBees: Depth First Search ..."
Chen, Feiyang PPoPP '26: "MetaAttention: A Unified and ..."
Chen, Haibo PPoPP '26: "MetaAttention: A Unified and ..."
Chen, Jiayu PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Chen, Tianshi PPoPP '26: "FlashAttention-T: Towards ..."
Chen, Xu PPoPP '26: "Laser: Unlocking Layer-Level ..."
Chen, Yidong PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Chen, Yifan PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Chen, Youmin PPoPP '26: "zBuffer: Zero-Copy and Metadata-Free ..."
Chen, YuAng PPoPP '26: "High-Throughput Non-uniformly ..."
Chen, Yuhan PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Chen, Yunji PPoPP '26: "FlashAttention-T: Towards ..."
Chen, Zhuohan PPoPP '26: "COCCL: A Collective Communication ..."
Chen, Zizhong PPoPP '26: "COCCL: A Collective Communication ..."
Cheng, Dazhao PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Cheng, Sanchuan PPoPP '26: "Cacheman: A Comprehensive ..."
Cheng, Shenggan PPoPP '26: "HelixPipe: Efficient Distributed ..."
Cheng, Yu PPoPP '26: "MetaAttention: A Unified and ..."
Chi, Xuebin PPoPP '26: "TAC: Cache-Based System for ..."
Chu, Xiaowen PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Coccimiglio, Gaetano PPoPP '26: "Multiverse: Transactional ..."
Cui, Bin PPoPP '26: "Elastor: Elastic and Efficient ..."
Dai, Shengdong PPoPP '26: "Cacheman: A Comprehensive ..."
Dai, Wenhao PPoPP '26: "Accelerating Sparse Transformer ..."
De Man, Quinten PPoPP '26: "UFO Trees: Practical and Provably-Efficient ..."
Deng, Haodong PPoPP '26: "Accelerating Sparse Transformer ..."
Dhulipala, Laxman PPoPP '26: "UFO Trees: Practical and Provably-Efficient ..." PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Di, Peng PPoPP '26: "TAC: Cache-Based System for ..."
Dice, Dave PPoPP '26: "Hapax Locks: Scalable Value-Based ..."
Dong, Dezun PPoPP '26: "A Diagonal Block Memory-Aware ..."
Dong, Dong PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Dong, Quanxing PPoPP '26: "Laser: Unlocking Layer-Level ..."
Du, Yang PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..."
Duan, Xiaohui PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Fan, Ruibo PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Fatourou, Panagiota PPoPP '26: "Sharded Elimination and Combining ..." PPoPP '26: "Concurrent Balanced Augmented ..."
Fei, Xiang PPoPP '26: "PANA: A Fine-Grained Runtime-Adaptive ..."
Feng, Tianyu PPoPP '26: "Exploiting Efficient Mapping ..."
Fu, Fangcheng PPoPP '26: "Elastor: Elastic and Efficient ..."
Fu, Jianhao PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Fu, Jiayu PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Gai, Shun PPoPP '26: "zBuffer: Zero-Copy and Metadata-Free ..."
Gan, Lin PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Gao, Hongyu PPoPP '26: "TAC: Cache-Based System for ..."
Ge, Hao PPoPP '26: "Elastor: Elastic and Efficient ..."
Gibbons, Phillip B. PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Gowda, Kishen N PPoPP '26: "UFO Trees: Practical and Provably-Efficient ..."
Gu, Junyu PPoPP '26: "TAC: Cache-Based System for ..."
Gu, Lin PPoPP '26: "DTMiner: A Data-Centric System ..."
Gu, Qiqi PPoPP '26: "SPIDER: Unleashing Sparse ..."
Gu, Yan PPoPP '26: "Parallel Dynamic Spatial Indexes ..." PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Gu, Yida PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Guan, Haibing PPoPP '26: "BEEMS: Boosting Machine Vision ..."
Guan, Naixuan PPoPP '26: "Cacheman: A Comprehensive ..."
Guo, Qi PPoPP '26: "FlashAttention-T: Towards ..."
Han, Ruobing PPoPP '26: "Scaling GPU-to-CPU Migration ..."
He, Ligang PPoPP '26: "DTMiner: A Data-Centric System ..."
Hou, Yinbo PPoPP '26: "DTMiner: A Data-Centric System ..."
Hu, Xiaokang PPoPP '26: "Cacheman: A Comprehensive ..."
Huang, Bo PPoPP '26: "Parallel Dynamic Spatial Indexes ..."
Huang, Dan PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Huang, Jianqiang PPoPP '26: "PANA: A Fine-Grained Runtime-Adaptive ..."
Huang, Kezhao PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Huang, Shuhong PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Huang, Wenjing PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Jannesari, Ali PPoPP '26: "Dynamic Detection of Inefficient ..."
Jayanti, Siddhartha PPoPP '26: "Concurrent Balanced Augmented ..."
Ji, Zhuoran PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Jia, Haipeng PPoPP '26: "A Diagonal Block Memory-Aware ..."
Jiang, Chao PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Jiang, Jiazhi PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Jiang, Li PPoPP '26: "BEEMS: Boosting Machine Vision ..."
Jiang, Wenbin PPoPP '26: "DTMiner: A Data-Centric System ..."
Jin, Hai PPoPP '26: "DTMiner: A Data-Centric System ..."
Jin, Zhou PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..."
Ju, Lei PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Kang, Hongbo PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Kim, Hyesoon PPoPP '26: "Scaling GPU-to-CPU Migration ..."
Kogan, Alex PPoPP '26: "Hapax Locks: Scalable Value-Based ..."
Kong, Haoran PPoPP '26: "COCCL: A Collective Communication ..."
Lei, Kinman PPoPP '26: "RoMeo: Mitigating Dual-dimensional ..."
Li, Guangzhao PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Li, Haoxu PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Li, Haoyang PPoPP '26: "Elastor: Elastic and Efficient ..."
Li, Huiba PPoPP '26: "zBuffer: Zero-Copy and Metadata-Free ..."
Li, Ling PPoPP '26: "FlashAttention-T: Towards ..."
Li, Shengguo PPoPP '26: "A Diagonal Block Memory-Aware ..."
Li, Sian PPoPP '26: "TAC: Cache-Based System for ..."
Li, Wei PPoPP '26: "FlashAttention-T: Towards ..."
Li, Yang PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Li, Yewen PPoPP '26: "Faster and Cheaper: Pushing ..."
Li, Yida PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..."
Li, Yuchen PPoPP '26: "VDHA: Vector-Driven Hash Aggregation ..."
Li, Zhengrui PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Liang, Yunkai PPoPP '26: "Laser: Unlocking Layer-Level ..."
Liang, Zhiqiang PPoPP '26: "TAC: Cache-Based System for ..."
Liao, Jianxiong PPoPP '26: "Laser: Unlocking Layer-Level ..."
Liao, Xiaofei PPoPP '26: "DTMiner: A Data-Centric System ..."
Lin, Longlong PPoPP '26: "DTMiner: A Data-Centric System ..."
Lin, Sheng PPoPP '26: "Elastor: Elastic and Efficient ..."
Liu, Fang PPoPP '26: "TAC: Cache-Based System for ..."
Liu, Fangxin PPoPP '26: "BEEMS: Boosting Machine Vision ..." PPoPP '26: "Accelerating Sparse Transformer ..."
Liu, Hongyu PPoPP '26: "Accelerating Sparse Transformer ..."
Liu, Hongyuan PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Liu, Jian PPoPP '26: "BEEMS: Boosting Machine Vision ..."
Liu, Jie PPoPP '26: "A Diagonal Block Memory-Aware ..."
Liu, Jinyang PPoPP '26: "PRISM: An Efficient GPU-Based ..."
Liu, Man PPoPP '26: "COCCL: A Collective Communication ..."
Liu, Weifeng PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..." PPoPP '26: "Characterizing Matrix Multiplication ..." PPoPP '26: "DiggerBees: Depth First Search ..."
Liu, Xiangyu PPoPP '26: "zBuffer: Zero-Copy and Metadata-Free ..."
Liu, Xingchen PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Liu, Yi PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "ElasGNN: An Elastic Training ..."
Liu, Zedong PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Liu, Ziming PPoPP '26: "HelixPipe: Efficient Distributed ..."
Lu, Bing PPoPP '26: "PRISM: An Efficient GPU-Based ..."
Lu, Yuechen PPoPP '26: "Characterizing Matrix Multiplication ..." PPoPP '26: "DiggerBees: Depth First Search ..."
Lu, Yutong PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Luan, Zhongzhi PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "ElasGNN: An Elastic Training ..."
Luo, Ben PPoPP '26: "Cacheman: A Comprehensive ..."
Luo, Dejun PPoPP '26: "PRISM: An Efficient GPU-Based ..."
Luo, Weile PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Lyu, Shengkai PPoPP '26: "COCCL: A Collective Communication ..."
Ma, Kejie PPoPP '26: "APERTURE: Algorithm-System ..."
Ma, Lingxiao PPoPP '26: "MetaAttention: A Unified and ..."
Ma, Yuchen PPoPP '26: "A Distributed Matrix-Block-Vector ..."
Ma, Zixuan PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Marzen, Luke PPoPP '26: "Dynamic Detection of Inefficient ..."
McGuffey, Charles PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Men, Ziyang PPoPP '26: "Parallel Dynamic Spatial Indexes ..." PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Meng, Ke PPoPP '26: "Faster and Cheaper: Pushing ..."
Metaxakis, Nikos PPoPP '26: "Sharded Elimination and Combining ..."
Miao, Ziming PPoPP '26: "MetaAttention: A Unified and ..."
Ni, Yuhui PPoPP '26: "A Diagonal Block Memory-Aware ..."
Nikolaev, Ruslan PPoPP '26: "Fixing Non-blocking Data Structures ..."
Niu, Jiawen PPoPP '26: "Elastor: Elastic and Efficient ..."
Niu, Yiduo PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..."
Niu, Yuyao PPoPP '26: "DiggerBees: Depth First Search ..."
Pan, Zhe PPoPP '26: "Root-Down Exposure for Maximal ..." PPoPP '26: "VDHA: Vector-Driven Hash Aggregation ..."
Qi, Hao PPoPP '26: "DTMiner: A Data-Centric System ..."
Qian, Depei PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "ElasGNN: An Elastic Training ..." PPoPP '26: "APERTURE: Algorithm-System ..."
Qiu, Fudong PPoPP '26: "Cacheman: A Comprehensive ..."
Qiu, Xishi PPoPP '26: "Cacheman: A Comprehensive ..."
Qu, Peng PPoPP '26: "Root-Down Exposure for Maximal ..." PPoPP '26: "VDHA: Vector-Driven Hash Aggregation ..."
Ravi, Srivatsan PPoPP '26: "Multiverse: Transactional ..."
Ren, Bin PPoPP '26: "A Distributed Matrix-Block-Vector ..."
Roca, Aleix PPoPP '26: "Rethinking Thread Scheduling ..."
Roh, Younghun PPoPP '26: "Concurrent Balanced Augmented ..."
Rong, Mengfei PPoPP '26: "Accelerating Sparse Transformer ..."
Ruppert, Eric PPoPP '26: "Concurrent Balanced Augmented ..."
Schardl, Tao B. PPoPP '26: "Waste-Efficient Work Stealing ..."
Sharma, Atharva PPoPP '26: "UFO Trees: Practical and Provably-Efficient ..."
Shen, Hanjing PPoPP '26: "BEEMS: Boosting Machine Vision ..."
Shen, Yibin PPoPP '26: "Cacheman: A Comprehensive ..."
Shi, Heng PPoPP '26: "SPIDER: Unleashing Sparse ..."
Shi, Lu PPoPP '26: "Towards Singular Value Decomposition ..."
Shi, Xingguo PPoPP '26: "TAC: Cache-Based System for ..."
Shim, Junhyung PPoPP '26: "Dynamic Detection of Inefficient ..."
Singer, Kyle PPoPP '26: "Waste-Efficient Work Stealing ..."
Singh, Ajay PPoPP '26: "Sharded Elimination and Combining ..." PPoPP '26: "Concurrent Balanced Augmented ..."
Song, Zeyu PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Stathopoulos, Andreas PPoPP '26: "A Distributed Matrix-Block-Vector ..."
Sun, Chengyu PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Sun, Desen PPoPP '26: "MixFusion: A Patch-Level Parallel ..."
Sun, Qingxiao PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..." PPoPP '26: "APERTURE: Algorithm-System ..." PPoPP '26: "Accelerating Sparse Transformer ..."
Sun, Yihan PPoPP '26: "Parallel Dynamic Spatial Indexes ..."
Sun, Zhenhang PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Tan, Guangming PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..." PPoPP '26: "Faster and Cheaper: Pushing ..."
Tang, Lei PPoPP '26: "TAC: Cache-Based System for ..."
Tang, MingLiang PPoPP '26: "RoMeo: Mitigating Dual-dimensional ..."
Tang, Ruibai PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Tang, Shizhi PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Tao, Dingwen PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Tian, Xingjian PPoPP '26: "COCCL: A Collective Communication ..."
Tian, Yang PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Wang, Fakang PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Wang, Haojie PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Wang, Hulin PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Wang, Jue PPoPP '26: "TAC: Cache-Based System for ..."
Wang, Lei PPoPP '26: "MetaAttention: A Unified and ..."
Wang, Pengbo PPoPP '26: "ElasGNN: An Elastic Training ..."
Wang, Qiang PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Wang, Siqi PPoPP '26: "ElasGNN: An Elastic Training ..."
Wang, Tao PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Wang, Xiaoying PPoPP '26: "PANA: A Fine-Grained Runtime-Adaptive ..."
Wang, Xuanyu PPoPP '26: "Elastor: Elastic and Efficient ..."
Wang, Xuezhu PPoPP '26: "ElasGNN: An Elastic Training ..."
Wang, Yangang PPoPP '26: "TAC: Cache-Based System for ..."
Wang, Yi PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Wang, Yinuo PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Wang, Yiqing PPoPP '26: "APERTURE: Algorithm-System ..."
Wang, Yuke PPoPP '26: "MixFusion: A Patch-Level Parallel ..."
Wang, Zhan PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Wang, Zhuo PPoPP '26: "Binary Compatible Critical ..."
Wei, Jinhui PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Wei, Xingda PPoPP '26: "MetaAttention: A Unified and ..."
Wei, Yuanhao PPoPP '26: "Concurrent Balanced Augmented ..."
Wei, Zheng PPoPP '26: "COCCL: A Collective Communication ..."
Wen, Yuan PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Wen, Yuanbo PPoPP '26: "FlashAttention-T: Towards ..."
Wrench, Evan PPoPP '26: "Concurrent Balanced Augmented ..."
Wu, Chengzhang PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Wu, Chenpeng PPoPP '26: "SPIDER: Unleashing Sparse ..."
Wu, Jiesheng PPoPP '26: "Cacheman: A Comprehensive ..."
Wu, Xueyu PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Wu, Yifan PPoPP '26: "Cacheman: A Comprehensive ..."
Xia, Yaqi PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Xia, Yuqing PPoPP '26: "MetaAttention: A Unified and ..."
Xiao, Limin PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Xie, Chenhao PPoPP '26: "APERTURE: Algorithm-System ..."
Xu, Chuanfu PPoPP '26: "A Diagonal Block Memory-Aware ..."
Xu, Guanglin PPoPP '26: "FlashAttention-T: Towards ..."
Xu, Jianxing PPoPP '26: "FlashAttention-T: Towards ..."
Xu, Ruibai PPoPP '26: "FlashAttention-T: Towards ..."
Xu, WeiWei PPoPP '26: "Towards Singular Value Decomposition ..."
Xu, Yufan PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "ElasGNN: An Elastic Training ..."
Xue, Jilong PPoPP '26: "MetaAttention: A Unified and ..."
Yang, Donglin PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Yang, Fan PPoPP '26: "MetaAttention: A Unified and ..."
Yang, Guangwen PPoPP '26: "HierCut: Enabling 16-bit Format ..."
Yang, Hailong PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "ElasGNN: An Elastic Training ..." PPoPP '26: "APERTURE: Algorithm-System ..." PPoPP '26: "Accelerating Sparse Transformer ..."
Yang, Jinwu PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Yang, Mao PPoPP '26: "MetaAttention: A Unified and ..."
Yang, Xiaojian PPoPP '26: "A Diagonal Block Memory-Aware ..."
Yang, Xinyu PPoPP '26: "Accelerating Sparse Transformer ..."
Yang, Zhi PPoPP '26: "MetaAttention: A Unified and ..."
Yao, Jianguo PPoPP '26: "SPIDER: Unleashing Sparse ..."
Yao, Xijia PPoPP '26: "ASM-SpMM: Unleashing the Potential ..."
Yin, Wenhao PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
You, Xin PPoPP '26: "Exploiting Efficient Mapping ..."
You, Yang PPoPP '26: "HelixPipe: Efficient Distributed ..."
Yu, Enze PPoPP '26: "APERTURE: Algorithm-System ..."
Yu, Feng PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Yu, Hui PPoPP '26: "DTMiner: A Data-Centric System ..."
Yu, Jeffrey Xu PPoPP '26: "High-Throughput Non-uniformly ..."
Yu, Jiping PPoPP '26: "ParDiff: Efficiently Parallelizing ..."
Yu, Xiangrui PPoPP '26: "ROME: Maximizing GPU Efficiency ..."
Yuan, Fan PPoPP '26: "A Diagonal Block Memory-Aware ..."
Zeng, Hongwei PPoPP '26: "Characterizing Matrix Multiplication ..."
Zeng, Wenqi PPoPP '26: "High-Throughput Non-uniformly ..."
Zhai, Jidong PPoPP '26: "RoMeo: Mitigating Dual-dimensional ..." PPoPP '26: "ParDiff: Efficiently Parallelizing ..." PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Zhai, Mingshu PPoPP '26: "RoMeo: Mitigating Dual-dimensional ..."
Zhang, Chunming PPoPP '26: "Faster and Cheaper: Pushing ..."
Zhang, Geng PPoPP '26: "HelixPipe: Efficient Distributed ..."
Zhang, Junyao PPoPP '26: "Binary Compatible Critical ..."
Zhang, Kaige PPoPP '26: "Exploiting Efficient Mapping ..." PPoPP '26: "APERTURE: Algorithm-System ..."
Zhang, Qianyu PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Zhang, Qihao PPoPP '26: "RoMeo: Mitigating Dual-dimensional ..."
Zhang, Rui PPoPP '26: "FlashAttention-T: Towards ..."
Zhang, Shaoshuai PPoPP '26: "Towards Singular Value Decomposition ..."
Zhang, Siwei PPoPP '26: "Trojan Horse: Aggregate-and-Batch ..."
Zhang, Yiming PPoPP '26: "zBuffer: Zero-Copy and Metadata-Free ..."
Zhang, Youhui PPoPP '26: "Root-Down Exposure for Maximal ..." PPoPP '26: "VDHA: Vector-Driven Hash Aggregation ..." PPoPP '26: "PANA: A Fine-Grained Runtime-Adaptive ..."
Zhang, Yu PPoPP '26: "DTMiner: A Data-Centric System ..."
Zhang, Zhiyuan PPoPP '26: "Pipelonk: Accelerating End-to-End ..."
Zhang, Zhonghai PPoPP '26: "Faster and Cheaper: Pushing ..."
Zhao, Hairui PPoPP '26: "PRISM: An Efficient GPU-Based ..." PPoPP '26: "COCCL: A Collective Communication ..." PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Zhao, Jin PPoPP '26: "DTMiner: A Data-Centric System ..."
Zhao, Lian PPoPP '26: "TAC: Cache-Based System for ..."
Zhao, Liyang PPoPP '26: "COCCL: A Collective Communication ..."
Zhao, Qian PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Zhao, Xuanlei PPoPP '26: "HelixPipe: Efficient Distributed ..."
Zhao, Yiwei PPoPP '26: "PIM-zd-tree: A Fast Space-Partitioning ..."
Zhao, Zepeng PPoPP '26: "MixFusion: A Patch-Level Parallel ..."
Zheng, Liyan PPoPP '26: "ChituDiffusion: A Data-Characteristic-Aware ..."
Zhou, Chunbao PPoPP '26: "TAC: Cache-Based System for ..."
Zhou, Xiaobo PPoPP '26: "JanusQuant: Accurate and Efficient ..."
Zhou, Yueyuan PPoPP '26: "CCL-D: A High-Precision Diagnostic ..."
Zhou, Zhe PPoPP '26: "Binary Compatible Critical ..."
Zhou, Zhi PPoPP '26: "Laser: Unlocking Layer-Level ..."

321 authors

proc time: 9.47