Publications
[USENIX ATC’2024] Zheng Wang, Yuke Wang, Boyuan Feng, Guyue Huang, Dheevatsa Mudigere, Bharath Muthiah, Ang Li, Yufei Ding. OPER: Optimality-Guided Embedding Table Parallelization for Large-scale Recommendation Model. [Link]
[ASPLOS’2024] Zheng Wang, Yuke Wang, Jiaqi Deng, Da Zheng, Ang Li, Yufei Ding. RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing. [Link]
[ASPLOS’2024] Boyuan Feng, Zheng Wang, Yuke Wang, Shu Yang, Yufei Ding. ZENO: A Type-based Optimization Framework for Zero-Knowledge Neural Network Inference. [Link]
[USENIX ATC’2023] Yuke Wang, Boyuan Feng, Zheng Wang, Guyue Huang, Yufei Ding. TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs. [Link]
[OSDI’2023] Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Ang Li, Kevin Barker, Yufei Ding. MGG: Accelerating Graph Neural Networks with Fine-Grained Intra-Kernel Communication-Computation Pipelining on Multi-GPU Platforms. [Link]
[ISCA’2023] Siqi Li, Fengbin Tu, Liu Liu, Jilan Lin, Zheng Wang, Yangwook Kang, Yufei Ding, Yuan Xie. ECSSD: Hardware/Data Layout Co-Designed In-Storage-Computing Architecture for Extreme Classification. [Link]
[SC’2022] Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding. EL-Rec: Efficient Large-scale Recommendation Model Training via Tensor-train Embedding Table. [Link]
[Preprint] Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Ang Li, Yufei Ding. GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing. [Link]
[USENIX ATC’2022] Boyuan Feng, Tianqi Tang, Yuke Wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding. Faith: An Efficient Framework for Transformer Verification on GPUs. [Link]
[AAAI’2021] Boyuan Feng, Yuke Wang, Zheng Wang, Yufei Ding. UAG: Uncertainty-aware Atten- tion Graph Neural Network for Defending Adversarial Attacks. [Link]