About Me
I am a second-year CS Ph.D. at Hong Kong University of Science and Technology(Guang Zhou) under the instruction from Prof. Xiaowen Chu(primary) and Xinyu Chen. I received my B.S. at Peking University advised by Prof. Jie Zhang and Chuanfu Xiao.
I am interested in high performance computing(HPC), especially GPU computing and making machine learning system more efficient(MLSys). If you are interested in my research, please feel free to contact me! Email
MBTI: ESTJ or ENTJ
News
- [2025.5] Our paper “BurstGPT: A Real-World Workload Dataset to Optimize LLM Serving Systems” is accepted by KDD D&B track. Congratulations to Yuxin and Yuhan.
- [2025.4] Our paper “SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs” awards Eurosys 2025 Best Paper. Congratulations to Ruibo and all authors.
- [2025.3] Our paper “HOSCF: Efficient decoupling algorithms for finding the best rank-one approximation of higher-order tensors” is accepted by SIMAX. Congratulations to Chuanfu and all authors.
- [2024.3] We release the BurstGPT(Dataset, Paper). As far as we know, this is the first real trace of chatbot using GPT server. Feel free to try!
- [2023.12] Collect and release the Awesome_LLM_Accelerate-PaperList. It is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future.
Publications
(Eurosys 2025) SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
Ruibo Fan, Xiangrui Yu, Peijie Dong, Zeyu Li, Gu Gong, Qiang Wang, Wei Wang, Xiaowen Chu(SIMAX) HOSCF: Efficient decoupling algorithms for finding the best rank-one approximation of higher-order tensors
Chuanfu Xiao, Zeyu Li, Chao Yang(IPDPS 2024)Benchmarking and Dissecting the Nvidia Hopper GPU Architecture
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu(J. Chem. Phys. Aug 2023) DeePMD-kit v2: A software package for Deep Potential models
Jinzhe Zeng, Duo Zhang, Denghui Lu, Pinghui Mo, Zeyu Li, …, Weinan E, Roberto Car, Linfeng Zhang, Han Wang
Academic Services
2025: OSDI, ATC
2024: MLsys, Neurips, IPDPS, TPDS, TOCS, Computing survey
Experiences
Peking University
Undergrad Research (advisor: Prof. Jie Zhang)
sept. 2022 – Jun. 2023Peking University
Research Intern(advisor: Prof. Chao Yang and Dr. Chuanfu Xiao)
Jan. 2021 – Jun. 2023Microsoft research asis
System and Networking Intern (advisor: Dr. Yang Wang)
Sept. 2024 - Feb. 2025Taichi Graphics
Intern (advisor: Dr. Haidong Lan)
Mar. 2022 - Jun. 2023DP Technology
Intern (advisor: Mr. Denghui Lu)
Mar. 2021 - Aug. 2021
Selected Awards
- The First Prize of ASC 20~21 (Student Supercomputer Challenge) (Top 1%)
- The Second Prize of Peking University Scholarship (Top 5%), 2021
- Merit Student of Peking University (Top 15%), 2021
- Outstanding graduate of Yuanpei College (Top 15%), 2023
- Chinese Chemistry Olympiad (First Prize, Rank 6 in China), 2017
Update data: 2025/5/11(y/m/d)