About Me

I am a second-year CS Ph.D. at Hong Kong University of Science and Technology(Guang Zhou) under the instruction from Prof. Xiaowen Chu(primary) and Xinyu Chen. I received my B.S. at Peking University advised by Prof. Jie Zhang and Chuanfu Xiao.

I am interested in high performance computing(HPC), especially GPU computing and making machine learning system more efficient(MLSys). If you are interested in my research, please feel free to contact me! Email

MBTI: ESTJ or ENTJ

News

  • [2025.5] Our paper “BurstGPT: A Real-World Workload Dataset to Optimize LLM Serving Systems” is accepted by KDD D&B track. Congratulations to Yuxin and Yuhan.
  • [2025.4] Our paper “SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs” awards Eurosys 2025 Best Paper. Congratulations to Ruibo and all authors.
  • [2025.3] Our paper “HOSCF: Efficient decoupling algorithms for finding the best rank-one approximation of higher-order tensors” is accepted by SIMAX. Congratulations to Chuanfu and all authors.
  • [2024.3] We release the BurstGPT(Dataset, Paper). As far as we know, this is the first real trace of chatbot using GPT server. Feel free to try!
  • [2023.12] Collect and release the Awesome_LLM_Accelerate-PaperList. It is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future.

Publications

  • (Eurosys 2025) SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
    Ruibo Fan, Xiangrui Yu, Peijie Dong, Zeyu Li, Gu Gong, Qiang Wang, Wei Wang, Xiaowen Chu

  • (SIMAX) HOSCF: Efficient decoupling algorithms for finding the best rank-one approximation of higher-order tensors
    Chuanfu Xiao, Zeyu Li, Chao Yang

  • (IPDPS 2024)Benchmarking and Dissecting the Nvidia Hopper GPU Architecture
    Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu

  • (J. Chem. Phys. Aug 2023) DeePMD-kit v2: A software package for Deep Potential models
    Jinzhe Zeng, Duo Zhang, Denghui Lu, Pinghui Mo, Zeyu Li, …, Weinan E, Roberto Car, Linfeng Zhang, Han Wang

Academic Services

2025: OSDI, ATC
2024: MLsys, Neurips, IPDPS, TPDS, TOCS, Computing survey

Experiences

  • Peking University
    Undergrad Research (advisor: Prof. Jie Zhang)
    sept. 2022 – Jun. 2023

  • Peking University
    Research Intern(advisor: Prof. Chao Yang and Dr. Chuanfu Xiao)
    Jan. 2021 – Jun. 2023

  • Microsoft research asis
    System and Networking Intern (advisor: Dr. Yang Wang)
    Sept. 2024 - Feb. 2025

  • Taichi Graphics
    Intern (advisor: Dr. Haidong Lan)
    Mar. 2022 - Jun. 2023

  • DP Technology
    Intern (advisor: Mr. Denghui Lu)
    Mar. 2021 - Aug. 2021

Selected Awards

  • The First Prize of ASC 20~21 (Student Supercomputer Challenge) (Top 1%)
  • The Second Prize of Peking University Scholarship (Top 5%), 2021
  • Merit Student of Peking University (Top 15%), 2021
  • Outstanding graduate of Yuanpei College (Top 15%), 2023
  • Chinese Chemistry Olympiad (First Prize, Rank 6 in China), 2017

Update data: 2025/5/11(y/m/d)