Zhongkai Yu
WeChat QR

zhongkai.jpg

I am currently a second-year PhD student in the CSE department of UC San Diego (since 2024), working with Prof.Yufei Ding. Before coming to UCSD, I obtained my master’s degree from Institute of Computing Technology, Chinese Academy of Sciences, under the supervision of Prof. Yunji Chen. Prior to that, I received my bachelor’s degree from Shanghai Jiao Tong University.

My research interests primarily lie in Computer Architecture, AI Accelerators, Machine Learning Systems, and AI for Chip Design.

Also, shout-out to my roommate Zaifeng, who is an expert in kernels, systems, and “cooking”. Check out his homepage and research.

Email: zhy055@ucsd.edu


Education

  • University of California, San Diego (UCSD), 2024 - Present
    Ph.D. student in Computer Science & Engineering
  • University of Chinese Academy of Sciences (UCAS), 2021 - 2024
    M.E. in Computer Technology
  • Shanghai Jiao Tong University (SJTU), 2017 - 2021
    B.S. in Physics (Zhiyuan Honors Program)


Experience

  • Samsung Semiconductor, 2024.6 - 2024.9.
    Research Intern, AGI Lab
  • Cambricon Technologies, 2022.3 - 2023.11.
    IC Design Intern, Architecture group


Selected Publications

  1. ArXiv
    Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting
    Zhongkai Yu, Yue Guan, Zihao Yu, Chenyang Zhou, Shuyi Pei, Yangwook Kang, Yufei Ding, and Po-An Tsai
    arXiv preprint arXiv:2510.05497, 2025
  2. OSDI’25
    KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads (To Appear)
    Yue Guan, Yuanwei Fang, Keren Zhou, Corbin Robeck, Manman Ren, Zhongkai Yu, and Yufei Ding
    In USENIX Symposium on Operating Systems Design and Implementation, 2025
  3. MICRO’24
    Cambricon-LLM: A Chiplet-Based Hybrid Architecture for On-Device Inference of 70B LLM
    Zhongkai Yu, Shengwen Liang, Tianyun Ma, Yunke Cai, Ziyuan Nan, Di Huang, Xinkai Song, Yifan Hao, Jie Zhang, Tian Zhi, Yongwei Zhao, Zidong Du, Xing Hu, Qi Guo, and Tianshi Chen
    In Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
  4. DAC’22
    E2sr: an end-to-end video codec assisted system for super resolution acceleration
    Zhuoran Song, Zhongkai Yu, Naifeng Jing, and Xiaoyao Liang
    In Proceedings of the 59th ACM/IEEE Design Automation Conference, 2022