About me

I am a second-year CS Ph.D. student at Stanford University, advised by Prof. Kunle Olukotun. My research interest is broadly in computer networks, distributed systems, and machine learning.

Previously, I obtained my bachelor’s degree from the University of Chicago, with a triple major in computer science, statistics, and mathematics. During my undergraduate years, I have been fortunate to work with Prof. Junchen Jiang and Prof. Ravi Netravali on video streaming and analytics. I have also interned at the Mathematics and Computer Science Division at Argonne National Laboratory.

The pronunciation of my first name (Qizheng) is very close to that of “keygen” in public key encryption. I also go by Alex.

Last updated: April 2023

You might be looking for…

For Spring 2024, the course webpage for CS 244 (Advanced Topics in Networking) is here. You might also want to check out Ed. Feel free to e-mail me or make an Ed post if you would like to chat.

  • For reading critiques, please send to cs244-spr2324-submit@lists.stanford.edu.
  • For project proposals and reports, please send to cs244-spr2324-staff@lists.stanford.edu. If you are looking for ideas on what paper to replicate for the course project, it could be very helpful if you check out this paper as it discussed what prior cohorts of CS 244 students did for their projects.

For Spring 2024, the Stanford systems reading group is Tuesday every week 3 - 4 pm at Gates 415. We read and discuss research papers in the general domain of systems. The webpage is here. Sign up for the mailing list here. We have free and high-quality boba for all participants, so please consider joining!

Publications

* indicates equivalent contribution

  • CacheGen: Fast Context Loading for Language Model Applications via KV Cache Streaming
    Yuhan Liu, Hanchen Li, Yihua Cheng, Siddhant Ray, Yuyang Huang, Qizheng Zhang, Kuntai Du, Jiayi Yao, Shan Lu, Ganesh Ananthanarayanan, Michael Maire, Henry Hoffmann, Ari Holtzman, Junchen Jiang
    Preprint [paper]

  • Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents
    Qizheng Zhang, Ali Imran, Enkeleda Bardhi, Tushar Swamy, Muhammad Shahbaz, Kunle Olukotun
    OSDI 2024 (to appear)

  • The Dataflow Abstract Machine Simulator Framework
    Nathan Zhang, Rubens Lacouture, Gina Sohn, Paul Mure, Qizheng Zhang, Fredrik Kjolstad, Kunle Olukotun
    ISCA 2024 (to appear)

  • GRACE: Loss-Resilient Real-Time Video through Neural Codecs
    Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, Yuhan Liu, Kuntai Du, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang
    NSDI 2024 (to appear) [paper]

  • OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
    Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang
    SoCC 2023 [paper] [code]

  • Optimizing Real-Time Video Experience with Data Scalable Codec
    Hanchen Li*, Yihua Cheng*, Ziyi Zhang, Qizheng Zhang, Anton Arapin, Nick Feamster, Amrita Mazumdar
    SIGCOMM EMS Workshop 2023 [paper]

  • AccMPEG: Optimizing Video Encoding for Video Analytics
    Kuntai Du, Qizheng Zhang, Anton Arapin, Haodong Wang, Zhengxu Xia, Junchen Jiang
    MLSys 2022 [paper] [code]

  • Understanding the Potential of Server-Driven Edge Video Analytics
    Qizheng Zhang, Kuntai Du, Neil Agarwal, Ravi Netravali, Junchen Jiang
    HotMobile 2022 [paper] [code] [slides] [talk]

  • Server-Driven Video Streaming for Deep Learning Inference
    Kuntai Du*, Ahsan Pervaiz*, Xin Yuan, Aakanksha Chowdhery, Qizheng Zhang, Henry Hoffmann, Junchen Jiang
    SIGCOMM 2020 [paper] [code]