Skip to content

Home

Hello! I’m Yizhou Shan (ε•δΈ€θˆŸ), I’m a Research Scientist at Huawei Cloud. I earned my PhD from University of California San Diego, CSE under the supervision of Prof. Yiying Zhang.

I now run Serverless AI platform at Huawei Cloud, responsible for cost-efficient Model Serving (LLM, LMM, T2I, T2V, etc), Agent Serving, and Post-Training infrastructure. If you are interested in working with me (full-time or intern), we should talk.

Contact: syzwhat AT gmail DOT com You can find my CV here.

Blogging

Latest

Hot

Research

Research

My main research interests span machine learning systems, distributed systems, data center networking, OS, hardware (FPGA), disaggregated memory/storage systems, and their intersections.

Serving LLMs at Cloud Scale

Disaggregated Data Center Architecture

Disaggregated Memory

Networking Design

Publications

  1. CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference
    Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang
    [Preprint] [Code]
  2. Inference without Interference: Disaggregate LLM Inference for Mixed Downstream Workloads
    Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan
    [Preprint] [Code]
  3. Optimizing Hardware-Based Network Computation DAGs for Multiple Tenants with SuperNIC
    Yizhou Shan, Will Lin, Ryan Kosta, Arvind Krishnamurthy, Yiying Zhang
    [Preprint] [Code]
  4. Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers
    Cunchen Hu, Chenxi Wang, Sa Wang, Ninghui Sun, Yungang Bao, Jieru Zhao, Sanidhya Kashyap, Pengfei Zuo, Xusheng Chen, Liangliang Xu, Qin Zhang, Hao Feng, Yizhou Shan
    HotOS 2023 [Paper]
  5. Core slicing: closing the gap between leaky confidential VMs and bare-metal cloud
    Ziqiao Zhou, Yizhou Shan, Weidong Cui, Xinyang Ge, Marcus Peinado, Andrew Baumann
    OSDI 2023 [Paper]
  6. MARB: Bridge the Semantic Gap between Operating System and Application Memory Access Behavior
    Haifeng Li, Ke Liu, Ting Liang, Zuojun Li, Tianyue Lu, Hui Yuan, Yinben Xia, Yungang Bao, Mingyu Chen, Yizhou Shan
    DATE 2023
  7. HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory
    Haifeng Li, Ke Liu, Ting Liang, Zuojun Li, Tianyue Lu, Hui Yuan, Yinben Xia, Yungang Bao, Mingyu Chen, Yizhou Shan
    HPCA 2023 [Paper]
  8. Towards a Fully Disaggregated and Programmable Data Center
    Yizhou Shan, Will Lin, Zhiyuan Guo, Yiying Zhang
    APSys 2022 [Paper]
  9. Distributing and Disaggregating Hardware Resources in Data Centers
    Yizhou Shan
    UCSD Dissertation 2022
  10. Clio: A Hardware-Software Co-Designed Disaggregated Memory System
    Yizhou Shan, Zhiyuan Guo (co-first authors), Xuhao Luo, Yutong Huang, Yiying Zhang
    ASPLOS 2022 [Paper] [Code] [Slide]
  11. Disaggregating Persistent Memory and Controlling Them Remotely: An Exploration of Passive Disaggregated Key-Value Stores
    Shin-Yeh Tsai, Yizhou Shan, Yiying Zhang
    ATC 2020 [Paper] [Code] [Slide] [Short-Talk] [Full-Talk] [Keynote]

  12. Storm: a fast transactional dataplane for remote data structures
    Stanko Novakovic, Yizhou Shan, Aasheesh Kolli, Michael Cui, Yiying Zhang, Haggai Eran, Liran Liss, Michael Wei, Dan Tsafrir, Marcos Aguilera
    SYSTOR 2019 (Best Paper Award) [Paper] [Slide] [Talk]

  13. LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation
    Yizhou Shan, Yutong Huang, Yilun Chen, Yiying Zhang
    OSDI 2018 (Best Paper Award) [Paper] [Code] [Slide] [Keynote-iCloud] [Talk]

  14. Distributed Shared Persistent Memory
    Yizhou Shan, Shin-Yeh Tsai, Yiying Zhang
    SoCC 2017 [Paper] [Code] [Slide] [Poster]

Workshops

  1. Disaggregating Persistent Memory and Controlling Them Remotely: An Exploration of Passive Disaggregated Key-Value Stores
    Shin-Yeh Tsai, Yizhou Shan, Yiying Zhang
    12th Annual Non-Volatile Memories Workshop (NVMW 2021) [Paper]

  2. Challenges in Building and Deploying Disaggregated Persistent Memory
    Yizhou Shan, Yutong Huang, Yiying Zhang
    10th Annual Non-Volatile Memories Workshop (NVMW 2019) [Paper]

  3. Disaggregating Memory with Software-Managed Virtual Cache
    Yizhou Shan, Yiying Zhang
    2018 Workshop on Warehouse-scale Memory Systems (WAMS 2018) (co-located with ASPLOS ‘18) [Paper]

  4. Distributed Shared Persistent Memory
    Yizhou Shan, Shin-Yeh Tsai, Yiying Zhang
    9th Annual Non-Volatile Memories Workshop (NVMW 2018) [Paper]

  5. Disaggregated Operating System
    Yiying Zhang, Yizhou Shan, Sumukh Hallymysore
    17th International Workshop on High Performance Transaction Systems (HPTS 2017) [Paper]

Professional Services

Program Committee

  • FAST (2026, 2025)
  • EuroSys (2025, 2024, 2023)
  • ATC (2025, 2024, 2023)
  • NSDI (2026, 2025, 2024)
  • SoCC (2023, 2022)

Shadow/External Program Committee

  • EuroSys (2022-shadow, 2021-shadow)
  • ASPLOS (2021-external)

Journal Reviewer

  • Journal of Systems Research: 2021 - Current
  • ACM Transactions on Architecture and Code Optimization (TACO): 2021
  • ACM Transactions on Storage (TOS): 2020
  • IEEE/ACM Transactions on Networking: 2020

Artifact Evaluation Committee

  • SOSP (2021)
  • OSDI (2020)

Social

πŸ„ 🚣 πŸ€ 🏈