Yunkai Liang

picture.jpg

I am a Ph.D. candidate in the School of Computer Science and Engineering at Sun Yat-sen University, advised by Prof. Zhi Zhou. Before that, I received my B.Eng. degree in software engineering from Sun Yat-sen University. My research interests focus on large language model (LLM) inference and cloud computing system. During my Ph.D. program, I also gained industrial experience as a research intern at Huawei Cloud, working with Dr. Pengfei Zuo on LLM inference system.

Publications

2026

  1. PPoPP
    Laser: Unlocking Layer-Level Scheduling for Efficient Multi-SLO LLM Serving
    Jianxiong Liao, Quanxing Dong, Yunkai Liang, Zhi Zhou, and Xu Chen
    In Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026

2025

  1. arxiv
    Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation
    Yunkai Liang, Zhangyu Chen, Pengfei Zuo, Zhi Zhou, Xu Chen, and Zhou Yu
    arXiv preprint arXiv:2503.20552, 2025
  2. arxiv
    Prefill-decode aggregation or disaggregation? unifying both for goodput-optimized llm serving
    Chao Wang, Pengfei Zuo, Zhangyu Chen, Yunkai Liang, Zhou Yu, and Ming-Chang Yang
    arXiv preprint arXiv:2508.01989, 2025

2024

  1. OSDI
    (Poster) PipeDecode: Efficient LLM Inference using Pipelines within Decoding
    Yunkai Liang, Bin Gao, Pengfei Zuo, Zhi Zhou, and Xu Chen
    In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24), 2024

2023

  1. ICDCS
    EdgeOrcher: Predictive Function Orchestration for Serverless-Based Edge Native Applications
    Yunkai Liang, Zhi Zhou, and Xu Chen
    In Ph.D. student symposium, 2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS), 2023

Serving

  • Reviewer: DPCS’24
  • External Reviewer: ICML’26, Eurosys’26, NSDI’25, NAS’24