CV

Lihui Gu

lihuigu@zju.edu.cn
(+86) 15824346819
Beijing, , CN

Summary

AI Infra Engineer at ByteDance. M.S. in Software Engineering from Zhejiang University. Focus on LLM/DiT inference optimization and speculative decoding.

Education

  • Software Engineering
    2026-03
    Zhejiang University
  • Computer Science and Technology
    2023-06
    Zhejiang University of Technology
    GPA: 4.23/5

Work Experience

  • AI Search ML Infra Engineer
    2026-01 -
    ByteDance
    Multi-modal LLM speculative decoding training and RL post-training async pipeline refactoring.
  • Large Model Inference Optimization Intern
    2025-06 - 2025-12
    Kuaishou - Keling Technology Department
    DiT model feature cache reuse optimization. Work accepted to ICLR 2026.
  • Navigation Group Algorithm Intern
    2024-09 - 2025-04
    Hangzhou Feibu Technology
    End-to-end predictive-interactive modeling and RL optimization.

Skills

Deep Learning & Inference Optimization

  • LLM Inference
  • DiT Inference
  • Speculative Decoding
  • EAGLE3
  • vLLM
  • xDiT
  • CUDA
  • FP8
  • Tensor Parallelism
  • CUDA Graph

Publications

  • SCALINGCACHE: Extreme Acceleration of DiTs Through Difference Scaling and Dynamic Interval Caching
    2026
    ICLR 2026
    DiT model acceleration through feature caching, achieving 2.5x-3.0x end-to-end speedup.

Languages

  • Chinese
    Native
  • English
    Professional