Academic Pages is a ready-to-fork GitHub Pages template for academic personal websites

Education

  • ​​M.S. in Computer Science​​, Zhejiang University, 2023 - 2026(Expected)

  • ​​B.S. in Computer Science and Technology​​, Zhejiang University of Technology, 2019 - 2023

    • Computer Aided Design and Graphics System National Key Laboratory, Zhejiang University
    • School of Computer Science and Technology, Zhejiang University of Technology
    • Double Major: Computer Science and Technology; Minor: Intelligent Science and Technology
    • GPA: 4.23/5, Major Ranking: 1/23

Work experience

  • Large Model Inference Optimization Intern​, Kuaishou - Keling Technology Department, Beijing 2025.06 - 2025.09:
    • Optimized feature cache reuse strategy for DiT models by leveraging the similarity of adjacent denoising time-step features. Achieved a 2.5x end-to-end acceleration on the Wan2.1/Hunyuan video generation model and 3.0x on the Flux image generation model, surpassing SOTA solutions like Taylorseer/Easycache under the same acceleration ratio. Nearly lossless 1.4x end-to-end acceleration verified on the un-distilled Keling i2v model.
  • Navigation Group Algorithm Intern​​, Hangzhou Feibu Technology, Zhejiang, 2024.09-2025.04
    • ​​Predictive-Interactive Integrated Modeling​​: Designed an end-to-end data-driven model to replace rule-based strategies, enhancing decision-making in complex scenarios.

    • ​​Reinforcement Learning Optimization​​: Implemented model iterations based on the PPO algorithm, continuously improving port intersection traffic efficiency and reducing collision risks.

    • ​​Engineering Implementation​​: Reduced average intersection passage time by 0.5s across 1000+ simulation cases, completed real vehicle deployment of the model, and received an S-level performance evaluation during the internship.

Skills

  • ​​Deep Learning & Inference Optimization​​:
    • Familiar with DIT/LLM inference optimization techniques, core functionalities of mainstream inference frameworks (vLLM, xDiT), and multi-card distributed inference technology.
    • Skilled in the underlying implementation of speculative decoding, combining FP8 quantization, communication optimization, tensor parallelism, and CUDA Graph techniques.
    • Knowledgeable in the implementation details of high-performance fusion operators (Megakernel), achieving extreme acceleration by fusing the entire LLM decoding process into one kernel.
  • ​​Industrial Project Experience​​:
    • Led deep learning/reinforcement learning projects in autonomous driving scenarios (prediction-interaction tasks), with comprehensive experience from algorithm to engineering.

Experience

Kling AI, Kuaishou
2025.06 - Present
Advisor: Menghan Xia, Xintao Wang
Interests: Controllable Video Generation
M.S. Zhejiang University
Sep. 2023.09 - Present
Advisor: Haoji Hu