publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- OSDI’25WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training (To Appear)In 19th USENIX Symposium on Operating Systems Design and Implementation , 2025
- USENIX ATC’25PluS: Highly Efficient and Expandable ML Compiler with Pluggable Graph Schedules (To Appear)In USENIX Annual Technical Conference , 2025
2024
2023
- ASPLOS’23RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding ColumnsIn Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4 , 2023