Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- preprintChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip DesignarXiv preprint arXiv:2601.21448, 2026
- preprintPancake: Hierarchical Memory System for Multi-Agent LLM ServingarXiv preprint arXiv:2602.21477, 2026
2025
2024
2023
- ASPLOS’23RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding ColumnsIn Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4, 2023🏆 Distinguished Artifact Award (presented at ASPLOS’24)