Selected Publications

Full Publications, indicates equal contribution, indicates members of the research group I lead.
[INFOCOM'25] Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices.

IEEE International Conference on Computer Communications (INFOCOM), 2025. CCF-A

Cite

[ArXiv'24] Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence.

Under review

Cite Paper Press

[ICPP'24] Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning.

International Conference on Parallel Processing (ICPP), 2024. CCF-B

Cite Paper Slides

[IEEE WCM'24] Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing.

IEEE Wireless Communications (IEEE WCM), 2024. 中科院一区(Top) IF=12.9

Cite Paper

[INFOCOM'24] Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference.

IEEE International Conference on Computer Communications (INFOCOM), 2024. CCF-A

Cite Paper Slides Videos Press 🏵️ IEEE ComSoc Student Travel Grant Award

[MobiCom'24] Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices.

Annual International Conference On Mobile Computing And Networking (MobiCom), 2024. CCF-A

Cite Paper Demo Slides Poster 🏵️ Top #1 Conference in Mobile Computing and Computer Networks

[DATE'24] Communication-Efficient Model Parallelism for Distributed In-situ Transformer Inference.

Design, Automation and Test in Europe Conference (DATE), 2024. CCF-B

Cite Paper

[ICPP'22] Eco-FL: Adaptive Federated Learning with Efficient Edge Collaborative Pipeline Training.

International Conference on Parallel Processing (ICPP), 2022. CCF-B

Cite Paper Code Slides Videos