Selected Publications

Full Publications, indicates equal contribution, indicates members of the research group I lead.
[ArXiv'25] Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing.

Under Review, 2025.

[ArXiv'25] Resource-Efficient Collaborative Edge Transformer Inference with Hybrid Model Parallelism.

Under Review, 2025.

[INFOCOM'25] Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices.

IEEE International Conference on Computer Communications (INFOCOM), 2025. CCF-A

Cite

[ICPP'24] Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning.

International Conference on Parallel Processing (ICPP), 2024. CCF-B

Cite Paper Slides

[INFOCOM'24] Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference.

IEEE International Conference on Computer Communications (INFOCOM), 2024. CCF-A

Cite Paper Slides Videos Press 🏵️ IEEE ComSoc Student Travel Grant Award

[MobiCom'24] Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices.

Annual International Conference On Mobile Computing And Networking (MobiCom), 2024. CCF-A

Cite Paper Demo Slides Poster 🏵️ Top #1 Conference in Mobile Computing and Computer Networks

[ICPP'22] Eco-FL: Adaptive Federated Learning with Efficient Edge Collaborative Pipeline Training.

International Conference on Parallel Processing (ICPP), 2022. CCF-B

Cite Paper Code Slides Videos