Cloud and Distributed Systems Lab
Cloud and Distributed Systems Lab
Overview
Research
News
Members
Publications
Contact
Chengzhong Xu
Latest
Embracing Imbalance: Dynamic Load Shifting among Microservice Containers in Shared Clusters
Grad: Intelligent Microservice Scaling by Harnessing Resource Fungibility
Multiplexing Dynamic Deep Learning Workloads with SLO-awareness in GPU Clusters
SMIless: Serving DAG-based Inference with Dynamic Invocations under Serverless Computing
Derm: SLA-aware Resource Management for Highly Dynamic Microservices
Optimizing Dynamic Data Center Provisioning through Speed Scaling: A Primal-Dual Perspective
Heet: Accelerating Elastic Training in Heterogeneous Deep Learning Clusters
Optimizing Resource Management for Shared Microservices: A Scalable System Design
Interference-aware Multiplexing for Deep Learning in GPU Clusters A Middleware Approach
Understanding and Optimizing Workloads for Unified Resource Management in Large Cloud Platforms
Erms: Efficient Resource Management for Shared Microservices with SLA Guarantees
The Power of Prediction Microservice Auto Scaling via Workload Learning
An In-Depth Study of Microservice Call Graph and Runtime Performance
Characterizing Microservice Dependency and Performance: Alibaba Trace Analysis
Cite
×