题目:Autonomic Cloud Systems Management: Challenge and Opportunities
报告人:须诚忠 教授 美国韦恩州立大学计算机工程工程系
地点:东五楼二楼210学术报告厅
时间:6月8日上午9:00
报告摘要:
Cloud computing, unlocked by virtualization, is emerging as an increasingly important service-oriented computing paradigm. Management is key to providing accurate service availability and performance data and to enabling on-demand real-time capacity planning to meet service demands dynamically. This is because virtualization does not reduce the complexity of a system. In fact, having multiple virtual machines (VMs) running on top of a shared physical computing infrastructure increases the overall system complexity and poses new challenges in systems management. Optimizing one component may compromise the others, leading to overall performance degradation. Frequent component failures here and there would even cause low system productivity.
This talk starts with a review of challenge issues in the management of large scale cloud computing systems. A machine learning approach is introduced for tackling the performance and reliability problems. Two case studies will be presented. One is anomaly detection, bottleneck identification, and VM autoconfiguration. The other is proactive failure management that deals with failures before they occur in cloud systems. Empirical models built from statistical learning exhibit great potential to help overcome the challenges of scale and complexity in current and future networked computer systems.
报告人简介:
Dr. Cheng-Zhong Xu is a Professor in the Department of Electrical and Computer Engineering of Wayne State University, the Director of the Laboratory for Cloud and Internet Computing, and the Director of Sun's Center of Excellence in Open Source Computing and Applications (SUN OSCA). He received his BS and MS degrees from Nanjing University, and Ph.D. from the University of Hong Kong in 1986, 1989, and 1993, respectively, all in Computer Science. He was a Guest Professorof the Paderborn Center of Parallel Computing in the Paderborn Univeristy, Germany before he joined the faculty of Wayne State University in 1995.
Dr. Xu's current research interest includes resource management in distributed and parallel systems, high performance cluster computing, and scalable and secure Internet services. He has published extensively in these areas. He was a main inventor and promoter of diffusive load balancing algorithms in parallel computing, and a key innovator of feedback control and machine learning-based systems management approaches for the provisioning of high service avaiability of
Internet servers and datacenters. Dr. Xu was the leading author of the book "Load Balancing in Parallel Computers" (Kluwer Academic/Springer Verlag, 1995). It was arguably the first that addressed the load balancing issue systematically.
Dr. Xu’s recent book "Scalable and Secure Internet Services and Architecture" (Chapman & Hall/CRC Press, 2005) provided an in-depth analysis of the Internet services in a unified framework from the performannce perspective. Dr. Xu's research was supported by US NSF, NASA, and industries like Cray Research and Sun Microsystems. He is theDirector of the Sun OSCA Center, established by Sun Microsystems in partnership with Wayne State University.
Dr. Xu is an editor of a number of leading journals in his areas, including IEEE Transactions on Parallel and Distributed Systems and Journal of Parallel and Distributed Computing. He is a recipient of "Faculty Research Award" of Wayne State University in 2000, President's Award for Excellence in Teaching in 2002, and Career Development Chair Award in 2003.