题目:Towards MapReduce for Desktop Grid Computing
报告人:贺海武
地点:东五楼二楼210学术报告厅
时间:11月2日上午9:30
报告摘要:
MapReduce is an emerging programming model for dataintensive application proposed by Google, which has attracted a lot of attention recently. MapReduce borrows ideas from functional programming, where programmer defines Map and Reduce tasks to process large set of distributed data. In this paper we propose an implementation of the MapReduce programming model. We present the architecture of the prototype based on BitDew, a middleware for large scale data management on Desktop Grid. We describe the set of features which makes our approach suitable for large scale and loosely connected Internet Desktop Grid: massive fault tolerance, replica management, barriers-free execution, latency-hiding optimisation as well as distributed result checking. We also present performance evaluation of the prototype both against micro-benchmarks and real MapReduce application. The scalability test shows that we achieve linear speedup on the classic WordCount benchmark. Several scenarios involving lagger hosts and host crashes demonstrate that the prototype is able to cope with an experimental context similar to real-world Internet.
报告人简介:
Dr. Haiwu HE (贺海武)received his M. Sc. and the Ph. D. degrees in computing science from University of Sciences and Technologies of Lille (Lille I),France, respectively in 2002 and 2005. He was a postdoctoral researcher at INRIA Saclay, France in 2007.Currently, he is a research engineer at ENS-Lyon in Lyon, France. He has published about 20 refereed journal and conference papers.His research and development interest covers HPC, Cloud computing, Scientific computing, Desktop Grid, etc.