Title：Scalable Big Data Management on clouds and HPC systems
We live in the era of Big Data, where the size of data generated by digital media, social networks and scientific instruments is increasing at an extreme rate. To realize the potential benefits of Big Data in practice (e.g., harnessing scientific discoveries, accelerating big and small business, and so on), we need to deal with several challenging issues, among which, data management is an important one. As the current data management techniques and infrastructure strain under the added weight of Big Data, there is a call for a paradigm shift in large scale data management and processing mechanisms (e.g., MapReduce, key-value stores) and computing infrastructure (e.g., Clouds, Exascale systems). In this talk, I will present recent work on addressing the main challenges when managing (storing, processing) big data at large-scale including I/O interference, performance variability, fault-tolerance, consistency, and so on.
Shadi Ibrahim is an Inria Research Scientist within the Ascola/STACK team at Inria Rennes Bretagne Atlantique research center. He obtained his Ph.D. in Computer Science from Huazhong University of Science and Technology (HUST) in Wuhan of China in 2011. From November 2011 to September 2013, he was a Post-Doc researcher within the KerData team working on scalable Big Data managements on clouds. Since October 2013, he was hired as a permanent research scientist at Inria. He has several years of experience with academic research at HUST, Inria research center and Microsoft research center in Asia. He has served as a PC member for international conferences in cloud, HPC and parallel and distributed systems (ACM/IEEE SC, IEEE ICPP, IEEE Cluster, ACM/IEEE CCGrid), and as Program co-chair for the 2018 IEEE International Congress on Big Data, the CEBDA 2018 Workshop, the 2017 ICA3PP conference, the 2017 EBDMA Workshop; and the PhD consortium of the 2014 CloudCom conference. His current research interests are in Big data management, Data-Intensive computing, HPC, cloud and Edge computing, virtualization technology, and file and storage systems. His papers are published in prestigious international journals (such as ACM ToPC, IEEE TPDS and FGCS) and proceedings (such as ACM/IEEE SuperComputing, IEEE ICDCS, IEEE IPDPS, and IEEE ICPP). He is currently leading the ANR KerStream project and is a member the DISCOVERY Inria Project Lab and JLESC: Joint Laboratory on Extreme-Scale Computing.