Time: August 26 – 27, 2010
Place: Aalborg University
Description: In this course, we will review two main aspects of data management – persistent data storage and retrieval systems for application development (OLTP) and data-analysis platforms and solutions for data-centric decision making (OLAP). We will start with a broad overview of the key features of OLTP and OLAP systems to underscore the main principles that guided the overall design of these system in the enterprise context. We will then present the emerging computing infrastructures that have emerged as a compelling paradigm for scalable and reliable computing substrate for building large-scale applications as well as for large-scale data analysis. We will then explore the emerging approaches for building Internet scale persistent data stores (Google’s BigTable, PNUTS from Yahoo!, and Amazon’s Dynamo) as well as massively distributed and parallel frameworks (MapReduce and Hadoop) for large-scale data analysis. During this course we will identify with the key research and development challenges for managing internet scale data in cloud computing infrastructures built over geographically dispersed data centers.
Instructor Biography:
Dr. Divyakant Agrawal is a Professor of Computer Science at the University of California at Santa Barbara. His research expertise is in the areas of database systems, distributed computing, data warehousing, and large-scale information systems. From January 2006 through December 2007, Dr. Agrawal served as VP of Data Solutions and Advertising Systems at the Internet Search Company ASK.com. While at ASK.com, Dr. Agrawal was the Chief Architect for building the next-generation Business Intelligence and Data Warehousing system at ASK.com. In addition, he developed revenue-sensitive products at ASK.com by applying data-mining and machine-learning technologies over ASK.com’s historical data. Dr. Agrawal also serves as a Visiting Senior Research Scientist at the NEC Laboratories of America in Cupertino, CA. During his professional career he has served on the Program Committees of International Conferences, Symposia, and Workshops and served as an editor of the journal of Distributed and Parallel Databases from 1993 to 2008 and the VLDB Journal from 2003-2009. Dr. Agrawal currently serves on the editorial board of the Proceedings of the VLDB. He is serving as the Program Chair for 2010 ACM SIGMOD Conference on Management of Data (SIGMOD’2010), as the General Chairs of 2010 ACM SIGSPATIAL Conference on geographical Information Systems (GIS’2010), and 2010 IEEE Workshp on Information and Software as a Service (WISS’2009).