Enabling Large Scale Data Analytics: From Theoretical Foundations to Practice

IIT Hyderabad is organising a course on "Enabling Large Scale Data Analytics: From Theoretical Foundations to Practice" in Summer 2016. The course will be held at IIT Hyderabad Campus from 13th June to 17th June 2016. The course instructor is Dr. Barna Saha, Assistant Professor in the College of Information and Computer Sciences at the University of Massachusetts Amherst.

The course is being offered under the GIAN scheme launched by the Govenment of India.

Live webcast link of the course


News and Updates

The course lectures are over. The examiniation will be held today from 10:00 am.

Webcast link for the course is here.

Course schedule updated. See here.

For those who have opted for accommodation, accommodation status has been sent on 8th June 2016. Accommodation is arranged in Hostels in OLD/Temporary Campus of IIT Hyderabad. Route map to reach the old campus is here. Free transport via institute buses is available between the two campuses.

Route map to drive to IIT Hyderabad Permanent Campus is here. This document contains the same details, with information about public transport.

Clarification: The exam on the last day is optional. It is for those who are interested to write the exam. People who write the exam would also get a grade certificate at the end of the course.

Registration is now closed. Registration confirmation/tentative confirmation email has been sent to the participants. If you have registered but have not yet received any confirmation email, please drop an email to the course coordinator immediately.


The amount of data in our world has been exploding at an unforeseeable rate. The increasing volume and detail of information captured by enterprises, and the rise of multimedia, social media, and the Internet of Things are contributing to this exponential growth. Healthcare industries are promising to transform the world through "big" data. The ongoing data deluge is bringing in new opportunities in businesses, finances, and education. As we walk through this digitized age of exploded data, there is an increasing demand to develop unified toolkits for data processing and analysis. In this course our main goal is to lay the mathematical foundation of large scale data processing, develop algorithms and learn how to analyze them.


The primary objectives of the course are as follows:

  1. Exposing participants to the theories behind large scale data processing algorithms
  2. Providing exposure to practical problems and their solutions, and understanding why the solutions work
  3. Enhancing the capability of the participants to perform theoretical analysis with the goal of developing practical algorithms for variety of applications.
  4. To develop an appreciation for current and future challenges of large scale data analytics methods in both theory and practice.


Since good amount of time in the course will be devoted to theoretical aspects of large scale data analytics, the participants are expected to have understanding of basic probability, and general notion of algorithms.