Data sets are growing rapidly with the presence of new information generation from business enterprises, scientific and engineering disciplines, social networks, mobile phones, and other sensors. With a 4x growth rate of data each year, it is expected that we will have produced more than 35 zettabytes of data by 2020. Traditional databases and learning methods do not work for this massive amount of mainly unstructured data. This course provides the knowledge to use new Big Data tools and learn ways of storing information that will allow for efficient processing and analysis. In addition, you learn to store, manage, process and analyze massive amounts of unstructured data. This course will introduce tools such as Hadoop, Hive, Pig, Mahout, BigTable, and MongoDB for operations on massive data. Since all operations on massive data run on the cloud, we will introduce the cloud and its services for massive data. We will look at Amazon Elastic Cloud, Microsoft’s Azure, Google App Engine, etc.