Let the fun begin ...
In the next seven weeks, we will present to you many of the important tools for extracting information from very large datasets. Each week there will be a number of videos to watch, and one or more homeworks to do. The materials are backed up by a free on-line textbook, also published by Cambridge University Press, also called "Mining of Massive Datasets." You can download the book athttp://www.mmds.org
The first week is devoted to two topics:
- MapReduce: A programming system for easily implementing parallel algorithms on commodity clusters. This material is in the first four videos available for the week.
- Link Analysis: The remaining seven videos discuss the PageRank algorithm that made Google more effective than previous search engines.