Sketching Algorithms for Big Data

Harvard CS 226/MIT 6.889 - Fall 2017 (Syllabus)

Big data is data so large that it does not fit in the main memory of a single machine. The need to process big data by space-efficient algorithms arises in Internet search, machine learning, network traffic monitoring, scientific computing, signal processing, and other areas.

This course will cover mathematically rigorous models for developing such algorithms, as well as some provable limitations of algorithms operating in those models. Some topics covered include:

This course is intended for both graduate students and advanced undergraduate students satisfying the following prerequisites: mathematical maturity and comfort with algorithms (e.g. CS 124 at Harvard, or 6.046 at MIT), discrete probability, and linear algebra.