36-782: Information Theory and Statistics (Fall 2023)

Course Description

Information theory is an area of applied probability that was developed to model and analyze engineering systems for storing and transmitting data. Since then, information-theoretic ideas have also played an important role in several topics within statistics, most notably in showing the optimality of statistical procedures. However, the role of information theory is not limited only to proving impossibility results. In this course, we will develop the tools to study some such modern and classical topics involving the interplay of information theory and statistics.

We will begin the course by introducing the main information measures (entropy, relative entropy, and mutual information) and rigorously establish their key properties. Next, we will study the fundamental task of (lossless) data compression, and in particular, see how the above information measures naturally arise as quantities with specific operational meaning. Next, we will study the (perhaps surprising) links between compression, and the optimal growth rate of the wealth in gambling. Finally, we will show how this connection can be exploited to design powerful methods for sequential inference. We will also introduce the notion of information projection, and study its connections to the error exponents in hypothesis testing.

Schedule

08/29: Information measures for discrete distributions.
08/31: Properties of Information measures (chain rules, convexity/concavity, DPI, Fano's inequality).
09/05: Information measures for continuous/general distributions.
09/07: Variational Definitions, f-divergences, applications.
09/12: Data Compression.
09/14: Gambling and Portfolio Optimization.
09/19: Universal Compression.
09/21: Universal Portfolios.
09/26: Coin betting and OCO.
09/28: Sequential Hypothesis Testing.
10/03: Sequential Hypothesis Testing.
10/05: Sequential Hypothesis Testing.
10/10: Confidence Sequences.
10/12: Confidence Sequences.

References

We will not follow a single textbook, but will draw upon the following references:

Elements of Information Theory, by Cover and Thomas.
Information Theory: from coding to learning, by Polyanskiy and Wu. [Link]
Lecture Notes for Statistics 311, by Duchi. [Link]
Some relevant papers: [Paper 1], [Paper 2]

36-782: Information Theory and Statistics (Fall 2023)

Course Description

Syllabus

Lecture Notes

Homework

Schedule

References