Course Information
- Time: MW 9:30-10:45
- Location: Rice Hall 340
- Instructor: Chen-Yu Wei
- TA: Braham Snyder
- Office Hours (Instructor): TBD
- Office Hours (TA): TBD
Overview
Reinforcement learning (RL) is a powerful learning paradigm through which machines learn to make (sequential) decisions. It has been playing a pivotal role in advancing artificial intelligence, with notable successes including mastering the game of Go and enhancing large language models.
This course focuses on the design principles of RL algorithms. Similar to statistical learning, a central challenge in RL is to generalize learned capabilities to unseen environments. However, RL also faces additional challenges such as exploration-exploitation tradeoff, credit assignment, and distribution mismatch between behavior and target policies. Throughout the course, we will delve into various solutions to these challenges and provide theoretical justifications.
Prerequisites
This course is mathematically demanding. Students are expected to have strong foundations in probability, linear algebra, and calculus. A basic understanding of machine learning and convex optimization will be beneficial. Proficiency in python programming is required.
Topics
Bandits, online learning, dynamic programming, Q-learning, policy evaluation, policy gradient.
Grading
- (40%) Written assignments
- (30%) Programming assignments
- (30%) Final project
Schedule
Date | Topics | Materials | Notes |
---|---|---|---|
1/13 | Introduction | ||
1/15 | |||
1/20 | MLK Holiday | ||
1/22 | |||
1/27 | |||
1/29 | |||
2/3 | |||
2/5 | |||
2/10 | |||
2/12 | |||
2/17 | |||
2/19 | |||
2/24 | |||
2/26 | |||
3/3 | |||
3/5 | |||
3/10 | Spring recess | ||
3/12 | Spring recess | ||
3/17 | |||
3/19 | |||
3/24 | |||
3/26 | |||
3/31 | |||
4/2 | |||
4/7 | |||
4/9 | |||
4/14 | |||
4/16 | |||
4/21 | |||
4/23 | |||
4/28 |
Resources
- Bandit Algorithms by Tor Lattimore and Csaba Szepesvari
- Reinforcement Learning: An Introduction by Richard Sutton and Andrew Barto
- Reinforcement Learning: Theory and Algorithms by Alekh Agarwal, Nan Jiang, Sham Kakade, and Wen Sun
- Statistical Reinforcement Learning and Decision Making: Course Notes by Dylan Foster and Sasha Rakhlin
Previous Offerings
- CS 6501 Reinforcement Learning (Spring 2024)
- CS 6501 Topics in Reinforcement Learning (Fall 2022) by Prof. Shangtong Zhang