(PhD) Dynamic Programming

View All Courses

This course offers an advanced introduction Markov Decision Processes (MDPs)a formalization of the problem of optimal sequential decision making under uncertaintyand Reinforcement Learning (RL)a paradigm for learning from data to make near optimal sequential decisions. The first part of the course will cover foundational material on MDPs. We'll then look at the problem of estimating long run value from data, including popular RL algorithms like temporal difference learning and Q-learning. The final part of the course looks at the design and analysis of efficient exploration algorithms, i.e. those that intelligently probe the environment to collect data that improves decision quality. This a doctoral level course. Students should have experience with mathematical proofs, coding for numerical computation, and the basics of statistics, optimization, and stochastic processes.

Division: Decision, Risk and Operations

Fall 2023

B9120 - 001

Faculty

Daniel Russo

Part of Term

PhD - Full Term

Section Syllabus

No Syllabus

Section Notes

Day(s)

Date(s)

Start/End Time

Room

Thursday 09/05/2023 - 12/08/2023 9:00AM - 12:15PM Kravis 430

Log In to View Evaluations

Fall 2023

B9120 - 001

B9120 Course Evaluation

Accessibility Panel

Language Settings