MATTHIEU JONCKHEERE: RL CS 2021

Introduction to Reinforcement learning

1- [SLIDES]
Introduction to RL, Short Introduction to Markov chains.

Markov framework for RL.

Markov decision processes
Bellman equations
Dynamic programming

Markov + Bellman

Notebook

3- [SLIDES]

Bandits problems

4- [SLIDES]
Model free tabular problems.
Monte-Carlo and TD

5- [SLIDES]
Bigger state spaces: approximations of value functions
Policy gradient

Actor critic

BIBLIOGRAPHY:

Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues (Texts in Applied Mathematics),
by Pierre Bremaud (2001-02-01), Springer.

Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto

Second Edition, in progress MIT Press, Cambridge, MA, 2017

Dynamic programing and optimal control, D. Bertsekas,

SA, 2012

Páginas

RL CS 2021