Introduction to Reinforcement learning
Markov framework for RL.
Markov decision processesBellman equations
Dynamic programming
TP
Markov + Bellman
NotebookBandits problems
TP
Actor critic
BIBLIOGRAPHY:
Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues (Texts in Applied Mathematics),
by Pierre Bremaud (2001-02-01), Springer.
Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues (Texts in Applied Mathematics),
by Pierre Bremaud (2001-02-01), Springer.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto
Second Edition, in progress MIT Press, Cambridge, MA, 2017
Dynamic programing and optimal control, D. Bertsekas,
SA, 2012