by Zeqian Li

Markov Decision Process and Reinforcement Learning

How to beat human in 49 Atari games.

Presentation Summary

In these slides, I present:

Basic concepts of Markov Decision Process, Bellman equation
Statistical mechanics of Markov Decision Process
Soft Q-learning in reinforcement learning

Examples

A well-documented code repo from OpenAI.

References

Most contents here are from lectures in ICTP Spring College on Physics of Complex Systems, 2018. Full video lectures are available on ICTP Youtube channel and ICTP website.
A good video lecture from Stanford CS231n course. I also strongly recommend the full course.
Mnih, Volodymyr, et al. “Human-level control through deep reinforcement learning.” Nature 518.7540 (2015): 529.
Sutton, Richard S., and Andrew G. Barto. Reinforcement learning: An introduction. MIT press, 2018.
Deep reinforcement learning resources on OpenAI

All Machine Learning

Yubo "Paul" Yang 2019-02-28 ALGORITHM
machine learning