AI Saturdays: Reinforcement Learning Track

Code and notes from the AI Saturdays (Madrid, 3rd ed.) Reinforcement Learning Track.

Session 1 - 2020-02-08

For this first session we had to watch the following lectures from CS 285 at UC Berkeley (Deep Reinforcement Learning):

We also had to take a general look al Pytorch, since this would be the DL framework used throughout the course.

During the session we had a look at some basic Reinforcement Learning concepts:

Overview of environments, statuses, observations, policies and actions.
Commonly used notation
Markov chains
V and Q functions
Model Free and Model Based Reinforcement Learning
Imitation Learning

Afterward, we started to code a couple of small examples in Gym. This included:

A script that executes a series of episodes of a game with a random agent, which samples actions uniformly from the action space of the environment.
A script that allows the user to play the game and record the gameplay (observations, actions and rewards).

Session 2 - 2020-02-15

For the second session we decided to review the following pages from OpenAI Spinning Up:

We also decided to watch the following lecture from David Silver:

Video
Slides

Finally, we read the 4th chapter from Deep Reinforcement Learning Hands On, by Maxim Lapan, (Chapter4: The Cross-Entropy Method).

Sadly, I could not attend the session personally, but it consisted on reviewing the Cross-Entropy method, analyzing in which environments it would be most applicable, and implementing them for a few environments.

Session 3 - 2020-02-22

The third session focused on Tabular Learning, the Value Iteration Method, Deep Q-learning and the Deep Q-Networks.

We read the Chapters 5 (Tabular Learning and the Bellman Equation) and 6 (Deep Q-Networks) and watched to following lecture by David Silver:

Video
Slides

During the class, we reviewed the Value Iteration and Q-learning methods, and applied the first one to the Frozen Lake environment from OpenAI Gym, and the second one to the ATARI Pong game, with great success.

Session 4 - 2020-02-29

I

Name	Name	Last commit message	Last commit date
Latest commit miguel-bm Updated render function for tanks Mar 14, 2020 cd751b7 · Mar 14, 2020 History 31 Commits
01_Intro	01_Intro	Reorganize folder structure	Feb 21, 2020
02_CE_method	02_CE_method	Reorganize folder structure	Feb 21, 2020
03_DeepQ	03_DeepQ	Finished 2048 custom environment	Mar 6, 2020
04_Policy_Grad/notebooks	04_Policy_Grad/notebooks	Added a few WIP custom environments	Mar 6, 2020
05_Environments	05_Environments	Updated render function for tanks	Mar 14, 2020
.gitignore	.gitignore	Initial commit	Feb 9, 2020
LICENSE	LICENSE	Initial commit	Feb 9, 2020
README.md	README.md	Some features for tank game	Mar 7, 2020
requirements.txt	requirements.txt	Some features for tank game	Mar 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Saturdays: Reinforcement Learning Track

Session 1 - 2020-02-08

Session 2 - 2020-02-15

Session 3 - 2020-02-22

Session 4 - 2020-02-29

About

Releases

Packages

Languages

License

miguel-bm/ai6rl

Folders and files

Latest commit

History

Repository files navigation

AI Saturdays: Reinforcement Learning Track

Session 1 - 2020-02-08

Session 2 - 2020-02-15

Session 3 - 2020-02-22

Session 4 - 2020-02-29

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages