Name	Name	Last commit message	Last commit date
Latest commit ichko Add assets Jul 11, 2020 6e5adaa · Jul 11, 2020 History 155 Commits
assets	assets	Add assets	Jul 11, 2020
src	src	Remove unnecessary imports	Jul 11, 2020
.env	.env	init project and add model	May 4, 2020
.gitignore	.gitignore	Implement train eval all pipeline	Jul 6, 2020
README.md	README.md	Update	Jul 11, 2020
readme.md	readme.md	Readme update	Jul 11, 2020
requirements.txt	requirements.txt	Add requirements file	Jul 11, 2020

Name

Last commit message

Last commit date

ichko

Add assets

Jul 11, 2020

6e5adaa · Jul 11, 2020

Jul 11, 2020

Remove unnecessary imports

Jul 11, 2020

.env

init project and add model

May 4, 2020

.gitignore

Implement train eval all pipeline

Jul 6, 2020

Jul 11, 2020

Jul 11, 2020

Add requirements file

Jul 11, 2020

Forward model

Previous notebooks and experiments can be found here.

Experiments and models for my masters thesis on learning environment dynamics from observations.

Notes and tasks

Profiling code
- pip install profiling
- profiling live-profile -m src.pipeline.train -- --debug
General stuff
- Mask out empty (padded) frames after rollout has finished. See here.
- [~] Label smoothing. Do I actually want that?
Models
- RNN Deconvolution Baseline
- Learn frame transformations
  - Instead of compressing the state like the RNN does
  - Action + Precondition (last few frames) -> transformation vector T
  - Use T to transform the current frame to the future frame
  - Play rollout of frame transformations - results in wandb look promising
Notes
- 12.06.2020
  - Update implementation of RNN Deconv
  - Focus on making RNN deconv work on PONG
    - WHY RNN Deconv - it is the only model that can model PONG with the current setup of the data pipeline.
    - Frame transforming models need two frames as context
    - TODO: [ ] Train and save working RNN Deconv model [ ] Write playing script [ ] Write script for manipulating the latent RNN state and viewing the result?
- 06.06.2020
  - Implement pong agent class + action mappings ([3, 3] => 9)
  - Make RNN Playable (interface like a gym)
- 04.06.2020
  - [BUGFIX] Found major bug in RNN models - the pred frames and true frames were not aligned, the model was trying to predict the present from the present
  - [BUGFIX] TimeDistributed (decorator) module was not holding the wrapped module in it's state resulting in the parameters of the wrapped module not being part of the overall model, resulting in the model not being able to be trined. (Took quite some time)
  - [FEATURE] Implemented generic multiprocessing function spawner and random agent rollout generator that leads to newer rollouts in the training buffer faster. Hopefully this can reduce over-fitting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Forward model

Notes and tasks

About

Contributors 2

Languages

ichko/forward-model

Folders and files

Latest commit

History

Repository files navigation

Forward model

Notes and tasks

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages