paper-reading-group

Accelerating Reinforcement Learning with Learned Skill Priors (SPiRL)

Authors: Karl Pertsch, Youngwoon Lee, Joseph J. Lim
Code: https://github.com/clvrai/spirl
Website: https://clvrai.github.io/spirl/

Motivation

Previous Work

Contribution

Leveraging skills embeddings from large offline datasets and learning a prior over them for efficient downstream learning tasks

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%201.png

Methodology

Learning the Embedding Space and the Prior over the embeddings

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%202.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%203.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%204.png

Using the prior over the skills for downstream reinforcement learning tasks

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%205.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%206.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%207.png

Experiments

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%208.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%209.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%2010.png

Accelerating%20Reinforcement%20Learning%20with%20Learned%20S%2055f74821b841411e9b7695dd6cab9440/Untitled%2011.png

Further work

  1. Experimenting with learning skill priors conditioned on different things; the paper reports results with the priors conditioned on the current state
  2. How does this approach scale with multi-task learning (continual learning agents)?