paper-reading-group

DeepSynth: Automata Synthesis for Automatic Task Segmentation in RL

https://arxiv.org/pdf/1911.10244.pdf

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate, Tom Melham, Daniel Kroening (CS Department, University of Oxford)

Background

Motivation

Previous Work

Automata

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled.png

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%201.png

What does DeepSynth do?

DeepSynth Architecture

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%202.png

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%203.png

Tracing

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%204.png

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%205.png

Automaton Synthesis

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%206.png

RL

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%207.png

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%208.png

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%209.png

Experiments

DeepSynth%20Automata%20Synthesis%20for%20Automatic%20Task%20Se%20956bdaa7e08c423aa97f319b469790d3/Untitled%2010.png

Further Discussion

References

Logically Constrained RL - https://arxiv.org/abs/2002.12156

Reward Machines - https://arxiv.org/abs/2010.03950