Ga offline met de app Player FM !
Jacob Beck and Risto Vuorio
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
63 afleveringen
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
63 afleveringen
Alle afleveringen
×Welkom op Player FM!
Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.