Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning

Published: Apr 30, 2020
Abstract
Off-policy reinforcement learning algorithms promise to be applicable in settings where only a fixed data-set (batch) of environment interactions is available and no new experience can be acquired. This property makes these algorithms appealing for real world problems such as robot control. In practice, however, standard off-policy algorithms fail in the batch setting for continuous control. In this paper, we propose a simple solution to this...
Paper Details
Title
Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning
Published Date
Apr 30, 2020
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.