This is an audio transcript of the Rachman Review podcast episode: ‘South Korea’s real-life political drama’ Gideon Rachman Hello and welcome to the Rachman Review. I’m Gideon Rachman, chief foreign a ...
We study offline reinforcement learning (RL), which seeks to learn a good policy based on a fixed, pre-collected dataset. A fundamental challenge behind this task is the distributional shift due to th ...