We study offline reinforcement learning (RL), which seeks to learn a good policy based on a fixed, pre-collected dataset. A fundamental challenge behind this task is the distributional shift due to th ...
A staff member demonstrates his interaction with a virtual human during the Global Digital Economy Conference 2024 (GDEC 2024) in Beijing, capital of China, July 2, 2024. (Xinhua/Ren Chao) BEIJING, Ja ...