Deep Coherent Exploration for Continuous Control
Yijie Zhang 1 Herke van Hoof 2
Abstract strategies and undirected strategies (Thrun, 1992; Plappert
et al., 2018). While directed strategies aim to extract use-
In policy search methods for reinforcement learn-
ful information from existing experiences for better explo-
ing (RL), exploration is often performed by in-
...


雷达卡




京公网安备 11010802022788号







