Decoupling Value and Policy for Generalization in Reinforcement Learning
Roberta Raileanu 1 Rob Fergus 1
Abstract ization (Farebrother et al., 2018; Zhang et al., 2018a; Cobbe
et al., 2018; Igl et al., 2019), data augmentation (Cobbe
Standard deep reinforcement learning algorithms et al., 2018; Lee et al., 2020; Ye et al., 2020; Kostrikov et al.,
use a shared representation for the policy and ...


雷达卡




京公网安备 11010802022788号







