PEBBLE: Feedback-Efficient Interactive Reinforcement Learning
via Relabeling Experience and Unsupervised Pre-training
Kimin Lee * 1 Laura Smith * 1 Pieter Abbeel 1
Abstract Kober & Peters, 2011; Kober et al., 2013; Silver et al.,
2017; Andrychowicz et al., 2020; Kalashnikov et al., 2018;
Conveying complex objectives to reinforcement
Vinyals et al., 2019). Scaling ...


雷达卡




京公网安备 11010802022788号







