Randomized Exploration for Reinforcement Learning with General Value
Function Approximation
Haque Ishfaq * 1 2 Qiwen Cui * 3 Viet Nguyen 1 2 Alex Ayoub 4 Zhuoran Yang 5 Zhaoran Wang 6
Doina Precup 1 2 7 Lin F. Yang 8
Abstract when general function approximation is used to estimate
We propose a model-free reinforcement learn- the value function, i.e., the expectation of long-term return.
ing algorithm inspired by ...


雷达卡




京公网安备 11010802022788号







