Towards Minimax Optimal Reinforcement Learning
in Factored Markov Decision Processes
Yi Tian* Jian Qian Suvrit Sra
Department of EECS Department of EECS Department of EECS
MIT MIT MIT
Cambridge, MA 02139 Cambridge, MA 02139 Cambridge, MA 02139
yitian@mit.edu jianqian@mit.edu suvrit@mit.edu
Abstract
We study minimax optimal reinforcement learni ...


雷达卡


京公网安备 11010802022788号







