Variational Policy Gradient Method for
Reinforcement Learning with General Utilities
Junyu Zhang Alec Koppel
Department of Electrical Engineering CISD
Center for Statistics and Machine Learning US Army Research Laboratory
Princeton University, Princeton, NJ 08544 Adelphi, MD 20783
junyuz@princeton.edu alec.e.koppel.civ@mail.mil
Amrit Singh Bedi Csaba Szepesvári
CISD ...


雷达卡


京公网安备 11010802022788号







