Off-Policy Evaluation via Off-Policy Classification
Alex Irpan1 , Kanishka Rao1 , Konstantinos Bousmalis2 ,
Chris Harris1 , Julian Ibarz1 , Sergey Levine1,3
1
Google Brain, Mountain View, USA
2
DeepMind, London, UK
3
University of California Berkeley, Berkeley, USA
{alexirpan,kanishkarao,konstantinos,ckharris,julianibarz,slevine}@google.com
Abstrac ...


雷达卡


京公网安备 11010802022788号







