Towards Optimal Off-Policy Evaluation for
Reinforcement Learning with Marginalized
Importance Sampling
Tengyang Xie Yifei Ma Yu-Xiang Wang
Dept. of Computer Science AWS AI Labs Dept. of Computer Science,
UIUC Amazon.com Services, Inc. UC Santa Barbara
Urbana, IL 61801 East Palo Alto, CA 94303 Santa Barbara, CA 93106
tx10@illinois.edu yifeim@amazon.com yuxiangw@cs.ucsb.edu
...


雷达卡



京公网安备 11010802022788号







