On the Correctness and Sample Complexity of
Inverse Reinforcement Learning
Abi Komanduru Jean Honorio
Purdue University Purdue University
West Lafayette IN 47906 West Lafayette IN 47906
akomandu@purdue.edu jhonorio@purdue.edu
Abstract
Inverse reinforcement learning (IRL) is the problem of finding a reward function
that generates a given optimal policy for a given M ...


雷达卡


京公网安备 11010802022788号







