Reward-rational (implicit) choice:
A unifying formalism for reward learning
Hong Jun Jeon1 , Smitha Milli2 , Anca Dragan2
hjjeon@stanford.edu, smilli@berkeley.edu, anca@berkeley.edu
Equal contribution,
1
Stanford University,
2
University of California, Berkeley
Abstract
It is often difficult to hand-specify what the co ...


雷达卡


京公网安备 11010802022788号







