Using a Logarithmic Mapping to Enable Lower
Discount Factors in Reinforcement Learning
Harm van Seijen Mehdi Fatemi
Microsoft Research Montréal Microsoft Research Montréal
harm.vanseijen@microsoft.com mehdi.fatemi@microsoft.com
Arash Tavakoli
Imperial College London
a.tavakoli@imperial.ac.uk
Abstract
In an effort to better understand the different w ...


雷达卡



京公网安备 11010802022788号







