Planning in entropy-regularized
Markov decision processes and games
Jean-Bastien Grill Omar D. Domingues
DeepMind Paris SequeL team, Inria Lille
jbgrill@google.com omar.darwiche-domingues@inria.fr
Pierre Ménard Rémi Munos Michal Valko
SequeL team, Inria Lille DeepMind Paris DeepMind Paris
pierre.menard@inria.fr munos@google.com valkom@deepmind.com
...


雷达卡


京公网安备 11010802022788号







