Muesli: Combining Improvements in Policy Optimization
Matteo Hessel * 1 Ivo Danihelka * 1 2 Fabio Viola 1 Arthur Guez 1 Simon Schmitt 1 Laurent Sifre 1
Theophane Weber 1 David Silver 1 2 Hado van Hasselt 1
Abstract atari57 median atari57 median
Muesli MuZero[2021]
We propose a novel policy update that combines ...


雷达卡




京公网安备 11010802022788号







