Why are Adaptive Methods Good
for Attention Models?
Jingzhao Zhang Sai Praneeth Karimireddy Andreas Veit
MIT EPFL Google Research
jzhzhang@mit.edu sai.karimireddy@epfl.ch aveit@google.com
Seungyeon Kim Sashank Reddi Sanjiv Kumar
Google Research Google Research Google Research
seungyeonk@google.com sashank@google.com sanjivk@google.com
...


雷达卡


京公网安备 11010802022788号







