Pythia: A Suite for Analyzing Large Language Models
Across Training and Scaling
Stella Biderman * 1 2 Hailey Schoelkopf * 1 3 Quentin Anthony 1 Herbie Bradley 1 4 Kyle O’Brien 1
Eric Hallahan 1 Mohammad Aflah Khan 5 Shivanshu Purohit 6 1 USVSN Sai Prashanth 1 Edward Raff 2
Aviya Skowron 1 Lintang Sutawika 1 7 Oskar van der Wal 8
arXiv:2304.01373v2 [cs.CL] 31 May 2023
...


雷达卡


京公网安备 11010802022788号







