Learning to summarize from human feedback
Nisan Stiennon Long Ouyang Jeff Wu Daniel M. Ziegler Ryan Lowe
Chelsea Voss Alec Radford Dario Amodei Paul Christiano
arXiv:2009.01325v3 [cs.CL] 15 Feb 2022
OpenAI
Abstract
As language models become more powerful, training and evaluation are increas-
...


雷达卡


京公网安备 11010802022788号







