Evolving Attention with Residual Convolutions
Yujing Wang 1 * Yaming Yang 2 * Jiangang Bai 1 2 Mingliang Zhang 1 2
Jing Bai 2 Jing Yu 3 Ce Zhang 4 Gao Huang 5 Yunhai Tong 1
Abstract 80 79.63
79.29
Transformer is a ubiquitous model for natural lan- 79.1
...


雷达卡




京公网安备 11010802022788号







