Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li Yuan1 *, Yunpeng Chen2 , Tao Wang1,3 , Weihao Yu1 , Yujun Shi1 ,
Zihang Jiang1 , Francis E.H. Tay1 , Jiashi Feng1 , Shuicheng Yan1
1 2 3
National University of Singapore YITU Technology Institute of Data Science, National University of Singapore
yuanli@u.nus.edu, yunpeng.chen@yitu-inc.com, shuicheng.yan@gmail.com
Abstract
...


雷达卡




京公网安备 11010802022788号







