Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan Bohan Zhuang Jing Liu Haoyu He Jianfei Cai
Dept of Data Science and AI, Monash University
Abstract
DeiT-B
The recently proposed Visual image Transformers (ViT) DeiT-S
with pure attention have achieved promising performance 80
...


雷达卡




京公网安备 11010802022788号







