TRAR: Routing the Attention Spans in Transformer
for Visual Question Answering
Yiyi Zhou12 , Tianhe Ren12 , Chaoyang Zhu12 , Xiaoshuai Sun12 *, Jianzhuang Liu3 ,
Xinghao Ding2 , Mingliang Xu4 , Rongrong Ji12
1
Media Analytics and Computing Lab, School of Informatics, Xiamen University, China
2
School of Informatics, Xiamen University, China
3
Noah’s Ark Lab, Huawei Technologies 4 Zhengzhou Univers ...


雷达卡




京公网安备 11010802022788号







