Multi-modality Associative Bridging through Memory:
Speech Sound Recollected from Face Video
Minsu Kim* Joanna Hong* Se Jin Park Yong Man Ro
Image and Video Systems Lab, KAIST, South Korea
{ms.k, joanna2587, jinny960812, ymro}@kaist.ac.kr
Abstract Visual
modality Downstream
(a) Fusi ...


雷达卡




京公网安备 11010802022788号







