Telling the What while Pointing to the Where:
Multimodal Queries for Image Retrieval
Soravit Changpinyo Jordi Pont-Tuset Vittorio Ferrari Radu Soricut
Google Research
{schangpi,jponttuset,vittoferrari,rsoricut}@google.com
Abstract A horse in a city,
a occluding a bike
and a car.
Most ...


雷达卡




京公网安备 11010802022788号







