Abstract: To establish semantic associations between images and texts, existing Image-Text Retrieval (ITR) methods primarily focus on fixed-scale fragments, which only identify explicit semantic ...
Leigang Qu, Feng Cheng, Ziyan Yang, Qi Zhao, Shanchuan Lin, Yichun Shi, Yicong Li, Wenjie Wang, Tat-Seng Chua, Lu Jiang In-context image editing aims to modify images based on a contextual sequence ...
Abstract: Text-to-image person retrieval aims to match target pedestrian images based on a text query. Existing methods mainly learn feature alignment between texts and pedestrian images from global ...