Abstract: In this paper, we study the composed query image retrieval, which aims at retrieving the target image similar to the composed query, i.e., a reference image and the desired modification text ...
Abstract: Document understanding is a critical task in extracting structured information from documents such as forms, receipts, and reports. Visually Rich Documents (VRDs) present unique challenges ...