Abstract: Image-text matching is a vital task in multi-modal intelligence. Recently, researchers have moved beyond simply aligning fragments between image regions and text words at a low level. They ...
Abstract: In this paper, we introduce OpenCIR, a fully-functional Conditional Image Repainting (CIR) model designed for local image editing. Given an image and a combination of conditions related to ...