Bootstrap Modal Tutorial

MCPL: Multi-Modal Collaborative Prompt Learning for Medical Vision-Language Model

Abstract: Multi-modal prompt learning is a high-performance and cost-effective learning paradigm, which learns text as well as image prompts to tune pre-trained vision-language (V-L) models like CLIP ...

IEEE

Cross-Modal Semantic Relations Enhancement With Graph Attention Network for Image-Text Matching

Abstract: Image-text matching is a vital task in multi-modal intelligence. Recently, researchers have moved beyond simply aligning fragments between image regions and text words at a low level. They ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

MCPL: Multi-Modal Collaborative Prompt Learning for Medical Vision-Language Model

Cross-Modal Semantic Relations Enhancement With Graph Attention Network for Image-Text Matching

Trending now