This paper introduces MCTrack, a new 3D multi-object tracking method that achieves state-of-the-art (SOTA) performance across KITTI, nuScenes, and Waymo datasets. Addressing the gap in existing ...
This project aims to demonstrate how to configure visible and infrared datasets to accommodate multimodal object detection tasks based on YOLOv11. With three different configuration methods (directory ...
Abstract: Underwater image captioning bridges the gap between visual perception and semantic understanding of underwater scenes, playing a crucial role in applications such as ocean geoscience and ...
Abstract: As the cornerstone of computer vision (CV), image recognition is of great importance. It not only profoundly affects daily life fields such as facial recognition, intelligent security, and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results