Abstract: In wireless communication, localization plays a key role in different applications such as asset tracking, navigation and emergency response. This paper explores a deep learning framework ...
Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...
Abstract: In real-world physiological and psychological scenarios, there often exists a robust complementary correlation between audio and visual signals. Audio-Visual Event Localization (AVEL) aims ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results