Abstract: Cross-view geolocalization determines the location of a query image, captured by a drone or ground-based camera, by matching it to a georeferenced satellite image. While traditional ...
Abstract: We aim for an open-vocabulary sound event localization and detection (SELD) system that detects and localizes sound events in any category described by prompt texts. An open-vocabulary SELD ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results