Abstract: Recent transformer-based methods achieve notable gains in the Human-object Interaction Detection (HOID) task by leveraging the detection of DETR and the prior knowledge of Vision-Language ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results
Feedback