Decoupling Object Detection from Human-Object Interaction Recognition

Ying Jin; Yinpeng Chen; Lijuan Wang; Jianfeng Wang; Pei Yu; Lin Liang; Jenq-Neng Hwang; Zicheng Liu

Back

Journal article

Decoupling Object Detection from Human-Object Interaction Recognition

Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang and Zicheng Liu

12/12/2021

Abstract

Computer Science - Computer Vision and Pattern Recognition

We propose DEFR, a DEtection-FRee method to recognize Human-Object Interactions (HOI) at image level without using object location or human pose. This is challenging as the detector is an integral part of existing methods. In this paper, we propose two findings to boost the performance of the detection-free approach, which significantly outperforms the detection-assisted state of the arts. Firstly, we find it crucial to effectively leverage the semantic correlations among HOI classes. Remarkable gain can be achieved by using language embeddings of HOI labels to initialize the linear classifier, which encodes the structure of HOIs to guide training. Further, we propose Log-Sum-Exp Sign (LSE-Sign) loss to facilitate multi-label learning on a long-tailed dataset by balancing gradients over all classes in a softmax format. Our detection-free approach achieves 65.6 mAP in HOI classification on HICO, outperforming the detection-assisted state of the art (SOTA) by 18.5 mAP, and 52.7 mAP in one-shot classes, surpassing the SOTA by 27.3 mAP. Different from previous work, our classification model (DEFR) can be directly used in HOI detection without any additional training, by connecting to an off-the-shelf object detector whose bounding box output is converted to binary masks for DEFR. Surprisingly, such a simple connection of two decoupled models achieves SOTA performance (32.35 mAP).

Metrics

22 Record Views

Details

Title: Decoupling Object Detection from Human-Object Interaction Recognition
Creators: Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Jenq-Neng Hwang
Zicheng Liu
Academic Unit: Computer Science Department
Publication Details: 12/12/2021
Identifiers: 99257914763901671
Language: English

Decoupling Object Detection from Human-Object Interaction Recognition

Abstract

Related links

Metrics

Details