Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information

QPIC

Repo for CVPR2021 paper “QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information”

by Masato Tamura, Hiroki Ohashi, and Tomoaki Yoshinaga.

overview

QPIC is implemented by extending the recently proposed object detector, DETR. QPIC leverages the query-based detection and attention mechanism in the transformer, and as a result, achieves high HOI detection performance with simple detection heads.

attention

Example attention maps.

Preparation

Dependencies

Our implementation uses external libraries such as NumPy and PyTorch. You can resolve the dependencies with the following command.

pip install numpy
pip install -r requirements.txt

Note that this command may dump errors during installing pycocotools, but the errors can be ignored.

Dataset

HICO-DET

HICO-DET dataset can be downloaded here. After finishing downloading, unpack

 

 

 

To finish reading, please visit source site