Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
QPIC
Repo for CVPR2021 paper “QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information”
by Masato Tamura, Hiroki Ohashi, and Tomoaki Yoshinaga.
QPIC is implemented by extending the recently proposed object detector, DETR. QPIC leverages the query-based detection and attention mechanism in the transformer, and as a result, achieves high HOI detection performance with simple detection heads.
Example attention maps.
Preparation
Dependencies
Our implementation uses external libraries such as NumPy and PyTorch. You can resolve the dependencies with the following command.
pip install numpy
pip install -r requirements.txt
Note that this command may dump errors during installing pycocotools, but the errors can be ignored.
Dataset
HICO-DET
HICO-DET dataset can be downloaded here. After finishing downloading, unpack