Paper
6 August 2021 Learning human-object interactions by attention aggregation
Author Affiliations +
Proceedings Volume 11913, Sixth International Workshop on Pattern Recognition; 119130H (2021) https://doi.org/10.1117/12.2604708
Event: Sixth International Workshop on Pattern Recognition, 2021, Chengdu, China
Abstract
Recent years, deep neural networks have achieved impressive progress in object detection. However, detecting the interactions between objects is still challenging. Many researchers pay attention to human-object interaction (HOI) detection as a basic task in detailed scene understanding. Most conventional HOI detectors are in a two-stage manner and usually slow in inference. One-stage methods for direct parallel detection of HOI triples breaks through the limitation of object detection, but the extracted features are still insufficient. To overcome these drawbacks above, we propose an improved one-stage HOI detection approach, in which attention aggregation module and dynamic point matching strategy play key roles. The attention aggregation enhances the semantic expression ability of interaction points explicitly by aggregating contextually important information, while the matching strategy can filter the negative HOI pairs effectively in the inference stage. Extensive experiments on two challenging HOI detection benchmarks: VCOCO and HICO-DET show that our method achieves considerable performance compared to state-of-the-art performance without any additional human pose and language features.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Dongzhou Gu, Shuang Cai, and Shiwei Ma "Learning human-object interactions by attention aggregation", Proc. SPIE 11913, Sixth International Workshop on Pattern Recognition, 119130H (6 August 2021); https://doi.org/10.1117/12.2604708
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Feature extraction

Network architectures

Detection and tracking algorithms

Mechatronics

Neural networks

RELATED CONTENT


Back to Top