1 September 2021 SiamPAT: Siamese point attention networks for robust visual tracking
Hang Chen, Weiguo Zhang, Danghui Yan
Author Affiliations +
Funded by: National Natural Science Foundation of China (NSFC)
Abstract

Attention mechanism originates from the study of human visual behavior, which has been widely used in various fields of artificial intelligence in recent years and has become an important part of neural network structure. Many attention mechanism-based trackers have gained improved performance in both accuracy and robustness. However, these trackers cannot suppress the influence of background information and distractors accurately and do not enhance the target object information, which limits the performance of these trackers. We propose new Siamese point attention (SPA) networks for robust visual tracking. SPA networks learn position attention and channel attention jointly on two branch information. To construct point attention, each point on the template feature is used to calculate the similarity on the search feature. The similarity calculation is based on the local information of the target object, which can reduce the influence of background, deformation, and rotation factors. We can obtain the region of interest by calculating the position attention from point attention. Position attention is integrated into the calculation of channel attention to reduce the influence of irrelevant areas. In addition, we also propose the object attention, and integrate it into the classification and regression module to further enhance the semantic information of the target object and improve the tracking accuracy. Extensive experiments are also conducted on five benchmark datasets. The experiment results show that our method achieves state-of-the-art performance.

© 2021 SPIE and IS&T 1017-9909/2021/$28.00© 2021 SPIE and IS&T
Hang Chen, Weiguo Zhang, and Danghui Yan "SiamPAT: Siamese point attention networks for robust visual tracking," Journal of Electronic Imaging 30(5), 053001 (1 September 2021). https://doi.org/10.1117/1.JEI.30.5.053001
Received: 8 February 2021; Accepted: 19 August 2021; Published: 1 September 2021
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical tracking

Convolution

Visualization

Video

Chemical species

Data modeling

Feature extraction

Back to Top