Poster + Paper
3 April 2023 A study of attention information from transformer layers in hybrid medical image segmentation networks
Author Affiliations +
Conference Poster
Abstract
Transformer models have recently started gaining popularity in Computer Vision related tasks. Within Medical Image Segmentation, segmentation models such as TransUNet have incorporated transformer blocks alongside convolutional blocks while remaining faithful to the popular U-Net architecture. The present work utilizes attention maps to examines information flow within transformer blocks of three such segmentation models: (i) TransUNet, (ii) 2D CATS, and (iii) 2D UNETR. Based on the attention maps, compressed versions of these models are proposed which retain only as many transformer layers as are necessary for the model to achieve a global receptive field. The parameter saving is more than 60% whereas the dice metric does not drop by more than 5% compared to the original (uncompressed) model.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Syed Nouman Hasany, Caroline Petitjean, and Fabrice Meriaudeau "A study of attention information from transformer layers in hybrid medical image segmentation networks", Proc. SPIE 12464, Medical Imaging 2023: Image Processing, 124641O (3 April 2023); https://doi.org/10.1117/12.2652215
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Image segmentation

Data modeling

Education and training

Visual process modeling

Image processing

Performance modeling

Back to Top