Batch size go big or go home: counterintuitive improvement in medical autoencoders with smaller batch size

Cailey I. Kerley; Leon Y. Cai; Yucheng Tang; Lori L. Beason-Held; Susan M. Resnick; Laurie E. Cutting; Bennett A. Landman

doi:10.1117/12.2653643

3 April 2023 Batch size go big or go home: counterintuitive improvement in medical autoencoders with smaller batch size

Cailey I. Kerley, Leon Y. Cai, Yucheng Tang, Lori L. Beason-Held, Susan M. Resnick, Laurie E. Cutting, Bennett A. Landman

Author Affiliations +

Proceedings Volume 12464, Medical Imaging 2023: Image Processing; 124640H (2023) https://doi.org/10.1117/12.2653643
Event: SPIE Medical Imaging, 2023, San Diego, California, United States

Abstract

Batch size is a key hyperparameter in training deep learning models. Conventional wisdom suggests larger batches produce improved model performance. Here we present evidence to the contrary, particularly when using autoencoders to derive meaningful latent spaces from data with spatially global similarities and local differences, such as electronic health records (EHR) and medical imaging. We investigate batch size effects in both EHR data from the Baltimore Longitudinal Study of Aging and medical imaging data from the multimodal brain tumor segmentation (BraTS) challenge. We train fully connected and convolutional autoencoders to compress the EHR and imaging input spaces, respectively, into 32- dimensional latent spaces via reconstruction losses for various batch sizes between 1 and 100. Under the same hyperparameter configurations, smaller batches improve loss performance for both datasets. Additionally, latent spaces derived by autoencoders with smaller batches capture more biologically meaningful information. Qualitatively, we visualize 2-dimensional projections of the latent spaces and find that with smaller batchesthe EHR network better separates the sex of the individuals, and the imaging network better captures the right-left laterality of tumors. Quantitatively, the analogous sex classification and laterality regressions using the latent spaces demonstrate statistically significant improvements in performance at smaller batch sizes. Finally, we find improved individual variation locally in visualizations of representative data reconstructions at lower batch sizes. Taken together, these results suggest that smaller batch sizes should be considered when designing autoencoders to extract meaningful latent spaces among EHR and medical imaging data driven by global similarities and local variation.

Conference Presentation

Citation Download Citation

Cailey I. Kerley, Leon Y. Cai, Yucheng Tang, Lori L. Beason-Held, Susan M. Resnick, Laurie E. Cutting, and Bennett A. Landman "Batch size go big or go home: counterintuitive improvement in medical autoencoders with smaller batch size", Proc. SPIE 12464, Medical Imaging 2023: Image Processing, 124640H (3 April 2023); https://doi.org/10.1117/12.2653643

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available