We investigated the feasibility of classifying pathologic invasive nodules and pre-invasive or benign nodules by quantitative analysis of the CT attenuation distribution patterns and other radiomic features of lung nodule components. We developed a new 3D adaptive multi-component Expectation-Maximization (EM) analysis method to segment the solid and non-solid nodule components and the surrounding lung parenchymal region. Features were extracted to characterize the size, shape, and the CT attenuation distribution of the entire nodule as well as the individual regions. With permission of the National Lung Screening Trial (NLST) project, a data set containing the baseline low dose CT scans of 53 cases with known pathologic tumor type categorization was obtained. The 53 cases contain 45 invasive nodules (group 1) and 42 pre-invasive nodules (group 2). A logistic regression model (LRM) was built using leave-one-case-out resampling and receiver operating characteristic (ROC) analysis for classification of group 1 and group 2, using the pathologic categorization as ground truth. With 4 selected features, the LRM achieved a test area under the curve (AUC) value of 0.877±0.036. The results demonstrated that the pathologic invasiveness of lung adenocarcinomas could be categorized according to the CT attenuation distribution patterns of the nodule components manifested on LDCT images.
|