Open Access
17 October 2017 Feature analysis of cell nuclear chromatin distribution in support of cervical cytology
Author Affiliations +
Abstract
Cytology, a method of estimating cancer or cellular atypia from microscopic images of scraped specimens, is used according to the pathologist’s experience to diagnose cases based on the degree of structural changes and atypia. Several methods of cell feature quantification, including nuclear size, nuclear shape, cytoplasm size, and chromatin texture, have been studied. We focus on chromatin distribution in the cell nucleus and propose new feature values that indicate the chromatin complexity, spreading, and bias, including convex hull ratio on multiple binary images, intensity distribution from the gravity center, and tangential component intensity and texture biases. The characteristics and cellular classification accuracies of the proposed features were verified through experiments using cervical smear samples, for which clear nuclear morphologic diagnostic criteria are available. In this experiment, we also used a stepwise support vector machine to create a machine learning model and a cross-validation algorithm with which to derive identification accuracy. Our results demonstrate the effectiveness of our proposed feature values.

1.

Introduction

Despite recent improvements in our understanding of molecular changes in cancer cells, it remains difficult to diagnose cancer using biologic methods. Some biologic methods, such as fluorescent in situ hybridization (FISH) for detecting chromosomal translocation and polymerase chain reaction (PCR) for detecting cell clonality, are sometimes used to assist the cancer diagnosis; however, cell clonality and chromosomal translocation are not limited features of the cancer cells.14 Cancer is always diagnosed by pathologists via light microscopic evaluations of histological or cytological samples. These diagnoses are based on the degrees of structural and cellular atypia.5 Among the many morphological changes occurring in cancer cells, nuclear atypia is one of the most important. Nuclear atypia refers to an abnormal cell nuclear appearance and includes changes in the nuclear size and shape, numbers and sizes of nucleoli, and chromatin texture. However, pathological evaluations of nuclear atypia may display a lack of consistency owing to variability depending on the cytologists.6 In fact, cytological and histological diagnostic reproducibility and accuracy are problematic for some cell types (e.g., erythroblasts).7 Therefore, quantitative feature analysis of nuclear atypia can enhance the cytologist’s assessment accuracy.

Conversely, to prevent overlook, cell diagnostic support systems that continuously process the extraction of cell regions (segmentation), feature extraction, and cell-type prediction (classification by a machine learning method) have also been studied.812 Because improvement in the accuracy of the system is required, addition of new features is effective. With respect to the feature extraction aspect of the system, several methods have been proposed, including quantifications of nuclear size, shape, and brightness;13 Haralick14 or run-length15 analysis of chromatin texture;10,1618 cell nuclear contour complexity (CC);19 and radial distribution (RD) value.20 Although the CC value quantified the complexity of chromatin distribution in large areas, it did not consider chromatin distribution in small areas. In addition, the RD value focused on the deviation of the chromatin distribution only in the radial direction.

In this study, we aim to propose useful new features to facilitate judgments by cytologists and increase the accuracy of cell diagnostic support systems. Specifically, we propose three kinds of new feature values quantified by a complexity value considering chromatin distribution with small areas, a spreading value for chromatin distribution, and a tangential bias (TB) value for chromatin distribution. The proposed feature values include convex hull ratios on multiple binary images, intensity distribution from the gravity center, and tangential component intensity and texture bias. The characteristics of these proposed feature values are verified through experiments using cervical smear samples. In particular, the nuclear morphology-based diagnostic criteria for cervical cytology are clear, and interobserver differences in assessments are small.7 In these experiments, we compare our proposed feature values with the annotations classified by pathologists according to the Bethesda system.21

Thereafter, we examine the effectiveness of our proposed feature values using an analysis of variance (ANOVA) and a cross validation (CV)22 of models generated via a machine learning method. For machine learning, we use a support vector machine (SVM)23,24 that is extended to a multiclass classification using a one-versus-one method25 and perform variable selection using a stepwise (floating) method.26

2.

Feature Extraction of Cervical Cytology Image

2.1.

Extraction of Cell Nuclear Area

We used cervical smear samples collected at the Department of Gynecologic Oncology, Saitama Medical University International Medical Center. These samples were applied to slides, fixed with 95% alcohol, and subjected to Papanicolaou staining. Squamous cells in these samples were observed and imaged at 1000× (10× eyepiece and 100× objective lenses) magnification with an optical microscope (AXIO imager A1; Carl Zeiss Ltd., Oberkochen, Germany) attached to a cooled charge-coupled device camera (256 shades of gray) and three transmission filters of red, green, and blue. In each shooting, exposure time and white balance were fixed. In this study, we targeted squamous cells only.

Cells were estimated from these images by a pathologist and two cytotechnologists according to the Bethesda system.21 The classifications are as follows: negative for intraepithelial lesion or malignancy (NILM), atypical squamous cells of undetermined significance (ASC-US), low-grade squamous intraepithelial lesion (LSIL), high-grade squamous intraepithelial lesion (HSIL), atypical squamous cells but cannot exclude HSIL (ASC-H), and squamous cell carcinoma (SCC). ASC-US and ASC-H, respectively, represent intermediate classifications between NILM and LSIL and between LSIL and HSIL. In this paper, we avoided these intermediate classifications and used only typical cells (NILM, LSIL, HSIL, and SCC). In addition, we divided NILM cases into three types: normal cell (NOR), metaplastic cell (MET), and regenerative cell (REG). MET and REG include reactive nuclear atypia, which is occasionally difficult to discriminate from neoplastic nuclear atypia [LSIL, HSIL, carcinoma in situ (CIS), and SCC]. Therefore, it is important to classify these cell types using image-based characterization system. Although several detail studies included the image-based cell classification system, few were investigated using the detailed NILM classification.16,20,27 We also divide SCC cases as CIS and SCC because cytologists usually distinguish these two categories. We, therefore, evaluated seven types of cells: NOR, MET, REG, LSIL, HSIL, CIS, and SCC. Figure 1(a) shows representative images including CIS cells.

Fig. 1

Examples of cervical cytological images. (a) Original image and (b)–(f) mask extraction image of each cell nuclei: NOR, MET, REG, LSIL, HSIL, CIS, and SCC, respectively.

JMI_4_4_047501_f001.png

Subsequently, we manually extracted cell nuclear regions from images to generate masking images and transformed these from RGB color to gray-scale according to the Y value in the YCbCr color system. Figures 1(b)1(h), respectively, show examples of the gray-scale masking images of NOR, MET, REG, LSIL, HSIL, CIS, and SCC. Although automatic methods for extracting cell and cell nuclear regions have been proposed,8 these are not completely accurate. We, therefore, manually extracted the studied cell nuclear regions.

2.2.

Conventional Feature Values

Previous studies of the feature quantification of cervical cytology images have used feature values related to the cell nuclear size and shape.8,13,20,28 We also use eight feature values (nuclear area f1, nuclear perimeter f2, nuclear longest diameter f3, nuclear shortest diameter f4, nuclear convex hull area f5, nuclear convex hull perimeter f6, nuclear circularity f7, and nuclear extension f8). In this paper, convex hull refers to a convex line (i.e., shape of a rubber band) surrounding the nuclear outline. f7 and f8 are represented by the following equations:

Eq. (1)

f7=4πf1f22,

Eq. (2)

f8=f3f4.

Murata et al.16 used other nuclear shape values to evaluate images of thyroid tumor cytology specimens. These values can be expressed using the following equations:

Eq. (3)

f9=πf324f1,

Eq. (4)

f10=f2f6,
where f9 represents the roundness (with numerical value decreasing with rounded) and f10 represents the convex hull ratio of the outer shape of the nucleus. These values are calculated for the cell nuclear regions extracted manually as in Sec. 2.1.

Feature values have also been proposed for chromatin distribution in the cell nucleus, including the average value, the number of maximum value, and the number of minimum value of the image pixel intensities in nuclear regions.8,13,28 In addition, the RD value20 represents the difference in average intensity between the center and periphery of the cell nucleus and has been suggested. Regarding the gray-scale masking images described in Sec. 2.1, we also use these four chromatin distribution feature values, represented as f11, f12, f13, and f14, respectively. Murata et al. also used skewness, kurtosis, the coefficient of variation, and the upper 20 percentile ratio of the intensity histogram. The coefficient of variation is defined as the ratio between the average value and standard deviation. We also use these values and denote as f15, f16, f17, and f18, respectively.

Murata et al.10,1618 used 15 texture feature values, in which 10 are Haralick feature values14 calculated using cooccurrence matrices and 5 are run-length feature values15 calculated using run-length matrices. Both matrices are calculated from gray-scale intensities within cell nuclei. The 10 Haralick feature values are contrast (f19, contrast of intensity), energy (f20, uniformity of intensity and texture), correlation (f21, correlation of intensity and texture), variance (f22, variance of intensity), entropy (f23, diversity of intensity and texture), sum variance (f24, contrast of intensity and texture), sum entropy (f25, diversity of intensity), difference variance (f26, variance of texture), difference entropy (f27, diversity of texture), and inverse difference moment (f28, homogeneity). The five run-length feature values are gray level nonuniformity (f29, ununiformity of intensity), run percentage (f30, ununiformity of intensity and texture), short run emphasis (f31, magnitude of high frequency), long run emphasis (f32, magnitude of low frequency), and run-length nonuniformity (f33, nonuniformity of texture).

The cooccurrence matrix represents the appearance frequency P [=P(i,j)] of pixel intensities on a gray-scale image, where i is the intensity of a pixel of interest A and j is the intensity of a pixel B near A. We used gray-scale images of 256 gradations, and the matrix size became 256×256. Multiple cooccurrence matrices can be generated using the differences in distance values (r) and argument values (θ) between A and B. We used four types each of r (r=1, 2, 4, and 8 pixels) and four types each of θ (θ=0  deg, 45 deg, 90 deg, and 135 deg), generated 16 cooccurrence matrices, and used the averages of Haralick feature values calculated by their 16 matrix as f19 to f28.

The run-length matrix represents the appearance frequency R [=R(i,l)] of run l in the pixel of interest i. Run indicates the number of consecutive identical intensity values in the scanning direction θ. The intensity gradient is frequently subjected to quantization before creating a run-length matrix. We, therefore, used four types each of θ (θ=0  deg, 45 deg, 90 deg, and 135 deg) and four type each of quantization values (gradations 256, 16, 4, and 2), generated 16 run-length matrix, and used the averages of 5 run-length features calculated by their 16 run-length matrix as f29 to f33.

Furthermore, Kiyuna et al.19 previously quantified the complexity of chromatin distribution on a nuclear image from mammary gland cells as a CC value and a fractal feature. These features are represented as f34 and f35 in the following equations:

Eq. (5)

f34=i{L(i)f21[L(i)>f2]},

Eq. (6)

f35=max{D(i)}.
These values were calculated using a binarization of the cytology image, where i represents the intensity threshold for binarizing, and L(i) and D(i) represent the contour perimeter and a fractal dimension of the image binarized by threshold i, respectively. The values f34 and f35 increase as the chromatin distribution complexity increases. We used the box counting method to calculate the fractal dimension.

Another four feature values had been proposed by Haralick: sum average (f36, average of intensity), information measures of correlation 1 (f37, uniformity of texture), information measures of correlation 2 (f38, diversity of texture), and maximal correlation coefficient (f39, uniformity of intensity and texture). These features can be expressed using the following equations:

Eq. (7)

f36=k=0510{ki=0255j=0255P(i,j)(k=i+j)},

Eq. (8)

f37=MImax{H(PJ),H(PJ)},

Eq. (9)

f38=1exp{2[H^(PJJP)H^(P)]},

Eq. (10)

f39=S(Q)where  Q(i,j)=kP(i,k)P(j,k)PJ(i)·PJ(k),
where MI is the mutual information of P, J is a vector for which all elements =1, H is an entropy function, H^ is a joint entropy function, and S is a function used to calculate a second eigenvalue.

We designed feature values f139 as conventional feature (Cf) values.

2.3.

Proposed Feature Values

2.3.1.

Convex hull contour complexity values (Pf.1)

Kiyuna et al.19 quantified the complexity of chromatin distribution using feature f34. However, this feature is not counted if the perimeter L(i) is less than f2; in other words, f34 does not consider the chromatin complexities in small regions. Therefore, we previously proposed the following feature value F29

Eq. (11)

F=i{1[L(i)LC(i)1.2]},
where LC(i) and L(i)LC(i) represent the convex hull perimeter and convex hull ratio, respectively, of an image binarized using threshold i, and the expression in Σ represents the indicator function. F is a variable that counts the number of binarization threshold values i, in which the convex hull ratio is 1.2 or more. The value of F increases with chromatin distribution complexity such that F is counted even in small chromatin regions with sufficient complexity. However, F had a large correlation with f10, which is the convex hull ratio of the outer shape of the nucleus.

We, therefore, use the following Lr(i), which is a chromatin distribution complexity divided by f10, and propose following new feature values f40 to f43

Eq. (12)

Lr(i)=L(i)LC(i)·f10,

Eq. (13)

f40=i{Lr(i)1[Lr(i)1]},

Eq. (14)

f41=i{1[Lr(i)1.1]},

Eq. (15)

f42=i{1[Lr(i)1.2]},

Eq. (16)

f43=i{1[Lr(i)1.3]}.
f40 is shown as the fill area in Fig. 2 for which the horizontal axis is the intensity threshold for binarization and the vertical axis is Lr(i). f41, f42, and f43 are shown as intensity widths of the graph when Lr(i)1.1, 1.2, and 1.3 in Fig. 2. For a more detailed representation of the shape of the graph shown in Fig. 2, multiple feature values are used. We name values of f41, f42, and f43 as convex hull (CH) CC, convex hull intensity-width 1.1 (CW1.1), CW1.2, and CW1.3, respectively. We designed these four features f4043 as proposal feature values 1 (Pf.1).

Fig. 2

Convex hull contour complexity values.

JMI_4_4_047501_f002.png

2.3.2.

Chromatin distribution spreading value (Pf.2)

We also propose a method for quantifying the chromatin distribution spreading (CDS) value. First, we create an intensity histogram using the gray-scale intensity set from the cell nucleus on the input image and obtain a threshold Th via a linear discriminant analysis30 of the histogram. Th is the threshold used to distinguish dark-stained (i.e., assumed chromatin) and light-stained regions (i.e., nonchromatin). Here, a coordinate on the input image is designed as Z (Z =[x,y]), and the 256-level gray-scale intensity of Z is designated as I(Z). Next, a chromatin image is generated by replacing the I(Z) of all pixels in the image with I(Z), calculated using the following equation:

Eq. (17)

I(Z)={255I(Z)Th,[I(Z)+Th<255]0,(otherwise).
I(Z) is a value obtained by inverting the image negative and positive [255I(Z)] and subtracting the bias value Th. As a result, high-density stained pixels such as nucleoli and chromatin appear as high values. Figures 3(a) and 3(b) show representative chromatin images based on those in Figs. 1(c) and 1(h), respectively, and show at fivefold intensity to enhance visualization.

Fig. 3

Chromatin image. (a) and (b) Image calculated from Fig. 1(c) and 1(h), respectively.

JMI_4_4_047501_f003.png

Next, the center of a gravity of the chromatin region G is obtained using the following equation:

Eq. (18)

G=(xyxI(Z)xyI(Z),xyyI(Z)xyI(Z)).
Finally, we used G to obtain the CDS value, denoted as f44 in the following equation:

Eq. (19)

f44=1f1xy{|GZf3|2I(Z)}.
We designed this feature f44 as Pf.2.

2.3.3.

Tangential bias values of chromatin distribution (Pf.3)

Cytologically, a biased chromatin texture distribution is an important atypical cell nuclear feature. Jingu et al.20 proposed the RD value, which represents radial bias in the intensity of chromatin distribution but did not consider the tangential direction. Therefore, we previously proposed a feature value E calculated by the following process and Eq. (20):31

  • i. Extraction of the outer shape of the cell nucleus and fitting to an ellipse.

  • ii. Calculation of the center, short axis, and long axis of the ellipse.

  • iii. Euclidean transformation of the input nuclear image such that the short axis, long axis, and center of the ellipse become the new X-axis, Y-axis, and origin, respectively.

  • iv. Creation of four images by cutting the transformed image at the X- and Y-axes.

  • v. Calculation of some chromatin distribution feature values ft(t{11,12,and18to44}) on each of the four images to yield ft,1,ft,2,ft,3, and, ft,4

Eq. (20)

E=SD(ft,1,ft,2,ft,3,ft,4),
where SD is a function used to calculate the standard deviation. However, since the standard deviation is easily influenced by the magnitude of each feature values, we use coefficient of variation instead of the standard deviation as shown in the following equation:

Eq. (21)

gt=|SD(ft,1,ft,2,ft,3,ft,4)mean(ft,1,ft,2,ft,3,ft,4)|,
where mean is a function used to calculate the mean.

In this paper, we propose TB values f45 and f46 of chromatin distribution using gt. TB values, denoted as f45 and f46, are determined experimentally in Sec. 3.

3.

Evaluation of the Tangential Bias Values for (Pf.3)

To experimentally examine TB in the chromatin distribution, we created 633 masking images from 633 cervical smear samples according to the method described in Sec. 2.1. Table 1 shows the numbers of cell nuclei, slides, and patients for each cell classification. We prepared one slide per patient and took images of a single cell type for each slide. These samples included 164, 86, 74, 36, 155, 84, and 34 cases of NOR, MET, REG, LSIL, HSIL, CIS, and SCC, respectively. We then calculated the gt of each masking image as described in Sec. 2.3.3. Thereafter, the gt values were linearly normalized as gt, thus converting the maximum and minimum values of each gt to 1 and 0. Accordingly, gt can be represented as follows:

Eq. (22)

gt=gtmax(gt)min(gt).
Figure 4 shows an experimental result from a calculation using 75% tiles, median values, and 25% tiles of the gt for each cell type. The horizontal axis indicates the feature number, and colors indicate the types of cells annotated by pathologists. The circles in Fig. 4 indicate the feature values related to intensity, which had notably high values (g11, g18, g24, g36) in SCC. The squares in Fig. 4 indicate the run-length feature values, which are included among the texture features; here, g29 to g32 were explicitly high for both CIS and SCC. These values could, therefore, be useful for cellular classification.

Table 1

Numbers of cell nuclei, slides, and patients.

NORMETREGLSILHSILCISSCCTotal
Cell nuclei1648674361558434633
Slides666335332
Patients666335332

Fig. 4

Feature bias values gt.

JMI_4_4_047501_f004.png

We, therefore, propose two TB values f45 and f46, as shown in the following equations:

Eq. (23)

f45=g11+g18+g24+g364,

Eq. (24)

f46=g29+g30+g31+g324,
where f45 and f46 represent the biases of intensity distribution and texture distribution, respectively. We designed these two features f45 and f46 as Pf.3. In addition, we designed the entire set of features f146 as proposal feature values of all (Pf.A).

To minimize the variability of staining, we used the same staining machine and protocol and omitted poor samples (such as dried samples). To minimize the influence of intensity fluctuation during scanning, we photographed each slide with fixed exposure time and white balance. However, small variations in staining due to the different conditions of the samples cannot be excluded. Feature values related to intensity (f11,18,24,36) may be influenced by these effects to a substantially greater degree than feature values related to shape and texture. Note that f45 is a feature value derived from other intensity-related feature values; however, the effects of sample condition are smaller than on other intensity-related feature values (f11,18,24,36), because f45 uses the coefficient of variation.

4.

Verification of the Characteristics of the Proposed Feature Values

4.1.

Comparative Experimental Results and Discussion Between Cell Types

To verify the characteristics of our proposed feature values, we calculated feature values f146 from 633 masking images of cervical smear samples described in Sec. 3. Thereafter, features f146 were linearly normalized to yield f146 such that the maximum and minimum values of each feature became 1 and 0. Figure 5 shows an experimental result calculated using the 75% tiles, median values, and 25% tiles of features f146 for each cell types. However, we note that some Cf values were omitted.

Fig. 5

Experimental results from a feature value analysis of seven cell types.

JMI_4_4_047501_f005.png

Figure 5 shows many differences in feature values among NOR, NET, and REG, which we classified as NILM. In particular, REG had a large area (f1 was high), and MET exhibited high texture homogeneity (f28 and f39 were high, whereas f23 and f27 were low). All proposed value f4046 were small for NOR and large for SCC. In particular, f44 and f46 were also large for CIS. Cancer cells possess a chromatin structure that differs from the normal structure.5 These results suggest that our proposed method represents the features of this chromatin structure.

In addition, we calculated the absolute |ρi,j| values of the correlation coefficients between fi and fj (i{1,2,,46}, j{1,2,,46}) using SPSS software (IBM Corporation, Armonk, New York). ρi,j is expressed by the following equation:

Eq. (25)

ρi,j=cov(fi,fj)var(fi)var(fj),
where cov and var represent functions for calculating the sample covariance and variance, respectively.

Table 2 shows a list of feature numbers (k) with high correlation coefficients (|ρi,j|0.95 or 0.95>|ρi,j|0.90) with other features. There were strong correlations between size and shape features f16 and the run-length features f30,33. There were also strong correlations among intensity-related features f11,18,24,36, all of which are Cfs. In contrast, proposed features f4046 did not show strong correlations with any other features; therefore, our proposed features are highly original. These various feature values are useful for improving the accuracy of machine-learning-based cellular classification, which we will discuss further in Sec. 5.

Table 2

List of feature numbers i and j with high correlation coefficient.

ij when |ρi,j|≥0.95j when 0.95>|ρi,j|≥0.90ij when |ρi,j|≥0.95j when 0.95>|ρi,j|≥0.90
12, 4, 5, 6, 30, 333, 12, 292220
21, 3, 4, 5, 6, 30, 332325
32, 61, 5, 30, 332411, 18, 36
41, 2, 5, 630, 332523
51, 2, 4, 6, 30, 333, 12, 292620
61, 2, 3, 4, 5, 30332728
892827
98291, 5, 33
1118, 24, 36301, 2, 5, 6, 333, 4
12131, 5331, 2, 5, 303, 4, 6, 29
13123611, 18, 24
1811, 24, 363738
2026223837

Furthermore, we used SPSS software to perform a one-dimensional ANOVA of the cellular classification corresponding to each feature value. Accordingly, all values (f4046) differed significantly (significance level: 1.0%) among the cellular classifications, and therefore, any proposed or Cf values could potentially improve the accuracy of cellular classification accuracy.

Next, we used a t-test to evaluate whether each feature value differed significantly with respect to reactive (MET and REG) and neoplastic nuclear atypia (LSIL, HSIL, CIS, and SCC). The results are shown in Fig. 5(upper): here, the * and ** symbols indicate that the corresponding feature values had significant differences at respective significance levels of 5.0% and 1.0%. This test revealed significant differences in many feature values related to chromatin distribution, including the proposed values f4446. These could, therefore, be considered useful for distinguishing between reactive and neoplastic nuclear atypia.

4.2.

Verification of Experimental Results and Discussion of Representative Images

We next calculated some of the normalized feature values f1,34,35,4046 corresponding to the representative images in Figs. 1(b)1(f). Figure 6 shows the results, with feature numbers indicated on the horizontal axis.

Fig. 6

Feature values of the images in Fig. 1.

JMI_4_4_047501_f006.png

In Fig. 6, many values in area f1, the conventional complexity values f3435, and the proposed complexity values f4044 exhibited similar tendencies; however, in the representative image of SCC, f3435 were moderate, whereas f4044 were high. In addition, the proposed value f45 was also high. Although these findings are subjective, the chromatin distribution in Fig. 1(h) appears to be complex and widely spread. We consider that our proposed method reflects this trend.

5.

Machine Learning Validation of Proposed Methods

5.1.

Validation Method

Next, we verified the cellular classification accuracy using machine learning and a CV method.22 These verifications were compared among eight different models, a conventional model (Cf) and seven models combining Cf with models including our proposed values: Cf + Pf.1, Cf + Pf.2, Cf + Pf.3, Cf + Pf.1 + Pf.2, Cf + Pf.1 + Pf.3, Cf + Pf.2 + Pf.3, and Pf.A (= Cf + Pf.1 + Pf.2 + Pf.3).

For machine learning, we used the SVM;23,24 however, we note that this method is intended for two-class classification. We, therefore, implemented a one-versus-one method25 to expand the classification from two-class to multiclass using a round-robin method of classes. In addition, we selected variables for SVM using a stepwise (floating) method.26 In Sec. 4.1, some of the Cfs showed high correlation coefficients. If we use all of these features directly to create a model of the SVM, the accuracy of identification may decrease due to over-learning.32 The stepwise method we use can mitigate the reduced classification accuracy caused by over-learning, because the possibility of simultaneously selecting features exhibiting high correlation coefficients in the method is low.

Two methods could be used to combine multiclass classification and variable selection. The first involves selecting the same type of features for each comparison, and the second involves selecting different types of features for each comparison. In this paper, we used the second method, which is capable of more detailed feature selection.

A machine learning protocol based on these methods is depicted in Fig. 7(a) as stepwise SVM (SSVM). Before performing the procedure described in Fig. 7(a), we calculated the normalized feature values of f1 to fm for all 633 cervical smear samples described in Sec. 4 (NOR for 164, MET for 86, REG for 74, LSIL for 36, HSIL for 155, CIS for 84, and SCC for 34). They were a number of imbalanced samples, which can cause incorrect answer rates. We, therefore, virtually matched the sample number of each class using a oversampling method “adaptive synthetic sampling approach for imbalanced learning”33 to increase the number of each class up to 200 (for a total of 1400 samples) and assumed the value sets to be the feature vectors f1m, where m represents the number of features types used for calculation. For example, m becomes 39 when calculating the machine learning model Cf.

Fig. 7

Flowchart of the (a) SSVM and (b) accuracy evaluation.

JMI_4_4_047501_f007.png

In (1) of Fig. 7(a), the order of the normalized feature values f1m changed randomly with the initialization of some variables. We assumed the changed values to be feature vectors F1m. Here, C is a set representing the round-robin selection of seven classes, W is a set representing the kernel functions and cost parameters among the SVM parameters, Ej={Ej,1,,Ej,i,} is a set of feature vectors F1m selected for model Cj, Vj is the SVM parameter selected for model Cj, CV(Ej,Cj,Vj) is a function used to calculate the accuracy rate from the CV of the SVM, AM is the maximum accuracy rate calculated by the CV, and u is an updated flag of the maximum accuracy rate. In addition, LIN, RBF, and the numeric values of the elements of W in Fig. 7(a), respectively, represent a linear function of the kernel, a radial basis function of the kernel, and the cost parameters included among the SVM parameters. In this paper, we used 10-fold as the number of CV divisions.

In Fig. 7(a), (2), (3), and (4), respectively, represent procedures involving forward feature selection, backward feature selection, and SVM parameter optimization. These values were calculated based on the CV of the SVM. Here, AC is an accuracy rate calculated by the CV, E is a power set of Ej, and V is a power set of Vj. Feature selection was implemented by calculating these procedures until the maximum accuracy rate was no longer updated, and SSVM was implemented by calculating procedures using the round-robin selection of seven classes.

Figure 7(b) shows an accuracy evaluation procedure based on the SSVM, where Uk is a set of E and V, SSVM is a function used to calculate the procedure in Fig. 7(a), MCV is a function used to calculate the accuracy rate from the multiclass CV according to the one-versus-one method, Bk,h is an accuracy rate obtained from the MCV, Mk is a mean accuracy rate obtained by repeatedly (50×) calculating the MCV, M is a set of Mk obtained by repeating these procedures, and d is an index number of the maximum value of M. SSVM is likely to fall into a local solution, and Uk is not necessarily the optimum value when obtained from the calculation of a single SSVM. In other words, the selected features and accuracy may be affected by the order of the initial data set. We, therefore, randomly exchanged data sets and extracted the optimum value by repeating the SSVM from k=1 to 40 to eliminate the fall into a local solution as much as possible.

Finally, the optimum accuracy rate set Bd={Bd,1,,Bd,k,,Bd,50} and parameter set Ud were outputted, and the results of eight classification models (Cf, Cf + Pf.1, Cf + Pf.2, Cf + Pf.3, Cf + Pf.1 + Pf.2, Cf + Pf.1 + Pf.3, Cf + Pf.2 + Pf.3, and Pf.A) were compared.

5.2.

Validation Results and Discussion

We calculated the averages (Ave.) and standard deviations (SD.) of the accuracy rate set Bd of the eight classification models, using the validation method shown in Sec. 5.1. Table 3 presents the results of a comparison of these values, as well as the Dunnett’s test (D-test) results for each model. Here, D-test 1 represents the D-test results of comparisons between each models and Cf, D-test 2 represents the D-test results of comparisons between each model and Pf.A, and ** indicates a significant difference (significance level = 5.0%). D-test is a multiple comparison, many-to-one procedure (i.e., compares each of many treatment groups with one control group) and is used to verify differences between the average values from each group.34,35 We used SPSS software to perform this procedure.

Table 3

Experimental accuracy rates for each model.

CfCf + Pf.1Cf + Pf.2Cf + Pf.3Cf + Pf.1+ Pf.2Cf + Pf.1+ Pf.3Cf + Pf.2+ Pf.3Pf.A
Avg. (%)86.8087.5286.8187.2587.7488.2287.4688.44
SD. (%)0.880.961.021.020.890.880.871.05
D-test 1************
D-test 2**************
Ave. and SD., respectively, represent the average and standard deviation of the accuracy rate set Bd for each of eight classifications. D-tests 1 and 2 represent comparisons with Cf and Pf.A, respectively. ** represents a significant difference at a level of 5.0%.

The average accuracy rates of all proposed models except Cf + Pf.2 were higher than the conventional model (Cf) and exhibited statistically significant differences from Cf by the D-test. Therefore, our proposed models Pf.1 and Pf.3 (features f4043,4546) are useful features for cervical cell classification by machine learning. Although there was no significant difference between Cf + Pf.2 and Cf, there was a significant difference between Pf.A and Cf + Pf.1 + Pf.3. Therefore, our Pf.2 (feature f44) is also a useful feature for cervical cell classification. These results show the usefulness of incorporating our features into the diagnostic support system of the cytology. In addition, these results indicate that our features are different from the Cfs; therefore, our features have the possibility to be useful features in cell diagnosis by the cytologist.

As shown in 5.1, to extend the SVM to a multiclass classification of seven classes, we performed the SVM 21 times in the round-robin selection format; in other words, we obtained 21 selected feature sets E (EUd) to create a single machine learning model. We, therefore, extracted the 21 selected feature sets E of Pf.A, which had the highest accuracy rate and calculated the frequencies as shown in Fig. 8 (cumulative bar chart). Red, magenta, blue, cyan, green, brown, and black colors indicate selected features from comparisons related to NOR, MET, REG, LSIL, HSIL, CIS, and SCC, respectively. Based on Fig. 8, the selection of all proposal features indicates that all contributed to improve the classification accuracy.

Fig. 8

Frequency of selected features in the Pf.A model.

JMI_4_4_047501_f008.png

6.

Conclusion

Although cytology is a useful diagnostic tool for cervical and other conditions, it is generally used empirically. In this paper, we aimed to quantify the cell nuclear morphologies often used in cytologic analyses and proposed three new types of feature values: Pf.1, Pf.2, and Pf.3. Pf.1 includes CH CC values that represent the complexity of chromatin distribution within the cell nucleus. Pf.2 is the CDS, which represent intensity spreading from the gravity center in the chromatin region. Pf.3 is the TB values of chromatin distribution, which were calculated using the coefficient of variation for the intensities and run-length texture values of nuclear images that had been divided into four images based on the center of a fitted ellipse.

We used three methods to verify these proposal feature values. All methods used 633 images of nuclei obtained from the cervical cytology specimens of 32 patients and cell type information (NOR, MET, REG, LSIL, HSIL, CIS, or SCC) that had been annotated by a pathologist and two cytotechnologists.

The first method used an ANOVA to determine whether the proposal feature values differed among the seven classes. We found that all proposal values differed significantly at a 1.0% significance level, indicating the usefulness of these proposed feature values for cervical cytology. The second method used the t-test to determine differences in our proposed feature values between reactive (MET and REG) and neoplastic nuclear atypia (LSIL, HSIL, CIS, and SCC). We found that our proposed values CH, CW1.1, and CW1.2 differed at a 5.0% significance level, indicating their usefulness as distinguishing factors.

The third method determined whether the classification accuracy among the seven classes improved when multiple sets of feature values were combined through SSVM and a machine learning technique with a variable selection function. We calculated the accuracy of these finding using the CV method, calculated accuracy distribution using several repeats, and verified effectiveness using D-tests. We used eight different models, the conventional model (Cf) and seven models combining Cf with proposed models: Cf + Pf.1, Cf + Pf.2, Cf + Pf.3, Cf + Pf.1 + Pf.2, Cf + Pf.1 + Pf.3, Cf + Pf.2 + Pf.3, and Pf.A (= Cf + Pf.1 + Pf.2 + Pf.3). Accordingly, average accuracy rates of all proposed models except Cf + Pf.2 were higher than the conventional model (Cf) and exhibited statistically significant differences from Cf by D-test. This indicates that Pf.1 and Pf.3 are useful for cervical cell classification by machine learning. Although there was no significant difference between Cf + Pf.2 and Cf, there was a significant difference between Pf.A and Cf + Pf.1 + Pf.3; therefore, Pf.2 is also useful for cervical cell classification. The model created via SSVM selected all proposed feature values, and the results indicated that all proposed features contributed to the improved classification accuracy.

We proposed features reflecting the complexity, spreading, and bias of the chromatin distribution and showed that classification accuracy rates were increased by combining our features with Cfs. These results show the usefulness of incorporating our features into a diagnostic support system for cytology. In addition, these results indicate that our features are different from the Cfs; therefore, our features have the possibility to be useful features in cell diagnosis by the cytologist.

Meanwhile, since the evaluation of the usefulness of individual feature values in actual clinical diagnosis was not conducted, continuing studies are necessary to evaluate the usefulness in clinical practice. In addition, although we focused on the cell nucleus, the cytoplasm is also an important indicator. In the future, we aim to quantify the features of the cell cytoplasm and continue studies to evaluate the usefulness in clinical practice.

Disclosures

This study was financially supported by the Saitama Medical University, Faculty of Health and Medical Care (SMU-FHMC) Grants 14-013 and 15-005. The authors state no conflict of interest. This study was conducted after receiving approval from the Saitama Medical University International Medical Center Institutional Review Board (IRB) (Application number 15-018).

Acknowledgments

We are very grateful to the two cytotechnologists who classified the cells used in this study. We are also very grateful to the Saitama Medical University for financial support with SMU-FHMC grants.

References

1. 

S. Janz, M. Potter and C. S. Rabkin, “Lymphoma- and leukemia-associated chromosomal translocations in healthy individuals,” Genes Chromosomes Cancer, 36 (3), 211 –223 (2003). http://dx.doi.org/10.1002/(ISSN)1098-2264 Google Scholar

2. 

M. Nambiar and S. C. Raghavan, “Chromosomal translocations among the healthy human population: implications in oncogenesis,” Cell. Mol. Life Sci., 70 (8), 1381 –1392 (2013). http://dx.doi.org/10.1007/s00018-012-1135-x CMLSFI 1420-9071 Google Scholar

3. 

D. N. Posnett et al., “Clonal populations of T cells in normal elderly humans: the T cell equivalent to ‘benign monoclonal gammapathy’,” J. Exp. Med., 179 (2), 609 –618 (1994). http://dx.doi.org/10.1084/jem.179.2.609 JEMEAV 0022-1007 Google Scholar

4. 

T. Iijima, Y. Inadome and M. Noguchi, “Clonal proliferation of B lymphocytes in the germinal centers of human reactive lymph nodes: possibility of overdiagnosis of B cell clonal proliferation,” Diagn. Mol. Pathol., 9 (3), 132 –136 (2000). http://dx.doi.org/10.1097/00019606-200009000-00002 DMPAES 1052-9551 Google Scholar

5. 

D. Zink, A. H. Fischer and J. A. Nickerson, “Nuclear structure in cancer cells,” Nat. Rev. Cancer, 4 677 –687 (2004). http://dx.doi.org/10.1038/nrc1430 NRCAC4 1474-175X Google Scholar

6. 

R. M. DeMay, “Common problems in papanicolaou smear interpretation,” Arch. Pathol. Lab. Med., 121 (3), 229 –238 (1997). Google Scholar

7. 

S. Parmentier et al., “Assessment of dysplastic hematopoiesis: lessons from healthy bone marrow donors,” Haematologica, 97 (5), 723 –730 (2012). http://dx.doi.org/10.3324/haematol.2011.056879 HAEMAX 0390-6078 Google Scholar

8. 

C. Duanggate, B. Uyyanonvara and T. Koanantakul, “A review of image analysis and pattern classification techniques for automatic pap smear screening process,” in Int. Conf. on Embedded Systems and Intelligent Technology, 212 –217 (2008). Google Scholar

9. 

N. A. M. Isa, “Automated edge detection technique for pap smear images using moving k-means clustering and modified seed based region growing algorithm,” Int. J. Comput. Internet Manage., 13 (3), 45 –59 (2005). Google Scholar

10. 

Y. F. Chen et al., “Semi-automatic segmentation and classification of pap smear cells,” IEEE J. Biomed. Health Inform., 18 (1), 94 –108 (2014). http://dx.doi.org/10.1109/JBHI.2013.2250984 Google Scholar

11. 

S. Watanabe and T. C. Group, “An automated apparatus for cancer prescreening: CYBEST,” Comput. Graphics Image Process., 3 (4), 350 –358 (1974). http://dx.doi.org/10.1016/0146-664X(74)90029-X CGIPBG 0146-664X Google Scholar

12. 

J. Holmquist et al., “Computer analysis of cervical cells automatic feature extraction and classification,” J. Histochem. Cytochem., 26 (11), 1000 –1017 (1978). http://dx.doi.org/10.1177/26.11.569164 Google Scholar

13. 

J. Jantzen and G. Dounias, “Analysis of pap-smear image data,” in Proc. of Nature-Inspired Smart Information Systems 2nd Annual Symp., (2006). Google Scholar

14. 

R. M. Haralick, K. Shanmugam and I. Dinstein, “Texture feature for image classification,” IEEE Trans. Syst., Man, Cybern., SMC-3 (6), 610 –621 (1973). http://dx.doi.org/10.1109/TSMC.1973.4309314 ISYMAW 0018-9472 Google Scholar

15. 

M. M. Galloway, “Texture analysis using gray level run lengths,” Comput. Graphics Image Process., 4 (2), 172 –179 (1975). http://dx.doi.org/10.1016/S0146-664X(75)80008-6 CGIPBG 0146-664X Google Scholar

16. 

S. Murata et al., “Morphological abstraction of thyroid tumor cell nuclei using morphometry with factor analysis,” Microsc. Res. Tech., 61 (5), 457 –462 (2003). http://dx.doi.org/10.1002/(ISSN)1097-0029 MRTEEO 1059-910X Google Scholar

17. 

S. I. Niwas, P. Palanisamy and K. Sujathan, “Complex wavelet based texture features of cancer cytology images,” in 5th Int. Conf. on Industrial and Information Systems, 348 –353 (2010). http://dx.doi.org/10.1109/ICIINFS.2010.5578679 Google Scholar

18. 

M. Kowal and P. Filipczuk, “Nuclei segmentation for computer-aided diagnosis of breast cancer,” Int. J. Appl. Math. Comput. Sci., 24 (1), 19 –31 (2014). http://dx.doi.org/10.2478/amcs-2014-0002 Google Scholar

19. 

T. Kiyuna et al., “Characterization of chromatin texture by contour complexity for cancer cell classification,” in 8th IEEE Int. Conf. on BioInformatics and BioEngineering (BIBE ’08), 1 –6 (2008). http://dx.doi.org/10.1109/BIBE.2008.4696831 Google Scholar

20. 

R. Jingu et al., “Quantitative image analysis of nuclear chromatin distribution for cytological diagnosis,” Acta Cytol., 55 (5), 455 –459 (2011). http://dx.doi.org/10.1159/000330672 ACYTAN 0001-5547 Google Scholar

21. 

B. S. Apgar, L. Zoschnick and T. C. Wright, “The 2001 Bethesda system terminology,” Am. Fam. Physician, 68 (10), 1992 –1998 (2003). Google Scholar

22. 

D. A. Forsyth and J. Ponce, Computer Vision: A Modern Approach, Prentice Hall(2002). Google Scholar

23. 

C. Cortes and V. Vapnik, “Support-vector networks,” Mach. Learn., 20 273 –297 (1995). MALEEZ 0885-6125 Google Scholar

24. 

C. C. Chang and C. J. Lin, “LIBSVM: a library for support vector machines,” ACM Trans. Intell. Syst. Technol., 2 (3), 1 –27 (2011). http://dx.doi.org/10.1145/1961189 Google Scholar

25. 

C. W. Hsu and C. J. Lin, “A comparison of methods for multi-class support vector machines,” IEEE Trans. Neural Networks, 13 415 –425 (2002). http://dx.doi.org/10.1109/72.991427 ITNNEP 1045-9227 Google Scholar

26. 

L. Wang et al., “A novel stepwise support vector machine (SVM) method based on optimal feature combination for predicting miRNA precursors,” Afr. J. Biotechnol., 10 (74), 16720 –16731 (2011). Google Scholar

27. 

M. Zhao et al., “Feature quantification and abnormal detection on cervical squamous epithelial cells,” Comput. Math. Methods Med., 2015 941680 (2015). http://dx.doi.org/10.1155/2015/941680 Google Scholar

28. 

J. Jantzen et al., “Pap-smear benchmark data for pattern classification,” in Proc. of Nature-Inspired Smart Information Systems (NISIS), 1 –9 (2005). Google Scholar

29. 

Y. Ohnuki et al., “A study of a quantitative evaluation method by contour complexity of the nucleus image for cancer cell diagnosis,” in Forum on Information Technology (FIT ’13), 401 –402 (2013). Google Scholar

30. 

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. Syst., Man, Cybern., 9 (1), 62 –66 (1979). http://dx.doi.org/10.1109/TSMC.1979.4310076 Google Scholar

31. 

H. Komagata et al., “A study of eccentric quantitation approach for chromatin distribution in cytodiagnosis,” in Media Computing Conf. 2014, (2014). Google Scholar

32. 

G. F. Hughes, “On the mean accuracy of statistical pattern recognizers,” IEEE Trans. Inf. Theory, 14 55 –63 (1968). http://dx.doi.org/10.1109/TIT.1968.1054102 IETTAW 0018-9448 Google Scholar

33. 

H. He et al., “ADASYN: adaptive synthetic sampling approach for imbalanced learning,” in IEEE Int. Joint Conf. on Neural Networks, (2008). http://dx.doi.org/10.1109/IJCNN.2008.4633969 Google Scholar

34. 

C. Dunnett, “A multiple comparison procedure for comparing several treatments with a control,” J. Am. Stat. Assoc., 50 1096 –1121 (1955). http://dx.doi.org/10.1080/01621459.1955.10501294 Google Scholar

35. 

C. Dunnett, “New tables for multiple comparisons with a control,” Biometrics, 20 482 –491 (1964). http://dx.doi.org/10.2307/2528490 BIOMB6 0006-341X Google Scholar

Biography

Hideki Komagata received his BE, ME, and PhD degrees in information engineering from Niigata University, Niigata, Japan, in 2003, 2005, and 2010, respectively. He is currently an assistant professor at the School of Biomedical Engineering, Saitama Medical University, Saitama, Japan. His research interests include computer vision and medical imaging.

Takaya Ichimura received his PhD from Kumamoto University, Kumamoto, Japan, in 2005. He is an assistant professor of Saitama Medical University. His current research interests include nuclear atypia and molecular nature of the chromatin.

Yasuka Matsuta received her BE and ME degrees from Saitama Medical University, Saitama, Japan, in 2012 and 2014, respectively. She currently works on blood purification at the Japanese Red Cross Saitama Hospital, Saitama, Japan. She is a member of the Japan Association for Clinical Engineers.

Masahiro Ishikawa received his PhD from Niigata University, Niigata, Japan, in 2006. He is currently an assistant professor at Saitama Medical University. His current research interests include image processing and computer aided diagnosis.

Kazuma Shinoda received his BE and ME degrees from Niigata University, Niigata, Japan, in 2005 and 2007, respectively, and his PhD from Tokyo Institute of Technology, Yokohama, Japan, in 2011. He is currently an assistant professor at the Graduate School of Engineering, Utsunomiya University, Utsunomiya, Japan. His research interests include digital image processing, image compression, and multispectral imaging.

Naoki Kobayashi received his BSc and ME degrees from Tokyo Institute of Technology, Tokyo, Japan, in 1979 and 1981, respectively, and his PhD from Niigata University, Niigata, Japan, in 2000. He worked for Cyber Communication Labs and R&D Sections of Nippon Telegraph and Telephone Corporation from 1981 to 2008. He has been a professor at the School of Biomedical Engineering, Faculty of Health and Medical Care of Saitama Medical University since 2008. His research interest is medical image processing, image compression, and biosignal processing.

Atsushi Sasaki received his MD degree in 1980 from the School of Medicine, Gunma University, his PhD in 1984 from Gunma University, and his postgraduate course of basic medical science in pathology. In 2009, he moved to Saitama Medical University as the professor of the Department of Pathology. He is currently engaged in diagnostic pathology and neuropathology. His research interests include microglia/brain macrophages and brain tumor pathology.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
Hideki Komagata, Takaya Ichimura, Yasuka Matsuta, Masahiro Ishikawa, Kazuma Shinoda, Naoki Kobayashi, and Atsushi Sasaki M.D. "Feature analysis of cell nuclear chromatin distribution in support of cervical cytology," Journal of Medical Imaging 4(4), 047501 (17 October 2017). https://doi.org/10.1117/1.JMI.4.4.047501
Received: 2 May 2017; Accepted: 26 September 2017; Published: 17 October 2017
Lens.org Logo
CITATIONS
Cited by 8 scholarly publications.
Advertisement
Advertisement
KEYWORDS
Californium

Cell biology

Machine learning

Diagnostics

Cancer

Feature extraction

Feature selection

Back to Top