论文标题
白细胞分类
White blood cell classification
论文作者
论文摘要
本文提出了一个新型的自动分类框架,以识别五种白细胞。从血液涂片图像中分割完整的白细胞并从中提取有利特征仍然是白细胞分类的具有挑战性的任务。因此,我们提出了一种自适应阈值分割方法,以处理具有不均匀颜色和不均匀照明的血液涂片图像,该图像是基于色彩空间信息和阈值分割设计的。随后,在成功将白细胞与血液涂片图像分开后,提取了许多非线性特征,包括几何,颜色和纹理特征。然而,冗余功能会影响分类速度和效率,并且考虑到这一点,设计了基于分类和回归树(CART)的特征选择算法。通过对特征之间非线性关系的深入分析,从最初的非线性特征成功删除了无关的和冗余特征。之后,选定的突出特征被馈入粒子群优化支持矢量机(PSO-SVM)分类器,以识别白细胞的类型。最后,为了评估所提出的白细胞分类方法的性能,我们构建了一个白细胞数据集,其中包含500张血液涂片图像用于实验。通过与手动获得的地面真相进行比较,提出的分割方法平均达到了分段核和细胞区域的平均骰子相似性97.98%和97.57%的骰子相似性。此外,提出的方法可以达到99.76%的分类准确性,这很好地证明了其有效性。
This paper proposes a novel automatic classification framework for the recognition of five types of white blood cells. Segmenting complete white blood cells from blood smears images and extracting advantageous features from them remain challenging tasks in the classification of white blood cells. Therefore, we present an adaptive threshold segmentation method to deal with blood smears images with non-uniform color and uneven illumination, which is designed based on color space information and threshold segmentation. Subsequently, after successfully separating the white blood cell from the blood smear image, a large number of nonlinear features including geometrical, color and texture features are extracted. Nevertheless, redundant features can affect the classification speed and efficiency, and in view of that, a feature selection algorithm based on classification and regression trees (CART) is designed. Through in-depth analysis of the nonlinear relationship between features, the irrelevant and redundant features are successfully removed from the initial nonlinear features. Afterwards, the selected prominent features are fed into particle swarm optimization support vector machine (PSO-SVM) classifier to recognize the types of the white blood cells. Finally, to evaluate the performance of the proposed white blood cell classification methodology, we build a white blood cell data set containing 500 blood smear images for experiments. By comparing with the ground truth obtained manually, the proposed segmentation method achieves an average of 95.98% and 97.57% dice similarity for segmented nucleus and cell regions respectively. Furthermore, the proposed methodology achieves 99.76% classification accuracy, which well demonstrates its effectiveness.