为表征乳腺肿瘤的特性,设计基于乳腺影像报告与数据系统(Breast Imaging Reporting and Data System,BI-RADS)的高通量特征、尺度不变特征转换(scale-invariant feature transform,SIFT)的特征和卷积神经网络(convolutional neural network,CNN)的特征,并结合3个重要临床信息,共获得10 703个特征,构成相应的特征体系。利用十折法随机重复采样100次,先用单因素t检验筛选出
P
<0.05的特征,再使用最小化绝对值收缩和选择算子(least absolute shrinkage and selection operator,LASSO)回归模型进一步筛选,将出现次数最多的100个特征作为最优特征集,输入一个使用十倍交叉验证法的线性核支持向量机(support vector machine,SVM)分类器中进行分类。
结果:
对380例女性浸润性乳腺癌患者进行研究,分类器受试者工作特征(receiver operating characteristic,ROC)曲线的曲线下面积(area under curve,AUC)为0.903,准确率为82.6%,灵敏度为90.6%,特异度为69.9%。
结论:
该研究提取的乳腺超声图像特征能较好地预测淋巴结转移,可为医师进行乳腺淋巴结检查提供参考。
Abstract
Objective:
To investigate the ipsilateral axillary lymph node metastases based on high-throughput sonographic features of invasive breast carcinoma.
Methods:
In order to represent the characteristics of breast tumors
the high-throughput features based on Breast Imaging Reporting and Data System (BI-RA
DS)
features based on scale-invariant feature transform (SIFT) and features based on convolutional neural network (CNN) were extracted. Combined with three important clinical features
a total of 10 703 features were obtained to constitute a feature system. A 10-fold feature selection method using t-test and least absolute shrinkage and selection operator (LASSO) regression model was bootstrapped for 100 times. The 100 most frequent features were input into a linear-kernel support vector machine (SVM) classifier in 10-fold cross validation experiment.
Results:
For the included 380 breast cancer cases
the area under curve (AUC) of the receiver operating characteristic (ROC) curve
accuracy
sensitivity
specificity were 0.903
82.6%
90.6%
69.9%
respectively.
Conclusion:
The sonographic features extracted from breast ultrasound images can predict lymph node metastases and provide the possibility of lymph node metastases for a sonographer.