Abstract
Breast cancer is one of the rapid spreading diseases resulting in the death of younger age group of women. Unfortunately, as the detection of cancer is at later stage, the lifetime of the patient is decreased. If the detection is at early stage, then their lifetime could have been improved. Hence, the proposal aims at predicting the presence of breast cancer at early stage through deep learning. To identify suitable model for deep learning, initially machine learning algorithm with Logistic Regression, K Nearest Neighbors, Support Vector Machine (linear), Support Vector Machine, Gaussian, Decision Tree and Random Forest along with ensemble learning algorithms such as Bagged Trees, Subspace discriminant and RUSBoosted Trees are implemented with 30 attributes. Comparison of performance metrices indicates that random forest performs better. Then, feature selection of 14 attributes is attained through heat map. With minimal features, the above set of algorithms is implemented and their corresponding performance indices such as accuracy, misclassification cost, prediction speed, training time, predicted class, true class, positive predict value, sensitivity, specificity, precision, F1 score, Area Under the Curve and Receiver Operating Characteristic Curve are obtained. In this, random forest performs better and in addition, the performance of 14 attributes is almost exhibiting closer performance as that of 30. However, feature selection is mandate and can be eliminated if the algorithm is implemented through deep learning model. The model consists of many hidden layers which performs binary classification on the given dataset to predict whether a person is malignant or benign. The performance indices of the proposed model are validated and the results exhibit its supremacy.
Similar content being viewed by others
References
Malhotra, B. D., Kumar, S., & Pandey, C. M. (2016). Nanomaterials based biosensors for cancer biomarker detection. Journal of Physics: Conference Series, 704, 012011. https://doi.org/10.1088/1742-6596/704/1/012011
Mittal, S., Kaur, H., Gautam, N., & Mantha, A. K. (2017). Biosensors for breast cancer diagnosis: A review of bioreceptors, biotransducers and signal amplification strategies. Biosensors and Bioelectronics, 88, 217–231. https://doi.org/10.1016/j.bios.2016.08.028
Parker, S. I., Kenney, C. J., Peterson, V. Z., Ikeda, D. M., Backus, F. A., Snoeys, W. J., & Aw, C. H. (1994). Breast cancer calcification measurements using direct X-ray detection in a monolithic silicon pixel detector. IEEE Transactions on Nuclear Science, 41(6), 2862–2873. https://doi.org/10.1109/23.340659
Hossein N. M. (2016). Early detection of breast cancer using electrochemical and nano biosensor, Cancer Diagnostics Conference & Expo, Italy, 8(5), June, p. 88.
Ryu, J., Heo, M. S., & Kim, H. C. (2010). Development of portable breast self-examination device using enhanced tactile feedback. Electronics letters, 46(25), 1651–1653. https://doi.org/10.1049/el.2010.2711
Bahramiabarghouei, H., Porter, E., Santorelli, A., Gosselin, B., Popović, M., & Rusch, L. A. (2015). Flexible 16 antenna array for microwave breast cancer detection. IEEE Transactions on Biomedical Engineering, 62(10), 2516–2525. https://doi.org/10.1109/TBME.2015.2434956
Santorelli, A., Porter, E., Kang, E., Piske, T., Popović, M., & Schwartz, J. D. (2015). A time-domain microwave system for breast cancer detection using a flexible circuit board. IEEE Transactions on Instrumentation and Measurement, 64(11), 2986–2994. https://doi.org/10.1109/TIM.2015.2440565
Gao, S., Mondal, S., Zhu, N., Liang, R., Achilefu, S., & Gruev, V. (2015). A compact NIR fluorescence imaging system with goggle display for intraoperative guidance. In 2015 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE. 1622–1625, https://doi.org/10.1109/ISCAS.2015.7168960.
Gao, S., Modal, S., Zhu, N., Liang, R., Achilefu, S., & Gruev,’ V. (2015). Live demonstration: A compact NIR fluorescence imaging system design with goggle display for intraoperative guidance’, IEEE International Symposium on Circuits and Systems (ISCAS), Lisbon, 2015, pp. 1910–1910. https://doi.org/10.1109/ISCAS.2015.7169043
Porter, E., Bahrami, H., Santorelli, A., Gosselin, B., Rusch, L. A., & Popović, M. (2016). A wearable microwave antenna array for time-domain breast tumour screening. IEEE Transactions on Medical Imaging, 35(6), 1501–1509. https://doi.org/10.1109/TMI.2016.2518489
.Fadhillah, U. D. L., Afikah, Z. A. N., Safiee, N. E. N., Asnida, A. W., Rafiq, A. K. M., & Ramlee, M. H. (2018). Development of a low-cost wearable breast cancer detection device. In 2018 2nd International Conference on Bio Signal Analysis, Processing and Systems (ICBAPS) IEEE. 41–46. https://doi.org/10.1109/ICBAPS.2018.8527419.
Farag, O., Mohamed, M., Abd El Ghany, M., & Hofmann, K. (2018). Integrated sensors for early breast cancer diagnostics. In 2018 IEEE 21st International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS). IEEE. 153–157https://doi.org/10.1109/DDECS.2018.00034.
Hemavathi, N., & Sudha, S. (2016). A novel regression based clustering technique for wireless sensor networks. Wireless Personal Communications, 88(4), 985–1013. https://doi.org/10.1007/s11277-016-3226-8
Arulmozhi, P., Hemavathi, N., Rayappan, J. B. B., & Raj, P. (2020). ALRC: A novel adaptive linear regression based classification for grade based student learning using radio frequency identification. Wireless Personal Communications. https://doi.org/10.1007/s11277-020-07141-4
Gubern-Merida, A., Kallenberg, M., Mann, R. M., Marti, R., & Karssemeijer, N. (2014). Breast segmentation and density estimation in breast MRI: A fully automatic framework. IEEE Journal of Biomedical and Health Informatics, 19(1), 349–357. https://doi.org/10.1109/JBHI.2014.2311163
Halim, E., Halim, P. P., & Hebrard, M. (2018). Artificial intelligent models for breast cancer early detection. In 2018 International Conference on Information Management and Technology (ICIMTech). IEEE. 517–521 https://doi.org/10.1109/ICIMTech.2018.8528140.
Mishra, V., Singh, Y., & Rath, S. K. (2019). Breast cancer detection from thermograms using feature extraction and machine learning techniques. In 2019 IEEE 5th International Conference for Convergence in Technology (I2CT). IEEE. 1–5. https://doi.org/10.1109/I2CT45611.2019.9033713.
Polat, K., & Sentürk, U. (2018). A Novel ML Approach to Prediction of Breast Cancer: Combining of mad normalization, KMC based feature weighting and AdaBoostM1 classifier. In 2018 2nd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT). IEEE. 1–4. https://doi.org/10.1109/ISMSIT.2018.8567245.
Aaqib, M., Tufail, M., & Anwar, S. (2019). A novel deep learning based approach for breast cancer detection. In 2019 13th International Conference on Mathematics, Actuarial Science, Computer Science and Statistics (MACS). IEEE. 1–6. https://doi.org/10.1109/MACS48846.2019.9024761
Shen, L., Margolies, L. R., Rothstein, J. H., Fluder, E., McBride, R., & Sieh, W. (2019). Deep learning to improve breast cancer detection on screening mammography. Scientific Reports, 9(1), 1–12. https://doi.org/10.1038/s41598-019-48995-4
Xie, J., Liu, R., Luttrell, J., IV., & Zhang, C. (2019). Deep learning based analysis of histopathological images of breast cancer. Frontiers in Genetics, 10, 80. https://doi.org/10.3389/fgene.2019.00080
Ragab, D. A., Sharkas, M., Marshall, S., & Ren, J. (2019). Breast cancer detection using deep convolutional neural networks and support vector machines. PeerJ, 7, e6201. https://doi.org/10.7717/peerj.6201
https://www.mathworks.com/help/stats/machine-learning-in-matlab.html.
Mangasarian, O. L., Street, W. N., & Wolberg, W. H. (1995). Breast cancer diagnosis and prognosis via linear programming. Operations Research, 43(4), 570–577.
Funding
There is no funding for this research work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix 1
Appendix 1
Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass [24]. They describe characteristics of the cell nuclei present in the image.
1.1 Attribute Information
(1) ID number
(2) Diagnosis (M = malignant, B = benign)
(3-32)
Ten real-valued features are computed for each cell nucleus:
-
(a) Radius (mean of distances from center to points on the perimeter)
-
(b) Texture (standard deviation of gray-scale values)
-
(c) Perimeter
-
(d) Area
-
(e) Smoothness (local variation in radius lengths)
-
(f) Compactness (perimeter^2 / area—1.0)
-
(g) Concavity (severity of concave portions of the contour)
-
(h) Concave points (number of concave portions of the contour)
-
(i) Symmetry
-
(j) Fractal dimension ("coastline approximation"—1)
Radius mean | Mean of distances from center to points on the perimeter |
Radius largest worst | Largest mean value for mean of distances from center to points on the perimeter |
Perimeter mean | Mean size of the core tumor |
Perimeter largest worst | Largest mean of perimeter |
Smoothness mean | Mean of local variation in radius lengths |
Compactness mean | Mean of perimeter^2 / area—1.0 |
Concavity mean | Mean of severity of concave portions of the contour |
Concave points mean | Mean for number of concave portions of the contour |
Symmetry mean | Symmetry |
Fractal dimension mean | Standard error for "coastline approximation" – 1 |
Radius se | Standard error for the mean of distances from center to points on the perimeter |
Texture se | Standard error for standard deviation of gray-scale values |
Perimeter se | Standard error for perimeter |
Area se | Standard error for area |
Smoothness se | Standard error for local variation in radius lengths |
Compactness se | Standard error for perimeter^2 / area—1.0 |
Concavity se | Standard error for severity of concave portions of the contour |
Concave points se | Mean for number of concave portions of the contour |
Symmetry se | Standard error for symmetry |
Fractal dimension se | Mean for "coastline approximation" – 1 |
Smoothness largest | "Worst" or largest mean value for local variation in radius lengths |
Texture largest worst | "Worst" or largest mean value for standard deviation of gray-scale values |
Concavity largest | "Worst" or largest mean value for severity of concave portions of the contour |
Area largest worst | Largest area |
Symmetry largest | Largest symmetry |
Compactness largest worst | "Worst" or largest mean value for perimeter^2 / area—1.0 |
Texture mean | Standard deviation of gray-scale values |
Concave points largest worst | "Worst" or largest mean value for number of concave portions of the contour |
Area mean | Mean of the area |
Fractal dimension largest worst | "Worst" or largest mean value for "coastline approximation"—1 |
Rights and permissions
About this article
Cite this article
Hemavathi, N., Sriranjani, R., Arulmozhi, P. et al. Deep Learning based Early Prediction Scheme for Breast Cancer. Wireless Pers Commun 122, 931–946 (2022). https://doi.org/10.1007/s11277-021-08933-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-021-08933-y