Machine learning for the classification of breast cancer tumor: a comparative analysis
DOI:
https://doi.org/10.18006/2022.10(2).440.450Keywords:
Breast cancer, Multilayer perceptron, K-NN, MLP, Random forestAbstract
The detection and diagnosis of Breast cancer at an early stage is a challenging task. With the increase in emerging technologies such as data mining tools, along with machine learning algorithms, new prospects in the medical field for automatic diagnosis have been developed, with which the prediction of a disease at an early stage is possible. Early detection of the disease may increase the survival rate of patients. The main purpose of the study was to predict breast cancer disease as benign or malignant by using supervised machine learning algorithms such as the K-nearest neighbor (K-NN), multilayer perceptron (MLP), and random forest (RF) and to compare their performance in terms of the accuracy, precision, F1 score, support, and AUC. The experimental results demonstrated that the MLP achieved a high prediction accuracy of 99.4%, followed by random forest (96.4%) and K-NN (76.3%). The diagnosis rates of the MLP, random forest and K-NN were 99.9%, 99.6%, and 73%, respectively. The study provides a clear idea of the accomplishments of classification algorithms in terms of their prediction ability, which can aid healthcare professionals in diagnosing chronic breast cancer efficiently.
References
Asri, H., Mousannif, H., Moatassime, H.A., & Noel, T. (2016). Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis. The 6th International Symposium on Frontiers in Ambient and Mobile Systems (FAMS 2016). Procedia Computer Science, 83, 1064 – 1069. DOI: https://doi.org/10.1016/j.procs.2016.04.224
Banu, A.B., & Subramanian, P.T. (2018). Comparison of Bayes classifers for breast cancer classifcation. Asian Pacific Journal of Cancer Prevention, 19(10), 2917–20.
Chaurasia, V., Pal, S., & Tiwari, B. (2018). Prediction of benign and malignant breast cancer using data mining techniques. Journal of Algorithms and Computational Technology, 12(2), 119–26. DOI: https://doi.org/10.1177/1748301818756225
Costa, K., Ribeiro, P., Carmargo, A., Rossi, V., et al. (2013). Comparison of the techniques decision tee and MLP for data mining in SPAMs detection in computer networks. Proceedings of the 3rd international conference on innovative computing Technology, 344–348. DOI: https://doi.org/10.1109/INTECH.2013.6653725
Forsyth, A.W., Barzilay, R., Hughes, K.S., Lui, D., et al. (2018). Machine Learning Methods to Extract Documentation of Breast Cancer Symptoms from Electronic Health Records. Journal of Pain and Symptom Management, 55(6), 1492-1499. DOI: https://doi.org/10.1016/j.jpainsymman.2018.02.016
García-Laencina, P.J., Abreu, P.H., Abreu, M.H., & Afonoso, N. (2015). Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values. Computers in Biology and Medicine, 59, 125–133. DOI: https://doi.org/10.1016/j.compbiomed.2015.02.006
Islam, M., Haque, R., Iqbal, H., Hasan, M., et al. (2020). Breast Cancer Prediction: A Comparative Study Using MachineLearning Techniques. SN Computer Science, 1, 290. DOI: https://doi.org/10.1007/s42979-020-00305-w
Islam, M.M., Iqbal, H., Haque, M.R., & Hasan, M.K. (2017). Prediction of Breast Cancer Using Support Vector Machine and K-Nearest Neighbours 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), Dhaka, Bangladesh. DOI: https://doi.org/10.1109/R10-HTC.2017.8288944
Jiang, T., Gradus, J.L., & Rosellini, A.J. (2020). Supervised machine learning: A brief primer. Behavior Therapy, 51(5), 675-687. DOI: https://doi.org/10.1016/j.beth.2020.05.002
Kaur, P., Kumar, R., & Kumar, M. (2019). A healthcare monitoring system using random forest and internet of things (IoT). Multimedia Tools and Applications, 78, 19905–19916. DOI: https://doi.org/10.1007/s11042-019-7327-8
Latchoumi, T.P., & Parthiban, L. (2017). Abnormality detection using weighed particle swarm optimization and smooth support vector machine. Biomedical Research, 28, 4749–51.
Muktevi, S. (2020). Prediction of Breast Cancer Disease using Machine Learning Algorithms. International Journal of Innovative Technology and Exploring Engineering, 9(4), 2868-2878. DOI: https://doi.org/10.35940/ijitee.D1866.029420
Rana, M., Chandorkar, P., & Dsouza, A. (2015). Breast cancer diagnosis and recurrence prediction using machine learning
techniques. International Journal of Research in Engineering and Technology, 4(4), 372-376.
Sakri, S.B., Rashid, N.B.A., & Zain, Z.M. (2018). Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access, 6, 29637–29647. DOI: https://doi.org/10.1109/ACCESS.2018.2843443
Singh, B.K. (2019). Determining relevant biomarkers for breast cancer using anthropometric and clinical features: A comparative investigation in machine learning paradigm. Biocybernetics and Biomedical Engineering, 39, 393-409. DOI: https://doi.org/10.1016/j.bbe.2019.03.001
Sun, Y.S., Zhao, Z., Yang, Z.N., Xu, F., et al. (2017). Risk Factors and Preventions of Breast Cancer. International Journal of Biological Sciences, 13(11):1387–1397. DOI: https://doi.org/10.7150/ijbs.21635
Verikas, A., Gelzinis, A., & Bacauskiene, M. (2011). Mining data with random forest: a survey and results of new tests. Pattern Recognition, 44(2), 330–349. DOI: https://doi.org/10.1016/j.patcog.2010.08.011
Wu, M., Zhong, X., Peng, Q., Xu, M., et al. (2019). Prediction of Molecular Subtypes of Breast Cancer using BI-RADS Features Based on a “White Box” Machine Learning Approach in a Multi-modal Imaging Setting. European Journal of Radiology, 114, 175-184. DOI: https://doi.org/10.1016/j.ejrad.2019.03.015
Downloads
Published
How to Cite
License
Copyright (c) 2022 Journal of Experimental Biology and Agricultural Sciences
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.