Machine learning for the classification of breast cancer tumor: a comparative analysis

Authors

  • Ranjan K. Mohapatra Department of Chemistry, Government College of Engineering, Keonjhar, Odisha 758002, India https://orcid.org/0000-0001-7623-3343
  • Madhumita Pal Electronics and communication Engineering, C. V. Raman Global University, Bidyanagar, Mahura, Janla, Bhubaneswar, Odisha 752054, India
  • Smita Parija Electronics and communication Engineering, C. V. Raman Global University, Bidyanagar, Mahura, Janla, Bhubaneswar, Odisha 752054, India
  • Ganapati Panda Electronics and communication Engineering, C. V. Raman Global University, Bidyanagar, Mahura, Janla, Bhubaneswar, Odisha 752054, India
  • Kuldeep Dhama Division of Pathology, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly-243122, Uttar Pradesh, India https://orcid.org/0000-0001-7469-4752

DOI:

https://doi.org/10.18006/2022.10(2).440.450

Keywords:

Breast cancer, Multilayer perceptron, K-NN, MLP, Random forest

Abstract

The detection and diagnosis of Breast cancer at an early stage is a challenging task. With the increase in emerging technologies such as data mining tools, along with machine learning algorithms, new prospects in the medical field for automatic diagnosis have been developed, with which the prediction of a disease at an early stage is possible. Early detection of the disease may increase the survival rate of patients. The main purpose of the study was to predict breast cancer disease as benign or malignant by using supervised machine learning algorithms such as the K-nearest neighbor (K-NN), multilayer perceptron (MLP), and random forest (RF) and to compare their performance in terms of the accuracy, precision, F1 score, support, and AUC. The experimental results demonstrated that the MLP achieved a high prediction accuracy of 99.4%, followed by random forest (96.4%) and K-NN (76.3%). The diagnosis rates of the MLP, random forest and K-NN were 99.9%, 99.6%, and 73%, respectively. The study provides a clear idea of the accomplishments of classification algorithms in terms of their prediction ability, which can aid healthcare professionals in diagnosing chronic breast cancer efficiently.

References

Asri, H., Mousannif, H., Moatassime, H.A., & Noel, T. (2016). Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis. The 6th International Symposium on Frontiers in Ambient and Mobile Systems (FAMS 2016). Procedia Computer Science, 83, 1064 – 1069. DOI: https://doi.org/10.1016/j.procs.2016.04.224

Banu, A.B., & Subramanian, P.T. (2018). Comparison of Bayes classifers for breast cancer classifcation. Asian Pacific Journal of Cancer Prevention, 19(10), 2917–20.

Chaurasia, V., Pal, S., & Tiwari, B. (2018). Prediction of benign and malignant breast cancer using data mining techniques. Journal of Algorithms and Computational Technology, 12(2), 119–26. DOI: https://doi.org/10.1177/1748301818756225

Costa, K., Ribeiro, P., Carmargo, A., Rossi, V., et al. (2013). Comparison of the techniques decision tee and MLP for data mining in SPAMs detection in computer networks. Proceedings of the 3rd international conference on innovative computing Technology, 344–348. DOI: https://doi.org/10.1109/INTECH.2013.6653725

Forsyth, A.W., Barzilay, R., Hughes, K.S., Lui, D., et al. (2018). Machine Learning Methods to Extract Documentation of Breast Cancer Symptoms from Electronic Health Records. Journal of Pain and Symptom Management, 55(6), 1492-1499. DOI: https://doi.org/10.1016/j.jpainsymman.2018.02.016

García-Laencina, P.J., Abreu, P.H., Abreu, M.H., & Afonoso, N. (2015). Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values. Computers in Biology and Medicine, 59, 125–133. DOI: https://doi.org/10.1016/j.compbiomed.2015.02.006

Islam, M., Haque, R., Iqbal, H., Hasan, M., et al. (2020). Breast Cancer Prediction: A Comparative Study Using MachineLearning Techniques. SN Computer Science, 1, 290. DOI: https://doi.org/10.1007/s42979-020-00305-w

Islam, M.M., Iqbal, H., Haque, M.R., & Hasan, M.K. (2017). Prediction of Breast Cancer Using Support Vector Machine and K-Nearest Neighbours 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), Dhaka, Bangladesh. DOI: https://doi.org/10.1109/R10-HTC.2017.8288944

Jiang, T., Gradus, J.L., & Rosellini, A.J. (2020). Supervised machine learning: A brief primer. Behavior Therapy, 51(5), 675-687. DOI: https://doi.org/10.1016/j.beth.2020.05.002

Kaur, P., Kumar, R., & Kumar, M. (2019). A healthcare monitoring system using random forest and internet of things (IoT). Multimedia Tools and Applications, 78, 19905–19916. DOI: https://doi.org/10.1007/s11042-019-7327-8

Latchoumi, T.P., & Parthiban, L. (2017). Abnormality detection using weighed particle swarm optimization and smooth support vector machine. Biomedical Research, 28, 4749–51.

Muktevi, S. (2020). Prediction of Breast Cancer Disease using Machine Learning Algorithms. International Journal of Innovative Technology and Exploring Engineering, 9(4), 2868-2878. DOI: https://doi.org/10.35940/ijitee.D1866.029420

Rana, M., Chandorkar, P., & Dsouza, A. (2015). Breast cancer diagnosis and recurrence prediction using machine learning

techniques. International Journal of Research in Engineering and Technology, 4(4), 372-376.

Sakri, S.B., Rashid, N.B.A., & Zain, Z.M. (2018). Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access, 6, 29637–29647. DOI: https://doi.org/10.1109/ACCESS.2018.2843443

Singh, B.K. (2019). Determining relevant biomarkers for breast cancer using anthropometric and clinical features: A comparative investigation in machine learning paradigm. Biocybernetics and Biomedical Engineering, 39, 393-409. DOI: https://doi.org/10.1016/j.bbe.2019.03.001

Sun, Y.S., Zhao, Z., Yang, Z.N., Xu, F., et al. (2017). Risk Factors and Preventions of Breast Cancer. International Journal of Biological Sciences, 13(11):1387–1397. DOI: https://doi.org/10.7150/ijbs.21635

Verikas, A., Gelzinis, A., & Bacauskiene, M. (2011). Mining data with random forest: a survey and results of new tests. Pattern Recognition, 44(2), 330–349. DOI: https://doi.org/10.1016/j.patcog.2010.08.011

Wu, M., Zhong, X., Peng, Q., Xu, M., et al. (2019). Prediction of Molecular Subtypes of Breast Cancer using BI-RADS Features Based on a “White Box” Machine Learning Approach in a Multi-modal Imaging Setting. European Journal of Radiology, 114, 175-184. DOI: https://doi.org/10.1016/j.ejrad.2019.03.015

Downloads

Published

2022-04-30

How to Cite

Mohapatra, R. K., Pal, M., Parija, S., Panda, G., & Dhama, K. (2022). Machine learning for the classification of breast cancer tumor: a comparative analysis. Journal of Experimental Biology and Agricultural Sciences, 10(2), 440–450. https://doi.org/10.18006/2022.10(2).440.450

Issue

Section

RESEARCH ARTICLES

Categories