Comparative Analysis of Machine Learning Techniques for Early Detection of Breast Cancer

Prachi Rawat; Rashmi Saini; Anuj Kumar

doi:10.63503/j.ijcma.2025.89

Authors

Prachi Rawat
Rashmi Saini
Anuj Kumar Doon University Dehradun

DOI:

https://doi.org/10.63503/j.ijcma.2025.89

Keywords:

Machine Learning, Medical Diagnosis, Breast Cancer, Classification, Early detection

Abstract

Abstract: Breast cancer is the most frequently encountered form of cancer among the populace, and women are more likely than males to develop it. Catching it early increases the likelihood of survival but due to the complex nature of masses and microcalcification, radiologists oftentimes fail to diagnose breast cancer properly. Radiologists use Computer aided diagnostic (CAD) systems to detect abnormalities, however, several uncertainties in breast cancer detection using mammograms makes it challenging. The employment of Machine Learning (ML) in the medical field for diagnosis and its accuracy is an inevitable futuristic step. ML techniques in breast cancer detection greatly help in early and accurate detection thereby increasing the patient’s survival rate. This paper compares the different popular Machine Learning techniques such as Support Vector Machine (SVM), Random Forest (RF), k Nearest Neighbor, and Decision Tree on Wisconsin Breast Cancer dataset. Various metrics for performance evaluation such as Accuracy, Precision, Recall, F1 score, Specificity, False Positive Rate, and False Negative Rate are used for model evaluation. Random Forest yielded the highest accuracy while SVM fared better than other algorithms by achieving the highest precision.

References

[1] World Health Organization (WHO), “Breast cancer,” World Health Organization (WHO). Accessed: Dec. 21, 2022. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/breast-cancer

[2] O. Ubrurhe, N. Houlden, and P. S. Excell, “A Review of Energy Efficiency in Wireless Body Area/Sensor Networks, With Emphasis on MAC Protocol,” Annals of Emerging Technologies in Computing, vol. 4, no. 1, pp. 1–7, Jan. 2020, doi: 10.33166/AETiC.2020.01.001.

[3] M. Piñeros, A. Znaor, L. Mery, and F. Bray, “A Global Cancer Surveillance Framework Within Noncommunicable Disease Surveillance: Making the Case for Population-Based Cancer Registries,” Epidemiol Rev, vol. 39, no. 1, pp. 161–169, Jan. 2017, doi: 10.1093/epirev/mxx003.

[4] S. J. Nechuta et al., “The After Breast Cancer Pooling Project: rationale, methodology, and breast cancer survivor characteristics,” Cancer Causes & Control, vol. 22, no. 9, pp. 1319–1331, Sep. 2011, doi: 10.1007/s10552-011-9805-9.

[5] M. F. Ak, “A Comparative Analysis of Breast Cancer Detection and Diagnosis Using Data Visualization and Machine Learning Applications,” Healthcare, vol. 8, no. 2, p. 111, Apr. 2020, doi: 10.3390/healthcare8020111.

[6] A. Jemal et al., “Cancer Statistics, 2008,” CA Cancer J Clin, vol. 58, no. 2, pp. 71–96, Jan. 2008, doi: 10.3322/CA.2007.0010.

[7] A. A. Ardakani, A. Gharbali, and A. Mohammadi, “Classification of Breast Tumors Using Sonographic Texture Analysis,” Journal of Ultrasound in Medicine, vol. 34, no. 2, pp. 225–231, Feb. 2015, doi: 10.7863/ultra.34.2.225.

[8] B. L. Sprague et al., “Variation in Mammographic Breast Density Assessments Among Radiologists in Clinical Practice,” Ann Intern Med, vol. 165, no. 7, p. 457, Oct. 2016, doi: 10.7326/M15-2934.

[9] P. E. Freer, “Mammographic Breast Density: Impact on Breast Cancer Risk and Implications for Screening,” RadioGraphics, vol. 35, no. 2, pp. 302–315, Mar. 2015, doi: 10.1148/rg.352140106.

[10] T. M. Kolb, J. Lichy, and J. H. Newhouse, “Comparison of the Performance of Screening Mammography, Physical Examination, and Breast US and Evaluation of Factors that Influence Them: An Analysis of 27,825 Patient Evaluations,” Radiology, vol. 225, no. 1, pp. 165–175, Oct. 2002, doi: 10.1148/radiol.2251011667.

[11] H. D. Cheng, X. J. Shi, R. Min, L. M. Hu, X. P. Cai, and H. N. Du, “Approaches for automated detection and classification of masses in mammograms,” Pattern Recognit, vol. 39, no. 4, pp. 646–668, Apr. 2006, doi: 10.1016/j.patcog.2005.07.006.

[12] F. Bray, P. McCarron, and D. M. Parkin, “The changing global patterns of female breast cancer incidence and mortality,” Breast Cancer Research, vol. 6, no. 6, p. 229, Dec. 2004, doi: 10.1186/bcr932.

[13] R. L. Birdwell, D. M. Ikeda, K. F. O’Shaughnessy, and E. A. Sickles, “Mammographic Characteristics of 115 Missed Cancers Later Detected with Screening Mammography and the Potential Utility of Computer-aided Detection,” Radiology, vol. 219, no. 1, pp. 192–202, Apr. 2001, doi: 10.1148/radiology.219.1.r01ap16192.

[14] P. Skaane and K. Engedal, “Analysis of sonographic features in the differentiation of fibroadenoma and invasive ductal carcinoma.,” American Journal of Roentgenology, vol. 170, no. 1, pp. 109–114, Jan. 1998, doi: 10.2214/ajr.170.1.9423610.

[15] K. Doi, “Computer-aided diagnosis: potential usefulness in diagnostic radiology and telemedicine,” in Proceedings of the National Forum: Military Telemedicine On-Line Today Research, Practice, and Opportunities, IEEE Comput. Soc. Press, pp. 9–13. doi: 10.1109/MTOL.1995.504521.

[16] N. GULER, E. UBEYLI, and I. GULER, “Recurrent neural networks employing Lyapunov exponents for EEG signals classification,” Expert Syst Appl, vol. 29, no. 3, pp. 506–514, Oct. 2005, doi: 10.1016/j.eswa.2005.04.011.

[17] A. Osareh and B. Shadgar, “Machine learning techniques to diagnose breast cancer,” in 2010 5th International Symposium on Health Informatics and Bioinformatics, IEEE, 2010, pp. 114–120. doi: 10.1109/HIBIT.2010.5478895.

[18] D. Michie, D. J. Spiegelhalter, and C. C. Taylor, “Machine Learning, Neural, and Statistical Classification,” 1994.

[19] Shen-Chuan Tai, Zih-Siou Chen, and Wei-Ting Tsai, “An Automatic Mass Detection System in Mammograms Based on Complex Texture Features,” IEEE J Biomed Health Inform, vol. 18, no. 2, pp. 618–627, Mar. 2014, doi: 10.1109/JBHI.2013.2279097.

[20] J. Nagi, S. Abdul Kareem, F. Nagi, and S. Khaleel Ahmed, “Automated breast profile segmentation for ROI detection using digital mammograms,” in 2010 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES), IEEE, Nov. 2010, pp. 87–92. doi: 10.1109/IECBES.2010.5742205.

[21] S. M. Butler, G. I. Webb, and R. A. Lewis, “A Case Study in Feature Invention for Breast Cancer Diagnosis Using X-Ray Scatter Images,” in Lecture Notes in Computer Science AI 2003: Advances in Artificial Intelligence, vol. 2903, 2003, pp. 677–685. doi: 10.1007/978-3-540-24581-0_58.

[22] A. Qayyum and A. Basit, “Automatic breast segmentation and cancer detection via SVM in mammograms,” in 2016 International Conference on Emerging Technologies (ICET), IEEE, Oct. 2016, pp. 1–6. doi: 10.1109/ICET.2016.7813261.

[23] T. T. Htay and S. S. Maung, “Early Stage Breast Cancer Detection System using GLCM feature extraction and K-Nearest Neighbor (k-NN) on Mammography image,” in 2018 18th International Symposium on Communications and Information Technologies (ISCIT), IEEE, Sep. 2018, pp. 171–175. doi: 10.1109/ISCIT.2018.8587920.

[24] B. Dai, R.-C. Chen, S.-Z. Zhu, and W.-W. Zhang, “Using Random Forest Algorithm for Breast Cancer Diagnosis,” in 2018 International Symposium on Computer, Consumer and Control (IS3C), IEEE, Dec. 2018, pp. 449–452. doi: 10.1109/IS3C.2018.00119.

[25] S. Hamed, A. Mesleh, and A. Arabiyyat, “Breast Cancer Detection Using Machine Learning Algorithms,” International Journal of Computer Science and Mobile Computing, vol. 10, no. 11, pp. 4–11, Nov. 2021, doi: 10.47760/ijcsmc.2021.v10i11.002.

[26] G. Williams, “Descriptive and Predictive Analytics,” in Data Mining with Rattle and R, New York, NY: Springer New York, 2011, pp. 171–177. doi: 10.1007/978-1-4419-9890-3_8.

[27] K. Kourou, T. P. Exarchos, K. P. Exarchos, M. V. Karamouzis, and D. I. Fotiadis, “Machine learning applications in cancer prognosis and prediction,” Comput Struct Biotechnol J, vol. 13, pp. 8–17, 2015, doi: 10.1016/j.csbj.2014.11.005.

[28] T. J. Cleophas and A. H. Zwinderman, Machine Learning in Medicine. Dordrecht: Springer Netherlands, 2013. doi: 10.1007/978-94-007-5824-7.

[29] D. Bazazeh and R. Shubair, “Comparative study of machine learning algorithms for breast cancer detection and diagnosis,” in 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), IEEE, Dec. 2016, pp. 1–4. doi: 10.1109/ICEDSA.2016.7818560.

[30] I. Kononenko, “Machine learning for medical diagnosis: history, state of the art and perspective,” Artif Intell Med, vol. 23, no. 1, pp. 89–109, Aug. 2001, doi: 10.1016/S0933-3657(01)00077-X.

[31] Y. Yasui and X. Wang, “Statistical Learning from a Regression Perspective by BERK, R. A.,” Biometrics, vol. 65, no. 4, pp. 1309–1310, Dec. 2009, doi: 10.1111/j.1541-0420.2009.01343_5.x.

[32] Breastcancer.org, “Breast cancer Facts and Statistics,” Breastcancer.org. Accessed: Dec. 12, 2022. [Online]. Available: https://www.breastcancer.org/symptoms/understand_bc/statistics

Comparative Analysis of Machine Learning Techniques for Early Detection of Breast Cancer

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

Make a Submission

issn-online

frequency

Language

format-of-publication

publishing-mode-and-access-fees

Submission Process:

Post Acceptance:

Email ID

Editor-in-Chief

Publisher