Predicting heart disease using machine learning algorithms

Authors

  • A. Berdaly Al-Farabi Kazakh National University
  • Z. Abdiakhmetova Al-Farabi Kazakh National University

DOI:

https://doi.org/10.26577/JMMCS.2022.v115.i3.10

Keywords:

Classification, Standardization, Training Selection, Metrics, Busting, Confusion matrix

Abstract

Increasing the accuracy of detecting heart disease is widely studied in the field of machine learning. Such a study is intended to prevent large costs in the field of healthcare and is the reason for the misdiagnosis. As a result, various methods of analyzing disease factors were proposed, aimed at reducing differences in the practice of doctors and reducing medical costs and errors. In this study, 6 classification learning algorithms were used, including machine learning methods such as classification Tree, Close neighborhood method, Naive Bayes, random forest tree, and busting methods.  These methods were collected by the University of Cleveland.using csv datacet, they were trained to make an effective and accurate prediction of heart disease. In order to increase the predictive capabilities of algorithms, all methods were trained primarily on non-standardized data.  A study was conducted on how much data standardization affects the result using the Standard Scaler method. In the course of the study, this method helped algorithms such as KNN and SVC improve the result by about 25\%.

References

[1] Gusev A .V., Novitskiy R.E., Ivshin.A.A."Machine learning based on laboratory data for disease prediction" ,
FARMAKOEKONOMIKA. Modern Pharmacoeconomics and Pharmacoepidemiology(2021): 8-9 p.
[2] R. Bharti [et al.],"Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning" ,
Computational Intelligence and Neuroscience(2021): 3-6 p.
[3] Dimitris Bertsimas, Luca Mingardi, Bartolomeo Stellato "Machine Learning for Real-Time Heart Disease Prediction" , IEEE Journal of Biomedical and Health Informatics (2021): 12-14 p.
[4] Mensah G.A., Roth G.A., Valentin Fuster, "The Global Burden of Cardiovascular Diseases and Risk Factors: 2020 and Beyond " , Journal of the American College of Cardiology, (2019): 2-9 p.
[5] Amini M., Zayeri F., Salehi M., "Trend analysis of cardiovascular disease mortality, incidence, and mortality-to-incidence ratio: results from global burden of disease study 2017" , BMC Public Health, (2021): 12 p.
[6] Md Mamun Ali [et al.], "Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison" , Computers in Biology and Medicine. (2021): 6-7 p.
[7] Amin Ul Haq [et al.], "A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms" , Mobile Information System (2018): 11-13 p.
[8] Fajr Ibrahem Alarsan, Mamoon Younes, "Analysis and classification of heart diseases using heartbeat features and machine learning algorithms" , Journal of Big Data (2019): 8 p.
[9] Yunxing Jiang [et al.], "Cardiovascular disease prediction by machine learning algorithms based on cytokines in kazakhs of china" , Clinical Epidemiology (2021): 5 p.
[10] Yan Yan Song, Ying Lu., "Decision tree methods: applications for classification and prediction" , Shanghai Archives of Psychiatry (2015): 1-7 p.
[11] Pбdraig Cunningham, "K-Nearest Neighbour Classifiers-A Tutorial" , ACM Computing Surveys (2021): 6-9 p.
[12] Sandro Sperandei, "Understanding logistic regression analysis" , Biochemia Medica (2014): 5-9 p.
[13] Candice Bentejac, Anna Csurgo, Gonzalo Martinez-Munoz, "Data classification using support vector machine" , Artificial Intelligence Review (2021): 2-7 p.
[14] Leo Breima, "Random forests" , Machine Learning 45,5-32 (2001)
[15] Durgesh K Srivastava, Lekha Bhambhu, "Data classification using support vector machine" , Journal of Theoretical and Applied Information Technology (2010): 3-9 p

Downloads

Published

2022-09-27