PROCESSING OF ISCHEMIC HEART DISEASE DATA USING ENSEMBLE CLASSIFICATION METHODS OF MACHINE LEARNING

Abstract

The WHO 2019 statistics provide evidence that cardiovascular diseases are among the prevailing causes of death globally [1]. In this study, a combined dataset of coronary artery disease (CAD), also known as ischemic heart disease, was used as the dataset for analysis. To influence the outcome of the occurrence of cardiovascular diseases, it is important to find significant features that contribute to the presence of this disease. This article demonstrated that important features can be obtained through classification and their visualization in Tableau. Three classification models were built, and important features were identified for each model. Then, the top 10 important features were selected from each model, and through comparison, the 5 most important features were identified that may influence the disease outcome. The classification models achieved the following f1-score results: LGBM (93.2%), XGB (92.0%), and RF (89.1%).

Author Biographies

Rustem Imanbek, Al-Farabi Kazakh National University, Almaty, Kazakhstan
Zholdas Buribayev, Al-Farabi Kazakh National University, Almaty, Kazakhstan
Ainur Yerkos, Al-Farabi Kazakh National University, Almaty, Kazakhstan
Published
2023-07-03
How to Cite
IMANBEK, Rustem; BURIBAYEV, Zholdas; YERKOS, Ainur. PROCESSING OF ISCHEMIC HEART DISEASE DATA USING ENSEMBLE CLASSIFICATION METHODS OF MACHINE LEARNING. Journal of problems in computer science and information technologies, [S.l.], v. 1, n. 2, july 2023. ISSN 2958-0846. Available at: <https://dslib.kaznu.kz/index.php/kaznu/article/view/68>. Date accessed: 22 nov. 2024. doi: https://doi.org/10.26577/JPCSIT.2023.v1.i2.06.