Early-Stage Cardiovascular Disease Prediction Using a Sigmoidtropy-Based Decision Tree

Authors

  • Anurag Bhatt Uttarakhand Open University, Haldwani, India
  • Ashutosh Kumar Bhatt Uttarakhand Open University, Haldwani, India

DOI:

https://doi.org/10.6000/1929-6029.2025.14.77

Keywords:

Cardiovascular Disease, Sigmoidtropy-Based Decision Tree (SDT), K-Means Clustering, Naive Bayes, Random Forest, Machine Learning, Risk Prediction, Data Mining

Abstract

Heart disease (HD) is a significant health issue in the world, and its early and proper prediction is essential to minimize mortality and the development of the disease. Cardiovascular disease (CVD) is one of the diseases that need effective and stable predictive models to assist clinical decision-making. This paper gives a Sigmoidtropy-Based Decision Tree (SDT) model of cardiovascular disease prediction, which improves the traditional decision tree by adding a sigmoid-based formulation of entropy. The heart disease data are first grouped by the K-means clustering method in order to enhance the data representation. The suggested SDT model is tested on the Cleveland heart disease dataset of the UCI repository and compared to the traditional classifiers, such as Naive bayes, random forest, and the traditional Decision Tree models. Experimental findings indicate that the SDT has an accuracy of 99.67 which is better than the performance of Random Forest (76.89%), Decision Tree (76.56%), and Naive Bayes (81.84%) with a lower execution time. Despite the promising performance shown by the results, it needs further validation with more datasets and strong evaluation plans to determine the generalizability.

References

Taneja A, Heart disease prediction system using data mining techniques. Oriental Journal of Computer science and Technology 2013; 6(4): 457-66.

Gawali M, Shirwalkar N, Kalshetti A. Heart disease prediction system using data mining techniques. International Journal of Pure and Applied Mathematics 2018; 120(6): 499-506.

Iliyas MM, Shaikh MI, Student MC. Prediction of Heart Disease Using Decision Tree. Allana Inst of Management Sciences, Pune 2019; 9: 1-5.

Yang L, Wu H, Jin X, Zheng P, Hu S, Xu X, Yu W, Yan J. Study of cardiovascular disease prediction model based on random forest in eastern China. Scientific reports 2020; 10(1): 1-8. DOI: https://doi.org/10.1038/s41598-020-62133-5

Chauhan YJ. Cardiovascular Disease Prediction using Classification Algorithms of Machine Learning. International Journal of Science and Research (IJSR) 2020; 9(5): 194-200.

Hossain ME, Uddin S, Khan A. Network analytics and machine learning for predictive risk modelling of cardiovascular disease in patients with type 2 diabetes. Expert Systems with Applications 2021; 164: 113918. DOI: https://doi.org/10.1016/j.eswa.2020.113918

Singh R. A Review on Heart Disease Prediction using Unsupervised and Supervised Learning. Neural Networks 99: 100.

Amin MS, Chiam YK, Varathan KD. Identification of significant features and data mining techniques in predicting heart disease. Telematics and Informatics 2019; 36: 82-93. DOI: https://doi.org/10.1016/j.tele.2018.11.007

Mohan S, Thirumalai C, Srivastava G. Effective heart disease prediction using hybrid machine learning techniques. IEEE Access 2019; 7: 81542-54. DOI: https://doi.org/10.1109/ACCESS.2019.2923707

Al’Aref SJ, Anchouche K, Singh G, Slomka PJ, Kolli KK, Kumar A, Pandey M, Maliakal G, Van Rosendael AR, Beecy AN, Berman DS. Clinical applications of machine learning in cardiovascular disease and its relevance to cardiac imaging. European heart journal 2019; 40(24): 1975-86. DOI: https://doi.org/10.1093/eurheartj/ehy404

Maji S, Arora S. Decision tree algorithms for prediction of heart disease. In Information and communication technology for competitive strategies. Springer, Singapore 2019; pp. 447-454. DOI: https://doi.org/10.1007/978-981-13-0586-3_45

Bashir S, Khan ZS, Khan FH, Anjum A, Bashir K. Improving heart disease prediction using feature selection approaches. In2019 16th international bhurban conference on applied sciences and technology (IBCAST). IEEE 2019; pp. 619-623. DOI: https://doi.org/10.1109/IBCAST.2019.8667106

Mathan K, Kumar PM, Panchatcharam P, Manogaran G, Varadharajan R. A novel Gini index decision tree data mining method with neural network classifiers for prediction of heart disease. Design automation for embedded systems 2018; 22(3): 225-42. DOI: https://doi.org/10.1007/s10617-018-9205-4

Li R, Shen S, Zhang X, Li R, Wang S, Zhou B, Wang Z. Cardiovascular Disease Risk Prediction Based on Random Forest. In The International Conference on Healthcare Science and Engineering. Springer, Singapore 2018; pp. 31-43. DOI: https://doi.org/10.1007/978-981-13-6837-0_3

Esfahani HA, Ghazanfari M. Cardiovascular disease detection using a new ensemble classifier. In 2017 IEEE 4th International Conference on Knowledge-Based Engineering and Innovation (KBEI). IEEE 2017; pp. 1011-1014. DOI: https://doi.org/10.1109/KBEI.2017.8324946

Pahwa K, Kumar R. Prediction of heart disease using hybrid technique for selecting features. In2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON). IEEE 2017; pp. 500-504. DOI: https://doi.org/10.1109/UPCON.2017.8251100

Pouriyeh S, Vahid S, Sannino G, De Pietro G, Arabnia H, Gutierrez J. A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease. In 2017 IEEE symposium on computers and communications (ISCC). IEEE 2017; pp 204-207.

Cleveland Heart Disease Dataset, [online] Available: http://archive.ics.uci.edu/ml/datasets/Heart+Disease.

Pouriyeh S, Vahid S, Sannino G, De Pietro G, Arabnia H, Gutierrez J. A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease.. In 2017 IEEE Symposium on Computers and Communications (ISCC). IEEE 2017; pp 205-206. DOI: https://doi.org/10.1109/ISCC.2017.8024530

Bouali H, Akaichi J. Comparative study of different classification techniques: heart disease use case. In2014 13th International Conference on Machine Learning and Applications. IEEE 2014; pp. 482-486. DOI: https://doi.org/10.1109/ICMLA.2014.84

Ekız S, Erdoğmuş P. Comparative study of heart disease classification. In2017 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT). IEEE 2017; pp. 1-4. DOI: https://doi.org/10.1109/EBBT.2017.7956761

Chauhan R, Bajaj P, Choudhary K, Gigras Y. Framework to predict health diseases using attribute selection mechanism. In2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom). IEEE 2015; pp. 1880-1884.

Jabbar MA, Deekshatulu BL, Chndra P. Alternating decision trees for early diagnosis of heart disease. InInternational Conference on Circuits, Communication, Control and Computing. IEEE 2014; pp. 322-328. DOI: https://doi.org/10.1109/CIMCA.2014.7057816

Farooq K, Karasek J, Atassi H, Hussain A, Yang P, MacRae C, Mahmud M, Luo B, Slack W. A novel cardiovascular decision support framework for effective clinical risk assessment. In2014 IEEE Symposium on Computational Intelligence in Healthcare and e-health (CICARE). IEEE 2014; pp. 117-124. DOI: https://doi.org/10.1109/CICARE.2014.7007843

Xu S, Zhang Z, Wang D, Hu J, Duan X, Zhu T. Cardiovascular risk prediction method based on CFS subset evaluation and random forest classification framework. In2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA). IEEE 2017; pp. 228-232. DOI: https://doi.org/10.1109/ICBDA.2017.8078813

Rahman QA, Tereshchenko LG, Kongkatong M, Abraham T, Abraham MR, Shatkay H. Utilizing ECG-based heartbeat classification for hypertrophic cardiomyopathy identification. IEEE transactions on nanobioscience 2015; 14(5): 505-12. DOI: https://doi.org/10.1109/TNB.2015.2426213

Shahin A, Moudani W, Chakik F, Khalil M. Data mining in healthcare information systems: case studies in Northern Lebanon. InThe Third International Conference on e-Technologies and Networks for Development (ICeND2014). IEEE 2014; pp. 151-155. DOI: https://doi.org/10.1109/ICeND.2014.6991370

Mittal M, Kaur I, Pandey SC, Verma A, Goyal LM. Opinion Mining for the Tweets in Healthcare Sector using Fuzzy Association Rule. EAI Endorsed Transactions on Pervasive Health and Technology 2019; 4(16). DOI: https://doi.org/10.4108/eai.13-7-2018.159861

Mittal M, Arora M, Pandey T, Goyal LM. Image segmentation using deep learning techniques in medical images. In Advancement of Machine Intelligence in Interactive Medical Image Analysis. Springer, Singapore 2020; pp. 41-63. DOI: https://doi.org/10.1007/978-981-15-1100-4_3

Banjoko AW, Abdulazeez KO. Efficient data-mining algorithm for predicting heart disease based on an angiographic test. Malays J Med Sci 2021; 28(5): 118-129. DOI: https://doi.org/10.21315/mjms2021.28.5.12

Absar N, Das EK, Shoma SN, Uddin M, et al. The efficacy of machine-learning-supported smart system for heart disease prediction. Healthcare (Basel) 2022; 10(6): 1137. DOI: https://doi.org/10.3390/healthcare10061137

Biswas N, Ali MM, Rahaman MA, Islam M, Mia MR, Azam S, Ahmed K, Bui FM, Al-Zahrani FA, Moni MA. Machine learning-based model to predict heart disease in early stage employing different feature selection techniques. BioMed Res Int 2023; 2023: 6864343. DOI: https://doi.org/10.1155/2023/6864343

Sadr H, Salari A, Ashoobi MT, Nazari M, et al. Cardiovascular Disease Diagnosis: a holistic approach using the integration of machine learning and deep learning models. Eur J Med Res 2024; 29: 455. DOI: https://doi.org/10.1186/s40001-024-02044-7

Rehman MU, Naseem S, Butt AR, Mahmood T, Khan AR, Khan I, Khan J, Jung YH, et al. Predicting coronary heart disease with advanced machine learning classifiers for improved cardiovascular risk assessment. Sci Rep 2025; 15: 13361. DOI: https://doi.org/10.1038/s41598-025-96437-1

Vu T, Kokubo Y, Inoue M, Yamamoto M, Mohsen A, Martin-Morales A, et al. Machine learning model for predicting coronary heart disease risk: development and validation using insights from a Japanese population-based study. JMIR Cardio 2025; 9: e68066. DOI: https://doi.org/10.2196/68066

Ganie SM, Pramanik PKD, Zhao Z. Ensemble learning with explainable AI for improved heart disease prediction based on multiple datasets. Sci Rep 2025; 15(1): 13912. DOI: https://doi.org/10.1038/s41598-025-97547-6

Teja MD, Rayalu GM. Optimizing heart disease diagnosis with advanced machine learning models: a comparison of predictive performance. BMC Cardiovasc Disord 2025; 25: 212. DOI: https://doi.org/10.1186/s12872-025-04627-6

Kailasanathan N, et al. Heart disease prediction with a feature-sensitized interpretable framework for Internet of Medical Things sensors. Front Digit Health 2025. DOI: https://doi.org/10.3389/fdgth.2025.1612915

Bhatt A, Dubey S, Bhatt A. Sudden cardiac arrest prediction using predictive analytics. Int J Intell Eng Sys 2017; 10(3): 184-191. DOI: https://doi.org/10.22266/ijies2017.0630.20

Bhatt A, Dubey SK, Bhatt AK. Early prediction of cardiovascular disease among young adults through coronary artery calcium score technique. In: International Conference on Advances in Computing and Data Sciences; . Cham: Springer International Publishing 2021; pp. 303-312. DOI: https://doi.org/10.1007/978-3-030-88244-0_29

Bhatt A, Bhatt AK. Leveraging data mining for predictive insights into cardiovascular health risks. Journal of Applied Bioanalysis 2025; 11(S2): 137-146. DOI: https://doi.org/10.53555/jab.v11si2.516

Downloads

Published

2025-12-30

How to Cite

Bhatt, A. ., & Bhatt, A. K. . (2025). Early-Stage Cardiovascular Disease Prediction Using a Sigmoidtropy-Based Decision Tree. International Journal of Statistics in Medical Research, 14, 855–866. https://doi.org/10.6000/1929-6029.2025.14.77

Issue

Section

Specia Issue: New Advances in Multiple Statistical Comparison and Its Applications in Medicine