Chronic kidney Disease Classification through Hybrid Feature Selection and Ensemble Deep Learning
DOI:
https://doi.org/10.6000/1929-6029.2025.14.11Keywords:
Deep learning, Feature selection, Recursive Feature EliminationAbstract
Diagnosing and treating at-risk patients for chronic kidney disease (CKD) relies heavily on accurately classifying the disease. The use of deep learning models in healthcare research is receiving much interest due to recent developments in the field. CKD has many features; however, only some features contribute weightage for the classification task. Therefore, it is required to eliminate the irrelevant feature before applying the classification task. This paper proposed a hybrid feature selection method by combining the two feature selection techniques: the Boruta and the Recursive Feature Elimination (RFE) method. The features are ranked according to their importance for CKD classification using the Boruta algorithm and refined feature set using the RFE, which recursively eliminates the least important features. The hybrid feature selection method removes the feature with a low recursive score. Later, selected features are given input to the proposed ensemble deep learning method for classification. The experimental ensemble deep learning model with feature selection is compared to Support Vector Machine (SVM), Logistic Regression (LR), and Random Forest (RF) models with and without feature selection. When feature selection is used, the ensemble model improves accuracy by 2%. Experimental results found that these features, age, pus cell clumps, bacteria, and coronary artery disease, do not contribute much to accurate classification tasks. Accuracy, precision, and recall are used to evaluate the ensemble deep learning model.
References
Shakhshir MH, Vanoh D, Hassan M, Zyoud SH. Mapping the global research landscape on malnutrition for patients with chronic kidney disease: a visualization analysis. Journal of Health Population and Nutrition 2023; 42(1): 101. DOI: https://doi.org/10.1186/s41043-023-00445-8
Islam MA, Majumder MZH, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms. Journal of Pathology Informatics 2023; 14: 100189. DOI: https://doi.org/10.1016/j.jpi.2023.100189
Levey A, Atkins R, Coresh J, Cohen E, Collins A, Eckardt K-U, et al. Chronic kidney disease as a global public health problem: approaches and initiatives-a position statement from kidney disease improving global outcomes. Kidney International 2007; 72(3): 247-259. DOI: https://doi.org/10.1038/sj.ki.5002343
Lv J-C, Zhang L-X. Prevalence and disease burden of chronic kidney disease. Renal fibrosis: mechanisms and therapies 2019; 3-15. DOI: https://doi.org/10.1007/978-981-13-8871-2_1
Tannor EK, Sarfo FS, Mobula LM, Sarfo-Kantanka O, Adu-Gyamfi R, Plange-Rhule J. Prevalence and predictors of chronic kidney disease among ghanaian patients with hypertension and diabetes mellitus: A multicenter cross-sectional study. The Journal of Clinical Hypertension 2019; 21(10): 1542-1550. DOI: https://doi.org/10.1111/jch.13672
Bai Q, Su C, Tang W, Li Y. Machine learning to predict end stage kidney disease in chronic kidney disease. Scientific Reports 2022; 12(1): 8377. DOI: https://doi.org/10.1038/s41598-022-12316-z
Sawhney R, Malik A, Sharma S, Narayan V. A comparative assessment of artificial intelligence models used for early prediction and evaluation of chronic kidney disease. Decision Analytics Journal 2023; 6: 100169. DOI: https://doi.org/10.1016/j.dajour.2023.100169
Harimoorthy K, Thangavelu M. Multi-disease prediction model using improved svm-radial bias technique in healthcare monitoring system. Journal of Ambient Intelligence and Humanized Computing 2021; 12: 3715-3723. DOI: https://doi.org/10.1007/s12652-019-01652-0
Fahimifar S, Mousavi K, Mozaffari F, Ausloos M. Identification of the most important external features of highly cited scholarly papers through 3 (ie, ridge, lasso, and boruta) feature selection data mining methods: Identification of the most important external features of highly cited scholarly papers through 3 (ie, ridge, lasso, and boruta) feature selection data mining methods. Quality & Quantity 2023; 57(4): 3685-3712. DOI: https://doi.org/10.1007/s11135-022-01480-z
Habibi A, Delavar MR, Sadeghian MS, Nazari B, Pirasteh S. A hybrid of ensemble machine learning models with rfe and boruta wrapper-based algorithms for flash flood susceptibility assessment. International Journal of Applied Earth Observation and Geoinformation 2023; 122: 103401. DOI: https://doi.org/10.1016/j.jag.2023.103401
Tang R, Zhang X. Cart decision tree combined with boruta feature selection for medical data classification. in 2020 5th IEEE International Conference on Big Data Analytics (ICBDA) IEEE 2020; 80-84. DOI: https://doi.org/10.1109/ICBDA49040.2020.9101199
Zhang B, Dong X, Hu Y, Jiang X, Li G. Classification and prediction of spinal disease based on the smote-rfe-xgboost model. Peer J Computer Science 2023; 9: e1280. DOI: https://doi.org/10.7717/peerj-cs.1280
Ebiaredoh-Mienye SA, Swart TG, Esenogho E, Mienye ID. A machine learning method with filter-based feature selection for improved prediction of chronic kidney disease. Bioengineering 2022; 9(8): 350. DOI: https://doi.org/10.3390/bioengineering9080350
Ferguson T, Ravani P, Sood MM, Clarke A, Komenda P, Rigatto C, Tangri N. Development and external validation of a machine learning model for progression of ckd. Kidney International Reports 2022; 7(8): 1772-1781. DOI: https://doi.org/10.1016/j.ekir.2022.05.004
Lee J, Warner E, Shaikhouni S, Bitzer M, Kretzler M, Gipson D, et al. Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease. Scientific Reports 2022; 12(1): 4832. DOI: https://doi.org/10.1038/s41598-022-08974-8
Cui R, Liu M, Initiative ADN, et al. Rnn-based longitudinal analysis for diagnosis of alzheimer’s disease. Computerized Medical Imaging and Graphics 2019; 73: 1-10. DOI: https://doi.org/10.1016/j.compmedimag.2019.01.005
Shi X, Wang T, Wang L, Liu H, Yan N. Hybrid convolutional recurrent neural networks outperform cnn and rnn in task-state eeg detection for parkinson’s disease. in 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), IEEE 2019; 939-944. DOI: https://doi.org/10.1109/APSIPAASC47483.2019.9023190
Ghosh P, Shamrat FJM, Shultana S, Afrin S, Anjum AA, Khan AA. Optimization of prediction method of chronic kidney disease using machine learning algorithm. in 2020 15th international joint symposium on artificial intelligence and natural language processing (iSAI-NLP), IEEE 2020; 1-6. DOI: https://doi.org/10.1109/iSAI-NLP51646.2020.9376787
Aljaaf AJ, Al-Jumeily D, Haglan HM, Alloghani M, Baker T, Hussain AJ, Mustafina J. Early prediction of chronic kidney disease using machine learning supported by predictive analytics. in 2018 IEEE congress on evolutionary computation (CEC), IEEE 2018; 1-9. DOI: https://doi.org/10.1109/CEC.2018.8477876
Alassaf RA, Alsulaim KA, Alroomi NY, Alsharif NS, Aljubeir MF, Olatunji SO, et al. Preemptive diagnosis of chronic kidney disease using machine learning techniques. in 2018 international conference on innovations in information technology (IIT), IEEE 2018; 99-104. DOI: https://doi.org/10.1109/INNOVATIONS.2018.8606040
Mezzatesta S, Torino C, De Meo P, Fiumara G, Vilasi A. A machine learning-based approach for predicting the outbreak of cardiovascular diseases in patients on dialysis. Computer Methods and Programs in Biomedicine 2019; 177: 9-15. DOI: https://doi.org/10.1016/j.cmpb.2019.05.005
Schucany WR, Tony Ng H. Preliminary goodness-of-fit tests for normality do not validate the one-sample student t. Communications in Statistics-Theory and Methods 2006; 35(12): 2275-2286. DOI: https://doi.org/10.1080/03610920600853308
Pacheco F, de Oliveira JV, Sanchez R-V, Cerrada M, Cabrera D, Li C, Zurita G, Artes M. A statistical comparison of neuroclassifiers and feature selection methods for gearbox fault diagnosis under realistic conditions. Neurocomputing 2016; 194: 192-206. DOI: https://doi.org/10.1016/j.neucom.2016.02.028
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Policy for Journals/Articles with Open Access
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are permitted and encouraged to post links to their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
Policy for Journals / Manuscript with Paid Access
Authors who publish with this journal agree to the following terms:
- Publisher retain copyright .
- Authors are permitted and encouraged to post links to their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work .