International Journal of Innovative Research in Computer and Communication Engineering
ISSN Approved Journal | Impact factor: 8.771 | ESTD: 2013 | Follows UGC CARE Journal Norms and Guidelines
| Monthly, Peer-Reviewed, Refereed, Scholarly, Multidisciplinary and Open Access Journal | High Impact Factor 8.771 (Calculated by Google Scholar and Semantic Scholar | AI-Powered Research Tool | Indexing in all Major Database & Metadata, Citation Generator | Digital Object Identifier (DOI) |
| TITLE | An Explainable Machine Learning Framework for Liver Disease Prediction Using Dual Statistical Feature Optimization and Ensemble Learning |
|---|---|
| ABSTRACT | Liver disease continues to be a major global health concern due to its silent progression and delayed diagnosis. This study proposes an efficient and interpretable machine learning framework for early liver disease prediction by integrating advanced preprocessing, Dual Statistical Feature Optimization (DSFO), and ensemble learning. Missing values are handled using iterative imputation techniques, while class imbalance is addressed using SMOTE to ensure balanced data distribution. The proposed DSFO method combines Chi-Square dependency analysis and ANOVA-based variance evaluation to identify the most significant clinical features. Unlike conventional stacking approaches, the model employs a simplified ensemble strategy using LightGBM and Extra Trees Classifier combined through soft voting, improving generalization while reducing model complexity. The framework achieved an accuracy of 97.96% and ROC-AUC of 0.9894, demonstrating strong predictive capability. To enhance interpretability, SHAP is used to provide insights into feature contributions. Results reveal that biochemical attributes such as bilirubin and liver enzyme levels play a dominant role in prediction. The proposed system offers a reliable and transparent solution suitable for real-world clinical applications. |
| AUTHOR | POLEMREDDY VENKATA ALEKHYA, DR. DEEPAK NEDUNURI M. Tech Scholar, Department of CSE, Sir C R Reddy College of Engineering, Eluru, India Professor, Department of CSE, Sir C R Reddy College of Engineering, Eluru, India |
| VOLUME | 183 |
| DOI | DOI: 10.15680/IJIRCCE.2026.1404144 |
| pdf/144_An Explainable Machine Learning Framework for Liver Disease Prediction Using Dual Statistical Feature Optimization and Ensemble Learning.pdf | |
| KEYWORDS | |
| References | [1] S. Hashem et al., “Machine Learning Prediction Models for Diagnosing Hepatocellular Carcinoma with HCV-related Chronic Liver Disease,” Computer Methods and Programs in Biomedicine, vol. 196, p. 105551, Nov. 2020, doi: 10.1016/j.cmpb.2020.105551. [2] E. Dritsas and M. Trigka, “Supervised Machine Learning Models for Liver Disease Risk Prediction,” Computers, vol. 12, no. 1, p. 19, Jan. 2023, doi: 10.3390/computers12010019. [3] S. Hashem et al., "Comparison of Machine Learning Approaches for Prediction of Advanced Liver Fibrosis in Chronic Hepatitis C Patients," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 15, no. 3, pp. 861-868, 1 May-June 2018, doi: 10.1109/TCBB.2017.2690848. [4] W. El Atifi, O. El Rhazouani, F. M. Khan, and H. Sekkat, “Optimizing ensemble machine learning models for accurate liver disease prediction in healthcare,” PLoS One, vol. 20, no. 8, p. e0330899, Aug. 2025, doi: 10.1371/journal.pone.0330899. [5] S. M. Ganie and P. K. Dutta Pramanik, “A comparative analysis of boosting algorithms for chronic liver disease prediction,” Healthcare Analytics, vol. 5, p. 100313, Jun. 2024, doi: 10.1016/j.health.2024.100313. [6] F. Mostafa, E. Hasan, M. Williamson, and H. Khan, “Statistical Machine Learning Approaches to Liver Disease Prediction,” Livers, vol. 1, no. 4, pp. 294–312, Dec. 2021, doi: 10.3390/livers1040023. [7] G. S. Harshpreet Kaur, “The Diagnosis of Chronic Liver Disease using Machine Learning Techniques,” ITII, vol. 9, no. 2, pp. 554–564, Mar. 2021, doi: 10.17762/itii.v9i2.382. [8] R. Amin, R. Yasmin, S. Ruhi, M. H. Rahman, and M. S. Reza, “Prediction of chronic liver disease patients using integrated projection based statistical feature extraction with machine learning algorithms,” Informatics in Medicine Unlocked, vol. 36, p. 101155, 2023, doi: 10.1016/j.imu.2022.101155. |