Show simple item record

dc.contributor.authorKiarie, Nahashon
dc.date.accessioned2026-04-29T07:58:55Z
dc.date.available2026-04-29T07:58:55Z
dc.date.issued2025
dc.identifier.citationA Thesis Submitted in Partial Fulfillment of the Requirement for Conferment of the Degree of Master in Information Technology of Meru University of Science and Technologyen_US
dc.identifier.urihttp://repository.must.ac.ke/handle/123456789/1617
dc.description.abstractBlood donor retention is critical for maintaining a stable and reliable blood supply, yet predicting donor retention remains a complex challenge. Previous attempts to develop blood donor retention models relied on single algorithms and achieved relatively low prediction accuracy limiting their practical application for donor retention. The Light Gradient Boosting Machine (Light GBM) algorithm employs leaf-wise growth strategy, excels in loss reduction and hence improves accuracy. However, this may lead to potential overfitting, on the other hand, the Extreme Gradient Boosting(XGBoost) algorithm incorporates a robust mechanism for combating overfitting, such as the regularization parameter, column sampling, and weight reduction on new trees but employs a level-wise growth strategy, which is sometimes computationally intensive. This study developed a hybrid ensemble gradient boosting model based on XGBoost and Light GBM. The ensemble model leverages on the high accuracy of Light GBM while mitigating overfitting through and the overfitting prevention strategies of XGBoost. The data was obtained from the Kenya blood banks with 5000 records and nine features. The base models were trained in parallel, a weighted ensemble model was created by assigning weights to the respective prediction results of each model, the ensemble model was then evaluated and the accuracy compared with the accuracy achieved by the individual algorithms. Bayesian hyperparameter optimization was implemented on the base learners in order to find the best combination of hyperparameters and further improve the performance of the model. The ensemble model achieved a performance accuracy of 99.00% and F1 score of 99.00%. This study enables blood agencies to accurately predict blood donor retention, thereby reducing the need for constant donor recruitment efforts and saving both time and costs. Additionally, it will provide insights for targeted retention strategies, ensuring a steady blood supply, ultimately saving lives and improving healthcare systems.en_US
dc.language.isoenen_US
dc.publisherMeru University of Science and Technologyen_US
dc.subjectBlood donor retentionen_US
dc.subjectEnsemble learningen_US
dc.subjectGradient boosting (XGBoost & LightGBM)en_US
dc.titleA Hybrid Ensemble Boosting Model for Enhanced Blood Donor Retentionen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record