Reliability of Predictions Using Hybrid Models: The Case of Malaria Incidence Rates in Uganda

Authors

  • Francis Bbosa School of Statistics and Planning, Makerere University, Kampala, Uganda https://orcid.org/
  • Ronald Wesonga Department of Statistics, College of Science, Sultan Qaboos University, Muscat, Oman
  • Josephine Nabukenya School of Computing and Informatics Technology, Makerere University, Kampala, Uganda
  • Peter Nabende School of Computing and Informatics Technology, Makerere University, Kampala, Uganda

DOI:

https://doi.org/10.12856/JHIA-2020-v7-i2-289

Abstract

Background and purpose: Reliability of estimates emanating from predictive independent data mining techniques is a complex problem. This could be attributed to cross-cutting weaknesses of individual techniques such as collinearity due to high dimensionality of attributes in a dataset, biasedness due to under fitting and over fitting of data as well as noise accumulation due to outliers and thus affecting the reliability of predictions emanating from these models. This study thus aimed at developing a hybrid data mining technique for predicting reliable malaria incidence rate thresholds.

Methods: The decision tree and naïve Bayes classifiers were used to build a hybrid prediction model. Results of the developed hybrid model were compared with independent data mining models using 10-fold cross-validation on a previously unlearned data set. Accuracy, F-measure and the area under the receiver operating characteristics curve (AUC) were the key performance metrics used to evaluate the generalizability of the hybrid model in comparison to the independent models.

Results: Findings revealed that the hybrid classifier attained an accuracy of 79.3% and an F-measure score of 84.2%, the naïve Bayes classifier achieved accuracy and F-measure value of 69% while the decision tree classifier registered an accuracy of 72.4% and an F-measure score of 80%.

Conclusions: The developed hybrid model outperformed both independent decision tree and naïve Bayes models. Notably, the hybrid model outperformed the independent decision tree and naïve Bayes classifiers in terms of accuracy by 6.9% and 10.3% respectively.

Downloads

Download data is not yet available.

Downloads

Published

2021-06-12

Issue

Section

Research Article

How to Cite

Reliability of Predictions Using Hybrid Models: The Case of Malaria Incidence Rates in Uganda. (2021). Journal of Health Informatics in Africa, 7(2), 29-46. https://doi.org/10.12856/JHIA-2020-v7-i2-289

Most read articles by the same author(s)