The Diagnosis of Dengue in Patients Presenting With Acute Febrile Illness Using Supervised Machine Learning and Impact of Seasonality

Ming, Damien K. and Tuan, Nguyen M. and Hernandez, Bernard and Sangkaew, Sorawat and Vuong, Nguyen L. and Chanh, Ho Q. and Chau, Nguyen V. V. and Simmons, Cameron P. and Wills, Bridget and Georgiou, Pantelis and Holmes, Alison H. and Yacoub, Sophie (2022) The Diagnosis of Dengue in Patients Presenting With Acute Febrile Illness Using Supervised Machine Learning and Impact of Seasonality. Frontiers in Digital Health, 4. ISSN 2673-253X

[thumbnail of pubmed-zip/versions/1/package-entries/fdgth-04-849641/fdgth-04-849641.pdf] Text
pubmed-zip/versions/1/package-entries/fdgth-04-849641/fdgth-04-849641.pdf - Published Version

Download (395kB)

Abstract

Background: Symptomatic dengue infection can result in a life-threatening shock syndrome and timely diagnosis is essential. Point-of-care tests for non-structural protein 1 and IgM are used widely but performance can be limited. We developed a supervised machine learning model to predict whether patients with acute febrile illnesses had a diagnosis of dengue or other febrile illnesses (OFI). The impact of seasonality on model performance over time was examined.

Methods: We analysed data from a prospective observational clinical study in Vietnam. Enrolled patients presented with an acute febrile illness of <72 h duration. A gradient boosting model (XGBoost) was used to predict final diagnosis using age, sex, haematocrit, platelet, white cell, and lymphocyte count collected on enrolment. Data was randomly split 80/20% into a training and hold-out set, respectively, with the latter not used in model development. Cross-validation and hold out set testing was used, with performance over time evaluated through a rolling window approach.

Results: We included 8,100 patients recruited between 16th October 2010 and 10th December 2014. In total 2,240 (27.7%) patients were diagnosed with dengue infection. The optimised model from training data had an overall median area under the receiver operator curve (AUROC) of 0.86 (interquartile range 0.84–0.86), specificity of 0.92, sensitivity of 0.56, positive predictive value of 0.73, negative predictive value (NPV) of 0.84, and Brier score of 0.13 in predicting the final diagnosis, with similar performances in hold-out set testing (AUROC of 0.86). Model performances varied significantly over time as a function of seasonality and other factors. Incorporation of a dynamic threshold which continuously learns from recent cases resulted in a more consistent performance throughout the year (NPV >90%).

Conclusion: Supervised machine learning models are able to discriminate between dengue and OFI diagnoses in patients presenting with an early undifferentiated febrile illness. These models could be of clinical utility in supporting healthcare decision-making and provide passive surveillance across dengue endemic regions. Effects of seasonality and changing disease prevalence must however be taken into account—this is of significant importance given unpredictable effects of human-induced climate change and the impact on health.

Item Type: Article
Subjects: Apsci Archives > Multidisciplinary
Depositing User: Unnamed user with email support@apsciarchives.com
Date Deposited: 02 Jan 2023 12:16
Last Modified: 13 Oct 2023 04:15
URI: http://eprints.go2submission.com/id/eprint/20

Actions (login required)

View Item
View Item