Predicting Pulmonary Function From the Analysis of Voice: A Machine Learning Approach

Alam, Md. Zahangir and Simonetti, Albino and Brillantino, Raffaele and Tayler, Nick and Grainge, Chris and Siribaddana, Pandula and Nouraei, S. A. Reza and Batchelor, James and Rahman, M. Sohel and Mancuzo, Eliane V. and Holloway, John W. and Holloway, Judith A. and Rezwan, Faisal I. (2022) Predicting Pulmonary Function From the Analysis of Voice: A Machine Learning Approach. Frontiers in Digital Health, 4. ISSN 2673-253X

[thumbnail of pubmed-zip/versions/1/package-entries/fdgth-04-750226/fdgth-04-750226.pdf] Text
pubmed-zip/versions/1/package-entries/fdgth-04-750226/fdgth-04-750226.pdf - Published Version

Download (792kB)

Abstract

Introduction: To self-monitor asthma symptoms, existing methods (e.g. peak flow metre, smart spirometer) require special equipment and are not always used by the patients. Voice recording has the potential to generate surrogate measures of lung function and this study aims to apply machine learning approaches to predict lung function and severity of abnormal lung function from recorded voice for asthma patients.

Methods: A threshold-based mechanism was designed to separate speech and breathing from 323 recordings. Features extracted from these were combined with biological factors to predict lung function. Three predictive models were developed using Random Forest (RF), Support Vector Machine (SVM), and linear regression algorithms: (a) regression models to predict lung function, (b) multi-class classification models to predict severity of lung function abnormality, and (c) binary classification models to predict lung function abnormality. Training and test samples were separated (70%:30%, using balanced portioning), features were normalised, 10-fold cross-validation was used and model performances were evaluated on the test samples.

Results: The RF-based regression model performed better with the lowest root mean square error of 10·86. To predict severity of lung function impairment, the SVM-based model performed best in multi-class classification (accuracy = 73.20%), whereas the RF-based model performed best in binary classification models for predicting abnormal lung function (accuracy = 85%).

Conclusion: Our machine learning approaches can predict lung function, from recorded voice files, better than published approaches. This technique could be used to develop future telehealth solutions including smartphone-based applications which have potential to aid decision making and self-monitoring in asthma.

Item Type: Article
Subjects: Apsci Archives > Multidisciplinary
Depositing User: Unnamed user with email support@apsciarchives.com
Date Deposited: 31 Dec 2022 07:21
Last Modified: 19 Sep 2023 07:37
URI: http://eprints.go2submission.com/id/eprint/65

Actions (login required)

View Item
View Item