Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015
dc.contributor.author | Wong, Kingsley | |
dc.contributor.author | Tessema, Gizachew | |
dc.contributor.author | Chai, Kevin | |
dc.contributor.author | Pereira, Gavin | |
dc.date.accessioned | 2023-09-07T01:34:12Z | |
dc.date.available | 2023-09-07T01:34:12Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Wong, K. and Tessema, G.A. and Chai, K. and Pereira, G. 2022. Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015. Scientific Reports. 12 (1): ARTN 19153. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/93230 | |
dc.identifier.doi | 10.1038/s41598-022-23782-w | |
dc.description.abstract |
Preterm birth is a global public health problem with a significant burden on the individuals affected. The study aimed to extend current research on preterm birth prognostic model development by developing and internally validating models using machine learning classification algorithms and population-based routinely collected data in Western Australia. The longitudinal retrospective cohort study involved all births in Western Australia between 1980 and 2015, and the analytic sample contains 81,974 (8.6%) preterm births (< 37 weeks of gestation). Prediction models for preterm birth were developed using regularised logistic regression, decision trees, Random Forests, extreme gradient boosting, and multi-layer perceptron (MLP). Predictors included maternal socio-demographics and medical conditions, current and past pregnancy complications, and family history. Class weight was applied to handle imbalanced outcomes and stratified tenfold cross-validation was used to reduce overfitting. Close to half of the preterm births (49.1% at 5% FPR, 95% CI 48.9%,49.5%) were correctly classified by the best performing classifier (MLP) for all women when current pregnancy information was available. The sensitivity was boosted to 52.7% (95% CI 52.1%,53.3%) after including past obstetric history in a sub-population of births from multiparous women. Around half of the preterm birth can be identified antenatally at high specificity using population-based routinely collected maternal and pregnancy data. The performance of the prediction models depends on the available predictor pool that is individual and time specific. | |
dc.language | English | |
dc.publisher | NATURE PORTFOLIO | |
dc.relation.sponsoredby | http://purl.org/au-research/grants/nhmrc/1099655 | |
dc.relation.sponsoredby | http://purl.org/au-research/grants/nhmrc/1173991 | |
dc.relation.sponsoredby | http://purl.org/au-research/grants/nhmrc/1195716 | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Science & Technology | |
dc.subject | Multidisciplinary Sciences | |
dc.subject | Science & Technology - Other Topics | |
dc.subject | HIGH-RISK | |
dc.subject | OUTCOMES | |
dc.subject | Pregnancy | |
dc.subject | Infant, Newborn | |
dc.subject | Humans | |
dc.subject | Female | |
dc.subject | Premature Birth | |
dc.subject | Western Australia | |
dc.subject | Retrospective Studies | |
dc.subject | Prognosis | |
dc.subject | Risk Factors | |
dc.subject | Cohort Studies | |
dc.subject | Machine Learning | |
dc.subject | Humans | |
dc.subject | Premature Birth | |
dc.subject | Prognosis | |
dc.subject | Risk Factors | |
dc.subject | Retrospective Studies | |
dc.subject | Cohort Studies | |
dc.subject | Pregnancy | |
dc.subject | Infant, Newborn | |
dc.subject | Western Australia | |
dc.subject | Female | |
dc.subject | Machine Learning | |
dc.title | Development of prognostic model for preterm birth using machine learning in a population-based cohort of Western Australia births between 1980 and 2015 | |
dc.type | Journal Article | |
dcterms.source.volume | 12 | |
dcterms.source.number | 1 | |
dcterms.source.issn | 2045-2322 | |
dcterms.source.title | Scientific Reports | |
dc.date.updated | 2023-09-07T01:34:12Z | |
curtin.department | Curtin School of Population Health | |
curtin.department | Office of the Pro Vice Chancellor Health Sciences | |
curtin.accessStatus | Open access | |
curtin.faculty | Faculty of Health Sciences | |
curtin.contributor.orcid | Pereira, Gavin [0000-0003-3740-8117] | |
curtin.contributor.orcid | Tessema, Gizachew [0000-0002-4784-8151] | |
curtin.contributor.orcid | Chai, Kevin [0000-0003-1645-0922] | |
curtin.contributor.researcherid | Pereira, Gavin [D-7136-2014] | |
curtin.contributor.researcherid | Tessema, Gizachew [J-9235-2018] | |
curtin.contributor.researcherid | Chai, Kevin [F-1015-2013] | |
curtin.identifier.article-number | ARTN 19153 | |
dcterms.source.eissn | 2045-2322 | |
curtin.contributor.scopusauthorid | Pereira, Gavin [35091486200] | |
curtin.contributor.scopusauthorid | Chai, Kevin [23396028100] | |
curtin.repositoryagreement | V3 |