Machine learning and natural language processing to identify falls in electronic patient care records from ambulance attendances
dc.contributor.author | Tohira, Hideo | |
dc.contributor.author | Finn, Judith | |
dc.contributor.author | Ball, Stephen | |
dc.contributor.author | Brink, D. | |
dc.contributor.author | Buzzacott, Peter | |
dc.date.accessioned | 2022-02-20T02:41:46Z | |
dc.date.available | 2022-02-20T02:41:46Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Tohira, H. and Finn, J. and Ball, S. and Brink, D. and Buzzacott, P. 2021. Machine learning and natural language processing to identify falls in electronic patient care records from ambulance attendances. Informatics for Health and Social Care. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/87826 | |
dc.identifier.doi | 10.1080/17538157.2021.2019038 | |
dc.description.abstract |
We derived machine learning models utilizing features generated by natural language processing (NLP) of free-text data from an ambulance services provider to identify fall cases. The data comprised samples of electronic patient care records care records (ePCRs) from St John Western Australia (WA), the sole ambulance services provider in most of WA. We manually labeled fall cases by reviewing the free-text summary. The models used features including case characteristics (e.g., age) and text frequency-inverse document frequency (tf-idf) of each word of the free-text generated by NLP. Support vector machine (SVM) and random forest were used as classifiers. We compared the performance of the models against the manual identification of falls by recall, precision, and F-measure. A total of 9,447 cases (1%) were randomly sampled, of which 1,648 (17%) were labeled as fall. The best model was an SVM model using case characteristics and tf-idf’s of the first 100 words of free-text, with recall of 0.84, precision of 0.86, and F-measure of 0.85. This performance was better than an SVM model with only case characteristics. Machine-learning models incorporated with features generated by NLP improved the performance of classifying fall cases compared with models without such features. Scope remains for further improvement. | |
dc.language | eng | |
dc.subject | Emergency medical services | |
dc.subject | random forest | |
dc.subject | support vector machine | |
dc.subject | text frequency-inverse document frequency | |
dc.title | Machine learning and natural language processing to identify falls in electronic patient care records from ambulance attendances | |
dc.type | Journal Article | |
dcterms.source.startPage | 1 | |
dcterms.source.endPage | 11 | |
dcterms.source.issn | 1753-8157 | |
dcterms.source.title | Informatics for Health and Social Care | |
dc.date.updated | 2022-02-20T02:41:46Z | |
curtin.department | Curtin School of Nursing | |
curtin.accessStatus | Fulltext not available | |
curtin.faculty | Faculty of Health Sciences | |
curtin.contributor.orcid | Ball, Stephen [0000-0002-9457-3381] | |
curtin.contributor.orcid | Tohira, Hideo [0000-0002-2244-8004] | |
curtin.contributor.orcid | Finn, Judith [0000-0002-7307-7944] | |
curtin.contributor.orcid | Buzzacott, Peter [0000-0002-5926-1374] | |
curtin.contributor.researcherid | Tohira, Hideo [E-5431-2012] | |
curtin.contributor.researcherid | Finn, Judith [B-2678-2010] | |
dcterms.source.eissn | 1753-8165 | |
curtin.contributor.scopusauthorid | Ball, Stephen [55676853700] | |
curtin.contributor.scopusauthorid | Tohira, Hideo [6506836786] | |
curtin.contributor.scopusauthorid | Finn, Judith [57200768752] [7202432925] | |
curtin.contributor.scopusauthorid | Buzzacott, Peter [6506509899] |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |