Intelligent multi-document summarisation for extracting insights on racial inequalities from maternity incident investigation reports
In healthcare, thousands of safety incidents occur every year, but learning from these incidents is not effectively aggregated. Analysing incident reports using AI could uncover critical insights to prevent harm by identifying recurring patterns and contributing factors. To aggregate and extract valuable information, natural language processing (NLP) and machine learning techniques can be employed to summarise and mine unstructured data, potentially surfacing systemic issues and priority areas for improvement. This paper presents I-SIRch:CS, a framework designed to facilitate the aggregation and analysis of safety incident reports while ensuring traceability throughout the process. The framework integrates concept annotation using the Safety Intelligence Research (SIRch) taxonomy with clustering, summarisation, and analysis capabilities. Utilising a dataset of 188 anonymised maternity investigation reports annotated with 27 SIRch human factors concepts, I-SIRch:CS groups the annotated sentences into clusters using sentence embeddings and k-means clustering, maintaining traceability via file and sentence IDs. Summaries are generated for each cluster using offline state-of-the-art abstractive summarisation models (BART, DistilBART, T5), which are evaluated and compared using metrics assessing summary quality attributes. The generated summaries are linked back to the original file and sentence IDs, ensuring traceability and allowing for verification of the summarised information. Results demonstrate BART’s strengths in creating informative and concise summaries.
Funding
I-SIRch - Using Artificial Intelligence to Improve the Investigation of Factors Contributing to Adverse Maternity Incidents involving Black Mothers and Families : AI_H1200006
History
School
- Science
Department
- Computer Science
Published in
Artificial Intelligence in Healthcare. First International Conference, AIiH 2024, Swansea, UK, September 4–6, 2024, ProceedingsVolume
2Pages
316-329Publisher
Springer Nature Switzerland AGVersion
- AM (Accepted Manuscript)
Rights holder
© The Author(s), under exclusive license to Springer Nature Switzerland AGPublisher statement
This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-67285-9_23Publication date
2024-08-15Copyright date
2024ISBN
9783031672842; 978303167289ISSN
0302-9743eISSN
1611-3349Publisher version
Book series
Lecture Notes in Computer Science: 14976Language
- en