Article Text
Abstract
Aims and Objectives A non-contrast CT head scan (NCCTH) is the most common cross-sectional imaging investigation requested in the Emergency Department (ED), and several Artificial Intelligence (AI) tools have been developed to detect abnormalities on NCCTH, however there is currently little real-world evidence to support adoption. The AI-REACT study (IRAS 310995, NCT05427838) evaluated the impact of AI algorithm on the diagnostic performance of ED clinicians, radiologists and radiographers.
Method and Design A retrospective dataset of 150 NCCTH was compiled, including 63 normal control cases and 97 abnormal cases containing intracranial haemorrhage (ICH), infarct, midline shift, mass effect, or skull fracture. 30 readers of varying experience were recruited across four NHS trusts including 10 general radiologists, 15 Emergency Medicine clinicians, and five CT radiographers. Readers interpreted each scan first without, then with, the assistance of the qER EU 2.0 AI tool, with an intervening 2-week washout period. Using an arbitrated consensus opinion of 2 neuroradiologists as ground truth, the stand-alone performance of qER was assessed, and its impact on the readers’ diagnostic performance analysed.
Results and Conclusion Pooled analyses demonstrated a significant increase in reader sensitivity for abnormal scans (0.828 to 0.897, +0.069, 95%CI +0.106 to +0.0136, p >0.001) and ICH (0.846 to 0.916, +0.07 95%CI 0.108 to 0.0321 p = >0.001). ED clinicians with AI assistance demonstrated a sensitivity of 0.879 (abnormality) and 0.948 (ICH) compared to unaided radiologist sensitivity 0.890 (abnormality) and 0.939 (ICH), with no statistically significant changes in specificity.
Use of AI-assisted image interpretation led to a significant increase in the ability of ED clinicians to accurately identify abnormality and ICH on CT Head scans, to a level comparable to that of radiologists. These important findings should be fully explored prospectively. Further analysis of the effects on pathology and reader subgroups will help to identify potential strengths and use cases for this application.