Article Text

Clinical predictors of fracture in patients with shoulder dislocation: systematic review of diagnostic test accuracy studies
  1. Ilaria Oldrini1,
  2. Laura Coventry1,
  3. Alex Novak2,
  4. Steve Gwilym3,4,
  5. David Metcalfe1,2,3
  1. 1Warwick Medical School, University of Warwick, Coventry, UK
  2. 2Emergency Department, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
  3. 3Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
  4. 4Department of Trauma Surgery, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
  1. Correspondence to Dr David Metcalfe, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, Oxfordshire, UK; david.metcalfe{at}ndorms.ox.ac.uk

Abstract

Background Prereduction radiographs are conventionally used to exclude fracture before attempts to reduce a dislocated shoulder in the ED. However, this step increases cost, exposes patients to ionising radiation and may delay closed reduction. Some studies have suggested that prereduction imaging may be omitted for a subgroup of patients with shoulder dislocations.

Objectives To determine whether clinical predictors can identify patients who may safely undergo closed reduction of a dislocated shoulder without prereduction radiographs.

Methods A systematic review and meta-analysis of diagnostic test accuracy studies that have evaluated the ability of clinical features to identify concomitant fractures in patients with shoulder dislocation. The search was updated to 23 June 2022 and language limits were not applied. All fractures were included except for Hill-Sachs lesions. Quality assessment was undertaken using the Quality Assessment of Diagnostic Accuracy Studies 2 tool. Data were pooled and meta-analysed by fitting univariate random effects and multilevel mixed effects logistic regression models.

Results Eight studies reported data on 2087 shoulder dislocations and 343 concomitant fractures. The most important potential sources of bias were unclear blinding of those undertaking the clinical (6/8 studies) and radiographic (3/8 studies) assessment. The prevalence of concomitant fracture was 17.5%. The most accurate clinical predictors were age >40 (positive likelihood ratio (LR+) 1.8, 95% CI 1.5 to 2.1; negative likelihood ratio (LR−) 0.4, 95% CI 0.2 to 0.6), female sex (LR+ 2.0, 95% CI 1.6 to 2.4; LR− 0.7, 95% CI 0.6 to 0.8), first-time dislocation (LR+ 1.7, 95% CI 1.4 to 2.0; LR− 0.2, 95% CI 0.1 to 0.5) and presence of humeral ecchymosis (LR+ 3.0–5.7, LR− 0.8–1.1). The most important mechanisms of injury were high-energy mechanism fall (LR+ 2.0–9.8, LR− 0.4–0.8), fall >1 flight of stairs (LR+ 3.8, 95% CI 0.6 to 13.1; LR− 1.0, 95% CI 0.9 to 1.0) and motor vehicle collision (LR+ 2.3, 95% CI 0.5 to 4.0; LR− 0.9, 95% CI 0.9 to 1.0). The Quebec Rule had a sensitivity of 92.2% (95% CI 54.6% to 99.2%) and a specificity of 33.3% (95% CI 23.1% to 45.3%), but the Fresno-Quebec rule identified all clinically important fractures across two studies: sensitivity of 100% (95% CI 89% to 100%) in the derivation dataset and 100% (95% CI 90% to 100%) in the validation study. The specificity of the Fresno-Quebec rule ranged from 34% (95% CI 28% to 41%) in the derivation dataset to 24% (95% CI 16% to 33%) in the validation study.

Conclusion Clinical prediction rules may have a role in supporting shared decision making after shoulder dislocation, particularly in the prehospital and remote environments when delay to imaging is anticipated.

  • Systematic Review
  • clinical assessment
  • fractures and dislocations

Data availability statement

Data are available upon reasonable request. Not applicable.

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

What is already known on this topic

  • Shoulder dislocations are commonly associated with fractures of the proximal humerus, which may preclude closed reduction in the ED.

  • Plain radiographs are used to identify fractures but expose the patient to ionising radiation, delay definitive treatment and may inhibit appropriate prehospital attempts at reduction.

What this study adds

  • Clinical predictors can help risk stratify patients with a shoulder dislocation based on the likelihood of a concomitant clinically significant fracture.

  • Clinical prediction rules can help identify patients that could safely forgo radiography before closed reduction of a dislocated shoulder.

How this study might affect research, practice or policy

  • Clinical prediction rules can support shared decision making about early closed reduction of dislocated shoulders, particularly when there is an anticipated delay to imaging.

Background

Shoulder dislocation is a commonly encountered injury in the ED.1 Isolated shoulder dislocations should undergo prompt closed reduction to alleviate pain, limit tension on soft tissue structures and restore anatomical alignment.2 However, up to 25% of shoulder dislocations are associated with fracture of the proximal humerus.3–5 Concomitant fractures are important to identify as fracture of the humeral neck contraindicates closed reduction in the ED.6 7 Although tuberosity fractures are not an absolute contraindication,6 7 they are important to identify as they increase the risk of iatrogenic humeral neck fracture during attempts at reduction.8 It is therefore conventional to obtain plain radiographs before attempting closed reduction of a clinically dislocated shoulder.2 However, the need for plain radiographs increases cost, delays reduction and exposes the patient to ionising radiation,9–11 which is a particular concern for young patients with recurrent dislocations. The perceived need for imaging might also inhibit prehospital attempts at closed reduction,12 which may be particularly important in remote environments such as mountainous ski resorts.13

A number of studies have therefore tried to identify whether a there is a subgroup of patients for whom it is safe to omit prereduction radiographs despite a clinical diagnosis of shoulder dislocation.9 10

This systematic review sought to determine whether clinical predictors can safely identify a subgroup of patients who might safely undergo closed reduction of a dislocated shoulder without prereduction radiographs.

Methods

This systematic review was conducted according to the Preferred Reporting Items for Systematic Review and Meta-Analysis of Diagnostic Test Accuracy studies guidelines.14

Search strategy

Literature searches were designed with and performed by a specialist information librarian using PubMed, Ovid MEDLINE (from 1946 to November 2021), EBSCO CINAHL, Embase (from 1947 to 2021) and Web of Science. The search strategies are available in online supplemental file 1. The reference lists of included studies were screened for further items and a forward citation search undertaken for new studies using Google Scholar on 23 June 2022. Duplicates were removed using EndNote (Clarivate, Pennsylvania, USA) and unique items then imported into the specialist systematic review package Rayyan (Qatar Computing Research Institute, Doha, Qatar) for abstract screening.

Study selection

Studies were included if they reported the diagnostic characteristics of clinical findings (individually or in combination) for fracture in patients with a clinically dislocated shoulder. All glenohumeral joint dislocations (ie, anterior, posterior or inferior) were included. The reference standard was plain radiography, and a positive outcome was defined as the identification of a clinically significant fracture, which we defined pragmatically as any fracture with the exception of Hill-Sachs lesions. Studies were excluded if they did not include sufficient data to construct a 2×2 table. Two authors (IO and LC) independently screened abstracts and then full-texts with a third author (DM) available to arbitrate if necessary.

Data extraction and quality assessment

Data were extracted independently by two authors (IO and LC) and then compared for errors. Any inconsistencies were resolved through discussion with a third author (DM). If further data or clarification was required to construct 2×2 tables, the corresponding authors of included studies were contacted. Risk of bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool.15 This assessment was performed independently by two authors (IO and LC) and then checked by a third (DM).

Statistical analysis

Sensitivity, specificity and likelihood ratios were calculated for each feature together with their 95% CIs. The unit of analysis was individual shoulders. Clinical predictors reported by only one study were presented as individual data points and two studies as a range. Data about the same clinical predictor reported by three studies were pooled using univariate random effects models and data reported by four or more studies using multilevel mixed effects logistic regression models. Studies pooled using multilevel mixed effects logistic regression were also summarised using hierarchical summary receiver operating characteristic curve (HSROC) plots. Heterogeneity was evaluated using the I2 test and by visualising HSROC plots before data were pooled. However, we anticipated significant heterogeneity and so planned to pool data using mixed effects models. Publication bias was assessed by visually inspecting scatter plots and by regressing the log diagnostic OR against 1/ESS1/22 weighted by ESS (where ESS represents the effective sample size) with p<0.05 for the slope coefficient used as the threshold for significant asymmetry, as recommended by Deeks et al.16 Meta-analysis was undertaken using the diagti, midas, metan and metandi modules in Stata SE V.15. We planned to specifically highlight clinical features with a positive likelihood ratio (LR+) of >2.0 or a negative likelihood ratio (LR−) of <0.5.

Results

There were 4688 unique items retrieved from the search process that included eight eligible3–5 17–19 studies reporting data on 2087 individual shoulder dislocations and 343 concomitant fractures (figure 1). There was no evidence of publication bias.

Figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow diagram showing inclusion of studies.

Study and participant characteristics

The included studies are described in table 1. There were five retrospective3–5 17 20 and three prospective studies18 19 21 from EDs in Australia,20 Canada,4 18 19 France,17 Iran21 and Turkey.3 5 Three were published by a single research team4 18 19 but over non-overlapping time periods. Most studies included all adults aged >184 5 17–19 21 or ≥163, but one study limited its population to patients aged <40 years.20

Table 1

Characteristics of studies included within the review

Risk of bias

Table 2 shows the risk of bias assessment for each study using the QUADAS-2 tool. Six studies were at low risk of bias in the patient selection category because they reported data from consecutive patients.4 5 17–19 21 However, two studies were at high risk of patient selection bias.3 20 This is because one only included younger patients (aged <40 years), which may have distorted the diagnostic characteristics of predictors for fracture20 and the other because only a minority (30%) of patients with a dislocated shoulder had complete medical records available.3 These issues also gave rise to the only applicability concerns across all of the QUADAS-2 domains. Six studies were at unclear risk of bias in the index test category because they were retrospective studies and did not explicitly state that the assessor recording clinical features was blinded to the results of the radiograph.4 5 17 18 20 21 Similarly, three studies were at unclear risk of bias in the reference standard category because they did not explicitly confirm that the assessor was blinded to the clinical features.4 20 21 All eight studies were judged to be at low risk of bias in the flow and timing category because a fracture was unlikely to have developed or resolved between clinical assessment and imaging.

Table 2

Risk of bias assessments for included studies using the Quality Assessment of Diagnostic Accuracy Studies 2 tool

Prevalence of concomitant fracture

Seven of the eight studies3–5 17–19 21 were used to estimate the prevalence of clinically significant fractures among the ED population with shoulder dislocations because Ong et al only included younger patients (aged <40).20 There were therefore 331 clinically significant fractures among the group of 1891 shoulder dislocations (prevalence 17.5%). Four studies described the prevalence of different fracture types. The most common fractures affected one or more tuberosities in isolation (121/183, 66.1%; four studies).3 4 18 19 The prevalence of fractures that would preclude closed reduction in the ED (eg, fractures of the humeral head or neck) was up to 15%. Other fractures included bony Bankart lesions (5/125, 4.0%; two studies)4 19 as well as isolated fractures of the glenoid and coracoid.

Accuracy of clinical assessment

The most useful clinical features were age >40 (LR+ 1.8, 95% CI 1.5 to 2.1; LR− 0.4, 95% CI 0.2 to 0.6; 1406 dislocations, four studies), female sex (LR+ 2.0, 95% CI 1.6 to 2.4; LR− 0.7, 95% CI 0.6 to 0.8; 1497 dislocations, five studies), first-time dislocation (LR+ 1.7, 95% CI 1.4 to 2.0; LR− 0.2, 95% CI 0.1 to 0.5; 1745 dislocations, six studies) and presence of humeral ecchymosis (LR+ 3.0–5.7, LR− 0.8–1.1; 426 dislocations, two studies). The most important mechanisms of injury were high-energy mechanism fall (LR+ 2.0–9.8, LR− 0.4–0.8; 381 dislocations, two studies), fall >1 flight of stairs (LR+ 3.8, 95% CI 0.6 to 13.1; LR− 1.0, 95% CI 0.9 to 1.0; 760 dislocations, three studies) and motor vehicle collision (LR+ 2.3, 95% CI 0.5 to 4.0; LR− 0.9, 95% CI 0.9 to 1.0; 1362 dislocations, four studies).

Five studies (1410 dislocations and 216 fractures) reported data on the accuracy of the Quebec rule, which mandates prereduction radiography for injuries satisfying any one of the following criteria: age >40 (one study used a threshold of >35 years3), first shoulder dislocation and high-risk mechanism.3 4 17 20 21 High-risk mechanisms were falling down more than one flight of stairs, fight/assault and motor vehicle collision.4 Four of the five studies reported overall favourable diagnostic characteristics of the Quebec rule, but a single study from Australia (limited to younger patients aged <40 years) was less encouraging.20 This latter study reported a sensitivity of only 42% (95% CI 16% to 71%) and a specificity of 40% (95% CI 33% to 47%). When all five studies were pooled using a multilevel mixed effects logistic regression model, the Quebec rule had a sensitivity of 92.2% (95% CI 54.6% to 99.2%) and a specificity of 33.3% (95% CI 23.1% to 45.3%) (figure 2).

Figure 2

HSROC plot summarising accuracy of the Quebec rule for fracture. HSROC, hierarchical summary receiver operating characteristic curve.

One study presented an algorithm (the Fresno-Quebec Rule (FQR)) to determine which patients with a clinically dislocated shoulder should undergo prereduction radiographs.18 The factors incorporated into this algorithm were atraumatic recurrent episode, age >35 and high-risk mechanism (figure 3). High-risk mechanisms in the FQR are defined as motor vehicle collision, assault, sports-related or fall of >10 ft. According to the derivation study (207 dislocations and 24 fractures), the algorithm achieved a sensitivity of 100% (95% CI 89% to 100%) and a specificity of 34% (95% CI 28% to 41%) for identifying fractures (LR+ 1.9, 95% CI 1.6 to 2.3; LR− 0.1, 95% CI 0.0 to 0.6).18 This finding has been reproduced by a single retrospective study in Turkey (135 dislocations and 34 fractures), which also reported a sensitivity of 100% (95% CI 90% to 100%) and a specificity of 24% (95% CI 16% to 33%; LR+ 1.3, 95% CI 1.1 to 1.5; LR− 0.1, 95% CI 0.0 to 0.9).3

Figure 3

Fresno-Quebec rule reproduced from Émond et al.18

Discussion

This systematic review suggests there is a subgroup of patients with shoulder dislocations for whom radiographs may be safely omitted before closed reduction. Identifying this group could reduce costs, exposure to ionising radiation and time to joint reduction.9 10

All eight studies defined clinically significant fractures as including all fracture patterns except for Hill-Sachs lesions. However, the most common bony injury (isolated tuberosity fractures) does not preclude closed reduction in the ED,6 7 and so identifying this injury may not affect the immediate management. Nevertheless, there are a number of reasons why it might be helpful to identify this fracture before reduction. First, tuberosity fracture may herald an undisplaced fracture across the humeral neck and has been associated with iatrogenic humeral neck fracture during closed reduction.8 The identification of a tuberosity fracture should therefore lead to careful examination of the radiographs (ie, two orthogonal views) for evidence of an undisplaced fracture across the humeral neck.15 Second, tuberosity fractures may be difficult to appreciate on postreduction radiographs but often require orthopaedic follow-up and interval radiographs to exclude further displacement of the fragment.22 Finally, identification of a tuberosity fracture may reassure both the patient and emergency physician that this injury preceded attempts at closed reduction.

Studies across a range of settings have been consistent about the clinical features that are most useful for identifying patients with fractures. These features have been combined into a clinical decision rule—the Quebec rule—which recommends prereduction radiography in the case of a patient aged >40 years, a first episode of dislocation or a high-risk mechanism.4 The derivation study identified high-risk mechanisms as falling down more than one flight of stairs, fight/assault and motor vehicle collision.4 A number of validation papers supported this rule,3 4 17 20 21 but it performed much less well in a case–control study of younger patients from Australia.20 In that study, the rule was only 42% sensitive for fracture. A subgroup analysis by Bolvardi et al similarly found that the rule was only 50% sensitive for fracture in patients aged <40 years.21 Although promising in some studies, the Quebec rule requires further validation in younger populations before it can be recommended for use in clinical practice.

The Fresno-Quebec Rule has been externally validated3 and maintained a sensitivity of 100% across two studies consisting of 342 shoulder dislocations and 58 clinically significant fractures.3 18 19 The rule is also supported by other diagnostic test accuracy studies that previously supported the accuracy of these features for identifying fractures.3 4 17 21 Nevertheless, although Ong et al did not evaluate the FQR directly, their data suggest the underlying predictors may not be sufficiently sensitive to safely exclude fracture among younger patients.20 The FQR currently restricts its definition of ‘high-risk mechanism’ to falling from >10 ft, sports related, assault or motor vehicle collision.19 Only one of the four tuberosity fractures reported by Ong et al was atraumatic: two occurred after a fall from standing height and one during a seizure.20 Clinicians should make their own judgement about the likelihood of fracture given each mechanism of injury and think more broadly than the specific high-risk mechanisms described by the FQR. However, reassuringly, all of the ‘missed’ injuries reported among younger patients by Ong et al20 and Bolvardi et al21 were greater tuberosity fractures20 and so would not have precluded closed reduction in the ED.6

Limitations

This review identified eight diagnostic test accuracy studies that were broadly homogenous in terms of inclusion criteria, predictors, definitions, and choice of reference test. They included derivation and validation studies from EDs in a range of clinical settings. Reassuringly, the data from these studies were consistent, which lends considerable certainty to the findings of this evidence synthesis. However, the review suffers from a number of limitations. First, five of the eight studies used retrospective case control designs3–5 17 20 and suffered from missing data, which may have introduced bias. Second, continuous variables (such as age and height of fall) were dichotomised in the derivation studies, which makes predictors easier to use in clinical practice but discards valuable predictive information.23 For example, the FQR uses 35 years as its age threshold, but the difference in fracture risk between a 34-year-old and 36-year-old is likely to be much smaller than between a 36-year-old and a 90-year-old. Finally, rare populations (eg, patients with shoulder dislocation caused by seizure) were not represented in sufficient numbers, and so caution should be exercised before applying the FQR to all subgroups.

Conclusion

The existing evidence suggests that clinical prediction rules may have a role in supporting shared decision making after shoulder dislocation, particularly in the pre-hospital and remote environments when delay to imaging is anticipated. However, further external validation studies across different settings would be helpful.

Data availability statement

Data are available upon reasonable request. Not applicable.

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

Acknowledgments

We are grateful to Ms S Johnson (University of Warwick) for assistance with literature searching.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Handling editor Richard Body

  • Twitter @i_oldrini, @AlexTNovak, @SteveGwilym, @TraumaDataDoc

  • Correction notice This article was resupplied on June 8th 2023 to include a CC-BY licence.

  • Contributors IO led and undertook the initial systematic review. LC independently screened abstracts, extracted data and performed quality assessments. SG and AN contributed to the study design, data interpretation and critical review of the manuscript. DM designed and supervised the study then prepared the manuscript for publication and is the guarantor.

  • Funding DM is supported by a NIHR Advanced Fellowship and the NIHR Oxford Biomedical Research Centre.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, conduct, reporting or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.