Skip to main content
Log in

A readers' guide to the interpretation of diagnostic test properties: clinical example of sepsis

  • Review
  • Published:
Intensive Care Medicine Aims and scope Submit manuscript

Abstract

Background

One of the most challenging practical and daily problems in intensive care medicine is the interpretation of the results from diagnostic tests. In neonatology and pediatric intensive care the early diagnosis of potentially life-threatening infections is a particularly important issue.

Focus

A plethora of tests have been suggested to improve diagnostic decision making in the clinical setting of infection which is a clinical example used in this article. Several criteria that are critical to evidence-based appraisal of published data are often not adhered to during the study or in reporting. To enhance the critical appraisal on articles on diagnostic tests we discuss various measures of test accuracy: sensitivity, specificity, receiver operating characteristic curves, positive and negative predictive values, likelihood ratios, pretest probability, posttest probability, and diagnostic odds ratio.

Conclusions

We suggest the following minimal requirements for reporting on the diagnostic accuracy of tests: a plot of the raw data, multilevel likelihood ratios, the area under the receiver operating characteristic curve, and the cutoff yielding the highest discriminative ability. For critical appraisal it is mandatory to report confidence intervals for each of these measures. Moreover, to allow comparison to the readers' patient population authors should provide data on study population characteristics, in particular on the spectrum of diseases and illness severity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4. Fig. 4.
Fig. 6.
Fig. 7.

Similar content being viewed by others

References

  1. Escobar GJ (1999) The neonatal "sepsis work-up": personal reflections on the development of an evidence-based approach toward newborn infections in a managed care organization. Pediatrics 103:360–373

    CAS  PubMed  Google Scholar 

  2. Stoll BJ, Gordon T, Korones SB, Shankaran S, Tyson JE, Bauer CR, Fanaroff AA, Lemons JA, Donovan EF, Oh W, Stevenson DK, Ehrenkranz RA, Papile LA, Verter J, Wright LL (1996) Early-onset sepsis in very low birth weight neonates: a report from the National Institute of Child Health and Human Development Neonatal Research Network. J Pediatr 129:72–80

    CAS  PubMed  Google Scholar 

  3. Benitz WE, Gould JB, Druzin ML (1999) Preventing early-onset group B streptococcal sepsis: strategy development using decision analysis. Pediatrics 103:e76

    CAS  PubMed  Google Scholar 

  4. Hammerschlag MR, Klein JO, Herschel M, Chen FC, Fermin R (1977) Patterns of use of antibiotics in two newborn nurseries. N Engl J Med 296:1268–1269

    Google Scholar 

  5. Berner R, Niemeyer C, Leititis J, al. e (1998) Plasma levels and gene expression of granulocyte colony-stimulating factor, tumor necrosis factor-a, interleukin (IL) 1-b, IL-6, IL-8, and soluble intercellular adhesion molecule-1 in neonatal early onset sepsis. Pediatr Res 44:469–477

    CAS  PubMed  Google Scholar 

  6. Kuster H, Weiss M, Willeitner AE, Detlefsen S, Jeremias I, Zbojan J, Geiger R, Lipowsky G, Simbruner G (1998) Interleukin-1 receptor antagonist and interleukin-6 for early diagnosis of neonatal sepsis 2 days before clinical manifestation. Lancet 352:1271–1277

    CAS  PubMed  Google Scholar 

  7. Chiesa C, Panero A, Rossi N, Stegagno M, De Giusti M, Osborn JF, Pacifico L (1998) Reliability of procalcitonin concentrations for the diagnosis of sepsis in critically ill neonates. Clin Infect Dis 26:664–672

    CAS  PubMed  Google Scholar 

  8. Isaacman DJ, Karasic RB, Reynolds EA, Kost SI (1996) Effect of number of blood cultures and volume of blood on detection of bacteremia in children. J Pediatr 128:190–195

    CAS  PubMed  Google Scholar 

  9. Jaeschke R, Guyatt GH, Sackett DL (1994) Users' guides to the medical literature. III. How to use an article about a diagnostic test. B. What are the results and will they help me in caring for my patients? The Evidence-Based Medicine Working Group. JAMA 271:703–707

    CAS  PubMed  Google Scholar 

  10. Jaeschke R, Guyatt G, Sackett DL (1994) Users' guides to the medical literature. III. How to use an article about a diagnostic test. A. Are the results of the study valid? Evidence-Based Medicine Working Group. JAMA 271:389–391

    Article  CAS  PubMed  Google Scholar 

  11. Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, van der Meulen JH, Bossuyt PM (1999) Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 282:1061–1066

    CAS  PubMed  Google Scholar 

  12. Smith JE, Winkler RL, Fryback DG (2000) The first positive: computing positive predictive value at the extremes. Ann Intern Med 132:804–809

    CAS  PubMed  Google Scholar 

  13. Dujardin B, Van den Ende J, Van Gompel A, Unger JP, Van der Stuyft P (1994) Likelihood ratios: a real improvement for clinical decision making? Eur J Epidemiol 10:29–36

    CAS  PubMed  Google Scholar 

  14. Pauker SG, Kassirer JP (1980) The threshold approach to clinical decision making. N Engl J Med 302:1109–1117

    Google Scholar 

  15. Weinstein MC, Fineberg HV (1980) Clinical decision analysis. Saunders, Philadelphia

  16. Pauker SG, Kopelman RI (1992) Interpreting hoofbeats: can Bayes help clear the haze? N Engl J Med 327:1009–1013

    Google Scholar 

  17. Fagan TJ (1975) Nomogram for Bayes theorem. N Engl J Med 293:257

    CAS  PubMed  Google Scholar 

  18. Hanley JA, McNeil BJ (1983) A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148:839–843

    CAS  PubMed  Google Scholar 

  19. McNeil BJ, Hanley JA, Funkenstein HH, Wallman J (1983) Paired receiver operating characteristic curves and the effect of history on radiographic interpretation. CT of the head as a case study. Radiology 149:75–77

    CAS  PubMed  Google Scholar 

  20. Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240:1285–1293

    CAS  PubMed  Google Scholar 

  21. Mahieu LM, De Muynck AO, De Dooy JJ, Laroche SM, Van Acker KJ (2000) Prediction of nosocomial sepsis in neonates by means of a computer-weighted bedside scoring system (NOSEP score). Crit Care Med 28:2026–2033

    CAS  PubMed  Google Scholar 

  22. Tosteson AN, Weinstein MC, Wittenberg J, Begg CB (1994) ROC curve regression analysis: the use of ordinal regression models for diagnostic test assessment. Environ Health Perspect 102 [Suppl 8]:73–78

  23. Bagley SC, White H, Golomb BA (2001) Logistic regression in the medical literature: standards for use and reporting, with particular attention to one medical domain. J Clin Epidemiol 54:979–985

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joachim E. Fischer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fischer, J.E., Bachmann, L.M. & Jaeschke, R. A readers' guide to the interpretation of diagnostic test properties: clinical example of sepsis. Intensive Care Med 29, 1043–1051 (2003). https://doi.org/10.1007/s00134-003-1761-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00134-003-1761-8

Keywords

Navigation