Reliability of the PEDro scale for rating quality of randomized controlled trials.

TitleReliability of the PEDro scale for rating quality of randomized controlled trials.
Publication TypeJournal Article
Year of Publication2003
AuthorsMaher CG, Sherrington C, Herbert RD, Moseley AM, Elkins M
JournalPhysical therapy
Volume83
Issue8
Pagination713-21
Date Published2003 Aug
ISSN0031-9023
KeywordsDatabases, Bibliographic; Evidence-Based Medicine; Humans; Physical Therapy Modalities; Quality Control; Randomized Controlled Trials as Topic; Reproducibility of Results
Abstract

BACKGROUND AND PURPOSE: Assessment of the quality of randomized controlled trials (RCTs) is common practice in systematic reviews. However, the reliability of data obtained with most quality assessment scales has not been established. This report describes 2 studies designed to investigate the reliability of data obtained with the Physiotherapy Evidence Database (PEDro) scale developed to rate the quality of RCTs evaluating physical therapist interventions.

METHOD: In the first study, 11 raters independently rated 25 RCTs randomly selected from the PEDro database. In the second study, 2 raters rated 120 RCTs randomly selected from the PEDro database, and disagreements were resolved by a third rater; this generated a set of individual rater and consensus ratings. The process was repeated by independent raters to create a second set of individual and consensus ratings. Reliability of ratings of PEDro scale items was calculated using multirater kappas, and reliability of the total (summed) score was calculated using intraclass correlation coefficients (ICC [1,1]).

RESULTS: The kappa value for each of the 11 items ranged from.36 to.80 for individual assessors and from.50 to.79 for consensus ratings generated by groups of 2 or 3 raters. The ICC for the total score was.56 (95% confidence interval=.47-.65) for ratings by individuals, and the ICC for consensus ratings was.68 (95% confidence interval=.57-.76).

DISCUSSION AND CONCLUSION: The reliability of ratings of PEDro scale items varied from "fair" to "substantial," and the reliability of the total PEDro score was "fair" to "good."

Alternate JournalPhys Ther