Appraising the quality of randomized controlled trials: inter-rater reliability for the OTseeker evidence database.

TitleAppraising the quality of randomized controlled trials: inter-rater reliability for the OTseeker evidence database.
Publication TypeJournal Article
Year of Publication2005
AuthorsTooth L, Bennett S, McCluskey A, Hoffmann T, McKenna K, Lovarini M
JournalJournal of evaluation in clinical practice
Volume11
Issue6
Pagination547-55
Date Published2005 Dec
ISSN1356-1294
KeywordsAustralia; Humans; Internet; National Health Programs; Occupational Therapy; Randomized Controlled Trials as Topic
Abstract

RATIONALE AND AIMS: 'OTseeker' is an online database of randomized controlled trials (RCTs) and systematic reviews relevant to occupational therapy. RCTs are critically appraised and rated for quality using the 'PEDro' scale. We aimed to investigate the inter-rater reliability of the PEDro scale before and after revising rating guidelines.

METHODS: In study 1, five raters scored 100 RCTs using the original PEDro scale guidelines. In study 2, two raters scored 40 different RCTs using revised guidelines. All RCTs were randomly selected from the OTseeker database. Reliability was calculated using Kappa and intraclass correlation coefficients [ICC (model 2,1)].

RESULTS: Inter-rater reliability was 'good to excellent' in the first study (Kappas >or= 0.53; ICCs >or= 0.71). After revising the rating guidelines, the reliability levels were equivalent or higher to those previously obtained (Kappas >or= 0.53; ICCs >or= 0.89), except for the item, 'groups similar at baseline', which still had moderate reliability (Kappa = 0.53). In study 2, two PEDro scale items, which had their definitions revised, 'less than 15% dropout' and 'point measures and variability', showed higher reliability. In both studies, the PEDro items with the lowest reliability were 'groups similar at baseline' (Kappas = 0.53), 'less than 15% dropout' (Kappas

CONCLUSION: The PEDro scale is a reliable instrument for rating the quality of RCTs. Revised rating guidelines are provided for scale items that are difficult to rate, and helped to improve inter-rater reliability.

DOI10.1111/j.1365-2753.2005.00574.x
Alternate JournalJ Eval Clin Pract