Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate?
dc.contributor.author | Zapf, Antonia | |
dc.contributor.author | Castell, Stefanie | |
dc.contributor.author | Morawietz, Lars | |
dc.contributor.author | Karch, André | |
dc.date.accessioned | 2016-10-07T10:52:37Z | |
dc.date.available | 2016-10-07T10:52:37Z | |
dc.date.issued | 2016 | |
dc.identifier.citation | Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate? 2016, 16:93 BMC Med Res Methodol | en |
dc.identifier.issn | 1471-2288 | |
dc.identifier.pmid | 27495131 | |
dc.identifier.doi | 10.1186/s12874-016-0200-9 | |
dc.identifier.uri | http://hdl.handle.net/10033/620542 | |
dc.description.abstract | Reliability of measurements is a prerequisite of medical research. For nominal data, Fleiss' kappa (in the following labelled as Fleiss' K) and Krippendorff's alpha provide the highest flexibility of the available reliability measures with respect to number of raters and categories. Our aim was to investigate which measures and which confidence intervals provide the best statistical properties for the assessment of inter-rater reliability in different situations. | |
dc.language.iso | en | en |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | * |
dc.title | Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate? | en |
dc.type | Article | en |
dc.contributor.department | Helmholtz Centre for infection research, Inhoffenstr. 7, 38124 Braunschweig, Germany. | en |
dc.identifier.journal | BMC medical research methodology | en |
refterms.dateFOA | 2018-06-13T00:17:35Z | |
html.description.abstract | Reliability of measurements is a prerequisite of medical research. For nominal data, Fleiss' kappa (in the following labelled as Fleiss' K) and Krippendorff's alpha provide the highest flexibility of the available reliability measures with respect to number of raters and categories. Our aim was to investigate which measures and which confidence intervals provide the best statistical properties for the assessment of inter-rater reliability in different situations. |