Many analysts, one data set: Making transparent how variations in analytic choices affect results

Open Access
Authors
  • R. Silberzahn
  • E.L. Uhlmann
  • D.P. Martin
  • P. Anselmi
  • F. Aust ORCID logo
  • E. Awtrey
  • Š. Bahník
  • F. Bai
  • C. Bannard
  • E. Bonnier
  • R. Carlsson
  • F. Cheung
  • G. Christensen
  • R. Clay
  • M.A. Craig
  • A. Dalla Rosa
  • L. Dam
  • M.H. Evans
  • I. Flores Cervantes
  • N. Fong
  • M. Gamez-Djokic
  • A. Glenz
  • S. Gordon-McKeon
  • T.J. Heaton
  • K. Hederos
  • M. Heene
  • A.J. Hofelich Mohr
  • F. Högden
  • K. Hui
  • M. Johannesson
  • J. Kalodimos
  • E. Kaszubowski
  • D.M. Kennedy
  • R. Lei
  • T.A. Lindsay
  • S. Liverani
  • C.R. Madan
  • D. Molden
  • E. Molleman
  • R.D. Morey
  • L.B. Mulder
  • B.R. Nijstad
  • N.G. Pope
  • B. Pope
  • J.M. Prenoveau
  • F. Rink
  • E. Robusto
  • H. Roderique
  • A. Sandberg
  • E. Schlüter
  • F.D. Schönbrodt
  • M.F. Sherman
  • S.A. Sommer
  • K. Sotak
  • S. Spain
  • C. Spörlein
  • T. Stafford
  • L. Stefanutti
  • S. Tauber
  • J. Ullrich
  • M. Vianello
  • E.-J. Wagenmakers
  • M. Witkowiak
  • S. Yoon
  • B.A. Nosek
Publication date 09-2018
Journal Advances in Methods and Practices in Psychological Science
Volume | Issue number 1 | 3
Pages (from-to) 337-356
Number of pages 20
Organisations
  • Faculty of Social and Behavioural Sciences (FMG) - Psychology Research Institute (PsyRes)
Abstract

Twenty-nine teams involving 61 analysts used the same data set to address the same research question: whether soccer referees are more likely to give red cards to dark-skin-toned players than to light-skin-toned players. Analytic approaches varied widely across the teams, and the estimated effect sizes ranged from 0.89 to 2.93 (Mdn = 1.31) in odds-ratio units. Twenty teams (69%) found a statistically significant positive effect, and 9 teams (31%) did not observe a significant relationship. Overall, the 29 different analyses used 21 unique combinations of covariates. Neither analysts’ prior beliefs about the effect of interest nor their level of expertise readily explained the variation in the outcomes of the analyses. Peer ratings of the quality of the analyses also did not account for the variability. These findings suggest that significant variation in the results of analyses of complex data may be difficult to avoid, even by experts with honest intentions. Crowdsourcing data analysis, a strategy in which numerous research teams are recruited to simultaneously investigate the same research question, makes transparent how defensible, yet subjective, analytic choices influence research results.

Document type Article
Note Corrigendum published in: Advances in Methods and Practices in Psychological Science (2018) Vol. 1, iss. 4, p. 580. - With supplementary material.
Language English
Published at https://doi.org/10.1177/2515245917747646
Other links https://doi.org/10.1177/2515245918810 https://osf.io/gvm2z/ https://www.scopus.com/pages/publications/85116837831
Downloads
Supplementary materials
Permalink to this page
Back