Multiverse analysis is a scientific method that specifies and then runs a set of plausible alternative models or statistical tests for a single hypothesis.[1] It is a method to address the issue that the "scientific process confronts researchers with a multiplicity of seemingly minor, yet nontrivial, decision points, each of which may introduce variability in research outcomes".[2] A problem also known as Researcher degrees of freedom[3] or as the garden of forking paths. It is a method arising in response to the credibility and replication crisis taking place in science, because it can diagnose the fragility or robustness of a study's findings. Multiverse analyses have been used in the fields of psychology[4] and neuroscience.[5] It is also a form of meta-analysis allowing researchers to provide evidence on how different model specifications impact results for the same hypothesis, and thus can point scientists toward where they might need better theory or causal models.
Details
editMultiverse analysis most often produces a large number of results that tend to go in all directions. This means that most studies do not offer consensus or specific rejection of an hypothesis. Its strongest utilities thus far are instead to provide evidence against conclusions based on findings from single studies or to provide evidence about which model specifications are more or less likely to cause larger or more robust effect sizes (or not).
Evidence against single studies or statistical models, is useful in identifying potential false positive results. For example, a now infamous study concluded that female gender named hurricanes are more deadly than male gender named hurricanes.[6] In a follow up study,[7] researchers ran thousands of models using the same hurricane data, but making various plausible adjustments to the regression model. By plotting a density curve of all regression coefficients, they showed that the coefficient of the original study was an extreme outlier.
In a study of birth order effects,[8] researchers visualized a multiverse of plausible models using a specification curve which allows researchers to visually inspect a plot of all model outcomes against various model specifications. They could show that their findings supported previous research of birth order on intellect, but provided evidence against an effect on life satisfaction and various personality traits.
References
edit- ^ Steegen, Sara; Tuerlinckx, Francis; Gelman, Andrew; Vanpaemel, Wolf (September 2016). "Increasing Transparency Through a Multiverse Analysis". Perspectives on Psychological Science. 11 (5): 702–712. doi:10.1177/1745691616658637.
- ^ Breznau, Nate; et al. (28 October 2022). "Observing many researchers using the same data and hypothesis reveals a hidden universe of uncertainty". Proceedings of the National Academy of Sciences. 119 (44). doi:10.1073/pnas.2203150119. hdl:2066/285367.
- ^ Wicherts, Jelte M.; Veldkamp, Coosje L. S.; Augusteijn, Hilde E. M.; Bakker, Marjan; van Aert, Robbie C. M.; van Assen, Marcel A. L. M. (2016). "Degrees of Freedom in Planning, Running, Analyzing, and Reporting Psychological Studies: A Checklist to Avoid p-Hacking". Frontiers in Psychology. 7: 1832. doi:10.3389/fpsyg.2016.01832. PMC 5122713. PMID 27933012.
- ^ Harder, Jenna A. (2020). "The Multiverse of Methods: Extending the Multiverse Analysis to Address Data-Collection Decisions". Perspectives on Psychological Science. 15 (5): 1158–1177. doi:10.1177/1745691620917678. ISSN 1745-6916.
- ^ Clayson, Peter E. (2024-03-01). "Beyond single paradigms, pipelines, and outcomes: Embracing multiverse analyses in psychophysiology". International Journal of Psychophysiology. 197: 112311. doi:10.1016/j.ijpsycho.2024.112311. ISSN 0167-8760.
- ^ Jung, Kiju; Shavitt, Sharon; Viswanathan, Madhu; Hilbe, Joseph M. (17 June 2014). "Female hurricanes are deadlier than male hurricanes". Proceedings of the National Academy of Sciences. 111 (24): 8782–8787. doi:10.1073/pnas.1402786111. PMC 4066510. PMID 24889620.
- ^ Muñoz, John; Young, Cristobal (August 2018). "We Ran 9 Billion Regressions: Eliminating False Positives through Computational Model Robustness". Sociological Methodology. 48 (1): 1–33. doi:10.1177/0081175018777988.
- ^ Rohrer, Julia M.; Egloff, Boris; Schmukle, Stefan C. (December 2017). "Probing Birth-Order Effects on Narrow Traits Using Specification-Curve Analysis". Psychological Science. 28 (12): 1821–1832. doi:10.1177/0956797617723726. hdl:21.11116/0000-0000-3826-7.