Advancing Computational Reproducibility in the Dataverse Data Repository Platform


Trisovic A, Durbin P, Schlatter T, Durand G, Barbosa S, Brooke D, Crosas M. Advancing Computational Reproducibility in the Dataverse Data Repository Platform, in P-RECS '20: Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems. ; 2020 :15–20.


Recent reproducibility case studies have raised concerns showing that much of the deposited research has not been reproducible. One of their conclusions was that the way data repositories store research data and code cannot fully facilitate reproducibility due to the absence of a runtime environment needed for the code execution. New specialized reproducibility tools provide cloud-based computational environments for code encapsulation, thus enabling research portability and reproducibility. However, they do not often enable research discoverability, standardized data citation, or long-term archival like data repositories do. This paper addresses the shortcomings of data repositories and reproducibility tools and how they could be overcome to improve the current lack of computational reproducibility in published and archived research outputs.

Publisher's Version

Last updated on 10/19/2020