Over the last decade, our Data Science team at Harvard's Institute for Quantitative Social Science has been iteratively developing Dataverse, a data repository framework to facilitate and enhance data sharing, preservation, citation, reuse and analysis. The open source Dataverse software has been installed as a research data repository at multiple institutions worldwide. The Dataverse repository hosted at Harvard University is open to all researchers, and currently has over 53,000 data sets containing 734,000 files. During the last two years, based on user feedback and community practices, we have implemented extensible data publishing workflows and effective ways to link publications to data. In this talk, I'll present what we have learned in the process, and how it has helped us define data publishing.
by Mercè Crosas