Strategic Goals, Roadmap, and Releases

Strategic Goals

The Strategic Goals of the Dataverse Project are our highest-level guide.  These goals are to:

  1. increase adoption (users, dataverses, datasets, installations, journals)
  2. finish Dataverse 4 migration features
  3. develop capability to handle sensitive, large scale, and streaming data
  4. expand data and metadata features for existing and new disciplines
  5. expand archival and preservation features
  6. increase interoperability through implementation of standards
  7. increase contributions from the open-source development community
  8. improve UX and UI
  9. continue to increase the quality of the software

Throughout the year, we'll identify big steps that we can take to focus on one or more of these goals. These big steps are represented on our Roadmap. The Roadmap items that we're about to work on will be well defined, but those Roadmap items that are further out may just be big problems we know we need to solve in some way. Although we are committed to Roadmap items below, the timeframe of the items further out might vary slightly as critical issues, other priorities or dependencies rise.

Once we know what features and enhancements we'll add in order to honor the steps on the roadmap, we'll plan a Release. If the release text is hyperlinked, you can click on it to be taken to our task board to see the status of the release's tasks.

Q2 2017

  • Tabular Mapping

  • Customization

  • Community Release

4.6.2 Tabular Mapping (released June 7th)

Tabular files containing geospatial information may be tagged and mapped via the publicly available WorldMap platform maintained by Harvard University's Center for Geographic Analysis.

 

4.7 Customization and Community Release (released June 23rd)

A new, customizable homepage, header, and footer will improve the user experience for researchers visiting and using Dataverse. 

At the 2017 Dataverse Community Meeting Hackathon, attendees worked on items of interest to their installations and research communities. These items will be included in this release.

Q3 2017

  • Administrative Dashboard
  • AWS S3 Support
  • Support for Large Data

4.7.1. Dashboard (released July 14th)

Administrators will be able to manage installation users and superusers through a user interface. This also provides the base for further UI administrative functionality.
 

 

4.8 AWS S3 Support

Administrators will be able to run Dataverse on AWS S3, which is more cost effective than other AWS options. This also provides a new cloud-based option to run Dataverse for current and new installations.
 
 

4.9 Large Data Upload Integration

Dataverse integration with the Data Capture Module, an optional component for deposition of large datasets (both large number of files and large files).  Specific support for large datasets includes client-side checksums, non-http uploads (currently supporting rsync over ssh) and preservation of in-place directory hierarchy. This will expand Dataverse to other disciplines and will allow the project to handle large scale data.

Q4 2017

  • Data Provenance
  • Support for Sensitive Data

5.0 Support for Sensitive Data and Data Provenance

By implementing DataTags file-level security and access requirements, integration with DataTags interview tool, and the PSI differential privacy tool, Dataverse will be able to support sensitive data. 

Integrating with a data provenance system will allow users to track of where data files and datasets came from and how they were modified. This expansion of the data and metadata features of Dataverse increases reproducibility.