Skip to main content

Data Management

Resources in documenting, storing and preserving research data

Choosing a Data Repository

At the end of your research project, it is good practice to deposit your research data in a repository that ensures the data’s long-term preservation and accessibility.  Your choice of repository will depend on factors like:

  • Funder requirements
  • Anticipated publication venues (e.g. journals, conferences, etc.) & their data sharing policies
  • Institutional data policies
  • Repository platform features (Is the repository searchable? Does the repository provide metadata? Etc.)

Having trouble deciding on a repository? Library Data Services (kate.barron@sjsu.edu) can help you choose a repository that best fits your needs. In addition, the Data Repository Comparison Chart (template courtesy of MIT Libraries) can help you to compare several choices along the lines of scope, metadata and indexing, deposit policies and workflows, and more. Remember, you can submit your data to more than one repository if you so choose.

Domain-Specific Repository

There are many data repositories specific to research domain. Use the Registry of Research Data Repositories to search and/or browse repositories by subject. Each listed repository will include information about sponsoring institutions, terms of data deposit and access, and scope. You might also consult Open Access Directory's List of Disciplinary Repositories.

Notable, domain-specific repositories to be aware of include:

  • ICPSR: The Inter-university Consortium for Political and Social Research “maintains a data archive of more than 250,000 files of research in the social and behavioral sciences.” In addition to data archiving, ICPSR provides leadership/education in and conducts research on social science data curation and analysis.
  • DataONE: Data Observation Network for Earth. This repository hosts environmental research data, and also provides a variety of educational materials in data literacy and data management.
  • Dryad: A repository for scientific and medical research data, which can host a variety of data formats.
  • FigShare: A multi-disciplinary repository, which includes research materials from the physical sciences, social sciences, arts and humanities.

Institutional Repository: SJSU ScholarWorks

SJSU ScholarWorks is a digital repository of the research, scholarship, and creative works of San José State University faculty, students, and staff. The repository can accept any file type for deposit, including research data. Deposited files are indexed in Google and Google Scholar, which improves worldwide discovery of one's work. ScholarWorks is a good option for hosting artefacts from all research phases—for example, SJSU ScholarWorks can host a copy of your research article, along with the data supporting that article. For more information, contact ScholarWorks Coordinators at scholarworks@sjsu.edu.

Self-Published Repository: Dataverse

Dataverse is an open-source software created & distributed by the Institute for Quantitative Social Science (IQSS) in collaboration with the Harvard University Library and Harvard University Information Technology. You can download the software and use it to host “dataverse(s)” (essentially, data repositories). Dataverse is a good option if you’d like extra control in the distribution of your research data, as the software allows you to not only provide your research data and documentation, but also to manage its delivery medium. Visit The Dataverse Project website for more information.

Alternatively, if you like the Dataverse’s style and features, but would rather not manage your own repository, you can create and/or submit repository(ies) and dataset(s) to Harvard’s Dataverse.