Publication of Data
Repositories are used to archive, document and publish digital objects. They are storage locations for data, which enable the publication and archiving of data independent of the scholarly article itself in terms of time and space.
Depending on the repository, data, data sets, descriptions of experiments and evaluations, audio-visual objects such as image and video files, models of simulations and also software can be published. In some repositories, entire research data sets can be stored in their complex form as a single unit (e.g. “collection” in PANGAEA). read more
Types of repositories
Repositories can be classified according to various aspects. Most often, they are distinguished by whether they are discipline-specific, cross-discipline/generic or institutional.
Discipline-specific or disciplinary repositories offer the advantage of visibility in the research community and are already established institutions in some disciplines. However, not all academic subject areas have yet established discipline-specific repositories.
These repositories usually offer discipline-specific knowledge and know-how in the field of research data management, e.g. with regard to the curation of data or to special services (e.g. visualisation tools), but also specific tools for the analysis, research and visualisation.
Examples of discipline-specific repositories:
- ARCHE (https://arche.acdh.oeaw.ac.at/) a Resource Centre for Humanities Related Research in Austria
- AUSSDA (https://aussda.at/) the Austrian Social Science Data Archive
- GenBank (http://www.ncbi.nlm.nih.gov/genbank) for genetic research data
- Integrated Ocean Observing System (https://ioos.noaa.gov) for data from marine environmental research
- NoMaD (http://nomad-repository.eu) for data on materials in the field of physics
- PANGAEA (https://pangaea.de) for earth science data
- TextGrid repository (https://textgridrep.org/) for the humanities (XML/TEI-based).
For interdisciplinary research, the assignment of the resulting data to a subject area may be difficult. Cross-disciplinary repositories offer a solution here. They generally accept very different types of data and provide a good search function. In most cases, however, they do not curate the data or offer other forms of quality control.
Examples of cross-disciplinary, generic repositories:
- ZENODO (https://zenodo.org): Digital data from all research areas, a product from the EU OpenAIRE project
- DRYAD (http://datadryad.org): Focus on life sciences, not free of charge
Institutional repositories are also emerging more and more. Currently, they offer an alternative if no suitable discipline-specific repository is available. Compared to generic repositories, they have the advantage that control over the data remains in-house and at the same time the visibility of one’s own institution is increased. Legal framework conditions for handling data can also speak in favour of publication in one’s own repository. Institutional repositories are generally available and can be used free of charge for all of the institution’s own subject areas.
Some examples of institutional repositories in Austria:
- TU Vienna (https://researchdata.tuwien.ac.at/)
- University Vienna (https://phaidra.univie.ac.at/)
- TU Graz (https://repository.tugraz.at/)
- IST Austria (https://research-explorer.app.ist.ac.at/)
- FH St. Pölten (https://phaidra.fhstp.ac.at/)
Selection of a repository
In order to find an appropriate repository, the cross-disciplinary directory re3data (https://www.re3data.org) can be used. This is a DFG-funded project that lists German and international repositories for research data, with more than 2.700 entries at present (status from 02.08.2021). Here you may select the discipline, type of data or country. It is also possible to filter by very detailed criteria, for example for repositories that charge a fee for data upload or where data use is restricted.
When selecting a repository for data publication, the following order of priority is recommended:
- an external discipline-specific repository that is recognised in the discipline
- an institutional repository, or
- a cost-free multidisciplinary repository
Specific criteria for selecting a suitable repository:
- certification, e.g. Core Trust Seal
- (automated) assignment of persistent identifiers, e.g. DOI, handle
- access to data: open, restricted or inaccessible
The options for choosing a license may also influence the choice of a suitable repository.
(Adapted from: Biernacka, K., Bierwirth, M., Buchholz, P., Dolzycka, D., Helbig, K., Neumann, J., Odebrecht, C., Wiljes, C. and Wuttke, U. (2020). “Train-the-Trainer Concept on Research Data Management” Version 3.0. Berlin, DOI: https://doi.org/10.5281/zenodo.4071471, Creative Commons Attribution 4.0 International)
Duration: 9:00 mins
Content: “Research data for publications” is an educational video that uses a fictional example to show how to deal with research data that form the basis of a publication. It explains how such data can be consciously selected, structured in a comprehensible way and finally made publicly available on a repository for reuse.
Schmitz, D., Hausen, D., Trautwein-Bruns, U. (2018). “Research data to publications” RWTH Aachen University, DOI: 10.18154/RWTH-2019-10063)
License: CC BY 4.0
- Generalist Repository Comparison Chart:
This chart is designed to assist researchers in finding a generalist repository should no domain repository be available to preserve their research data.
- Fact Sheet: Research Data Repositories
With this fact sheet, we would like to provide all interested parties with important information about research data repositories.
- Research data repositories:
A short description of repositories with examples and furhter links.
- Data Guidelines European Commission
This page provides information about research data that needs to be include when publishing an article in Open Research Europe, where your data can be stored, and how your data should be presented.
FAIR Data Austria (2021). “Publication of Data”. In: Research Data Management Open Educational Resources Collection. (https://fair-office.at/index.php/publikation-von-daten/?lang=en).
License: CC BY 4.0 unless otherwise stated