Skip to main content

Data management

SciLifeLab and NBIS can support you with data management planning early on in the project to make data sharing more efficient. We can also assist you in identifying relevant repositories and common international standards for describing and publishing your data, as well as guide you through the submission process.

Data management resources provided by NBIS

A Brief Introduction to Data Management

SciLifeLab Data Guidelines - data type specific guidelines for good data management practices

SciLifeLab DS Wizard, DSW - a customised tool for creating Data Management Plans (DMPs) with templates conforming to the requirements of the Swedish Research Council and other stakeholders

Free Consultations - data stewards from NBIS are available to discuss and provide support regarding data management questions

Depositing data in a public repository

By depositing data in a public repository, you accept that the data will be published and may be used and redistributed according to specific terms set by yourself or the repository in question. Data deposition is considered important and you may get credit and acknowledgement for sharing data. A publication often results in getting a unique identifier for your dataset, which is often required for a journal to accept a manuscript for publication. Data sharing is often required by funders.

If required, data can often be published with a set moratorium, which means that data will not go public until a date that you set.

Note that data from human individuals might need special considerations when it comes to publishing.

Storing data at your university or SNIC

In Sweden, raw measurement data that is generated in a project belongs to the university, while the results are owned by the researcher - the so called teacher’s exception. Unless the data needs protection because of contractual obligations or personal privacy, for example in sensitive personal data such as biomedical measurements of humans, data is considered public.

The university has the legal responsibility to archive the data. Data publishing as supported here is important for the research process, but does not replace the legal responsibility of the university. As a researcher, you are recommended to ensure long term storage of your data in facilities provided by your university. While a project is active, compute and storage services can be provided by Swedish National Infrastructure for Computing, SNIC.

Metadata

Metadata provides ‘data about data’ , and may include information on the methodology used to collect the data, analytical and procedural information, definitions of variables, units of measurement, any assumptions made, the format and file type of the data and software used to collect and/or process the data. Researchers are strongly encouraged to use community metadata standards where these are in place. It is highly recommended to, from the very beginning of the project, structure e.g. sample metadata in a way that enables sequence data submission without having to reformat the metadata.

Here are our metadata guidelines for each specific data type:

MINSEQE (Minimal Information about a high throughput SEQuencing Experiment) is the preferred minimal metadata standard for transcriptomics data in general. For viral data, consider using the ENA virus pathogen reporting standard checklist.


For proteomics, use the minimal information model specified in MIAPE, and fill them using the controlled vocabularies specified by the Proteomics Standards Initiative: PSI CVs.


More information about metadata standards

Other data management resources

The ELIXIR Research Data Management Kit, RDMkit

Swedish National Data Service - Manage data