Data management is the process of validating, organizing, protecting, maintaining, and processing scientific data to ensure the accessibility, reliability, and quality of the data for its users.
Proper data management helps maintain scientific rigor and research integrity. Keeping good track of data and associated documentation lets researchers and collaborators use data consistently and accurately. Carefully storing and documenting data also allows more people to use the data in the future, potentially leading to more discoveries beyond the initial research.
- Information from the NIH website
If you are looking for repositories, please check the Where should I publish? Datasets section.
If you are looking for NIH specific information, please scroll down to our NIH section.
We encourage researchers to manage their data while keeping in mind FAIR principles. FAIR stands for:
Examples of how to make data FAIR include, but are not limited to: having DOIs, using Creative Commons or other clear and accessible data usage licenses, and using standardized language and organization. Following these principles helps ensure that data will not disappear or be forgotten. You can learn more about how to apply FAIR at the GO FAIR website.
As of January 25th, 2023, the NIH expects applicants to submit a plan for how they will manage and share their data and allows applicants to include certain costs associated with data management and sharing in their budget. This includes all NIH-supported research regardless of funding level, including extramural grants, extramural contracts, intramural research projects, and other funding agreements. Data is defined as any data needed to validate and replicate research findings, and does not include laboratory notebooks, preliminary analyses, completed case report forms, drafts of scientific papers, plans for future research, peer reviews, communications with colleagues, or physical objects such as laboratory specimens.
Your data plan should include the following:
For further information, you can check the NIH website or the OSF Data Management checklist.
The NIH has an official example of a data management plan, and the OSF Data Management working group also has a searchable database of example data management plans. The NIH has also provided a suggested list of NIH preferred repositories.
PLoS Computational Biology has also published a paper called Ten simple rules for maximizing the recommendations of the NIH data management and sharing plan.
For more information, please contact CRIO or email the NIH directly at sharing@nih.gov.