HomeResearch DataManage Data

Manage Data

NEW: 2023 NIH Data Sharing Policy

What are the essential components of data management?

  • Plan for data management when you start your research project
  • Organize your data (preferably according to a schema using established data and metadata standards)
  • Document your data so that it can be understood in context later
  • Store data with reuse and security in mind ⁠— keep original data files, use version control, and back up data in multiple locations
  • Secure your data by following all cybersecurity protocols, based on your data's risk
  • Validate your data, and assess for data quality
  • Share your data
  • Cite your data

Learn more in this Research Data Management guide.


What is research data management?

Research data management is the care and maintenance of data produced during research. It starts when your project starts, and continues through the end of the project, and sometimes extends beyond that. It has many components, but in summary, it involves planning, organizing, documenting, storing, securing, assessing, citing, and sharing your data alongside your research.

Why should you care about research data management?

Good research data management helps you:

  • Find, analyze, and reuse your own data — even within your own team 
  • Explain your data to others
  • Increase the rigor of your data and methods, which can increase your research impact
  • Stay publication-ready
  • Contribute to the scientific record
  • Share your data and make it reusable
  • Stay compliant with institutional and funder requirements
What are Yale's policies regarding data management?

Many of Yale's pertinent policies are summarized below:

Policy Summary
Research Data and Materials Policy This policy applies to all research data and materials generated with Yale resources, and covers data ownership, retention, transfer, sharing, and access policies. Notable requirements include that Yale researchers must make their data publicly available "to the extent feasible while minimizing harm" and that data and materials must be retained for at least three years after publication or final reporting.
Data Classification Policy This policy explains data risk level definitions and how to choose secure data systems based on the data's risk level. For more assistance, read the policy guidelines, and take the data classification questionnaire to determine your data's risk.
What are funder policies regarding data management?

Below, basic information as it pertains to data management is summarized for several major funders. Most government agencies require data management plans, and data sharing upon project completion. Though we make an effort to keep this information updated, please consult information from your funder of choice as well before moving forward with an application.

Funding Organization Data management plan required? DMPTool template available? Additional Information
U.S. National Institutes of Health (NIH) Yes Yes The NIH Data Management and Sharing Policy will be updated on January 23, 2023.
U.S. National Science Foundation Yes Yes Requirements can vary depending on the scientific concentration.
U.S. Department of Defense Yes Yes  
U.S. Department of Energy Yes Yes Requirements can vary across different offices, such as the Office of Science and Office of Energy.
United Kingdom Research & Innovation (UKRI) Councils Yes - for BBSRC. No Requirements differ across councils such as the Medical Research Council (MRC), Biotechnology and Biological Sciences Research Council (BBSRC), and Engineering and Physical Sciences Research Council (EPSRC).

(To see more data management and sharing requirements, view this list from SPARC.)

Request a Research Data Management workshop

Email the data librarian for the health sciences, Kaitlin Throgmorton, at kaitlin.throgmorton@yale.edu to discuss and schedule a custom research data management workshop for your department, class, lab, or team.


Additional Resources

Popular Data Management Tools

  • DMPTool — This tool has data management plan (DMP) templates for most major funders, including the NIH and NSF, that you can select, complete, and use to generate a DMP.
  • FairSharing.org — This website allows you to search for relevant data and metadata standards and policies across many subject areas.
  • re3data.org — This registry of data repositories allows you to search for places to find and deposit data by keyword.
  • Dryad — This digital repository enables finding and depositing of data. Yale is an institutional member of this service, which means you can deposit data in Dryad for free.
  • LabArchives — Licensed by Yale and free for those with a Yale NetID, this cloud-based electronic lab notebook (ELN) allows users to store and manage data in one place.