Research Data


Research Data Services for the Health Sciences

This site is a collection of information and services for researchers using health data. This collection is curated through the Cushing/Whitney Medical Library, if you have questions about the site or its content, email

What is Research Data?

Recorded factual material commonly accepted in the scientific community as necessary to document and support research findings. This does not mean summary statistics or tables; rather, it means the data on which summary statistics and tables are based. - NIH Data Sharing Policy and Implementation Guidance

Types of Research Data:

  • Observational data

  • Experimental data

  • Simulation data


Research Data Stages:

  • Raw data

  • Processed data

  • Intermediate data

  • Derived data

Research Data Does Not Include:

  • Summary statistics, tables, or visualizations

  • Physical objects such as gels or lab specimens


What is Research Data Management?

The care and maintenance of data produced during research, through:

  • File and folder organization
  • Data backups
  • Applying appropriate security measures
  • Preserving the context and meaning of the data through documentation and metadata.

Speaking broadly, throughout your entire research process you should complete the following data management activities:

  1. Plan your data management efforts early, with a data management plan.
  2. Include data management costs in your application budget
  3. Use descriptive file naming conventions
  4. Store your data in multiple locations
  5. Define roles and assign responsibilities for data management within your research team  
  6. Identify and use relevant metadata standards
  7. Deposit your data into an appropriate repository

For more information about Research Data Management, see this Library Guide.

What skills do biomedical data scientists need?
  • Dunn, M. C., & Bourne, P. E. (2017). Building the biomedical data science workforce. PLoS biology, 15(7), e2003082.
  • Attwood, T. K., Blackford, S., Brazas, M. D., Davies, A., & Schneider, M. V. (2019). A global perspective on evolving bioinformatics and data science training needs. Briefings in bioinformatics, 20(2), 398–404.

Data management plan icon
Learn about funder requirements and get feedback on your DMP

Data Management Plans

Data tools and software icon
Discover and compare helpful tools

Data Tools & Software

Data Classes & Materials
Learn about upcoming data classes and view class materials

Data Classes & Materials

Dataset wayfinding icon
Find the repository you need for the data you want

Find Datasets

Data storage
Find places to store your data

Data Storage

Data management best practices
Learn about best practices and definitions for data management projects.

Best Practices & Definitions

Data support groups
Explore different data support groups available across Yale.

Data Support Groups at Yale

Consultations & Drop-Ins
Meet and greet

Consultations & Drop-Ins