Research Data Management and Sharing: Finding Data Sources
A guide for researchers on how to manage and share the data they generate.
Finding Data Sources
There are many sources of datasets around the internet. Below is a non-exhaustive list of some of these.
- Data.govSearch and access over 100,000 federal data sets in science, health, research, education and other topics.
- HealthData.govData on health topics from a number of government agencies, including the U.S. Department of Health and Human Services, the Centers for Medicare and Medicaid Services, Centers for Disease Control and Prevention, Food and Drug Administration, and the Agency for Health Care Research and Quality.
- Data Discovery at the National Library of MedicineA platform to provide access to datasets from selected NLM resources.
- ICPSRThe Inter-university Consortium for Political and Social Research (ICPSR) maintains a data archive of more than 250,000 files of research in the social and behavioral sciences. It hosts 21 specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields.
- DANDI ArchiveArchive from the NIH BRAIN Initiative for publishing and sharing neurophysiology data including electrophysiology, optophysiology, and behavioral time-series, and images from immunostaining experiments. The open dataset from this can also be found at https://registry.opendata.aws/dandiarchive/
- National Center for Health StatisticsNCHS collects, analyzes, and disseminates timely, relevant, and accurate health data and statistics.
- Global Health Data ExchangeThe Global Health Data Exchange (GHDx) is a data catalog created and supported by Institute for Health Metrics and Evaluation (IHME) at the University of Washington.
- NIAID Data Discovery PortalA tool from the National Institute of Allergy and Infectious Diseases that allows researchers to find infectious and immune-mediated disease (IID) data across many repositories.