Skip to Main Content

Artificial Intelligence

The intersection of AI, healthcare, cancer research, and publishing

The Importance of Data

AI projects need data to train, test, and validate the systems being developed. Below are some sources for existing datasets, along with repositories where datasets can be saved and shared. (Note that some of the items listed under datasets may also serve as specialized data repositories.)

A resource's listing is not an endorsement of the quality of the data available. Always carefully appraise the information you are using to develop your AI tools. 

Looking for information on the NIH Data Management and Sharing policy? Visit our guide.

Datasets

Data Repositories

Need more information on selecting a repository or on the NIH data management policy? Visit our NIH Data Management and Sharing Policy guide.