Skip to Main Content

Artificial Intelligence

The intersection of AI, healthcare, cancer research, and publishing

The Importance of Data

AI projects need data to train, test, and validate the systems being developed. Below are some sources for existing datasets, along with repositories where datasets can be saved and shared. (Note that some of the items listed under datasets may also serve as specialized data repositories.)

A resource's listing is not an endorsement of the quality of the data available. Always carefully appraise the information you are using to develop your AI tools. 

Datasets

Data Repositories

Need more information on selecting a repository/the NIH data management policy? Visit our NIH Data Management and Sharing Policy guide.