The physical space of the MSK Library is permanently closed to visitors as of Friday, May 17, 2024. Please visit this guide for more information.
AI projects need data to train, test, and validate the systems being developed. Below are some sources for existing datasets, along with repositories where datasets can be saved and shared. (Note that some of the items listed under datasets may also serve as specialized data repositories.)
A resource's listing is not an endorsement of the quality of the data available. Always carefully appraise the information you are using to develop your AI tools.
Looking for information on the NIH Data Management and Sharing policy? Visit our guide.
Need more information on selecting a repository or on the NIH data management policy? Visit our NIH Data Management and Sharing Policy guide.