The physical space of the MSK Library is permanently closed to visitors as of Friday, May 17, 2024. Please visit this guide for more information.
AI projects need data to train, test, and validate the systems being developed. Below are some sources for existing datasets, along with repositories where datasets can be saved and shared. (Note that some of the items listed under datasets may also serve as specialized data repositories.)
A resource's listing is not an endorsement of the quality of the data available. Always carefully appraise the information you are using to develop your AI tools.
Need more information on selecting a repository/the NIH data management policy? Visit our NIH Data Management and Sharing Policy guide.