Document Type

Journal Article

Publication Title

ACM Computing Surveys

Volume

56

Issue

5

Publisher

Association for Computing Machinery

School

School of Science

RAS ID

64779

Funders

National Natural Science Foundation of China / Zhejiang University / National Key R&D Program of China

Comments

Li, J., Zhu, G., Hua, C., Feng, M., Bennamoun, B., Li, P., . . . Bennamoun, M. (2023). A systematic collection of medical image datasets for deep learning. ACM Computing Surveys, 56(5), article 116. https://doi.org/10.1145/3615862

Abstract

The astounding success made by artificial intelligence in healthcare and other fields proves that it can achieve human-like performance. However, success always comes with challenges. Deep learning algorithms are data dependent and require large datasets for training. Many junior researchers face a lack of data for a variety of reasons. Medical image acquisition, annotation, and analysis are costly, and their usage is constrained by ethical restrictions. They also require several other resources, such as professional equipment and expertise. That makes it difficult for novice and non-medical researchers to have access to medical data. Thus, as comprehensively as possible, this article provides a collection of medical image datasets with their associated challenges for deep learning research. We have collected the information of approximately 300 datasets and challenges mainly reported between 2007 and 2020 and categorized them into four categories: head and neck, chest and abdomen, pathology and blood, and others. The purpose of our work is to provide a list, as up-to-date and complete as possible, that can be used as a reference to easily find the datasets for medical image analysis and the information related to these datasets.

DOI

10.1145/3615862

Creative Commons License

Creative Commons Attribution-Share Alike 4.0 License
This work is licensed under a Creative Commons Attribution-Share Alike 4.0 License.

Share

 
COinS