NIH Clinical Center releases dataset of 32,000 CT images
Friday, July 20, 2018
The National Institutes of Health’s Clinical Center has made a large-scale dataset of CT images publicly available to help the scientific community improve detection accuracy of lesions. While most publicly available medical image datasets have less than a thousand lesions, this dataset, named DeepLesion, has over 32,000 annotated lesions identified on CT images.
The images, which have been thoroughly anonymized, represent 4,400 unique patients, who are partners in research at the NIH.
Once a patient steps out of a CT scanner, the corresponding images are sent to a radiologist to interpret. Radiologists at the Clinical Center then measure and mark clinically meaningful findings with an electronic bookmark tool. Similar to a physical bookmark, radiologists save their place and mark significant findings to be able to come back to at a later time. These bookmarks are complex – they provide arrows, lines, diameters, and text that can tell the exact location and size of a lesion so experts can identify growth or new disease.