Notice: Function Pods Templates was called incorrectly. Pod Template PHP code is no longer actively supported and will be completely removed in Pods 3.3 Please see Debugging in WordPress for more information. (This message was added in version 3.0.) in /var/www/html/wp-includes/functions.php on line 6085
Deprecated: Pod Template PHP code has been deprecated, please use WP Templates instead of embedding PHP. has been deprecated since Pods version 2.3 with no alternative available. in /var/www/html/wp-content/plugins/pods/includes/general.php on line 1037
Pretreat-MetsToBrain-Masks | A Large Open Access Dataset of Brain Metastasis 3D Segmentations on MRI with Clinical and Imaging Feature Information
DOI: 10.7937/6be1-r748 | Data Citation Required | Image Collection
Location | Species | Subjects | Data Types | Cancer Types | Size | Status | Updated | |
---|---|---|---|---|---|---|---|---|
Brain | Human | 200 | MR, Segmentation, Demographic, Diagnosis, Measurement, Follow-Up | Breast Cancer, Small Cell Lung Cancer, Non-small Cell Lung Cancer, Melanoma | Clinical, Image Analyses, Software/Source Code | Public, Complete | 2023/12/19 |
Summary
Patient images were collected from three sources: the Yale New Haven Health database (2013-2021), the Yale tumor board registry (2021), and the Yale gamma knife registry (2017-2021). Inclusion criteria included a pathologically proven diagnosis of brain metastasis and availability of a pretreatment scan with standard MRI sequences (T1 w, T1 post-gadolinium, T2w, and FLAIR). Exclusion criteria included lack of pretreatment scan or one of the standard MR sequences and significant motion artifact in any of the standard sequences. Images were acquired using a 1.5T or 3T scanner. Most patients had T1 post-gadolinium sequences acquired using the MPRAGE protocol with a small subset having only spin echoes. Segmentations of core and necrotic tumor components were performed on T1 post-gadolinium sequences while whole tumor was segmented on FLAIR sequences. All segmentations were performed manually on DICOM images in research PACS using a volumetric tool after transfer from clinical PACS. All segmentations were approved by two neuroradiologists. The four standard sequences for each patient were exported from research PACS into NifTI format, co-registered to the SRl24 anatomical template, resampled to a uniform isotropic resolution (1 mm3), and skull stripped. Core, necrosis, and whole segmentation masks were exported individually from research PACS into NifTI format, combined into one mask while retaining their positions in native space, and registered to the SRl24 anatomical template before resampling to a uniform isotropic resolution. The sequence and segmentation NifTI files were manually checked and corrected by a neuroradiologist on the ITK-SNAP software. Demographic data, including sex, ethnicity, age at diagnosis, smoking pack-year history, and presence of extranodal metastasis were obtained using the electronic medical record (EMR). Survival was calculated by subtracting the date of diagnosis from the date of death or from the date of last EMR note for censored patients. Qualitative imaging features, including infratentorial involvement and intratumoral susceptibility in at least one lesion on SWI sequence, were obtained from visual assessment of images. Quantitative imaging features were extracted from the NifTI segmentation mask for each patient and included total enhancing tumor volume, total necrotic tumor volume, total peritumoral edema volume, ratio of necrotic to enhancing volume, ratio of peritumoral edema to enhancing volume, number of enhancing lesions, number of necrotic lesions, and number of lesions with peritumoral edema. Finally, origin of metastasis was also obtained from the EMR from previous oncological and/or pathological reports. Dates of birth, diagnosis, death, and last note were all anonymized preserving duration between events. Clinical and imaging feature information are stored in an Excel file. Currently, many published brain metastasis segmentation algorithms have only been trained and validated on single institution datasets, leading to poor model generalizability. The availability of public datasets is important for algorithm generalizability and implementation into clinical practice. Our dataset is unique in its inclusion of many sub centimeter brain metastases, for which there are yet no robust algorithms. In addition, the dataset has many lesions with necrotic segmentations, which are historically not provided with dataset segmentations. We also include survival outcomes manually obtained from the EMR, allowing for future correlation of our imaging data with clinical outcomes. In addition, the heterogeneity of sources from which we obtained our patient data will allow for algorithms to be trained on a real-world hospital dataset that is not highly curated, thus enhancing model applicability to clinical practice. Cohort Selection:
Inclusion/exclusion criteria:
Imaging acquisition/parameters:
Segmentation protocol:
Image preprocessing:
Clinical and imaging data collection:
Data Access
Version 1: Updated 2023/12/19
Title | Data Type | Format | Access Points | Subjects | License | |||
---|---|---|---|---|---|---|---|---|
Images and Segmentations | MR, Segmentation | NIFTI | Download requires IBM-Aspera-Connect plugin |
200 | 1,000 | CC BY 4.0 | ||
Clinical data and data dictionary | Demographic, Diagnosis, Measurement, Follow-Up | XLSX | 200 | CC BY 4.0 |
Additional Resources for this Dataset
The following external resources have been made available by the data submitters. These are not hosted or supported by TCIA, but may be useful to researchers utilizing this collection.
- CaPTk software was used to apply the BraTS Pre-processing Pipeline for image registration and brain extraction on these data https://cbica.github.io/CaPTk/preprocessing_brats.html
Citations & Data Usage Policy
Data Citation Required: Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution must include the following citation, including the Digital Object Identifier:
Data Citation |
|
Ramakrishnan, D., Jekel, L., Chadha, S., Janas, A., Moy, H., Maleki, N., Sala, M., Kaur, M., Petersen, G. C., Merkaj, S., von Reppert, M., Baid, U., Bakas, S., Kirsch, C., Davis, M., Bousabarah, K., Holler, W., Lin, M., Westerhoff, M., Aneja, S., Memon, F., Aboian, M. (2023). A Large Open Access Dataset of Brain Metastasis 3D Segmentations on MRI with Clinical and Imaging Feature Information (Version 1) [dataset]. The Cancer Imaging Archive. DOI: https://doi.org/10.7937/6be1-r748 |
Acknowledgements
We would like to acknowledge the individuals and institutions that have provided data for this collection:
Yale Department of Radiology and Biomedical Imaging
- The SAIC Metadata Content Development Team identified existing or created new caDSR Common Data elements (CDEs) to describe the harmonized components of this dataset. The Metadata Content Development Team is supported by CBIIT under Task Order 140D0421F0008 from NCI.
Related Publications
Publications by the Dataset Authors
The authors recommended this paper as the best source of additional information about this dataset:
The following publications are recommended by the data submitters that may be useful to researchers utilizing this collection:
- Ramakrishnan, D., Jekel, L., Chadha, S. et al. A large open access dataset of brain metastasis 3D segmentations on MRI with clinical and imaging information. Sci Data 11, 254 (2024). https://doi.org/10.1038/s41597-024-03021-9
- Ramakrishnan, D., Jekel, L., Chadha, S., Janas, A., Moy, H., Maleki, N., Sala, M., Kaur, M., Petersen, G. C., Merkaj, S., von Reppert, M., Baid, U., Bakas, S., Kirsch, C., Davis, M., Bousabarah, K., Holler, W., Lin, M., Westerhoff, M., … Aboian, M. S. (2023). A Large Open Access Dataset of Brain Metastasis 3D Segmentations with Clinical and Imaging Feature Information (Version 2). arXiv. https://doi.org/10.48550/ARXIV.2309.05053
Research Community Publications
TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact TCIA’s Helpdesk.