The Brain Imaging and Neurophysiology Dataset of large-scale multimodal neural data.
Authors
Affiliations (8)
Affiliations (8)
- Department of Neurology, Beth Israel Deaconess Medical Center (BIDMC), Boston, MA, USA. [email protected].
- Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, MA, USA.
- Stanford University, Palo Alto, USA.
- Computer Science & Artificial Intelligence Lab, EECS, MIT, Cambridge, MA, USA.
- Department of Neurology, Beth Israel Deaconess Medical Center (BIDMC), Boston, MA, USA.
- Amazon Web Services, Seattle, USA.
- Yale University, New Haven, CT, USA.
Abstract
The Brain Imaging and Neurophysiology Dataset (BIND) represents one of the largest multi-institutional, multimodal, clinical neuroimaging repositories, comprising 1.8 million brain scans from 38,942 patients, linked to full-text reports and neurophysiological recordings. This comprehensive dataset addresses critical limitations in neuroimaging research by providing a rich and diverse set of large-scale multimodal data. BIND integrates de-identified data from Massachusetts General Hospital, Brigham and Women's Hospital, and Stanford University, including 1,723,699 MRI scans (1.5, 3 and 7 Tesla), 54,137 CT scans, 5,093 PET scans, and 526 SPECT scans, converted to standardized NIfTI format following BIDS organization. The dataset spans the full age spectrum and encompasses diverse neurological conditions alongside healthy subjects. We deployed Large Language Models to extract structured clinical metadata from 84,960 reports to extract standardized clinical information. All imaging data are linked to previously published EEG and polysomnography recordings, facilitating future multimodal analyses. BIND is freely accessible for academic research through the Brain Data Science Platform (https://bdsp.io/). This resource facilitates large-scale neuroimaging studies, machine learning applications, and multimodal brain research to accelerate discoveries in clinical neuroscience.