Publications

2024

Mehrtash A, Ziegler E, Idris T, Somarouthu B, Urban T, LaCasce AS, Jacene H, Van Den Abbeele AD, Pieper S, Harris G, Kikinis R, Kapur T. Evaluation of mediastinal lymph node segmentation of heterogeneous CT data with full and weak supervision. Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society. 2024;111:102312.

Accurate lymph node size estimation is critical for staging cancer patients, initial therapeutic management, and assessing response to therapy. Current standard practice for quantifying lymph node size is based on a variety of criteria that use uni-directional or bi-directional measurements. Segmentation in 3D can provide more accurate evaluations of the lymph node size. Fully convolutional neural networks (FCNs) have achieved state-of-the-art results in segmentation for numerous medical imaging applications, including lymph node segmentation. Adoption of deep learning segmentation models in clinical trials often faces numerous challenges. These include lack of pixel-level ground truth annotations for training, generalizability of the models on unseen test domains due to the heterogeneity of test cases and variation of imaging parameters. In this paper, we studied and evaluated the performance of lymph node segmentation models on a dataset that was completely independent of the one used to create the models. We analyzed the generalizability of the models in the face of a heterogeneous dataset and assessed the potential effects of different disease conditions and imaging parameters. Furthermore, we systematically compared fully-supervised and weakly-supervised methods in this context. We evaluated the proposed methods using an independent dataset comprising 806 mediastinal lymph nodes from 540 unique patients. The results show that performance achieved on the independent test set is comparable to that on the training set. Furthermore, neither the underlying disease nor the heterogeneous imaging parameters impacted the performance of the models. Finally, the results indicate that our weakly-supervised method attains 90%- 91% of the performance achieved by the fully supervised training.

Elaldi A, Gerig G, Dey N. E(3) × SO(3)-Equivariant Networks for Spherical Deconvolution in Diffusion MRI. Proceedings of machine learning research. 2024;227:301–319.

We present Roto-Translation Equivariant Spherical Deconvolution (RT-ESD), an E(3)×SO(3) equivariant framework for sparse deconvolution of volumes where each voxel contains a spherical signal. Such 6D data naturally arises in diffusion MRI (dMRI), a medical imaging modality widely used to measure microstructure and structural connectivity. As each dMRI voxel is typically a mixture of various overlapping structures, there is a need for blind deconvolution to recover crossing anatomical structures such as white matter tracts. Existing dMRI work takes either an iterative or deep learning approach to sparse spherical deconvolution, yet it typically does not account for relationships between neighboring measurements. This work constructs equivariant deep learning layers which respect to symmetries of spatial rotations, reflections, and translations, alongside the symmetries of voxelwise spherical rotations. As a result, RT-ESD improves on previous work across several tasks including fiber recovery on the DiSCo dataset, deconvolution-derived partial volume estimation on real-world in vivo human brain dMRI, and improved downstream reconstruction of fiber tractograms on the Tractometer dataset. Our implementation is available at https://github.com/AxelElaldi/e3so3_conv.

Mehrtash A, Ziegler E, Idris T, Somarouthu B, Urban T, LaCasce AS, Jacene H, Van Den Abbeele AD, Pieper S, Harris G, Kikinis R, Kapur T. Evaluation of mediastinal lymph node segmentation of heterogeneous CT data with full and weak supervision. Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society. 2024;111:102312.

Accurate lymph node size estimation is critical for staging cancer patients, initial therapeutic management, and assessing response to therapy. Current standard practice for quantifying lymph node size is based on a variety of criteria that use uni-directional or bi-directional measurements. Segmentation in 3D can provide more accurate evaluations of the lymph node size. Fully convolutional neural networks (FCNs) have achieved state-of-the-art results in segmentation for numerous medical imaging applications, including lymph node segmentation. Adoption of deep learning segmentation models in clinical trials often faces numerous challenges. These include lack of pixel-level ground truth annotations for training, generalizability of the models on unseen test domains due to the heterogeneity of test cases and variation of imaging parameters. In this paper, we studied and evaluated the performance of lymph node segmentation models on a dataset that was completely independent of the one used to create the models. We analyzed the generalizability of the models in the face of a heterogeneous dataset and assessed the potential effects of different disease conditions and imaging parameters. Furthermore, we systematically compared fully-supervised and weakly-supervised methods in this context. We evaluated the proposed methods using an independent dataset comprising 806 mediastinal lymph nodes from 540 unique patients. The results show that performance achieved on the independent test set is comparable to that on the training set. Furthermore, neither the underlying disease nor the heterogeneous imaging parameters impacted the performance of the models. Finally, the results indicate that our weakly-supervised method attains 90%- 91% of the performance achieved by the fully supervised training.

Wei R, Ganglberger W, Sun H, Hadar PN, Gollub RL, Pieper S, Billot B, Au R, Iglesias JE, Cash SS, Kim S, Shin C, Westover B, Thomas RJ. Linking brain structure, cognition, and sleep: insights from clinical data. Sleep. 2024;47(2).

STUDY OBJECTIVES: To use relatively noisy routinely collected clinical data (brain magnetic resonance imaging (MRI) data, clinical polysomnography (PSG) recordings, and neuropsychological testing), to investigate hypothesis-driven and data-driven relationships between brain physiology, structure, and cognition.

METHODS: We analyzed data from patients with clinical PSG, brain MRI, and neuropsychological evaluations. SynthSeg, a neural network-based tool, provided high-quality segmentations despite noise. A priori hypotheses explored associations between brain function (measured by PSG) and brain structure (measured by MRI). Associations with cognitive scores and dementia status were studied. An exploratory data-driven approach investigated age-structure-physiology-cognition links.

RESULTS: Six hundred and twenty-three patients with sleep PSG and brain MRI data were included in this study; 160 with cognitive evaluations. Three hundred and forty-two participants (55%) were female, and age interquartile range was 52 to 69 years. Thirty-six individuals were diagnosed with dementia, 71 with mild cognitive impairment, and 326 with major depression. One hundred and fifteen individuals were evaluated for insomnia and 138 participants had an apnea-hypopnea index equal to or greater than 15. Total PSG delta power correlated positively with frontal lobe/thalamic volumes, and sleep spindle density with thalamic volume. rapid eye movement (REM) duration and amygdala volume were positively associated with cognition. Patients with dementia showed significant differences in five brain structure volumes. REM duration, spindle, and slow-oscillation features had strong associations with cognition and brain structure volumes. PSG and MRI features in combination predicted chronological age (R2 = 0.67) and cognition (R2 = 0.40).

CONCLUSIONS: Routine clinical data holds extended value in understanding and even clinically using brain-sleep-cognition relationships.

2023

Chen Y, Zhang C, Xue T, Song Y, Makris N, Rathi Y, Cai W, Zhang F, O’Donnell LJ. Deep Fiber Clustering: Anatomically Informed Fiber Clustering with Self-supervised Deep Learning for Fast and Effective Tractography Parcellation. NeuroImage. 2023;273:120086.

White matter fiber clustering is an important strategy for white matter parcellation, which enables quantitative analysis of brain connections in health and disease. In combination with expert neuroanatomical labeling, data-driven white matter fiber clustering is a powerful tool for creating atlases that can model white matter anatomy across individuals. While widely used fiber clustering approaches have shown good performance using classical unsupervised machine learning techniques, recent advances in deep learning reveal a promising direction toward fast and effective fiber clustering. In this work, we propose a novel deep learning framework for white matter fiber clustering, Deep Fiber Clustering (DFC), which solves the unsupervised clustering problem as a self-supervised learning task with a domain-specific pretext task to predict pairwise fiber distances. This process learns a high-dimensional embedding feature representation for each fiber, regardless of the order of fiber points reconstructed during tractography. We design a novel network architecture that represents input fibers as point clouds and allows the incorporation of additional sources of input information from gray matter parcellation. Thus, DFC makes use of combined information about white matter fiber geometry and gray matter anatomy to improve the anatomical coherence of fiber clusters. In addition, DFC conducts outlier removal naturally by rejecting fibers with low cluster assignment probability. We evaluate DFC on three independently acquired cohorts, including data from 220 individuals across genders, ages (young and elderly adults), and different health conditions (healthy control and multiple neuropsychiatric disorders). We compare DFC to several state-of-the-art white matter fiber clustering algorithms. Experimental results demonstrate superior performance of DFC in terms of cluster compactness, generalization ability, anatomical coherence, and computational efficiency.

Neumann PE, Halle MW, Kong J, Kikinis R. West Meets East: Taking a Stab at Acupuncture Point Names. Clin Anat. 2023;35:641–50.

Acupuncture point names written in Chinese Han characters often provide clinically useful information in both their literal and figurative meanings about location and therapeutic use. The World Health Organization (WHO) standard acupuncture nomenclature includes these names in Han characters in an unusual array that includes both "original" forms and, in parentheses, simplified forms. Construction of a multilingual table of acupuncture point names during development of a database revealed that the assumption that the "original" form in the WHO nomenclature was the traditional Chinese character was frequently false. The Han character forms in the pdf of the 2009 reprint of WHO Standard Acupuncture Point Locations were carefully compared with Han characters used in traditional and simplified Chinese, Japanese and Korean writing systems. This work utilized three online tools: UnicodePlus, Unihan Database Lookup, and Wiktionary. Only 48% of the "original" character forms were traditional Chinese characters. The Unicode number was correct in 99%, but in most cases the East Asian font used was not a traditional Chinese one. The issue about Han character forms was also found in all earlier versions of the WHO standard acupuncture nomenclature. Other detected problems included the use of wrong characters for an "original" character form in one name and for a simplified character form in another name. The WHO standard acupuncture nomenclature should be revised with a focus on accuracy in the usage of Han characters.

Taymourtash A, Schwartz E, Nenning KH, Sobotka D, Licandro R, Glatter S, Diogo MC, Golland P, Grant E, Prayer D, Kasprian G, Langs G. Fetal Development of Functional Thalamocortical and Cortico-cortical Connectivity. Cerebral Cortex. 2023;33(9):5613–5624.

Measuring and understanding functional fetal brain development in utero is critical for the study of the developmental foundations of our cognitive abilities, possible early detection of disorders, and their prevention. Thalamocortical connections are an intricate component of shaping the cortical layout, but so far, only ex-vivo studies provide evidence of how axons enter the sub-plate and cortex during this highly dynamic phase. Evidence for normal in-utero development of the functional thalamocortical connectome in humans is missing. Here, we modeled fetal functional thalamocortical connectome development using in-utero functional magnetic resonance imaging in fetuses observed from 19th to 40th weeks of gestation (GW). We observed a peak increase of thalamocortical functional connectivity strength between 29th and 31st GW, right before axons establish synapses in the cortex. The cortico-cortical connectivity increases in a similar time window, and exhibits significant functional laterality in temporal-superior, -medial, and -inferior areas. Homologous regions exhibit overall similar mirrored connectivity profiles, but this similarity decreases during gestation giving way to a more diverse cortical interconnectedness. Our results complement the understanding of structural development of the human connectome and may serve as the basis for the investigation of disease and deviations from a normal developmental trajectory of connectivity development.

Tregidgo HFJ, Soskic S, Althonayan J, Maffei C, Van Leemput K, Golland P, Insausti R, Lerma-Usabiaga G, Caballero-Gaudes C, Paz-Alonso PM, Yendiki A, Alexander DC, Bocchetta M, Rohrer JD, Iglesias JE. Accurate Bayesian Segmentation of Thalamic Nuclei Using Diffusion MRI and an Improved Histological Atlas. NeuroImage. 2023;274:120129.

The human thalamus is a highly connected brain structure, which is key for the control of numerous functions and is involved in several neurological disorders. Recently, neuroimaging studies have increasingly focused on the volume and connectivity of the specific nuclei comprising this structure, rather than looking at the thalamus as a whole. However, accurate identification of cytoarchitectonically designed histological nuclei on standard in vivo structural MRI is hampered by the lack of image contrast that can be used to distinguish nuclei from each other and from surrounding white matter tracts. While diffusion MRI may offer such contrast, it has lower resolution and lacks some boundaries visible in structural imaging. In this work, we present a Bayesian segmentation algorithm for the thalamus. This algorithm combines prior information from a probabilistic atlas with likelihood models for both structural and diffusion MRI, allowing segmentation of 25 thalamic labels per hemisphere informed by both modalities. We present an improved probabilistic atlas, incorporating thalamic nuclei identified from histology and 45 white matter tracts surrounding the thalamus identified in ultra-high gradient strength diffusion imaging. We present a family of likelihood models for diffusion tensor imaging, ensuring compatibility with the vast majority of neuroimaging datasets that include diffusion MRI data. The use of these diffusion likelihood models greatly improves identification of nuclear groups versus segmentation based solely on structural MRI. Dice comparison of 5 manually identifiable groups of nuclei to ground truth segmentations show improvements of up to 10 percentage points. Additionally, our chosen model shows a high degree of reliability, with median test-retest Dice scores above 0.85 for four out of five nuclei groups, whilst also offering improved detection of differential thalamic involvement in Alzheimer's disease (AUROC 81.98%). The probabilistic atlas and segmentation tool will be made publicly available as part of the neuroimaging package FreeSurfer.

Diffusion MRI tractography is an advanced imaging technique that enables in vivo mapping of the brain's white matter connections. White matter parcellation classifies tractography streamlines into clusters or anatomically meaningful tracts. It enables quantification and visualization of whole-brain tractography. Currently, most parcellation methods focus on the deep white matter (DWM), whereas fewer methods address the superficial white matter (SWM) due to its complexity. We propose a novel two-stage deep-learning-based framework, Superficial White Matter Analysis (SupWMA), that performs an efficient and consistent parcellation of 198 SWM clusters from whole-brain tractography. A point-cloud-based network is adapted to our SWM parcellation task, and supervised contrastive learning enables more discriminative representations between plausible streamlines and outliers for SWM. We train our model on a large-scale tractography dataset including streamline samples from labeled long- and medium-range (over 40 mm) SWM clusters and anatomically implausible streamline samples, and we perform testing on six independently acquired datasets of different ages and health conditions (including neonates and patients with space-occupying brain tumors). Compared to several state-of-the-art methods, SupWMA obtains highly consistent and accurate SWM parcellation results on all datasets, showing good generalization across the lifespan in health and disease. In addition, the computational speed of SupWMA is much faster than other methods.

Zheng J, Yang Q, Makris N, Huang K, Liang J, Ye C, Yu X, Tian M, Ma T, Mou T, Guo W, Kikinis R, Gao Y. Three-Dimensional Digital Reconstruction of the Cerebellar Cortex: Lobule Thickness, Surface Area Measurements, and Layer Architecture. Cerebellum. 2023;22(2):249–60.

The cerebellum is ontogenetically one of the first structures to develop in the central nervous system; nevertheless, it has been only recently reconsidered for its significant neurobiological, functional, and clinical relevance in humans. Thus, it has been a relatively under-studied compared to the cerebrum. Currently, non-invasive imaging modalities can barely reach the necessary resolution to unfold its entire, convoluted surface, while only histological analyses can reveal local information at the micrometer scale. Herein, we used the BigBrain dataset to generate area and point-wise thickness measurements for all layers of the cerebellar cortex and for each lobule in particular. We found that the overall surface area of the cerebellar granular layer (including Purkinje cells) was 1,732 cm2 and the molecular layer was 1,945 cm2. The average thickness of the granular layer is 0.88 mm (± 0.83) and that of the molecular layer is 0.32 mm (± 0.08). The cerebellum (both granular and molecular layers) is thicker at the depth of the sulci and thinner at the crowns of the gyri. Globally, the granular layer is thicker in the lateral-posterior-inferior region than the medial-superior regions. The characterization of individual layers in the cerebellum achieved herein represents a stepping-stone for investigations interrelating structural and functional connectivity with cerebellar architectonics using neuroimaging, which is a matter of considerable relevance in basic and clinical neuroscience. Furthermore, these data provide templates for the construction of cerebellar topographic maps and the precise localization of structural and functional alterations in diseases affecting the cerebellum.