Centre for Youth Mental Health - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 3 of 3
  • Item
    No Preview Available
    Accommodating site variation in neuroimaging data using normative and hierarchical Bayesian models
    Bayer, JMM ; Dinga, R ; Kia, SM ; Kottaram, AR ; Wolfers, T ; Lv, J ; Zalesky, A ; Schmaal, L ; Marquand, A (ACADEMIC PRESS INC ELSEVIER SCIENCE, 2022-10-29)
    The potential of normative modeling to make individualized predictions from neuroimaging data has enabled inferences that go beyond the case-control approach. However, site effects are often confounded with variables of interest in a complex manner and can bias estimates of normative models, which has impeded the application of normative models to large multi-site neuroimaging data sets. In this study, we suggest accommodating for these site effects by including them as random effects in a hierarchical Bayesian model. We compared the performance of a linear and a non-linear hierarchical Bayesian model in modeling the effect of age on cortical thickness. We used data of 570 healthy individuals from the ABIDE (autism brain imaging data exchange) data set in our experiments. In addition, we used data from individuals with autism to test whether our models are able to retain clinically useful information while removing site effects. We compared the proposed single stage hierarchical Bayesian method to several harmonization techniques commonly used to deal with additive and multiplicative site effects using a two stage regression, including regressing out site and harmonizing for site with ComBat, both with and without explicitly preserving variance caused by age and sex as biological variation of interest, and with a non-linear version of ComBat. In addition, we made predictions from raw data, in which site has not been accommodated for. The proposed hierarchical Bayesian method showed the best predictive performance according to multiple metrics. Beyond that, the resulting z-scores showed little to no residual site effects, yet still retained clinically useful information. In contrast, performance was particularly poor for the regression model and the ComBat model in which age and sex were not explicitly modeled. In all two stage harmonization models, predictions were poorly scaled, suffering from a loss of more than 90% of the original variance. Our results show the value of hierarchical Bayesian regression methods for accommodating site variation in neuroimaging data, which provides an alternative to harmonization techniques. While the approach we propose may have broad utility, our approach is particularly well suited to normative modeling where the primary interest is in accurate modeling of inter-subject variation and statistical quantification of deviations from a reference model.
  • Item
    Thumbnail Image
    Site effects how-to and when: An overview of retrospective techniques to accommodate site effects in multi-site neuroimaging analyses
    Bayer, JMM ; Thompson, PM ; Ching, CRK ; Liu, M ; Chen, A ; Panzenhagen, AC ; Jahanshad, N ; Marquand, A ; Schmaal, L ; Samann, PG (FRONTIERS MEDIA SA, 2022-10-31)
    Site differences, or systematic differences in feature distributions across multiple data-acquisition sites, are a known source of heterogeneity that may adversely affect large-scale meta- and mega-analyses of independently collected neuroimaging data. They influence nearly all multi-site imaging modalities and biomarkers, and methods to compensate for them can improve reliability and generalizability in the analysis of genetics, omics, and clinical data. The origins of statistical site effects are complex and involve both technical differences (scanner vendor, head coil, acquisition parameters, imaging processing) and differences in sample characteristics (inclusion/exclusion criteria, sample size, ancestry) between sites. In an age of expanding international consortium research, there is a growing need to disentangle technical site effects from sample characteristics of interest. Numerous statistical and machine learning methods have been developed to control for, model, or attenuate site effects - yet to date, no comprehensive review has discussed the benefits and drawbacks of each for different use cases. Here, we provide an overview of the different existing statistical and machine learning methods developed to remove unwanted site effects from independently collected neuroimaging samples. We focus on linear mixed effect models, the ComBat technique and its variants, adjustments based on image quality metrics, normative modeling, and deep learning approaches such as generative adversarial networks. For each method, we outline the statistical foundation and summarize strengths and weaknesses, including their assumptions and conditions of use. We provide information on software availability and comment on the ease of use and the applicability of these methods to different types of data. We discuss validation and comparative reports, mention caveats and provide guidance on when to use each method, depending on context and specific research questions.
  • Item
    Thumbnail Image
    Charting brain growth and aging at high spatial precision
    Rutherford, S ; Fraza, C ; Dinga, R ; Kia, SM ; Wolfers, T ; Zabihi, M ; Berthet, P ; Worker, A ; Verdi, S ; Andrews, D ; Han, LK ; Bayer, JM ; Dazzan, P ; McGuire, P ; Mocking, RT ; Schene, A ; Sripada, C ; Tso, IF ; Duval, ER ; Chang, S-E ; Penninx, BW ; Heitzeg, MM ; Burt, SA ; Hyde, LW ; Amaral, D ; Nordahl, CW ; Andreasssen, OA ; Westlye, LT ; Zahn, R ; Ruhe, HG ; Beckmann, C ; Marquand, AF (eLIFE SCIENCES PUBL LTD, 2022-02-01)
    Defining reference models for population variation, and the ability to study individual deviations is essential for understanding inter-individual variability and its relation to the onset and progression of medical conditions. In this work, we assembled a reference cohort of neuroimaging data from 82 sites (N=58,836; ages 2-100) and used normative modeling to characterize lifespan trajectories of cortical thickness and subcortical volume. Models are validated against a manually quality checked subset (N=24,354) and we provide an interface for transferring to new data sources. We showcase the clinical value by applying the models to a transdiagnostic psychiatric sample (N=1985), showing they can be used to quantify variability underlying multiple disorders whilst also refining case-control inferences. These models will be augmented with additional samples and imaging modalities as they become available. This provides a common reference platform to bind results from different studies and ultimately paves the way for personalized clinical decision-making.