The Mahalanobis distance (DM; (Mahalanobis, 1936)) is a multivariate extension of the Euclidean distance, measuring the distance of each member of a set of multivariate measures to the mean of their multivariate distribution. For each subject, DM is calculated using the following formula:
where corresponds to the set of multivariate neuroimaging observations for each individual, μ is the mean of the multivariate distribution of neuroimaging measures, and S is the variance-covariance matrix between measures. For example, if m measures are collected from each individual, and μ correspond to a 1 × m vector, S corresponds to an m × m matrix, and DM from Eq. (1) is a scalar. In this way, DM accounts for the variance of individual observations as well as the covariance between the set of observations, homologous to the Euclidean distance in univariate analysis. In the present study, we estimate DM for individuals with ASD from their corresponding DTI brain measures, using the TDC as the population reference. In this case, DM corresponds to how close an ASD individual's brain measures are to the multivariate mean of the TDC population, where larger DM represents increased distance from the center of the typically developing population.
In constructing DM from longitudinal measurements, it is important to account for neurodevelopmental processes (Dean et al., 2014b, Dean et al., 2014a, Lebel and Beaulieu, 2011, Lebel et al., 2012, Snook et al., 2005). Generalized additive mixed models (GAMM's) were fit to the regional developmental trajectories of the DTI parameters (FA, MD, AD, and RD) of the TDC group to characterize the observed age-related white matter changes and establish a normative growth trajectory for each brain region. Generalized additive mixed models were utilized to characterize the age-related changes as these models have been designed specifically for cohort-sequential longitudinal designs (Wood, 2006, Wood, 2012) and for their ability to account for repeated measurements from the same individual. Furthermore, since the growth model that describes white matter is unknown, the semi-parametric nature of these spline models provides flexibility in capturing subtle developmental changes compared to parametric growth models (Travers et al., 2015b). Longitudinal modeling analyses were performed using R version 3.2.1 (R Development Team, 2014), while accounting for the nuisance variables of head coil (due to upgrade discussed earlier) and total motion index.
Upon determining the best fit model of the TDC group, these models were used to predict FA, MD, AD and RD along the modeled TDC growth trajectory for every ASD participant at each time point and for each brain region. The difference between the participants' parameter values from these predicted values (i.e. the model residuals) were calculated, corresponding to the vertical distance between the participant parameter measurements and the TDC reference growth trajectory. DM for each time point was calculated from these residuals using eq. (1), where corresponds to the difference between observed measurements () and modeled values (μ), and S is the variance-covariance matrix of the modeled residuals from the TDC group.
DM was similarly calculated for each TDC individual. However, to avoid including an individual's measurements in the model fitting when establishing the reference growth trajectory, a leave-one-out approach was used when modeling regional FA, MD, AD and RD developmental trajectories. For a given TDC subject, FA, MD, AD, and RD longitudinal measurements were removed prior to modeling and participant-specific residuals and DM was calculated as before. This process was repeated for each TDC participant.
After calculating DM for each longitudinal time point, an average across these time points was computed for each participant, providing a single, representative DM value for each individual. Supplementary Fig. 1 displays a representative schematic illustrating the process of calculating DM. Distributions of Mahalanobis distances were generated for both the ASD and TDC groups and the Bhattacharyya coefficient (Mo et al., 2015) was computed to assess the degree of overlap between the group distributions, where smaller Bhattacharyya coefficients correspond to a lesser degree of overlap. DM was additionally calculated by considering regional DTI parameters (FA, MD, AD, and RD) separately.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.
Tips for asking effective questions
+ Description
Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.