The central task in our problem is to select a small set of voxels that can effectively classify different conditions. Thus we would like to fit a sparse model such that at most locations the regression coefficient β1(sj) is zero or nearly zero. To this end, we modified the covariance matrix in the prior distribution in Eq. (3) as
where Dα = diag[ã1τ1, …, ãnτn], ãj = 1 if αj = 0 and ãj = cj if αj = 1, α = (α1, …, αn) is an indicator vector for variable selection. We set τj (> 0) small and cj (always > 1) large to make those β1(sj) with αj = 0 clustered around 0 with variance τj, whereas those β1(sj) with αj = 1 more dispersed with variance cjτj. Under this setting, the prior of β1(sj) corresponds to a mixture of two normal distributions, and is more effective to differentiate the two groups of β1(sj)’s [33]. In addition, we assume that α1, …, αn are i.i.d. observations from the Bernoulli distribution P(αj = 1) = 1 − P(αj = 0) = pj = 0.5. The prior distribution for β0 is specified as π(β0) = N(0, c3), where c3 = 105. The prior of γ depends on the model of the correlation function H(γ). In the Appendix, we consider an example with a Matérn correlation function with a decay parameter γ >0, for which we use π(γ) = N(a2, b2) truncated to be positive. We will discuss the choice of a2 and b2 using the empirical semivariogram in the Results Section. All the priors are assumed to be independent. A graphical representation of our BPSV model that depicts the dependency relationship between the variables is presented in Figure 2.
Graphical representation of our proposed model with m observations and n predictors.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.