Simulation framework

Christoph D. Rau; Natalia M. Gonzales; Joshua S. Bloom; Danny Park; Julien Ayroles; Abraham A. Palmer; Aldons J. Lusis; Noah Zaitlen

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Simulation framework

CR Christoph D. Rau

NG Natalia M. Gonzales

JB Joshua S. Bloom

DP Danny Park

JA Julien Ayroles

AP Abraham A. Palmer

AL Aldons J. Lusis

NZ Noah Zaitlen

This method is extracted from research article: PLoS Genet, Oct 2020

Modeling epistasis in mice and yeast using the proportion of two or more distinct genetic backgrounds: Evidence for “polygenic epistasis”

DOI: 10.1371/journal.pgen.1009165

Request a Protocol

Ask a question

Favorite

We created sets of simulated phenotypes based on the genotypes of the HDMP RI panel, which is an admixed population in which the B6 strain, on average, contributes 50% of each strain’s DNA. For each simulated phenotypes, we drew a SNP (75% > MAF > 25%) at random from the HMDP genotypes and created a phenotype based on β, the genetic effect size, ϕ the effect size of the interaction between global ancestry (θ) and the chosen SNP and a multivariate normal (mvn) derived from three variance terms: $σ_{g}^{2}$ , the proportion of variance attributable to genetic effects $σ_{θ}^{2}$ , the proportion of variance attributable to Gxθ effects and $σ_{ε}^{2}$ , the residual proportional variance attributable to all combined sources of error and variation not considered in this study. Phenotypes were generated both with and without the Gxθ variance term to ascertain the necessity of incorporating a second GRM (K^A), which would correct for relatedness in ancestry, into the algorithm.

We simulated four distinct phenotypes for our analysis

Phenotypes generated by including a SNP Effect

Phenotypes generated by including a Gxθ Effect

In each phenotype, $σ_{g}^{2}$ was set to 0.4. When incorporated, $σ_{θ}^{2}$ was set to 0.2 and $σ_{ε}^{2}$ was set to the remainder of the variance (0.6 or 0.4). The power of our model and independence of our β and β_Gxθ terms were queried by varying either β or β_Gxθ from 0 to 1 (200 values set 0.005 apart) with 1,000 simulated phenotypes at each step (200,000 total simulations per phenotype).

Using the same panel and set of SNPs described above, we drew one SNP at random to be our test SNP and one to ten additional SNPs to be simulated epistatically interacting SNPs (10,000 simulations per set of interacting SNPs, 100,000 simulations in total). For each simulation we created a composite SNP in which only strains with the minor (non-B6) allele in every one of the tests and interacting SNPs had that allele in the composite SNP. We used this composite SNP to generate a phenotype as described above with a χ² test statistic for the 2–11 interacting SNPs set to a large value of 20 to ensure a consistent and observable effect that could be recovered using either regular GWAS with pyLMM or our approach.

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol