An Oryza sativa data set [15] with 413 Oryza sativa individual records of genotypes and the trait of flowering time in Arkansas was used to demonstrate the problem of spurious associations. After removing the records with missing data for the trait and genotypes with a minor allele frequency less than 0.05, 374 individuals and 33,701 SNPs were included in the analysis. The data are publicly available at the Rice Diversity Panel (http://www.ricediversity.org). Inferences of association based on genomic windows using Bayesian regression models were conducted using non-overlapping windows of size 1 Mb. The genome of the real data was divided into 378 non-overlapping genomic windows of size of 1 Mb, and the value of T was set to , where N = 378. The lower, median and upper quartiles for the number of SNPs per window () were 60, 82 and 106, respectively. Seven windows containing more than 175 (i.e., upper quartile + (upper quartile − lower quartile)) SNPs were identified as dense.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.
Tips for asking effective questions
+ Description
Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.