Genome-wide association study (GWAS)

Anil Kumar Nalini Chandran; Jaspreet Sandhu; Larissa Irvin; Puneet Paul; Balpreet K. Dhatt; Waseem Hussain; Tian Gao; Paul Staswick; Hongfeng Yu; Gota Morota; Harkamal Walia

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Genome-wide association study (GWAS)

AC Anil Kumar Nalini Chandran

JS Jaspreet Sandhu

LI Larissa Irvin

PP Puneet Paul

BD Balpreet K. Dhatt

WH Waseem Hussain

TG Tian Gao

PS Paul Staswick

HY Hongfeng Yu

GM Gota Morota

HW Harkamal Walia

This method is extracted from research article: Front Plant Sci, Oct 2022

Rice Chalky Grain 5 regulates natural variation for grain quality under heat stress

DOI: 10.3389/fpls.2022.1026472

Request a Protocol

Ask a question

Favorite

A 700K high-density rice array marker dataset was used to run the GWAS (McCouch et al., 2016). In total, 411,066 SNPs were retained after filtering for missing data (< 20%) and minor allele frequency (< 5%). The population structure of the studied accessions was assessed using principal component analysis (PCA) on the constructed genomic relationship matrix (Zheng et al., 2012) ( Figure S1 ). GWAS was conducted in rrBLUP R package (Endelman, 2011) using the linear mixed model described earlier (Dhatt et al., 2021). SNP markers were declared significant using the P-value threshold of –log10(P) > 6.5, based on method of Li and Ji (2005) using effective number of markers (Li and Ji, 2005; Hussain et al., 2020). Manhattan plot and Q-Q plot were created using R package qqman (Turner, 2018). Phenotypic variance (R² ) explained by each SNP was estimated using the mixed.solve () function from the rrBLUP R package (Endelman, 2011) with SNP having variance equal to Kσ²u, where K is the design matrix of SNP and u is the random effect of the SNP. Additionally, R² explained by the locus having all the significant SNPs was estimated using BGLR R package (Pérez and De Los Campos, 2014). For this, all the SNPs were fitted jointly accounting the LD between the markers via a genomic restricted maximum likelihood method (Dhatt et al., 2021).

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol