Calculate main effect kinship matrix by partitioning high-dimensional markers into blocks

Wenchao Zhang; Xinbin Dai; Shizhong Xu; Patrick X Zhao

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Calculate main effect kinship matrix by partitioning high-dimensional markers into blocks

WZ Wenchao Zhang

XD Xinbin Dai

SX Shizhong Xu

PZ Patrick X Zhao

This method is extracted from research article: NAR Genom Bioinform, Oct 2019

GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS

DOI: 10.1093/nargab/lqz009

Ask a question

Favorite

After SNP and genetic variant calling (27,28), the genotype at one specific locus is represented by AA, Aa and aa for one homozygote, the heterozygote and the other homozygote, respectively. The three genotypes are then numerically coded by −1, 0, 1 or 0, 1, 2 for the additive effect and 0, 1, 0 for the dominance effect. From Equations (5) and (6), we can easily calculate the two main effect kinship matrices, e.g. equation M101 and equation M102 . With the high-throughput sequencing technology, we can easily generate millions of SNP data. The large volume of data is hard to handle using traditional matrix multiplications. Luckily, we can take advantage of the linearity of matrix theory and partition the whole genotype data into many blocks. Kinship matrix is calculated for each block, then the block specific kinship matrices are added to generate the final kinship matrix.

Let equation M103 be a equation M104 matrix for the additive or dominance genotype effect, where equation M105 is the number of markers and n is the sample size. If we partition equation M106 into equation M107 blocks, and each block equation M108 has equation M109 individuals and equation M110 markers so that equation M111 . The partitioned matrix can be written as equation M112 , according to matrix block theory, it is easy to prove that

Figure Figure1A1A illustrates the principle to generate the coded additive and dominance genotypic effect and calculate the main kinship matrix through partitioning the additive/dominance marker into blocks.

Mathematical principle to calculate the high-dimensional kinship matrices. (A) Principle to generate the coded additive and dominant genotypic effect and calculate the main kinship matrix through partitioning the additive/dominance marker into blocks. (B) Principle to generate all of the epistatic genotypic marker pairs and calculate the epistatic kinship matrix through partitioning the marker pairs into blocks.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact moc.puo@snoissimrep.slanruoj

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol