Calculation of analysable genome size

AC Alex Cagan
AB Adrian Baez-Ortega
NB Natalia Brzozowska
FA Federico Abascal
TC Tim H. H. Coorens
MS Mathijs A. Sanders
AL Andrew R. J. Lawson
LH Luke M. R. Harvey
SB Shriram Bhosle
DJ David Jones
RA Raul E. Alcantara
TB Timothy M. Butler
YH Yvette Hooks
KR Kirsty Roberts
EA Elizabeth Anderson
SL Sharna Lunn
EF Edmund Flach
SS Simon Spiro
IJ Inez Januszczak
EW Ethan Wrigglesworth
HJ Hannah Jenkins
TD Tilly Dallas
NM Nic Masters
MP Matthew W. Perkins
RD Robert Deaville
MD Megan Druce
RB Ruzhica Bogeska
MM Michael D. Milsom
BN Björn Neumann
FG Frank Gorman
FC Fernando Constantino-Casas
LP Laura Peachey
DB Diana Bochynska
ES Ewan St. John Smith
MG Moritz Gerstung
PC Peter J. Campbell
EM Elizabeth P. Murchison
MS Michael R. Stratton
IM Iñigo Martincorena
request Request a Protocol
ask Ask a question
Favorite

To estimate the somatic mutation rate, it was first necessary to establish the size of the analysable nuclear genome (that is, the portion of the genome in which variant calling could be performed reliably) for each sample (Supplementary Table 4). For both substitutions and indels, the analysable genome of a sample was defined as the complement of the union of the following genomic regions: regions reported as ‘not analysed’ by the CaVEMan variant caller; regions failing the ‘chromosome and contig’ filter; regions failing the ‘N-tract and contig-end’ filter; and regions failing the ‘sequencing coverage’ filter (see ‘Variant filtering’). For the analysis of mitochondrial variants, the analysable genome of a sample was defined as the portion of mtDNA that satisfied the ‘sequencing coverage’ filter (see ‘Mitochondrial variant calling and filtering’), after subtracting the hypervariable region (D-loop).

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A