Calculation of s(Cseg)

Richard Smith-Unna; Chris Boursnell; Rob Patro; Julian M. Hibberd; Steven Kelly

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Calculation of s(Cseg)

RS Richard Smith-Unna

CB Chris Boursnell

RP Rob Patro

JH Julian M. Hibberd

SK Steven Kelly

This method is extracted from research article: Genome Res, Aug 2016

TransRate: reference-free quality assessment of de novo transcriptome assemblies

DOI: 10.1101/gr.196469.115

Request a Protocol

Ask a question

Favorite

The per-nucleotide read coverage data is used to evaluate this score. To evaluate the probability that the contig originates from a single transcript (i.e., it is not chimeric), a Bayesian segmentation analysis of the per-nucleotide coverage depth is performed. For a correctly assembled contig, it is assumed that the distribution of per-nucleotide coverage values in that contig is best described by a single Dirichlet distribution, i.e., all nucleotides in the same transcript should have the same expression level, and thus should be best modeled as a stochastic sample from a single distribution. In contrast, a contig that is a chimera derived from concatenation of two or more transcripts will have per-nucleotide coverage values that are best described by two or more different Dirichlet distributions. The probability that the distribution of per-nucleotide read coverage values comes from a single Dirichlet distribution is evaluated using a Bayesian segmentation algorithm previously developed for analysis of changes in nucleotide composition (Liu and Lawrence 1999). To facilitate the use of this method, the per-nucleotide coverage along the contig is encoded as a sequence of symbols in an unordered alphabet by taking log₂ of the read depth rounded to the nearest integer. As the probability will be a value between 0 and 1, this probability is used directly as s(C_seg).

This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol