2.5. Calculation of Statistical Significance for maxV(n1)
This protocol is extracted from research article:
Multiple Alignment of Promoter Sequences from the Arabidopsis thaliana L. Genome
Genes (Basel), Jan 21, 2021; DOI: 10.3390/genes12020135

We used the Monte Carlo method to estimate the statistical significance of maxV(n1). Sequence S was randomly shuffled to obtain 200 random sequences. Then, matrix maxW was included in the Q set described in Section 2.2, which allowed taking into account the effectiveness of maxW alignment with random sequences. Then, each of these sequences were treated as described in Section 2.2, Section 2.3, Section 2.4 and maxW was calculated for each, producing 200 maxV(n1). Then, the mean maxV(n1)¯ and variance D(maxV(n1)) were calculated and used to compute Z:

where maxV(n1) was calculated for sequences S1 and S in Section 2.4.

Z was obtained for each MSA and the average Z value for random S sequences was estimated according to Formula (7) after each sequence had been subjected to the procedures described in Section 2.2, Section 2.3, Section 2.4 and here. As a result, the mean Z = 1.8, and we can assume that the MSA is non-random at Z > 6.0.

Note: The content above has been extracted from a research article, so it may not display correctly.

Please log in to submit your questions online.
Your question will be posted on the Bio-101 website. We will send your questions to the authors of this protocol and Bio-protocol community members who are experienced with this method. you will be informed using the email address associated with your Bio-protocol account.

We use cookies on this site to enhance your user experience. By using our website, you are agreeing to allow the storage of cookies on your computer.