The intronic regions were defined as the longest continuous segments within genes that do not overlap any annotated exons (including exons of other genes). The intronic regions were extended by 10 nts into the flanking exons to enable the identification of CCRs that overlap splice sites such as regions R1–R5 in the human Ate1 gene36. Next, these regions were intersected with the set of conserved RNA elements (phastConsElements track for the alignment of 99 vertebrates genomes to the human genome99) using bedtools100. We additionally excluded genomic regions that overlap intervals from the UCSC Genome Browser tRNA track101105, sno/miRNA track106112, TFBS Conserved track, which consists of Conserved Transcription Factor Binding Sites generated using the Transfac Matrix and Factor databases113, RepeatMasker track114 (all categories including SINE (short interspersed nuclear elements) and LINE (long interspersed nuclear elements)), SimpleTandemRepeats located by Tandem Repeats Finder115, and Human Nuclear mitochondrial sequences116118. Genomic regions marked as “snRNA,” “miRNA,” “miscRNA,” “snoRNA,” “rRNA,” and “tRNA” in GENCODE genome annotation were also excluded. The resulting set of CIRs was used as an input for the next step, which is described below.

Note: The content above has been extracted from a research article, so it may not display correctly.

Please log in to submit your questions online.
Your question will be posted on the Bio-101 website. We will send your questions to the authors of this protocol and Bio-protocol community members who are experienced with this method. you will be informed using the email address associated with your Bio-protocol account.

We use cookies on this site to enhance your user experience. By using our website, you are agreeing to allow the storage of cookies on your computer.