Ab initio assembly of transcripts from RNA-seq data

Chan Zhou; Samuel R. York; Jennifer Y. Chen; Joshua V. Pondick; Daniel L. Motola; Raymond T. Chung; Alan C. Mullen

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Ab initio assembly of transcripts from RNA-seq data

CZ Chan Zhou

SY Samuel R. York

JC Jennifer Y. Chen

JP Joshua V. Pondick

DM Daniel L. Motola

RC Raymond T. Chung

AM Alan C. Mullen

This method is extracted from research article: Genome Med, Mar 2016

Long noncoding RNAs expressed in human hepatic stellate cells form networks with extracellular matrix proteins

DOI: 10.1186/s13073-016-0285-0

Request a Protocol

Ask a question

Favorite

We mapped each replicate of directional paired-end RNA-seq data to the human reference genome (hg19/GRCh37) using TopHat v2.0.10 [59, 60] before assembling transcripts using both Cufflinks [61] and Scripture [62]. The TopHat settings were as follows:

tophat -p 8 --library-type fr-firststrand --mate-inner-dist 50 --mate-std-dev 50 --microexon-search --GTF genes.gtf -o < output-folder > <index of reference genome > Reads_end1.fastq Reads_end2.fastq

The reference genes in GTF file format (genes.gtf) were downloaded from the University of California, Santa Cruz (UCSC) genome browser [63]. We then assembled transcripts through the following settings of Cufflinks using TopHat output bam file as input:

cufflinks -p 8 --max-bundle-frags 100000000 --library-type fr-firststrand --frag-bias-correct --multi-read-correct -o < output_folder > <tophat_output_bam_file>

Max-bundle-frags was set to 100,000,000 such that highly expressed genes would be included in the output.

For analysis in Scripture, we used the following TopHat settings:

tophat -p 4 --microexon-search --GTF genes.gtf -o < output_folder > <index_of_reference_genome > <Reads_one_end.fastq>

We used Scripture (beta2 version) to assemble transcripts by following the protocol for transcript assembly (http://www.broadinstitute.org/software/scripture/). All transcripts assembled in Cufflinks and/or Scripture were then merged into one list through Cuffmerge [61].

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol