Gene Expression Profile Data and Clinical Parameters

EL Enhao Li
XY Xiaobao Yang
YD Yuzhang Du
GW Guanzheng Wang
DC David W. Chan
DW Di Wu
PX Peiqing Xu
PN Peihua Ni
DX Dakang Xu
YH Yiqun Hu
request Request a Protocol
ask Ask a question
Favorite

The transcriptome of RNA-seq data and clinical parameters of CRC cases (normal samples, 41 cases; tumor samples, 473 cases) were downloaded from The Cancer Genome Atlas Program (TCGA) repository of the National Cancer Institute (https://cancergenome.nih.gov/). The data parameters were as follows: primary site (colon), data category with transcriptome profiling with gene expression and quantification with FPKM and counts, experimental strategy with RNA-Seq analysis and workflow type with HTSeq-FPKM and HTSeq-Counts. Default settings were used for the other filters.

In order to test and verify the discovery in the TCGA cohort, we downloaded GSE14333 and GSE38832 from the Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/). Expression series matrix files of both datasets were based on GPL570. The former included 290 samples of primary colorectal cancer patients, while the latter had expression data of 122 human samples. To get a larger size of samples, two cohorts were combined for analyses.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A