4.1. Colon Cancer Dataset and Preprocessing

JZ Jie Zhu
WK Weikaixin Kong
ZX Zhengwei Xie
request Request a Protocol
ask Ask a question
Favorite

The workflow of our study is shown in Figure S1A. Public gene-expression data and full clinical annotation were searched for in the GEO and TCGA databases. Patients without survival information were removed from further evaluation. For microarray data from Affymetrix®, we applied the “limma” package in R for the analysis. As to datasets in TCGA, RNA sequencing data (FPKM value) of gene expression were downloaded from the Genomic Data Commons (GDC, https://portal.gdc.cancer.gov/ (accessed on 8 June 2020)). The somatic mutation data were acquired from the TCGA database. Data were analyzed with the R (version 3.6.2, 2019-12-12) software and R Bioconductor packages.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

post Post a Question
0 Q&A