Data preprocessing and batch effect correction

CC Christopher Cherry
DM David R Maestas
JH Jin Han
JA James I Andorko
PC Patrick Cahan
EF Elana J Fertig
LG Lana X Garmire
JE Jennifer H Elisseeff
ask Ask a question
Favorite

Seurat was used for most processing steps where other software is not specified58. All cell counts were pruned of cells with UMI counts below 250, cells with more than 10% mitochondrial genes, and genes expressed in fewer than 0.1% of cells. We then normalized and scaled the data with regression on UMI count to account for batch effect and percent mitochondrial genes and calculated principle components using the top 2000 most variable genes. For muscle data sets we then corrected the principle components for batch effect using Harmony59. There are inevitably differences between experimental batches when running single cell RNA sequencing. Harmony and other batch effect correction algorithms are designed to align different batches prior to downstream clustering and dimensional reduction. UMAP and shared nearest neighbor graph construction with subsequent Louvain clustering was then run on principle components.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A