Integration of scRNA-Seq and scATAC-Seq data

Anna Maria Ranzoni; Andrea Tangherloni; Ivan Berest; Simone Giovanni Riva; Brynelle Myers; Paulina M. Strzelecka; Jiarui Xu; Elisa Panada; Irina Mohorianu; Judith B. Zaugg; Ana Cvejic

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Integration of scRNA-Seq and scATAC-Seq data

AR Anna Maria Ranzoni

AT Andrea Tangherloni

IB Ivan Berest

SR Simone Giovanni Riva

BM Brynelle Myers

PS Paulina M. Strzelecka

JX Jiarui Xu

EP Elisa Panada

IM Irina Mohorianu

JZ Judith B. Zaugg

AC Ana Cvejic

This method is extracted from research article: Cell Stem Cell, Mar 2021

Integrative Single-Cell RNA-Seq and ATAC-Seq Analysis of Human Developmental Hematopoiesis

DOI: 10.1016/j.stem.2020.11.015

Request a Protocol

Ask a question

Favorite

We integrated scRNA-Seq and scATAC-Seq data using a recently developed method by Stuart et al. (2019). Namely, we used our scRNA-Seq data as reference dataset to train the classifier and automatically assign a cell type to each scATAC-Seq cell. The training of the classifier was performed using 511 CD34+ CD38- cells from our scRNA-Seq experiment. In order to have a suitable number of cells for each cell type to train the classifier, we considered scRNA-Seq clusters with at least 20 cells (i.e., HSC/MPPs, HSC/MPPs-Cycle, MEMPs, MEMPs-Cycle, GPs, and LMPs). We generated a gene expression matrix from our scATAC-Seq dataset by assigning each peak to the gene by considering the genome coordinates of the gene body ± 3 kb. We applied the Seurat function FindTransferAnchors (query.assay equal to RNA_promoter, features equal to the counts of the RNA_promoter, and k.anchor equal to 6) on the Canonical Correlation Analysis (CCA) space because it was more suitable, compared to the LSI space, for capturing the shared feature correlation structure between scRNA-Seq and scATAC-Seq data. We assigned the cell types to the scATAC-Seq cells by applying the Seurat TransferData on the first 50 LSI components corrected by Harmony considering the calculated anchors (refdata equal to the six scRNA-Seq clusters). In order to avoid assignments based on a low score, all cells with the prediction score lower than 40% (the value of a uniform distribution of six clusters is 16,67%) were labeled as unknown.

This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol