EpiScanpy chromatin data integration workflow

Anna Danese; Maria L. Richter; Kridsadakorn Chaichoompu; David S. Fischer; Fabian J. Theis; Maria Colomé-Tatché

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

EpiScanpy chromatin data integration workflow

AD Anna Danese

MR Maria L. Richter

KC Kridsadakorn Chaichoompu

DF David S. Fischer

FT Fabian J. Theis

MC Maria Colomé-Tatché

This method is extracted from research article: Nat Commun, Sep 2021

EpiScanpy: integrated single-cell epigenomic analysis

DOI: 10.1038/s41467-021-25131-3

Request a Protocol

Ask a question

Favorite

In the advent of having multiple datasets of the same omic (single-cell ATAC-seq or DNA methylation) to analyse jointly, it is important to remove potential batch effects. EpiScanpy offers this possibility using the bbKNN^³³ batch correction method. In order to integrate the different batches, it is required to use a common feature space. Thus, a preliminary step is to build count matrices using a shared set of features like windows or a common set of peaks between datasets. To obtain a good embedding of the different datasets together, it is important that the set of features used is representative of all datasets. For that, we select the most variable features on each dataset separately. Then we concatenate the datasets keeping the intersect of the variable features. Alternatively, epiScanpy can merge the datasets using the union of the different feature spaces. Additional quality controls and filtering are recommended to remove features that are not covered in enough cells, and cells which do not contain enough covered features. Finally, we proceed to library size normalisation and run the integration method on this concatenated matrix.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol