Pangenome Analysis

José Luis Maturana; Juan P. Cárdenas

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Pangenome Analysis

JM José Luis Maturana

JC Juan P. Cárdenas

This method is extracted from research article: Front Microbiol, Apr 2021

Insights on the Evolutionary Genomics of the Blautia Genus: Potential New Species and Genetic Content Among Lineages

DOI: 10.3389/fmicb.2021.660920

Request a Protocol

Ask a question

Favorite

The pangenome reconstruction of the blautia dataset was performed with Roary (Page et al., 2015), panX (Ding et al., 2018) and PEPPAN (Zhou et al., 2020). For all the programs, input files were generated by prokka (default settings, –kingdom Bacteria) (Seemann, 2014). For Roary and PEPPAN, GFF files were used while for PanX, GenBank archives. Roary was run with ‘-e -n -p 24 -v -r -i 80 –group_limit 100000’ options. PEPPAN and PanX were run with default options. The output from PEPPAN was parsed using PEPPAN_parser with ‘-t -c -a 95′ settings. Using python scripts, the output of the previous step, namely allele.fna, PEPPAN.gff and PEPPAN.gene_content.Rtab, was used to generate a multifasta file containing the pangenome. The rarefaction curves for this pangenome were taken from the file PEPPAN.gene_content.curve and plotted using pandas (Reback et al., 2020) and matplotlib (Hunter, 2007). For PanX, the file geneCluster.json was parsed with pandas to generate a “presence and absence gene” matrix, to then obtain basic statistics about the pagenome. To estimate the pangenome openness/closedness, this matrix was fed into the R library micropan v2.1 (Snipen and Liland, 2015) and an alpha value was estimated.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol