Construction of AMPs profiles (text mining)

Olalekan Olanrewaju Bakare; Marshall Keyster; Ashley Pretorius

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Construction of AMPs profiles (text mining)

OB Olalekan Olanrewaju Bakare

MK Marshall Keyster

AP Ashley Pretorius

This method is extracted from research article: BMC Mol Cell Biol, Nov 2020

Identification of biomarkers for the accurate and sensitive diagnosis of three bacterial pneumonia pathogens using in silico approaches

DOI: 10.1186/s12860-020-00328-4

Request a Protocol

Ask a question

Favorite

The Hidden Markov Models (HMMER) algorithm version 2.3.2 [52] was used to construct specific pathogen-targeted models/profiles using the respective training datasets. All the HMMER profiles were built on the Ubuntu 12.04 LTS operating system. The task was accomplished on a terminal and the command lines used to build each profile was written following the corresponding algorithm and the steps involved in their construction were as below:

For the first step, the training datasets of each target class were aligned using the ClustalW alignment tool [53]. The alignment was carried out using the command line:

The command line simply states <<do an alignment of the sequences which are in the upper case found in the input file “target class.fasta” with the FastA, using ClustalW as multiple alignment tools and GCG Postscript output for graphical printing>>. The output of the command results in the construction of aligned sequences, called “target class.msf”. The aligned sequences were used as input in the next step.

The next step enhances the construction of the profiles of the target class sequences by showing the common motifs/signatures within the profiles. To achieve this, the “Build profiles” was run using the following command:

To enhance the sensitivity of the profiles, the file generated (target class. hmm) from the profile building step was calibrated by using the command line:

The resulting profiles “target class.hmm” was used in evaluating the performance of the profiles by testing the created profiles on an independent AMP dataset.

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol