Composition-based features

SG Sudheer Gupta
MM Midhun K. Madhu
AS Ashok K. Sharma
VS Vineet K. Sharma
request Request a Protocol
ask Ask a question
Favorite

Amino acid composition (AAC) is the percentages of each amino acid in the given length of amino acid sequence. AAC has widely been applied in different peptide and protein composition based classification method [28, 29]. Since there are 20 amino acids, each peptide/protein is represented by 20 types of compositions or a vector size of 20.

where, AAC(i) is the amino acid composition of the amino acid (i) and amino acid (i) is one of the 20 amino acids.

Similar to the AAC, dipeptide composition (DPC) has also been extensively applied in sequence-based classifications, particularly in the immune epitope prediction algorithms [30, 31]. DPC differs from AAC in having pair of amino acids and thus also provides information on local arrangement. The percentage of every possible pair (dipeptide) of amino acids was calculated. The following equation has been used for this calculation:

where, DPC(i) is the dipeptide frequency of dipeptide (i) and the dipeptide (i) is one out of 400 dipeptides.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

post Post a Question
0 Q&A