Dipeptide composition (DC) is calculated as the occurrence frequency of each two adjacent amino acid residues. There are 20∗20 = 400 combinations of amino acid pairs. Compared with AAC, DC is a feature that considers some sequence-order information. It can be calculated as:
Where mi is the occurrence number of i-th dipeptide in protein sequence and L is the length of the protein sequence.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.