Advanced Search
Last updated date: Apr 25, 2022 Views: 587 Forks: 0
Use HHblits (1) to align multiple sequences from the clustered UniProtdatabase.
To measure residue-residue coevolution, use DCA (2, 3) with default parameters: x = 0:2; λ = 0:5 (for more info: http://dca.rice.edu/portal/dca/home). DCA outputs a directinformation (DI) score per pair of residues, rankingevolutionary correlation. Only coevolution pairs at a sequence distance ≥5 are considered.
Given a DI-ranked list of coevolution pairs, keep for further analysis only the first n pairs that maximize the Matthew's Correlation Coefficient (MCC) resulting from the prediction of contacts (<10 Å) in the initial structure. Given a list of n selected coevolution pairs, MCC is calculated as:
where TP is the number of contacts in the selected list, while FP corresponds to the selected pairs that are not in contact, TN to the non-selected pairs that are not in contact, and FN to contacts that have not been selected.
Coevolution pairs that are far apart in the initial structure are used to guide an initial, exploratory conformational sampling. Concretely, run a pulling trajectory for each distant coevolution pair i-j using discrete (d) MD. According to the standarddMD algorithm, the protein Hamiltonian is defined as a series of flat square wells (in this case Go-like single well, see (4)) and particles (Cα) move at constant velocity until a collision occurs, where momentum and energy conservation rules are enforced. To favor the formation of coevolutionary contacts (i-j) bias the trajectory by inverting velocities of particles i and j every 100 simulation steps if i and j arenot approaching each other, and keeping them unaltered otherwise. Maintain the trajectory for downstream analysis if i-j are in contact at some point of the trajectory.
To evaluate the coincidence between trajectories and coevolution maps, check whether contacts are spontaneously established along the pulling trajectory (k − l, where k,l ≠ i,j are indeed coevolution pairs). For this, compute receiver-operating characteristic curves (ROC; sensitivity versus 1 − specificity) to quantify the agreement between conformations generated inthe trajectory and the list of n coevolution pairs (filter out those contactsat ≤6.5 Å to exclude trivial trajectories). In this framework, the ROC space is defined as follows:
Sensitivity=TP/TP+FN;
Specificity=TN/FP+TN;
where TP counts contacts generated along the trajectory that are in turn selected coevolution pairs, FN is the number of coevolution pairs that are not in contact along the trajectory, TN are the pairs that are not in contact during the trajectory and, accordingly, are not coevolution pairs, and FP are the pairs that are in contact but do not coevolve.
Given the pulling trajectories described above, build a multiple SBM that would enable a single dMD trajectory to explore the nmodels ensemble. To this aim, shape particle-particle interactions to reflect the variability spanned by exploratory trajectories. Concretely, take the original PDB and the last snapshot of the accepted pulling trajectories, and describe the potential energyinteractions by meansof double-well squarepotentials when a pair of particles i-j distance change across the ensemble of structures, and single-well square potential otherwise. The wells are centered at (the distance in the initial PDB structure), and, when needed, a second one centered at the coevolution interaction distance between residues i −j (
, which is set considering all nmodels i −j distances coming from the accepted pulling trajectories; eq.2); to increase the importance of short-distance contacts, which could be obscured by larger ones, introduce a weight factor in the averaging (
; eq.3). If nmodels is the number of structures used to build the SBM, the center of the coevolution well is:
where is the residues i − j distance in the kth model, and the hard-core radius of the particles RHC is 2 Å.
For more details on the above protocol, the sampling strategy and the selection and evaluation of the sampled structures please refer to Sfriso et al. (5).
References
Category
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.
Tips for asking effective questions
+ Description
Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.
Share
Bluesky
X
Copy link