From any browser, access the phosphositeplus website (www.phosphosite.org)
Navigate to Downloads > Datasets from PSP
Accept the terms and conditions
Download the Methylation_site_dataset.gz file
Open file using Microsoft Excel
From the main file, duplicate the sheet (rename "Lysine Methylation")
Sort MOD_RSD column alphabetically, and keep only rows with K methylation
For KxP methylation sites:
Create a new duplicate sheet (rename "KxP methylation")
Use the SITE_+/-_7AA column for the searches below (in our spreadsheet, it was column J)
Create a column "K?P" --> use function =SEARCH("K?P",J2) and then drag down to create the column for all rows
In our spreadsheet, it was column K
This will provide all AA sequences that have a KxP motif, with the position of the K indicated as a number
Sort column "K?P" in ascending order, and keep only rows with values present
Create another column "K?P"--> use function =MID(J2,SEARCH("K?P",J2,3) and drag down to create a column for all rows
In our spreadsheet, it was column L
This will provide all the KxP motifs from the SITE_+/-_7AA column
Create another column "k?P" --> use function =FIND("k", L2) and drag down to create a column for all rows
In our spreadsheet, it was column M
This will provide a value of 1 if the lysine from the KxP is UNcapitalized, i.e., it is the site that is being modified
If the motif from column L has a methylated lysine at another spot, it will return 2 or 3
If the motif from column L had a CAPITALIZED K, then this column will return #VALUE
Create a new duplicate sheet (rename "KxP no repeats")
Delete all rows in which the "k?P" column returned #VALUE
Sort sheet by GENE column (in our spreadsheet, it was column A)
Use Data --> remove duplicate function and select column A
For KxxGG methylation sites
Create a new duplicate spreadsheet from the "Lysine Methylation" sheet (rename "KxxG methylation")
Repeat above steps using "K??G" instead of "K?P", and "k??G" instead of "k?P"
Retain the rows in which the "k??G" =FIND("k",L2) function returned the value of 1
Make the sheet "KxxG methylation no repeats"
Make a new sheet (rename "KxP_KxxG")
Copy all rows from the "KxP no repeats" and "KxxG no repeats"
Create a new duplicate sheet (rename "KxP_KxxG no repeats")
Sort by GENE (column A)
Use Data --> remove duplicate function and select column A
Spreadsheet used in our analysis available upon request once the protocol has been utilized (please contact cheryl.walker@bcm.edu)
Copyright: Content may be subjected to copyright.
How to cite:
Readers should cite both the Bio-protocol preprint and the original research article where this protocol was used:
Seervai, R and Walker, C(2021). In silico SETD2 methylation site analysis. Bio-protocol Preprint. bio-protocol.org/prep730.
Seervai, R. N. H., Jangid, R. K., Karki, M., Tripathi, D. N., Jung, S. Y., Kearns, S. E., Verhey, K. J., Cianfrocco, M. A., Millis, B. A., Tyska, M. J., Mason, F. M., Rathmell, W. K., Park, I. Y., Dere, R. and Walker, C. L.(2020). The Huntingtin-interacting protein SETD2/HYPB is an actin lysine methyltransferase . Science Advances 6(40). DOI: 10.1126/sciadv.abb7854
Post your question to gather feedback from the community. We will also invite the authors of this
article to respond.
0/150
Tips for asking effective questions
+ Description
Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.
Spinning
Post a Question
0 Q&A
Spinning
This protocol preprint was submitted via the "Request
a Protocol" track.