Prediction of the functional impact of coding nsSNPs using PolyPhen-2

Zeynep Kosaloglu; Julia Bitzer; Niels Halama; Zhiqin Huang; Marc Zapatka; Andreas Schneeweiss; Dirk Jäger; Inka Zörnig

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Prediction of the functional impact of coding nsSNPs using PolyPhen-2

ZK Zeynep Kosaloglu

JB Julia Bitzer

NH Niels Halama

ZH Zhiqin Huang

MZ Marc Zapatka

AS Andreas Schneeweiss

DJ Dirk Jäger

IZ Inka Zörnig

This method is extracted from research article: BMC Cancer, Nov 2016

In silico SNP analysis of the breast cancer antigen NY-BR-1

DOI: 10.1186/s12885-016-2924-7

Request a Protocol

Ask a question

Favorite

PolyPhen-2 combines information on sequence features, multiple alignments with homologous proteins, and structural parameters to predict the impact of a SNP on protein function.

For sequence-based assessment, PolyPhen-2 tries to identify the query as an entry in the UniProtKB/Swiss-Prot database. Using the feature table of the corresponding entry, PolyPhen-2 checks if a given SNP occurs at functional relevant site, e.g. if the SNP lies within a transmembrane, signal peptide, or binding region.

Similar to SIFT, PolyPhen-2 also assesses the degree of conversation of the position where the SNP occurs by utilizing a multiple sequence alignment of homologous sequences. For each variant PolyPhen-2 calculates a position-specific independent counts (PSIC) score. The PSIC score difference between the two variants describes the impact of a particular amino acid substitution: the higher the PSIC score difference, the higher functional impact the substitution is likely to have.

A BLAST query of the query sequence against protein structure databases is carried out to identify corresponding 3D protein structures. If corresponding structures are found, they are used to assess, whether the SNP is likely to destroy the hydrophobic core, interactions with ligands or other important features of the protein.

Finally, all parameters are taken together and empirical prediction rules are applied to make the final decision, whether the SNP is damaging or benign.

PolyPhen-2 is available online at http://genetics.bwh.harvard.edu/pph2/. We used the option ‘Batch query’ and submitted the list of genomic coordinates and variants of our filtered 191 nsSNPs.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol