Similarity network fusion

Mariam Pirashvili; Lee Steinberg; Francisco Belchi Guillamon; Mahesan Niranjan; Jeremy G. Frey; Jacek Brodzki

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Similarity network fusion

MP Mariam Pirashvili

LS Lee Steinberg

FG Francisco Belchi Guillamon

MN Mahesan Niranjan

JF Jeremy G. Frey

JB Jacek Brodzki

This method is extracted from research article: J Cheminform, Nov 2018

Improved understanding of aqueous solubility modeling through topological data analysis

DOI: 10.1186/s13321-018-0308-5

Ask a question

Favorite

Similarity Network Fusion (SNF) [38] is a recent computational method for data integration. Briefly, SNF combines many different types of measurements for a given set of samples. For n data points with m different types of measurements, m different $n \times n$ distance matrices are constructed, which can be thought of as a network on n points, with the distances being the weights on the edges. First, these are transformed into similarity matrices W by using an exponential similarity function. The SNF implementation takes these similarity matrices as input. To compute the fused matrix from multiple types of measurements, a full similarity matrix P and a sparse similarity matrix S are defined for each measurement. For the first, P is constructed by performing a form of normalisation on W, in the following way:

The matrix S is constructed using K nearest neighbours. For each i, let $N_{i}$ represent the K nearest neighbours of i, including i itself, giving

Next, the matrices P are iteratively updated to converge to a single similarity matrix. In the case $m = 2$ , the initial matrices are $P_{t = 0}^{(1)} = P^{(1)}$ , and $P_{t = 0}^{(2)} = P^{(2)}$ . The iterative step is given by

After t steps, the overall status matrix is computed as

We transform our $H_{0}$ and $H_{1}$ distance matrices into similarity matrices using the same exponential function described above and then use SNF to combine them into one matrix.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol