2.5. Chemical Space Visualization

Ana L. Chávez-Hernández; Norberto Sánchez-Cruz; José L. Medina-Franco

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

2.5. Chemical Space Visualization

AC Ana L. Chávez-Hernández

NS Norberto Sánchez-Cruz

JM José L. Medina-Franco

This method is extracted from research article: Biomolecules, Nov 2020

Fragment Library of Natural Products and Compound Databases for Drug Discovery †

DOI: 10.3390/biom10111518

Request a Protocol

Ask a question

Favorite

Morgan fingerprints with radius 2 (Morgan2, 1024-bits) were generated for each compound and fragment data set. To generate a visual representation of the chemical space, we used the recently developed algorithm TMAP (Tree MAP). This method allows the visual representation of many molecules that are difficult to visualize using other standard methods such as principal component analysis. Basically, TMAP allows the visualization of large data sets (such as the ones studied in this work—Table 1) through the distance between the clusters and the cluster’s detailed structure through branches and sub-branches [26,27]. Fingerprints for each data set (input data) were indexed in a local sensitive hashing (LSH) forest data structure, enabling c-approximate k-nearest neighbor (k-NN). Fingerprints were encoded using the MinHash algorithm. An undirected weighted c-approximate k-nearest neighbor graph (c-k-NNG) is constructed from the data points indexed in the LSH forest. This graph takes two arguments, k, the number of nearest-neighbors, and kc, the factor used by the augmented query algorithm. In this work, we used k = 50 and kc =10. Further details of the TMAP approach are published elsewhere [28].

Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol