System model of Deep Learning method

Rui Miao; Liang-Yong Xia; Hao-Heng Chen; Hai-Hui Huang; Yong Liang

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

System model of Deep Learning method

RM Rui Miao

LX Liang-Yong Xia

HC Hao-Heng Chen

HH Hai-Hui Huang

YL Yong Liang

This method is extracted from research article: Sci Rep, Jun 2019

Improved Classification of Blood-Brain-Barrier Drugs Using Deep Learning

DOI: 10.1038/s41598-019-44773-4

Request a Protocol

Ask a question

Favorite

Deep Learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have already dramatically improved the state-of-the-art speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep Learning is a model that can discover the more complicated structure of datasets by using the back-propagation algorithm. According to the discovered structure, the mode can change the internal parameters. The internal parameter of each layer is the result of the previous layer^⁴⁷. For different complex datasets, the number of layers required for Deep Learning is varied. We think that although the relation between clinical side effects and adaptability of drugs may be not so strong, there may have deeper relation between clinical expressiveness and final efficacy. That means clinical expressiveness will affect final efficacy. This relation is suitable for the main idea of Deep Learning which is trying to discover the deeper relation between the data through Multi-layer Network and back Propagation algorithm. Therefore, we try to establish a Deep Learning model to verify our thought.Based on the number of samples and dimensions of the drug datasets processed in this paper, we propose the four-layer Deep Learning model to deal with these datasets. The Deep Learning model which proposed in this paper is shown in Fig. 5.

The four-layer Deep Learning model constructed in this paper, $x$ represents the data of each input node, $D$ Srepresents the data of each output node. $W_{k i}$ is the weight between the input layer and the hidden layer, $w_{m n}$ is the weight between the first hidden layer and second hidden layer and $w_{i j}$ is the weight between the hidden layer and the output layer.

The number of nodes in the input layer and the output layer of the Deep Learning network. Here we calculated the number of nodes in the hidden layer using the following equation:

Where h is the number of hidden layer nodes, m is the number of input layer nodes, n is the number of output layer nodes, α is an adjustment constant between 1 to 10, and generally, $α = 1$ .

Setting the weight between node $i$ and node $j$ is $w_{i j}$ , the threshold of the node $j$ is $b_{j}$ , and the output value of each node is $x_{j}$ . The output value of each node in the current layer is changed with the output value of all nodes in the previous layer. The weights and the thresholds of the nodes are implemented by an active function. The equations are as follows:

where $f$ is the active function represented by the sigmoid function, and its equation as following:

The computation procedure is from top to bottom and then from left to right, and it needs to be observed strictly to finish the entire forward process.

After finishing the forward pass process, we need to construct the reverse transfer process. The most important thing in the reverse transfer process is the adjustment of the weights and thresholds between each adjacent layer. The specific adjustment steps are as follows:

Step 1. Assume that all results of the output layer are $d_{j}$ and the equation of error function is as follows:

Step 2. According to the gradient descent method, the weights and thresholds of the functions are modified in several times in order to minimize the error function. The gradient of $E (w, b)$ is divided by the correction of the weight vector at the current position. For the output node j:

Step 3. In order to calculate the weights and thresholds between the hidden layer and the output layer, we derive the active function which represents by equation (⁴), then through equations (⁷) and (⁸) for $w_{i j}$ , finally $δ_{i j}$ and $b_{j}$ are calculated by the equations (⁹) and (¹⁰):

Step 4. Calculate the thresholds between two hidden layers and between the input and hidden layers. In equations (¹¹) and (¹²), we suppose that $w_{m n}$ is the weight between the node m belongs to the first hidden layer and the node n belongs to the second hidden layer. The $w_{k i}$ is the weight between the node $K$ belongs to the input layer and the node $i$ belongs to the hidden layer. The thresholds $δ_{k i}$ and $δ_{m n}$ are calculated by the equations (¹³) and (¹⁴):

Step 5. According to the gradient descent method and the formulas, which mentioned above, equations (¹⁵) and (¹⁶) are used to adjust the weights and thresholds between the hidden layer and the output layer. The equations (¹⁷) and (¹⁸) are used to adjust the weights and thresholds between two hidden layers. The equations (¹⁹) and (²⁰) are used to adjust the weights and thresholds between the input layer and the hidden layer:

There is the whole procedure of the reverse transfer process in the Deep Learning method which is proposed in this paper. To complete the learning process of the entire Deep Learning network, the continuous adjustments of weights and thresholds are necessary. We can set an error threshold or a maximal number of cycles as a stop criterion to break off the entire learning process.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol