Artificial neural network model and training

Cheng Yong Tham; Roberto Tirado-Magallanes; Yufen Goh; Melissa J. Fullwood; Bryan T.H. Koh; Wilson Wang; Chin Hin Ng; Wee Joo Chng; Alexandre Thiery; Daniel G. Tenen; Touati Benoukraf

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Artificial neural network model and training

CT Cheng Yong Tham

RT Roberto Tirado-Magallanes

YG Yufen Goh

MF Melissa J. Fullwood

BK Bryan T.H. Koh

WW Wilson Wang

CN Chin Hin Ng

WC Wee Joo Chng

AT Alexandre Thiery

DT Daniel G. Tenen

TB Touati Benoukraf

This method is extracted from research article: Genome Biol, Mar 2020

NanoVar: accurate characterization of patients’ genomic structural variants using low-depth nanopore sequencing

DOI: 10.1186/s13059-020-01968-7

Request a Protocol

Ask a question

Favorite

The features used by the ANN are described below (number in parentheses represent the number of neurons):

Aligned/unaligned percentages flanking the novel adjacency (5)

Alignment E values flanking the novel adjacency (2)

Relative alignment bit scores flanking the novel adjacency (2)

Alignment identities flanking the novel adjacency (2)

The fraction of mismatches in alignments flanking the novel adjacency (2)

The fraction of gaps in alignments flanking the novel adjacency (2)

SV complexity—number of coexisting SV found at the novel adjacency (1)

Total number of alignments found on read (1)

Total number of SV that seemed to be captured by read (1)

Number of different chromosomes the read aligns (1)

The fraction of alignments less than 5% of read length (1)

Number of breakend-supporting reads B (1)

The fraction of breakend-supporting reads B over total read depth B + O (1)

If SV is an insertion/deletion, the size of the inserted/deleted segment (1)

The value of each feature is scaled to the range of [0, 1] by min-max normalization. The Python library Keras [41] was used to build and infer the ANN model. The backend engine used with Keras is TensorFlow [42]. The neural network model is a feed-forward network consisting of a 23 neuron input layer, two hidden layers of 12 and 5 neurons sequentially, and a single neuron output layer. The rectified linear unit (ReLU) activation function is used for the two hidden layers, while the Sigmoid activation function is used for the output layer. Dropout regularizations were implemented after each hidden layer with probabilities of 0.4 and 0.3 sequentially. If y_{k, i} denotes the value of the ith neuron in the k layer, we have that

where F(x) = max(x,0) denotes the ReLU non-linearity and $W_{j, i}^{k}$ is the neural weight between the jth neuron of the (k − 1)th layer and the ith neuron of the kth layer.

Ten million in silico 3GS reads simulated from a simulated genome consisting of 61,316 mixed zygosity SV were used to train a binary classifier ANN model through supervised learning. The ten million reads were distributed randomly into 20 sub-datasets before read-depth clustering to reduce the sequencing depth to 1X. The entire training dataset consists of 933,351 true and 41,186 false examples of novel adjacencies. Another simulated dataset (4X) with a different SV profile was used as the test dataset. Binary cross entropy was used as the loss function, and stochastic gradient descent (SGD) was used as the optimizer algorithm with their default parameters. The classification accuracy is collected and reported as a metric to assess the performance of the model. Sixty-three epochs were performed for the model training, with each epoch having 12,000 true and 12,000 false randomly selected examples and a batch size of 400 examples per iteration.

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol