Position-wise feed-forward networks

Jiajing Xie; Yuhang Song; Hailong Zheng; Shijie Luo; Ying Chen; Chen Zhang; Rongshan Yu; Mengsha Tong

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Position-wise feed-forward networks

JX Jiajing Xie

YS Yuhang Song

HZ Hailong Zheng

SL Shijie Luo

YC Ying Chen

CZ Chen Zhang

RY Rongshan Yu

MT Mengsha Tong

This method is extracted from research article: Brief Bioinform, Oct 2024

PathMethy: an interpretable AI framework for cancer origin tracing based on DNA methylation

DOI: 10.1093/bib/bbae497

Ask a question

Favorite

Each block contained an FFN that consisted of two linear transformations with a Gaussian Error Linear Unit (GELU) activation function in between:

where Inline graphic and were weight matrices of the linear transformations. Here, was the FFN dimension. was the model’s hidden dimension. and were the bias vectors of the linear transformations. The GELU function was defined as:

where Inline graphic represented the cumulative distribution function of the standard Gaussian distribution. Additionally, two dropout layers were applied in this network: the first after the GELU activation function and the second after the final linear transformation.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol