A Simple Protocol for Informative Visualization of Enriched Gene Ontology Terms

Titouan Bonnot; Morgane B. Gillard; Dawn H. Nagel

doi:10.21769/BioProtoc.3429

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Peer-reviewed

A Simple Protocol for Informative Visualization of Enriched Gene Ontology Terms

Titouan Bonnot email

MG Morgane B. Gillard

DN Dawn H. Nagel email

Published: Nov 20, 2019 DOI: 10.21769/BioProtoc.3429 Views: 18414

Edited by: Marisa Rosa Reviewed by: Yuko Kurita

PDF

Ask a question

How to cite

Favorite

Cited by

Abstract

In genome-scale datasets, Gene Ontology (GO) enrichment is a common analysis to highlight functions over-represented or under-represented in a subset of differentially expressed genes to elucidate the biological significance of the results. However, despite the diversity of existing tools to analyze GO enrichment, it is often difficult to integrate results in an article figure with sufficient clarity. This is partly due to the high number and to the redundancy of the enriched GO terms, especially when looking at large sets of differentially expressed genes. Here, we provide a simple method to plot representative enriched GO terms. The list of representative enriched GO terms is obtained using existing tools Panther and REVIGO and results are represented in different plots generated from a homemade R script and the ggplot2 R package. The generated plots are publication-quality figures. The diversity of represented parameters makes the plots highly informative (number of genes associated with the enriched GO terms, fold enrichment and level of statistical significance). Comparison of GO enrichment between different lists of genes in a single plot is possible. As proof of concept, we performed this analysis on an Arabidopsis heat responsive transcriptome dataset recently published.

Keywords: Omics data

Transcriptomics

Gene Ontology Enrichment

Biological Processes

R plot

ggplot2

Materials and Reagents

User determined list(s) of differentially expressed genes (Similar to the one provided in Supplementary Data 1). In our example, we used the lists of up-regulated and down-regulated genes in response to heat in two different genotypes [wild type (WT) and a clock double mutant (cca1-1/lhy-20) recently published in Blair et al. (2019).
Data file (Similar to the one provided in Supplementary Data 2)
R-script file (Similar to the one provided in Supplementary Data 3)
Data file (Similar to the one provided in Supplementary Data 4)

Equipment

Computer
Computer that can run one of the following operating system:
Microsoft^® Windows^® XP (or later)
Mac^® OS X^® 10.4 (or later)
Ubuntu 14.04 LTS (or later)

Software

R (https://r-project.org/)
RStudio (https://rstudio.com/products/rstudio/), RStudio is an optional user interface for R

Procedure

Category

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

16 Q&A

Hello,Thank you so much for sharing the protocol and the script....

1 Answer 12 Views Feb 17, 2022

Hello, Thank you for the script and the protocol. I tried the...

1 Answer 17 Views Nov 17, 2021

Hi,I ran the following script, to make Figure 2 but I got an...

1 Answer 18 Views Oct 29, 2021

Thank you very much, the truth was that this protocol helped...

1 Answer 10 Views Oct 6, 2021

Thanks for providing such information and I am going to use...

1 Answer 7 Views Sep 16, 2021

Hi,I ran the following script with Suppl. data 2.txt file, to...

1 Answer 13 Views Sep 14, 2021

Hello,Thanks for the protocol. Quite simple and clear. However...

1 Answer 12 Views Aug 18, 2021

Hello, I am trying to generate graphs similar to figures 3 and...

1 Answer 9 Views Aug 10, 2021

Another question is that how did you retrieve corresponding...

1 Answer 10 Views May 18, 2021

Hello, thanks for your protocol. Could you tell me how did you...

1 Answer 10 Views May 18, 2021

Hi, I am not able to run the script from line1 to line 54. I...

1 Answer 11 Views May 12, 2021

Dear Titouan Bonnot,Is there a problem to download the supplemental...

1 Answer 11 Views Mar 1, 2021

Dear authors,Thank you for nice article and tutorial. I been...

1 Answer 14 Views Dec 28, 2020

HiSupplementary Data S3 is corrupted. Could you please correct...

5 Answers 11 Views Jan 18, 2020

Seems that the initial GO analysis is not correcting for sampling bias (tissue/technology) by using the study specific background?

2 Answers 68 Views Nov 19, 2022

Can you provide a word document for the R-script? ?

2 Answers 83 Views Oct 29, 2022