GSEA uses complicated statistics, so it requires a computer program to run the calculations. GSEA has become standard practice, and there are many websites and downloadable programs that will provide the data sets and run the analysis.
MOET Multi-Ontology Enrichment Tool (MOET) is a web-based ontology analysis tool that provides functionality for multiple ontologies, including Disease, GO, Pathway, Phenotype, and Chemical entities (ChEBI) for multiple species, including rat, mouse, human, bonobo, squirrel, dog, pig, chinchilla, naked mole-rat and vervet (green monkey). It outputs a downloadable graph and a list of statistically overrepresented terms in the user's list of genes using hypergeometric distribution. MOET also displays the corresponding
Bonferroni correction and
odds ratio on the results page. It is simple to use, and results are provided with a few clicks in seconds; no software installations or programming skills are required. In addition, MOET is updated weekly, providing the user with the most recent data for analyses.
NASQAR NASQAR (Nucleic Acid SeQuence Analysis Resource) is an open source, web-based platform for high-throughput sequencing data analysis and visualization. GSEA can be run using the R-based clusterProfiler package. NASQAR currently supports GO Term and
KEGG Pathway enrichment with all organisms supported by an Org.Db database.
PlantRegMap The
gene ontology (GO) annotation for 165 plant species and GO enrichment analysis is available.
MSigDB The Molecular Signatures Database hosts an extensive collection of annotated gene sets that can be used with most GSEA Software.
Broad Institute The
Broad Institute website is in cooperation with MSigDB and has a downloadable GSEA software, as well a general tutorial.
WebGestalt WebGestalt is a web based gene set analysis toolkit. It supports three well-established and complementary methods for enrichment analysis, including Over-Representation Analysis (ORA), Gene Set Enrichment Analysis (GSEA), and Network Topology-based Analysis (NTA). Analysis can be performed against 12 organisms and 321,251 functional categories using 354 gene identifiers from various databases and technology platforms.
Enrichr Enrichr is a gene set enrichment analysis tool for mammalian gene sets. It contains background libraries for transcription regulation, pathways and protein interactions, ontologies including GO and the human and mouse phenotype ontologies, signatures from cells treated with drugs, gene sets associated with human diseases, and expression of genes in different cells and tissues. The background libraries are from over 200 resources and contain over 450,000 annotated gene sets. The tool can be accessed through API and provides different ways to visualize the results.
GeneSCF GeneSCF is a real-time based functional enrichment tool with support for multiple organisms and is designed to overcome the problems associated with using outdated resources and databases. Advantages of using GeneSCF: real-time analysis, users do not have to depend on enrichment tools to get updated, easy for computational biologists to integrate GeneSCF with their NGS pipeline, it supports multiple organisms, enrichment analysis for multiple gene list using multiple source database in single run, retrieve or download complete GO terms/Pathways/Functions with associated genes as simple table format in a plain text file.
DAVID DAVID is the database for annotation, visualization and integrated discovery, a
bioinformatics tool that pools together information from most major bioinformatics sources, with the aim of analyzing large gene lists in a
high-throughput manner. DAVID goes beyond standard GSEA with additional functions like switching between gene and protein identifiers on the genome-wide scale, which can have a considerable impact on practical interpretation of results. However, A most recent update was performed in 2021
Metascape integrates pathway enrichment analysis, protein complex analysis, and multi-list meta-analysis into one seamless workflow accessible through a significantly simplified user interface.
Metascape maintains analysis accuracy by updating its 40 underlying knowledgebases monthly.
Metascape presents results using easy-to-interpret graphics, spreadsheets, and publication quality presentations, and is freely available.
AmiGO 2 The
Gene Ontology (GO) consortium has also developed their own online GO term enrichment tool, allowing species-specific enrichment analysis versus the complete database, coarser-grained GO slims, or custom references.
GREAT Genomic region enrichment of annotations tool (GREAT) is a software which takes advantage of
regulatory domains to better associate gene ontology terms to genes. Its primary purpose is to identify pathways and processes that are significantly associated with factor regulating activity. This method maps genes with regulatory regions through a
hypergeometric test over genes, inferring proximal gene regulatory domains. It does this by using the total fraction of the genome associated with a given ontology term as the expected fraction of input regions associated with the term by chance. Enrichment is calculated by all regulatory regions, and several experiments were performed to validate GREAT, one of which being enrichment analyses done on 8
ChIP-seq datasets. is mainly used for the functional enrichment and network analysis of
Omics data.
FuncAssociate The FuncAssociate tool enables Gene Ontology and custom enrichment analyses. It allows inputting ordered sets as well as weighted gene space files for background.
InterMine Instances of
InterMine automatically provide enrichment analysis for uploaded sets of genes and other biological entities.
ToppGene suite ToppGene is a one-stop portal for gene list enrichment analysis and candidate gene prioritization based on functional annotations and protein interactions network. Developed and maintained by the Division of Biomedical Informatics at
Cincinnati Children's Hospital Medical Center.
QuSAGE Quantitative Set Analysis for Gene Expression (QuSAGE) is a computational method for gene set enrichment analysis. QuSAGE improves power by accounting for inter-gene correlations and quantifies gene set activity with a complete
probability density function (PDF). From this PDF,
P values and
confidence intervals can be easily extracted. Preserving the PDF also allows for post-hoc analysis (e.g., pair-wise comparisons of gene set activity) while maintaining statistical traceability. The applicability of QuSAGE has been extended to
longitudinal studies by adding functionality for general linear mixed models. QuSAGE was used by the NIH/NIAID to identify baseline transcriptional signatures that were associated with human
influenza vaccination responses. QuSAGE is available as a R/
Bioconductor package.
Blast2GO Blast2GO is a bioinformatics platform for functional annotation and analysis of genomic datasets. This tool allows to perform gene set enrichment analysis, among other functions.
g:Profiler g:Profiler is a toolset for finding biological categories enriched in gene lists, conversions between gene identifiers and mappings to their orthologs. g:Profiler relies on Ensembl as a primary data source and follows their quarterly release cycle while updating the other data sources simultaneously. g:Profiler supports close to 500 species and strains, including vertebrates, plants, fungi, insects and parasites. == Applications ==