GO Tools: Functional Similarity

FastSemSim

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
Bioinformatics Group, Department of Information Engineering, University of Padova, Padova, Italy.
Contact
Marco Mina
License
Free for academic use
GO data updates
not applicable (e.g. because users provide data)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

FastSemSim is a package that implements several semantic similarity measures. It is both a library and an end-user application, featuring an intuitive graphical user interface (GUI). It has been implemented with the aim of being fast, expandable, and easy to use. It allows the user to work with the most updated version of GO database and customizable annotation corpora. It provides a set of logically-organized classes that can be easily exploited to both integrate semantic similarity into different analysis pipelines and extend the library with new measures.

  • software library
  • functional similarity
  • semantic similarity
  • graphical user interface

Tool listing last updated 14 November 2011

FunSimMat

Platform
Online tool
Developer
Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.
Contact
FunSimMat Team
Publications
PMID:17932054
PMID:19923227
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

FunSimMat is a comprehensive resource of semantic and functional similarity values. It allows ranking disease candidate proteins and searching for functional similarity values for proteins (extracted from UniProt), and protein families (Pfam, SMART). FunSimMat provides several different semantic and functional similarity measures for each protein pair using the Gene Ontology annotation from UniProtKB.

  • database or data warehouse
  • functional similarity
  • semantic similarity
  • disease gene candidate prioritization

Tool listing last updated 03 June 2010

g:Profiler

Platform
Online tool
Developer
BIIT Group, Institute of Computer Science, University of Tartu, Tartu, Estonia.
Contact
BIIT support
Publications
PMID:17478515
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

g:Profiler is a public web server for characterising and manipulating gene lists from high-throughput genomic data. g:Profiler has a simple user-friendly web interface with powerful visualisation. g:Profiler currently supports 85 species, including mammals, fungi, plants, insects, etc, from the Ensembl and Ensembl Genomes databases. g:Profiler consists of the following tools:

  • g:GOSt retrieves most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs to a user-specified group of genes, proteins or microarray probes. g:GOSt also allows analysis of ranked or ordered lists of genes, visual browsing of GO graph structure, interactive visualisation of retrieved results, and many other features. Multiple testing corrections are applied to extract only statistically important results.
  • g:Convert allows conversion between gene or protein names, database IDs and microarray probes of more than 100 types. A mix of various IDs may be presented as input; output options include HTML, text and XLS spreadsheet.
  • g:Orth retrieves orthologs for a given set of genes, proteins or probes in a selected organism. Graphical representation also shows orthologs present in all g:Profiler organisms.
  • g:Sorter searches for similar expression profiles to a given gene, protein or probe in a large set of public microarray datasets from the Gene Expression Omnibus (GEO) database.
  • g:Cocoa provides a compact interface for comparing enrichments of multiple gene lists.
  • ontology or annotation visualization
  • statistical analysis
  • slimmer-type tool
  • term enrichment
  • protein interactions
  • functional similarity
  • other analysis
  • coexpression similarity search, Gene ID conversion, network enrichment analysis., orthology mapping, search by genomic locus

Tool listing last updated 26 January 2011

GOTaxExplorer

Platform
Online tool Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.
Contact
GOTax Team
Publications
PMID:17346342
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

GOTaxExplorer presents a new approach to comparative genomics that integrates functional information and families with the taxonomic classification. It integrates UniProt, Gene Ontology, NCBI Taxonomy, Pfam and SMART in one database. GOTaxExplorer provides four different query types: selection of entity sets, comparison of sets of Pfam families, semantic comparison of sets of GO terms, functional comparison of sets of gene products.
This permits to select custom sets of GO terms, families or taxonomic groups. For example, it is possible to compare arbitrarily selected organisms or groups of organisms from the taxonomic tree on the basis of the functionality of their genes. Furthermore, it enables to determine the distribution of specific molecular functions or protein families in the taxonomy. The comparison of sets of GO terms allows to assess the semantic similarity of two different GO terms. The functional comparison of gene products makes it possible to identify functionally equivalent and functionally related gene products from two organisms on the basis of GO annotations and a semantic similarity measure for GO.

  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • functional similarity
  • semantic similarity
  • other analysis
  • comparative genomics analysis

Tool listing last updated 03 June 2010

OWLSim

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
Berkeley Bioinformatics Open-Source Projects, Lawrence Berkeley National Lab, Berkeley, California, USA.
Contact
Chris Mungall
License
Free for academic use
GO data updates
no fixed update schedule
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

The OWLSim package provides the ability to do a number of standard semantic similarity methods and includes novel methods for combining these with dynamic selection of anonymous grouping classes.

  • functional similarity
  • semantic similarity

Tool listing last updated 06 February 2012

PiNGO

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
Department of Plant Systems Biology, VIB / Ghent University, Ghent, Belgium.
Contact
Steven Maere
License
Free for academic use
GO data updates
no fixed update schedule
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

PiNGO is a Java-based tool to easily find unknown genes in a network that are significantly associated with user-defined target Gene Ontology (GO) categories. PiNGO is implemented as a plugin for Cytoscape, a popular open source software platform for visualizing and integrating molecular interaction networks. PiNGO predicts the categorization of a gene based on the annotations of its neighbors, using the enrichment statistics of its sister tool BiNGO. Networks can either be selected from the Cytoscape interface or uploaded from file.

  • ontology or annotation search engine
  • statistical analysis
  • term enrichment
  • functional similarity
  • functional prediction

Tool listing last updated 25 January 2011

ToppGene Suite

Platform
Online tool
Developer
Cincinnati Childrens Hospital Medical Center, Cincinnati, Ohio, USA.
Contact
Anil Jegga
Publications
PMID:19465376
License
Free for academic use
GO data updates
no fixed update schedule
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

ToppGene Suite is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome.

  • database or data warehouse
  • slimmer-type tool
  • term enrichment
  • protein interactions
  • functional similarity
  • other analysis
  • disease gene ranking based on functional annotation similarity

Tool listing last updated 21 January 2011

G-SESAME

Platform
Online tool
Developer
Clemson Bioinformatics Center, South Carolina, USA.
Publications
PMID:17344234
License
Free for academic use

G-SESAME contains a set of tools. They include: tools for measuring the semantic similarity of GO terms; tools for measuring the functional similarity of genes; and tools for clustering genes based on their GO term annotation information.

  • functional similarity
  • semantic similarity

Tool listing submitted before 2009; tool may be unsupported or inactive.