The Gene Ontology project provides controlled vocabularies of defined terms representing gene product properties. These cover three domains: Cellular Component, the parts of a cell or its extracellular environment; Molecular Function, the elemental activities of a gene product at the molecular level, such as binding or catalysis; and Biological Process, operations or sets of molecular events with a defined beginning and end, pertinent to the functioning of integrated living units: cells, tissues, organs, and organisms.
The GO ontology is structured as a directed acyclic graph where each term has defined relationships to one or more other terms in the same domain, and sometimes to other domains. The GO vocabulary is designed to be species-agnostic, and includes terms applicable to prokaryotes and eukaryotes, and single and multicellular organisms.
In an example of GO annotation, the gene product "cytochrome c" can be described by the Molecular Function term "oxidoreductase activity", the Biological Process terms "oxidative phosphorylation" and "induction of cell death", and the Cellular Component terms "mitochondrial matrix" and "mitochondrial inner membrane".
These terms describe a component of a cell that is part of a larger object, such as an anatomical structure (e.g. rough endoplasmic reticulum or nucleus) or a gene product group (e.g. ribosome, proteasome or a protein dimer).
A biological process term describes a series of events accomplished by one or more organized assemblies of molecular functions. Examples of broad biological process terms are "cellular physiological process" or "signal transduction". Examples of more specific terms are "pyrimidine metabolic process" or "alpha-glucoside transport". The general rule to assist in distinguishing between a biological process and a molecular function is that a process must have more than one distinct steps.
A biological process is not equivalent to a pathway. At present, the GO does not try to represent the dynamics or dependencies that would be required to fully describe a pathway.
Molecular function terms describes activities that occur at the molecular level, such as "catalytic activity" or "binding activity". GO molecular function terms represent activities rather than the entities (molecules or complexes) that perform the actions, and do not specify where, when, or in what context the action takes place. Molecular functions generally correspond to activities that can be performed by individual gene products, but some activities are performed by assembled complexes of gene products. Examples of broad functional terms are "catalytic activity" and "transporter activity"; examples of narrower functional terms are "adenylate cyclase activity" or "Toll receptor binding".
It is easy to confuse a gene product name with its molecular function; for that reason GO molecular functions are often appended with the word "activity".
Details about the ontologies
- Ontology Structure: information about the structure of GO terms and the ontology.
- Ontology Relations: documentation on the inter-term relations used in GO
- Cellular Component Ontology: Rules governing content and stylistic aspects of GO terms in the cellular component ontology.
- Molecular Function Ontology: Rules governing content and stylistic aspects of GO terms, standard definitions and term relationships in the molecular function ontology.
- Biological Process Ontology: Rules governing content and stylistic aspects of GO terms, standard definitions and term relationships in the biological process ontology.
- Species-Specific Terms: How the Gene Ontology deals with words or phrases where the meaning varies depending on the organism.
- Documentation on specific areas of the ontology:
- GO Slim Guide: information about GO slims, cut-down versions of the ontologies useful for providing an overview of GO
- OBO 1.4 File Format Guide: the ontology file format used and recommended by the GO Consortium