the Gene Ontology

  • Open menus
  • Home
  • FAQ
  • Downloads
  • Ontologies
  • Annotations
  • Database
  • Mappings to GO
  • Teaching Resources
  • Other files
  • FTP and CVS downloads
  • Tools
  • Browsers
  • Microarray tools
  • Annotation tools
  • Other tools
  • Submit New Tools
  • Documentation
  • Introduction
  • Annotation Guide
  • Evidence Code Guide
  • Component Ontology
  • Function Ontology
  • Process Ontology
  • File Format Guide
  • GO Database Guide
  • GO Slim Guide
  • Meeting minutes
  • Editorial Style Guide
  • About GO
  • GO Consortium
  • Publications
  • Citation Policy
  • Mailing lists
  • Interest Groups
  • GO People
  • Funding
  • Acknowledgements
  • Newsletter
  • Projects
  • Cardiovascular
  • Immunology
  • Reference Genomes
  • Contact GO
  • Site Map

Cellular Component Ontology Guidelines

The cellular component ontology describes locations, at the levels of subcellular structures and macromolecular complexes. Examples of cellular components include nuclear inner membrane, with the synonym inner envelope, and the ubiquitin ligase complex, with several subtypes of these complexes represented.

Generally, a gene product is located in or is a subcomponent of a particular cellular component. The cellular component ontology includes multi-subunit enzymes and other protein complexes, but not individual proteins or nucleic acids. Cellular component also does not include multicellular anatomical terms.

  • The Cell
  • Protein Complexes
  • Membranes and Envelopes
  • Terms and Structure
  • Standard Definitions
  • Standard Synonyms
  • Membrane Proteins
  • Maintaining complete is_a and part_of trees in cellular component

The Cell

The cell is defined in GO as all components within and including the plasma membrane and any external encapsulating structures, such as the cell wall and the cell envelope. intracellular ; GO:0005622 is defined as the contents of the cell excluding the plasma membrane and any structures outside the plasma membrane. For this reason, cell projection ; GO:0042995 is a direct child of cell ; GO:0005623; cell projection membrane ; GO:0031253 is part_of cell projection ; GO:0042995 and plasma membrane ; GO:0005886.

Back to top

Protein Complexes

A cellular component should include more than one gene product; complexes of one gene product with a cofactor, e.g. heme and chlorophyll, should not be included. Homomultimeric proteins, e.g. the homodimeric alcohol dehydrogenase, may be included as cellular component terms, as should heteromultimeric proteins, e.g. hemoglobin with alpha and beta chains.

All complexes in the component ontology should be given parentage under the general term protein complex ; GO:0043234.

To distinguish cellular components from functions, use 'complex' in the term name of a component, and append enzyme names with the word 'activity'. For example, the molecular function term pyruvate dehydrogenase activity ; GO:0004738 describes the enzyme activity whereas the cellular component term pyruvate dehydrogenase complex ; GO:0045254 describes the multi-subunit structure in which the enzyme activity resides.

Back to top

Membranes and Envelopes

Terms and structure

GO distinguishes single and double membranes surrounding organelles: an organelle envelope is defined as two lipid bilayers plus the space, or lumen, between them, whereas an organelle membrane is defined as a single bilayer. For double-membrane organelles, the membrane term refers to either of the lipid bilayers, but excludes the intermembrane space. The envelope is part_of the organelle and is_a organelle envelope ; GO:0031967; the membrane is part_of the envelope, and inner membrane and outer membrane terms can be included:

chloroplast
[p] chloroplast envelope
---[p] chloroplast membrane
------[i] chloroplast inner membrane
------[i] chloroplast outer membrane
...
organelle envelope
[i] chloroplast envelope

History

Prior to December 2005, GO:0005635 was named nuclear membrane, with nuclear envelope as a synonym; this reflected a usage fairly common in the literature. For consistency with other organelle envelope and membrane terms, GO:0005635 is now named nuclear envelope, consistent with its definition, and a separate term, nuclear membrane ; GO:0031965, has been added.

Standard Definitions

organelle envelope
The double lipid bilayer enclosing the organelle and separating its contents from the rest of the cytoplasm; includes the intermembrane space.
organelle membrane, organelle with a single membrane
The lipid bilayer surrounding a(n) organelle.
organelle membrane, organelle with a double membrane
Either of the lipid bilayers that enclose the organelle and form the organelle envelope.
organelle inner membrane
The inner, i.e. lumen-facing, lipid bilayer of the organelle envelope.
organelle outer membrane
The outer, i.e. cytoplasm-facing, lipid bilayer of the organelle envelope.
organelle membrane lumen
The region between the inner and outer lipid bilayers of the organelle envelope.

Standard synonyms

The following synonym can be added to terms as long as the synonym string makes sense and does not have alternative meanings. Note that the term name and synonym can be switched depending on typical usage.

organelle membrane lumen
exact_synonym: organelle intermembrane space

Back to top

Membrane Proteins

As GO cellular component terms describe locations where a gene product may act, rather than physical features of proteins or RNAs, the terms integral membrane protein and peripheral membrane protein are present only as non-exact synonyms.

GO distinguishes classes of membrane-related location:

extrinsic to membrane ; GO:0019898 refers to gene products that are associated with membranes, but are neither directly embedded in the membrane nor anchored by covalent bonds to any moiety embedded in the membrane.

intrinsic to membrane ; GO:0031224 refers to gene products that have some covalently attached moiety embedded in the membrane, and is further split into integral to membrane ; GO:0016021 and anchored to membrane ; GO:0031225. The former refers to proteins in which some part of the peptide sequence spans all or part of the membrane (in theory, it could also be used for RNAs embedded in a membrane, if any such exist); the latter refers to gene products tethered to a membrane by a covalently attached anchor, such as a lipid moiety, which is embedded in the membrane.

Each of these terms can have child terms referring to specific membranes, for example integral to plasma membrane ; GO:0031226 or extrinsic to vacuolar membrane ; GO:0000306.

Note that even these terms actually describe spatial relationships between a membrane and a gene product, and therefore do not fit the strictest interpretation of a 'location'. They are retained nevertheless because of their considerable utility.

The cellular component ontology does not include terms for type I, II, etc., membrane proteins, because these classifications are not locations, but instead describe a different feature of the proteins, namely topological orientation with respect to the membrane and other cellular components. Furthermore, the wording type I integral membrane protein describes a class of gene products.

The DAG structure is as follows:

membrane
[p] intrinsic to membrane
---[i] anchored to membrane
[p] extrinsic to membrane

Diagram of different types of membrane proteins

Back to top

Maintaining complete is_a and part_of trees in cellular component

The cellular component ontology is now is_a complete, meaning that every term has a path to the root node which passes solely through is_a relationships. This should be preserved; the following guidelines should help maintain this structure.

  • Make sure the term has an is_a path to the root, i.e. there are is_a parent terms by at least one path all the way to cellular component.
  • Make sure the term has at least one part_of relation in its ancestry, to ensure that there are no part_of orphans. It does not need to be an immediate part_of parent, but every term has to be part of something. So, for complex Y, this would be okay for example:

    cell
    [p] complex X
    ---[i] complex Y

    because complex Y is transitively part_of cell.
  • Ensure that all logical is_a parents are added. So, for example, if your term is a protein complex, make sure it has the parent protein complex. Or if your term, or one of its parents, is part_of cell, it will need to be is_a cell part, or have cell part in its ancestry.
  • Check none of the relations you create are redundant. You can check for this in OBO-Edit by using the reasoner, and then using the link filter [self] [self] [is redundant].

Back to top


Open Biomedical Ontologies logo

Last modified Monday, 08-Jan-2007 06:10:07 PST
Cite GO • Terms of use • GO helpdesk
Copyright © 1999-Friday, 09-May-2008 18:18:38 PDT the Gene Ontology