Cellular Component Ontology Guidelines

The cellular component ontology describes locations, at the levels of subcellular structures and macromolecular complexes. Examples of cellular components include 'nuclear inner membrane', with the synonym 'inner envelope', and the 'ubiquitin ligase complex', with several subtypes of these complexes represented.

Generally, a gene product is located in or is a subcomponent of a particular cellular component. The cellular component ontology includes multi-subunit enzymes and other protein complexes, but not individual proteins or nucleic acids. Cellular component also does not include multicellular anatomical terms.

Membranes And Envelopes

Terms and structure

GO distinguishes single and double membranes surrounding organelles: an organelle envelope ; GO:0031967 is defined as two lipid bilayers plus the space, or lumen, between them, whereas an organelle membrane ; GO:0031090 is defined as a single bilayer. For double-membrane organelles, the membrane term refers to either of the lipid bilayers, but excludes the intermembrane space. The envelope is part of the organelle and is a organelle envelope ; GO:0031967; the membrane is part of the envelope, and inner membrane and outer membrane terms can be included:
  • chloroplast
    • [p] chloroplast envelope
      • [p] chloroplast membrane
        • [i] chloroplast inner membrane
        • [i] chloroplast outer membrane


  • organelle envelope
    • [i] chloroplast envelope


Prior to December 2005, nuclear envelope ; GO:0005635 was named 'nuclear membrane', with 'nuclear envelope' as a synonym; this reflected a usage fairly common in the literature. For consistency with other organelle envelope and membrane terms, GO:0005635 is now named 'nuclear envelope', consistent with its definition, and a separate term, nuclear membrane ; GO:0031965, has been added.

Standard Definitions

organelle envelope
The double lipid bilayer enclosing the organelle and separating its contents from the rest of the cytoplasm; includes the intermembrane space.
organelle membrane, organelle with a single membrane
The lipid bilayer surrounding a(n) organelle.
organelle membrane, organelle with a double membrane
Either of the lipid bilayers that enclose the organelle and form the organelle envelope.
organelle inner membrane
The inner, i.e. lumen-facing, lipid bilayer of the organelle envelope.
organelle outer membrane
The outer, i.e. cytoplasm-facing, lipid bilayer of the organelle envelope.
organelle membrane lumen
The region between the inner and outer lipid bilayers of the organelle envelope.

Standard synonyms

The following synonym can be added to terms as long as the synonym string makes sense and does not have alternative meanings. Note that the term name and synonym can be switched depending on typical usage.
organelle membrane lumen
exact_synonym: organelle intermembrane space

Protein Complexes

Definition of a Protein Complex

A cellular component should include more than one gene product; complexes of one gene product with a cofactor, e.g. heme and chlorophyll, should not be included. Homomultimeric proteins, e.g. the homodimeric alcohol dehydrogenase, may be included as cellular component terms, as should heteromultimeric proteins, e.g. hemoglobin with alpha and beta chains. All complexes in the component ontology should be given parentage under the general term protein complex ; GO:0043234 . To distinguish cellular components from functions, use 'complex' in the term name of a component, and append enzyme names with the word 'activity'. For example, the molecular function term pyruvate dehydrogenase activity ; GO:0004738 describes the enzyme activity whereas the cellular component term pyruvate dehydrogenase complex ; GO:0045254 describes the multi-subunit structure in which the enzyme activity resides.

Receptor-ligand complexes

As a rule, GO terms to indicate association of a receptor with its ligand should not be created, as their complex may not always be stable, and there could be a potential explosion of terms. However, we should allow for exceptions. The IntAct database wouldn't curate receptor-ligand complexes if these consisted of a single chain of each, but it will curate complexes when the ligand is not monomeric and receptors oligomerize upon ligand binding. An example of this is GO:1990270 'platelet-derived growth factor receptor-ligand complex', where the ligands are always dimeric and the receptor dimerizes upon ligand binding. (Also, in the case of GO:1990270, the complex has been shown to exist in a variety of experiments, including crystals, pull downs, comigrations, competition assays and more.)

Integration with SAO (Subcellular Anatomy Ontology)

The primary use of the GO Cellular Component Ontology is for GO annotation, but it has also been used for phenotype annotation, and for the annotation of images. Another ontology with similar scope is the Subcellular Anatomy Ontology (SAO), part of the Neuroscience Information Framework Standard (NIFSTD) suite of ontologies. The SAO also covers cell components, but in the domain of neuroscience.

Recently, the GO Cellular Component Ontology was enriched in content and links to the Biological Process and Molecular Function branches of GO as well as to other ontologies. This was achieved in several ways, one of which was amalgamation of SAO terms with GO Cellular Component ones. As a result, nearly 100 new neuroscience-related terms were added to the GO.

A recent paper describes this effort, along with other recent developments in the GO Cellular Component Ontology.

Maintaining complete 'is_a' and 'part_of' trees in cellular component

The cellular component ontology is is_a complete, meaning that every term has a path to the root node which passes solely through is_a relationships. This should be preserved; the following guidelines should help maintain this structure.

Make sure the term has an is_a path to the root, i.e. there are is_a parent terms by at least one path all the way to 'cellular component'. Make sure the term has at least one part_of relation in its ancestry, to ensure that there are no part_of orphans. It does not need to be an immediate part_of parent, but every term has to be part_of something. So, for complex Y, this would be okay for example:

  • cell
    • [p] complex X
      • [i] complex Y
because complex Y is transitively part_of cell. Ensure that all logical is_a parents are added. So, for example, if your term is a protein complex, make sure it has the parent 'protein complex'. Or if your term, or one of its parents, is part_of cell, it will need to be is_a 'cell part', or have 'cell part' in its ancestry. Check none of the relations you create are redundant. You can check for this in OBO-Edit by using the reasoner, and then using the link filter [self] [self] [is redundant]. As an added check, a weekly job runs every Monday night (US West Coast time) to remove any redundant relationship.