the Gene Ontology

Search
  • Open menus
  • Home
  • FAQ
  • Downloads
    • Ontologies
    • Annotations
    • Database
    • Mappings to GO
    • Teaching Resources
    • Other files
    • FTP and CVS downloads
  • Tools
    • Browsers
    • Microarray tools
    • Annotation tools
    • Other tools
    • Submit New Tools
  • Documentation
    • Introduction
    • Ontology...
      • Ontology structure
      • Ontology relations
      • Cellular Component
      • Molecular Function
      • Biological Process
      • GO Slim Guide
      • OBO v1.2 format
    • Annotation...
      • Annotation Guide
      • Evidence Codes
      • Conventions
      • SOPs
      • File Format
    • Database...
      • GO Database Guide
      • Database schema
    • File Formats...
      • File Format Guide
      • Annotation
      • OBO v1.2
      • OBO v1.0
      • GO RDF-XML
    • Meeting minutes
  • About GO
    • GO Consortium
    • Publications
    • Citation Policy
    • Mailing lists
    • Interest Groups
    • GO People
    • Funding
    • Acknowledgements
    • Newsletter
  • Projects
    • Reference Genomes
    • Cardiovascular
    • Renal
  • Contact GO
    • News
    • RSS
    • twitter

Current Annotations

  • Annotation Details and Downloads
  • Filtered files
  • Unfiltered files
  • gp2protein files

Annotation Details and Downloads

The gene association files submitted by GO Consortium members are shown in the tables below. Files are in the GO annotation file format and are compressed using the UNIX gzip utility. Please see the appropriate README file for further details on the annotation set. Any errors or omissions in annotations should be reported by writing to the GO helpdesk.

Ontology and annotation data is integrated in the mySQL and XML files. See the GO database guide for more information.

These files can also be downloaded via FTP; we recommend this method for the larger files, such as the UniProt dataset, as the web-based download may not work correctly.

Filtered Files

These files are taxon-specific and reflect the work of specific projects, primarily the model organisms database groups, to provide comprehensive, non-redundant annotation files for their organism. All the files in this table have been filtered using the annotation file QC checks script. A major component to the filtering is the requirement that particular taxon IDs can only be included within the association files provided by specific projects; please see the list of the authoritative groups for the major model organisms.

Statistics as of June 30, 2009

Filtered Annotation File Downloads
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download filtered files
Anaplasma phagocytophilum HZ
TIGR
1289 3474
(3474 non-IEA)
3/14/2009
  • annotations [39.7 kb]
  • README
Agrobacterium tumefaciensstr. C58
PAMGO
83 250
(250 non-IEA)
9/12/2008
  • annotations [3.4 kb]
  • README
Arabidopsis thaliana
TAIR/TIGR
52844 150860
(110166 non-IEA)
6/30/2009
  • annotations [3.0 mb]
  • README
Aspergillus nidulans
AspGD
3289 15492
(5576 non-IEA)
6/28/2009
  • annotations [162.4 kb]
  • README
Bacillus anthracis Ames
TIGR
5280 13115
(13115 non-IEA)
3/14/2009
  • annotations [152.9 kb]
  • README
Bos taurus
GO Annotations @ EBI
23776 99215
(3655 non-IEA)
5/30/2009
  • annotations [1.1 mb]
  • README
Carboxydothermus hydrogenoformans Z-2901
TIGR
2611 6397
(6397 non-IEA)
3/14/2009
  • annotations [82.0 kb]
  • README
Caenorhabditis elegans
WormBase
18251 97320
(47742 non-IEA)
6/3/2009
  • annotations [907.0 kb]
  • README
Campylobacter jejuni RM1221
TIGR
1829 4649
(4649 non-IEA)
3/14/2009
  • annotations [61.2 kb]
  • README
Candida albicans
CGD
1568 6712
(6712 non-IEA)
6/28/2009
  • annotations [144.8 kb]
  • README
Clostridium perfringens ATCC13124
TIGR
2892 7457
(7457 non-IEA)
3/14/2009
  • annotations [94.7 kb]
  • README
Colwellia psychrerythraea 34H
TIGR
4752 12117
(12117 non-IEA)
3/14/2009
  • annotations [144.6 kb]
  • README
Coxiella burnetii RSA 493
TIGR
2033 5172
(5172 non-IEA)
3/14/2009
  • annotations [59.6 kb]
  • README
Danio rerio
ZFIN
15007 100187
(22580 non-IEA)
6/30/2009
  • annotations [1.6 mb]
  • README
Dehalococcoides ethenogenes 195
TIGR
1584 3952
(3952 non-IEA)
3/14/2009
  • annotations [48.6 kb]
  • README
Dictyostelium discoideum
dictyBase
7403 30941
(20158 non-IEA)
6/28/2009
  • annotations [412.5 kb]
  • README
Drosophila melanogaster
FlyBase
12507 71311
(55819 non-IEA)
6/5/2009
  • annotations [1.1 mb]
  • README
Escherichia coli
EcoCyc & EcoliHub
3721 42477
(6079 non-IEA)
6/8/2009
  • annotations [480.2 kb]
  • README
Ehrlichia chaffeensis Arkansas
TIGR
1091 2861
(2861 non-IEA)
3/14/2009
  • annotations [35.7 kb]
  • README
Gallus gallus
GO Annotations @ EBI
16359 64173
(1942 non-IEA)
5/30/2009
  • annotations [708.4 kb]
  • README
Geobacter sulfurreducens PCA
TIGR
3410 8852
(8852 non-IEA)
3/14/2009
  • annotations [103.2 kb]
  • README
Homo sapiens
GO Annotations @ EBI
20059 160498
(64568 non-IEA)
5/30/2009
  • annotations [6.0 mb]
  • README
Hyphomonas neptunium ATCC 15444
TIGR
3108 7820
(7820 non-IEA)
3/14/2009
  • annotations [108.9 kb]
  • README
Leishmania major
Sanger GeneDB
3573 11440
(28 non-IEA)
5/20/2009
  • annotations [150.8 kb]
  • README
Listeria monocytogenes 4b F2365
TIGR
2819 7022
(7022 non-IEA)
3/14/2009
  • annotations [88.4 kb]
  • README
Magnaporthe grisea
PAMGO
11533 29269
(29269 non-IEA)
5/16/2009
  • annotations [359.6 kb]
  • README
Methylococcus capsulatus Bath
TIGR
2920 7037
(7037 non-IEA)
3/14/2009
  • annotations [94.0 kb]
  • README
Mus musculus
MGI
18256 162799
(62728 non-IEA)
6/26/2009
  • annotations [1.9 mb]
  • README
Neorickettsia sennetsu Miyayama
TIGR
929 2434
(2434 non-IEA)
3/14/2009
  • annotations [31.0 kb]
  • README
Oomycetes
PAMGO
30 126
(126 non-IEA)
2/13/2008
  • annotations [2.3 kb]
  • README
Oryza sativa
Gramene
41581 51110
(50427 non-IEA)
4/18/2009
  • annotations [874.4 kb]
  • README
Protein Data Bank [multispecies]
GO Annotations @ EBI
20793 118225
(0 non-IEA)
5/28/2009
  • annotations [669.4 kb]
  • README
Plasmodium falciparum
Sanger GeneDB
2207 4653
(4653 non-IEA)
3/14/2009
  • annotations [79.1 kb]
  • README
Pseudomonas aeruginosa PAO1
PseudoCAP
1519 7343
(7343 non-IEA)
3/14/2009
  • annotations [132.6 kb]
  • README
Pseudomonas fluorescens Pf-5
TIGR
3691 9707
(9707 non-IEA)
3/14/2009
  • annotations [105.2 kb]
  • README
Pseudomonas syringae DC3000
TIGR
4006 10259
(10259 non-IEA)
4/18/2009
  • annotations [118.2 kb]
  • README
Pseudomonas syringae pv. phaseolicola 1448A
TIGR
3506 9031
(9031 non-IEA)
3/14/2009
  • annotations [116.4 kb]
  • README
Rattus norvegicus
RGD
20285 184418
(101616 non-IEA)
6/27/2009
  • annotations [3.7 mb]
  • README
Reactome [multispecies]
CSHL & EBI
261 6499
(6499 non-IEA)
6/24/2009
  • annotations [35.3 kb]
  • README
Saccharomyces cerevisiae
SGD
6353 87950
(45545 non-IEA)
6/27/2009
  • annotations [1.2 mb]
  • README
Schizosaccharomyces pombe
Sanger GeneDB
5267 34128
(30005 non-IEA)
6/18/2009
  • annotations [597.6 kb]
  • README
Shewanella oneidensis MR-1
TIGR
4843 13598
(13598 non-IEA)
3/14/2009
  • annotations [143.3 kb]
  • README
Silicibacter pomeroyi DSS-3
TIGR
4252 10860
(10860 non-IEA)
3/14/2009
  • annotations [139.3 kb]
  • README
Solanaceae
SGN
38 68
(68 non-IEA)
4/26/2008
  • annotations [2.4 kb]
  • README
Trypanosoma brucei
Sanger GeneDB
2977 10516
(10516 non-IEA)
6/27/2009
  • annotations [176.4 kb]
  • README
UniProt [multispecies]
GO Annotations @ EBI
4644675 37255593
(23200 non-IEA)
6/23/2009
  • annotations [306.0 mb]
  • README
UniProt [multispecies] IEA annotations removed
GO Annotations @ EBI
5460 23200
(23200 non-IEA)
6/24/2009
  • annotations [333.6 kb]
  • README
Vibrio cholerae
TIGR
3858 9428
(9428 non-IEA)
3/14/2009
  • annotations [98.9 kb]
  • README
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download filtered files

Unfiltered Files

These files have not been filtered with the annotation file QC checks script. The most important difference between these files and the filtered files above is that gene products from certain taxa are not stripped out of the file; they may also contain annotations to obsolete terms or outdated IEA annotations. Please see the annotation file QC script documentation for full details of the checks performed.

Please note that if you use unfiltered files in conjunction with filtered files, there may be duplicated annotations.

Statistics as of June 30, 2009

Unfiltered Annotation File Downloads
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download unfiltered files
Arabidopsis thaliana
GO Annotations @ EBI
22375 107856
(29760 non-IEA)
5/28/2009
  • annotations [1.5 mb]
  • README
Mus musculus
GO Annotations @ EBI
35166 210699
(81269 non-IEA)
5/28/2009
  • annotations [2.8 mb]
  • README
Rattus norvegicus
GO Annotations @ EBI
27868 139330
(26314 non-IEA)
5/28/2009
  • annotations [1.6 mb]
  • README
Danio rerio
GO Annotations @ EBI
30749 107191
(5879 non-IEA)
5/28/2009
  • annotations [1.2 mb]
  • README
Protein Data Bank [multispecies]
GO Annotations @ EBI
33323 190650
(0 non-IEA)
5/28/2009
  • annotations [1000.3 kb]
  • README
Reactome [multispecies]
CSHL & EBI
3746 22480
(22480 non-IEA)
6/24/2009
  • annotations [193.9 kb]
  • README
UniProt [multispecies]
GO Annotations @ EBI
5308250 42170586
(569306 non-IEA)
5/29/2009
  • annotations [336.2 mb]
  • README
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download unfiltered files

In the tables above gene association counts are provided for all evidence codes and separately for everything except IEA, Inferred from Electronic Annotation. The IEA code means there has been no human involvement in the assignment of the association; see the GO evidence code documentation for more details.

Back to top

gp2protein files

The gp2protein directory contains files that map between model organism database object IDs and UniProt accessions.

Back to top


Open Biomedical Ontologies logo Last modified Wednesday, 10-Jun-2009 15:40:21 PDT
Help • Cite • Terms of use • Site Map
Copyright © 1999-Friday, 03-Jul-2009 20:04:58 PDT the Gene Ontology