the Gene Ontology

  • Open menus
  • Home
  • FAQ
  • Downloads
  • Ontologies
  • Annotations
  • Database
  • Mappings to GO
  • Teaching Resources
  • Other files
  • FTP and CVS downloads
  • Tools
  • Browsers
  • Microarray tools
  • Annotation tools
  • Other tools
  • Submit New Tools
  • Documentation
  • Introduction
  • Annotation Guide
  • Evidence Code Guide
  • Component Ontology
  • Function Ontology
  • Process Ontology
  • File Format Guide
  • GO Database Guide
  • GO Slim Guide
  • Meeting minutes
  • Editorial Style Guide
  • About GO
  • GO Consortium
  • Publications
  • Citation Policy
  • Mailing lists
  • Interest Groups
  • GO People
  • Funding
  • Acknowledgements
  • Newsletter
  • Projects
  • Cardiovascular
  • Immunology
  • Reference Genomes
  • Contact GO
  • Site Map

Current Annotations

  • Annotation Details and Downloads
  • Filtered files
  • Unfiltered files
  • gp2protein files

Annotation Details and Downloads

The gene association files submitted by GO Consortium members are shown in the tables below. Files are in the GO annotation file format and are compressed using the UNIX gzip utility. Please see the appropriate README file for further details on the annotation set. Any errors or omissions in annotations should be reported by writing to the GO helpdesk.

Ontology and annotation data is integrated in the mySQL and XML files. See the GO database guide for more information.

Filtered Files

These files are taxon-specific and reflect the work of specific projects, primarily the model organisms database groups, to provide comprehensive, non-redundant annotation files for their organism. All the files in this table have been filtered using the annotation file QC checks script. A major component to the filtering is the requirement that particular taxon IDs can only be included within the association files provided by specific projects; please see the list of the authoritative groups for the major model organisms.

numbers as of May 8, 2008

Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download filtered files
Anaplasma phagocytophilum HZ
TIGR
1292 3495
(3495 non-IEA)
3/22/2008
  • annotations [38.3 kb]
  • README
Agrobacterium tumefaciensstr. C58
PAMGO
31 50
(50 non-IEA)
2/2/2008
  • annotations [1.8 kb]
  • README
Arabidopsis thaliana
TAIR/TIGR
35596 108366
(85808 non-IEA)
5/8/2008
  • annotations [2.5 mb]
  • README
Bacillus anthracis Ames
TIGR
5287 13160
(13160 non-IEA)
3/22/2008
  • annotations [146.6 kb]
  • README
Bos taurus
GO Annotations @ EBI
22843 87981
(3274 non-IEA)
5/7/2008
  • annotations [1.1 mb]
  • README
Carboxydothermus hydrogenoformans Z-2901
TIGR
2615 6421
(6421 non-IEA)
3/22/2008
  • annotations [78.5 kb]
  • README
Caenorhabditis elegans
WormBase
13772 81238
(36263 non-IEA)
4/19/2008
  • annotations [740.4 kb]
  • README
Campylobacter jejuni RM1221
TIGR
1833 4678
(4678 non-IEA)
3/22/2008
  • annotations [59.0 kb]
  • README
Candida albicans
CGD
3728 16614
(5423 non-IEA)
5/8/2008
  • annotations [265.4 kb]
  • README
Clostridium perfringens ATCC13124
TIGR
2895 7496
(7496 non-IEA)
3/22/2008
  • annotations [90.6 kb]
  • README
Colwellia psychrerythraea 34H
TIGR
4810 12391
(12194 non-IEA)
3/22/2008
  • annotations [141.1 kb]
  • README
Coxiella burnetii RSA 493
TIGR
2036 5191
(5191 non-IEA)
3/22/2008
  • annotations [57.2 kb]
  • README
Danio rerio
ZFIN
14307 87846
(21284 non-IEA)
5/5/2008
  • annotations [1.3 mb]
  • README
Dehalococcoides ethenogenes 195
TIGR
1584 3973
(3973 non-IEA)
3/22/2008
  • annotations [46.7 kb]
  • README
Dictyostelium discoideum
dictyBase
6961 29022
(17682 non-IEA)
5/4/2008
  • annotations [351.8 kb]
  • README
Drosophila melanogaster
FlyBase
12408 69572
(53546 non-IEA)
5/2/2008
  • annotations [1.0 mb]
  • README
Ehrlichia chaffeensis Arkansas
TIGR
1094 2881
(2881 non-IEA)
3/22/2008
  • annotations [34.5 kb]
  • README
Gallus gallus
GO Annotations @ EBI
16581 61169
(1933 non-IEA)
5/7/2008
  • annotations [774.4 kb]
  • README
Geobacter sulfurreducens PCA
TIGR
3417 8886
(8886 non-IEA)
3/22/2008
  • annotations [99.2 kb]
  • README
Homo sapiens
GO Annotations @ EBI
35551 183797
(58071 non-IEA)
5/7/2008
  • annotations [2.6 mb]
  • README
Hyphomonas neptunium ATCC 15444
TIGR
3116 7913
(7864 non-IEA)
3/22/2008
  • annotations [104.6 kb]
  • README
Leishmania major
Sanger GeneDB
3616 11255
(30 non-IEA)
3/31/2008
  • annotations [157.1 kb]
  • README
Listeria monocytogenes 4b F2365
TIGR
2823 7048
(7048 non-IEA)
3/22/2008
  • annotations [84.5 kb]
  • README
Magnaporthe grisea
PAMGO
12876 51711
(29275 non-IEA)
4/19/2008
  • annotations [588.1 kb]
  • README
Methylococcus capsulatus Bath
TIGR
2924 7065
(7065 non-IEA)
3/22/2008
  • annotations [90.4 kb]
  • README
Mus musculus
MGI
18099 154253
(65138 non-IEA)
5/2/2008
  • annotations [1.7 mb]
  • README
Neorickettsia sennetsu Miyayama
TIGR
930 2454
(2454 non-IEA)
3/22/2008
  • annotations [29.8 kb]
  • README
Oomycetes
PAMGO
30 126
(126 non-IEA)
2/13/2008
  • annotations [2.3 kb]
  • README
Oryza sativa
Gramene
52082 64119
(64119 non-IEA)
5/3/2008
  • annotations [688.5 kb]
  • README
Protein Data Bank [multispecies]
GO Annotations @ EBI
29571 154252
(0 non-IEA)
5/7/2008
  • annotations [801.4 kb]
  • README
Plasmodium falciparum
Sanger GeneDB
3243 11646
(4671 non-IEA)
3/31/2008
  • annotations [161.6 kb]
  • README
Pseudomonas aeruginosa PAO1
PseudoCAP
1519 7381
(7381 non-IEA)
3/22/2008
  • annotations [129.5 kb]
  • README
Pseudomonas fluorescens Pf-5
TIGR
4164 10744
(9730 non-IEA)
3/22/2008
  • annotations [120.0 kb]
  • README
Pseudomonas syringae DC3000
TIGR
3902 9650
(9650 non-IEA)
3/22/2008
  • annotations [104.0 kb]
  • README
Pseudomonas syringae pv. phaseolicola 1448A
TIGR
3511 9065
(9065 non-IEA)
3/22/2008
  • annotations [111.7 kb]
  • README
Rattus norvegicus
RGD
25991 197009
(74445 non-IEA)
5/3/2008
  • annotations [3.1 mb]
  • README
Saccharomyces cerevisiae
SGD
6348 77236
(38184 non-IEA)
5/3/2008
  • annotations [1.1 mb]
  • README
Schizosaccharomyces pombe
Sanger GeneDB
5235 34674
(28664 non-IEA)
5/3/2008
  • annotations [585.4 kb]
  • README
Shewanella oneidensis MR-1
TIGR
4850 13662
(13662 non-IEA)
3/22/2008
  • annotations [138.0 kb]
  • README
Silicibacter pomeroyi DSS-3
TIGR
4257 10899
(10899 non-IEA)
3/22/2008
  • annotations [133.7 kb]
  • README
Solanaceae
SGN
38 68
(68 non-IEA)
4/26/2008
  • annotations [2.4 kb]
  • README
Trypanosoma brucei
Sanger GeneDB
3898 18858
(10572 non-IEA)
5/3/2008
  • annotations [292.5 kb]
  • README
Trypanosoma brucei chr 2
TIGR
292 896
(896 non-IEA)
2/16/2008
  • annotations [9.6 kb]
  • README
UniProt [multispecies]
GO Annotations @ EBI
3486636 25417941
(22199 non-IEA)
4/12/2008
  • annotations [201.8 mb]
  • README
Vibrio cholerae
TIGR
3863 9449
(9449 non-IEA)
3/22/2008
  • annotations [96.3 kb]
  • README
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download filtered files

Unfiltered Files

These files have not been filtered with the annotation file QC checks script. The most important difference between these files and the filtered files above is that gene products from certain taxa are not stripped out of the file; they may also contain annotations to obsolete terms or outdated IEA annotations. Please see the annotation file QC script documentation for full details of the checks performed.

Please note that if you use unfiltered files in conjunction with filtered files, there may be duplicated annotations.

numbers as of May 8, 2008

Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download unfiltered files
Arabidopsis thaliana
GO Annotations @ EBI
21453 84754
(7277 non-IEA)
5/7/2008
  • annotations [1.2 mb]
  • README
Mus musculus
GO Annotations @ EBI
33710 189108
(65858 non-IEA)
5/7/2008
  • annotations [2.7 mb]
  • README
Rattus norvegicus
GO Annotations @ EBI
28241 119606
(13397 non-IEA)
5/7/2008
  • annotations [1.5 mb]
  • README
Danio rerio
GO Annotations @ EBI
31274 112883
(4571 non-IEA)
5/7/2008
  • annotations [1.4 mb]
  • README
Protein Data Bank [multispecies]
GO Annotations @ EBI
45700 239167
(0 non-IEA)
5/7/2008
  • annotations [1.2 mb]
  • README
TIGR Gene Index [multispecies]
TIGR
281994 788343
(0 non-IEA)
10/2/2005
  • annotations [12.5 mb]
  • README
UniProt [multispecies]
GO Annotations @ EBI
3825597 28216336
(467894 non-IEA)
4/11/2008
  • annotations [228.7 mb]
  • README
Species, Database Gene Products Annotated Annotations Submission date MM/DD/YYYY Download unfiltered files

In the tables above gene association counts are provided for all evidence codes and separately for everything except IEA, Inferred from Electronic Annotation. The IEA code means there has been no human involvement in the assignment of the association; see the GO evidence code documentation for more details.

Back to top

gp2protein files

The gp2protein directory contains files that map between model organism database object IDs and UniProt accessions.

Back to top


Open Biomedical Ontologies logo

Last modified Wednesday, 22-Aug-2007 08:10:09 PDT
Cite GO • Terms of use • GO helpdesk
Copyright © 1999-Friday, 09-May-2008 12:36:36 PDT the Gene Ontology