Submit GO annotations

From GO Wiki
Jump to navigation Jump to search

What is needed to submit GO annotations to the GOC

GO annotations are disseminated in a 17 column tab de-limited GAF 2.0 format. But the following bits of information are sufficient to complete/make a GAF file. You can choose to construct a GAF file using the documentation or simply contact the GOC at go-helpdesk@mailman.stanford.edu for step-by-step assistance.
Minimum information needed to make a GAF file:

  • Stable IDs for the gene products or objects that are being annotated. If the gene product IDs are not in UniProt or NCBI, please submit the IDs to one of these databases
  • GO ID that each gene product can be associated with
  • Evidence that allows you to make the association
  • Reference (published paper or reference describing the methodology used to make the geneproductToGOterm association)
  • taxon_id for the gene products for which the associations are made

Checklist for GAF

Following details have to be considered before taking annotations from a new group.

  • If a group contacts GOC offering GO annotations, you want to ask them if they are willing to serve as the species owner (do they have resources, support to maintain the annotations for the species etc). For example MGI is the species owner for mouse. If they are able to serve as the species owner, then they have to provide us the following along with the GO annotations.
Example xrefs stanza
abbreviation: SGD
database: Saccharomyces Genome Database
object: Identifier for SGD Loci
synonym: SGDID
example_id: SGD:S000006169
local_id_syntax: ^S[0-9]{9}$
generic_url: http://www.yeastgenome.org/
url_syntax: http://db.yeastgenome.org/cgi-bin/locus.pl?dbid=[example_id]
url_example: http://db.yeastgenome.org/cgi-bin/locus.pl?dbid=S000006169
Example Config file
project_name=Saccharomyces Genome Database (SGD)
contact_email=sgd-helpdesk@lists.stanford.edu
project_url=http://www.yeastgenome.org/
funding_source=NHGRI of US National Institutes of Health, HG001315
email_report=sgd-go-curator@genome.stanford.edu
  • If the contributing group is the sole species owner, then the taxon filter will be added to the filtering script (implying they will provide all the annotations for that taxonID) and they should be able to absorb/integrate all external annotations into their GAF.
  • If the contributing group is unable to be the owner of the species, then request them to submit their annotations to GOA group. Ask them what they want the assigned_by column should say.
  • Also if the group is submitting IEAs, it is important to consider how their IEAs compare with the IEAs made by GOA for the same proteins (how much value are they adding to the annotation set for that taxon ID?)