Elements of an annotation

From GO Wiki
(Redirected from Annotation conventions)
Jump to navigation Jump to search

This page describes the different annotation fields.

Elements of an annotation

Annotation Subject

  • Annotations subjects consists of valid database identifiers, such as WB:WBGene00003721, SGD:S000001048, or UniProtKB:P99999.
  • Annotations subjects may be genes or gene products (e.g. proteins, including specific isoforms; ncRNAs; and protein complexes)
  • The list of valid database prefixes can be found on the GO website.


  • Annotation Subjects and GO terms are linked by a Relation from the Relations Ontology.
  • The relations applicable to each aspect of GO as well as usage guidelines can be found in the Annotation_Relations page.


  • The NOT statement indicates that the gene product does not enable a Molecular Function, is not part of a Biological Process or is not located in a specific Cellular Component.
  • NOT statements are only used when a user might expect that the gene product would have a specific biological property (MF, BP or CC).
  • Contrary to positive annotations, NOT statements propagate down the ontology, such that the annotation gene product NOT enables protein kinase activity means that the gene product does not enable protein serine/threonine kinase activity or protein tyrosine kinase activity either.
  • Both positive and NOT statements can be used in cases where there is conflicting experimental findings in the literature.
  • If an isoform has a different function from the main isoform represented by the gene-centric entity, a NOT annotation can be captured together with the isoform identifier.
  • NOT annotations can be supported by experiments that show the lack of activity (or BP, CC), or based on sequence data that it has lost essential residues and is unlikely to be able to carry out a function, participate in a process, or be found in a certain location. In this case the evidence code is IKR (Inferred from Key Residue) evidence code.

When NOT to use negation

  • The NOT qualifier should be not used to capture every experimental result.
    • For example in a subcellular localization experiment, locations where the gene product is not found should not be captured, unless it is explicitly needed. If a protein is found in the nucleus but not in the mitochondrion, usually a NOT located in mitochondrion annotation would be inappropriate,
  • The NOT qualifier should be not used to annotate negative or inconclusive experimental results.
    • For example, if a mutant develops a specific anatomical structure, it doesn't imply that it does not contribute to the process; the experiment may just not allow to make the positive conclusion.
  • The NOT qualifier should be not used to describe experimental conditions or specific contexts in which the gene product is not active (i.e, should not be used in combination with an extension).


  • MNN4 (CGD:CAL0000174110) NOT biological process involved in interspecies interaction between organisms from PMID:15271989, based on the result that loss of cell wall mannosylphosphate in Candida albicans does not influence macrophage recognition. This NOT annotation means that MNN4 is never involved in any biological process involved in interspecies interaction between organisms, which the data from the paper does not allow to conclude.

GO term

Annotation Extensions

  • Annotation extensions may be added to GO annotations to provide additional contextual information for the assertion.
  • Annotation extensions are structured text that use a relation from the Relations Ontology and an appropriate biological concept or entity to modify the GO annotation, e.g. nucleus 'part of' epithelial cell.
  • Detailed documentation on curation using annotation extensions can be found here: Annotation_Extension




Every annotation is marked with the name of the group that made the annotation. The group that made the annotation may be different from the database who manages the identifiers and/or the annotation file.


The date the annotation was made or last edited, in YYYYMMDD format.

Review Status

Last reviewed: 2022-04-19