Difference between revisions of "Guide to GO Evidence Codes"

From GO Wiki
Jump to: navigation, search
(Introduction)
Line 5: Line 5:
 
A GO annotation consists of a GO term associated with a specific reference that describes the work or analysis upon which the association between a specific GO term and gene product is based. Each annotation must also include an evidence code to indicate how the annotation to a particular term is supported. Although evidence codes do reflect the type of work or analysis described in the cited reference which supports the GO term to gene product association, they are not necessarily a classification of types of experiments/analyses. Note that these evidence codes are intended for use in conjunction with GO terms, and should not be considered in isolation from the terms. If a reference describes multiple methods that each provide evidence to make a GO annotation to a particular term, then multiple annotations with identical GO identifiers and reference identifiers but different evidence codes may be made.
 
A GO annotation consists of a GO term associated with a specific reference that describes the work or analysis upon which the association between a specific GO term and gene product is based. Each annotation must also include an evidence code to indicate how the annotation to a particular term is supported. Although evidence codes do reflect the type of work or analysis described in the cited reference which supports the GO term to gene product association, they are not necessarily a classification of types of experiments/analyses. Note that these evidence codes are intended for use in conjunction with GO terms, and should not be considered in isolation from the terms. If a reference describes multiple methods that each provide evidence to make a GO annotation to a particular term, then multiple annotations with identical GO identifiers and reference identifiers but different evidence codes may be made.
  
Out of all the evidence codes available, only Inferred from Electronic Annotation (IEA) is not assigned by a curator. Manually-assigned evidence codes fall into four general categories: experimental, computational analysis, author statements, and curatorial statements.
+
Evidence codes are '''not''' statements of the quality of the annotation. Within each evidence code classification, some methods produce annotations of higher confidence or greater specificity than other methods, in addition the way in which a technique has been applied or interpreted in a paper will also affect the quality of the resulting annotation. Thus evidence codes '''cannot''' be used as a measure of the quality of the annotation.
  
 +
Evidence codes fall into eight general categories as described below.
 +
 +
= Experimental Evidence Codes =
 
Use of an experimental evidence code in a GO annotation indicates that the cited paper displayed results from a physical characterization of a gene or gene product that has supported the association of a GO term. The '''Experimental Evidence Codes''' are:
 
Use of an experimental evidence code in a GO annotation indicates that the cited paper displayed results from a physical characterization of a gene or gene product that has supported the association of a GO term. The '''Experimental Evidence Codes''' are:
  
Line 15: Line 18:
 
*[[Inferred from Genetic Interaction (IGI)]]
 
*[[Inferred from Genetic Interaction (IGI)]]
 
*[[Inferred from Expression Pattern (IEP)]]
 
*[[Inferred from Expression Pattern (IEP)]]
 +
 +
= High Throughput Experimental Evidence Codes =
  
 
High throughput (HTP) evidence codes may be used to make annotations based upon high throughput methodologies.  Use of HTP evidence codes should be carefully considered and follow the GOC's guidelines for their use.  The '''High Throughput Experimental Evidence Codes''' are:
 
High throughput (HTP) evidence codes may be used to make annotations based upon high throughput methodologies.  Use of HTP evidence codes should be carefully considered and follow the GOC's guidelines for their use.  The '''High Throughput Experimental Evidence Codes''' are:
Line 24: Line 29:
 
*[[Inferred from High Throughput Expression Pattern (HEP)]]
 
*[[Inferred from High Throughput Expression Pattern (HEP)]]
  
Use of the computational analysis evidence codes indicates that the annotation is based on an in silico analysis of the gene sequence and/or other data as described in the cited reference. The evidence codes in this category also indicate a varying degree of curatorial input. The '''Computational Analysis Evidence Codes''' are:
+
= Similarity Evidence Codes =
 +
Use of the similarity evidence codes indicates that the annotation is based on an in silico analysis of the gene sequence described in the cited reference. The evidence codes in this category also indicate a varying degree of curatorial input. The '''Similarity Evidence Codes''' are:
  
 
*[[Inferred from Sequence or structural Similarity (ISS)]]
 
*[[Inferred from Sequence or structural Similarity (ISS)]]
Line 35: Line 41:
 
*[[Inferred from Key Residues (IKR)]]
 
*[[Inferred from Key Residues (IKR)]]
 
*[[Inferred from Rapid Divergence(IRD)]]
 
*[[Inferred from Rapid Divergence(IRD)]]
 +
 +
= Combinatorial Evidence Codes =
 
*[[Inferred from Reviewed Computational Analysis (RCA)]]
 
*[[Inferred from Reviewed Computational Analysis (RCA)]]
  
 +
= Author Statements =
 
Author statement codes indicate that the annotation was made on the basis of a statement made by the author(s) in the reference cited. The '''Author Statement Evidence Codes''' used by GO are:
 
Author statement codes indicate that the annotation was made on the basis of a statement made by the author(s) in the reference cited. The '''Author Statement Evidence Codes''' used by GO are:
  
Line 42: Line 51:
 
*[[Non-traceable Author Statement (NAS)]]
 
*[[Non-traceable Author Statement (NAS)]]
  
 +
= Curator Inference =
 
Use of the curatorial statement evidence codes indicates an annotation made on the basis of a curatorial judgement that does not fit into one of the other evidence code classifications. The '''Curatorial Statement Evidence Codes''' are:
 
Use of the curatorial statement evidence codes indicates an annotation made on the basis of a curatorial judgement that does not fit into one of the other evidence code classifications. The '''Curatorial Statement Evidence Codes''' are:
  
 
*[[Inferred by Curator (IC)]]
 
*[[Inferred by Curator (IC)]]
 +
 +
= No Biological Data =
 
*[[No biological Data available (ND) evidence code]]
 
*[[No biological Data available (ND) evidence code]]
  
All of the above evidence codes are assigned by curators. However, GO also used one evidence code that is assigned by automated methods, without curatorial judgement. The '''Automatically-Assigned Evidence Code''' is:
+
= Automatic Assertion =
 +
GO also used one evidence code that is assigned by automated methods, without curatorial judgement. The '''Automatically-Assigned Evidence Code''' is:
  
 
*[[Inferred from Electronic Annotation (IEA)]]
 
*[[Inferred from Electronic Annotation (IEA)]]
 
Evidence codes are '''not''' statements of the quality of the annotation. Within each evidence code classification, some methods produce annotations of higher confidence or greater specificity than other methods, in addition the way in which a technique has been applied or interpreted in a paper will also affect the quality of the resulting annotation. Thus evidence codes '''cannot''' be used as a measure of the quality of the annotation.
 
 
 
  
 
[[Category: Annotation]]
 
[[Category: Annotation]]
 
[[Category:Evidence Codes]]
 
[[Category:Evidence Codes]]

Revision as of 07:04, 18 April 2018

  • These guidelines are a guide to standard usage of the GO evidence codes.
  • Annotators may also find the evidence code decision tree useful in selecting the correct evidence code for an annotation.

Introduction

A GO annotation consists of a GO term associated with a specific reference that describes the work or analysis upon which the association between a specific GO term and gene product is based. Each annotation must also include an evidence code to indicate how the annotation to a particular term is supported. Although evidence codes do reflect the type of work or analysis described in the cited reference which supports the GO term to gene product association, they are not necessarily a classification of types of experiments/analyses. Note that these evidence codes are intended for use in conjunction with GO terms, and should not be considered in isolation from the terms. If a reference describes multiple methods that each provide evidence to make a GO annotation to a particular term, then multiple annotations with identical GO identifiers and reference identifiers but different evidence codes may be made.

Evidence codes are not statements of the quality of the annotation. Within each evidence code classification, some methods produce annotations of higher confidence or greater specificity than other methods, in addition the way in which a technique has been applied or interpreted in a paper will also affect the quality of the resulting annotation. Thus evidence codes cannot be used as a measure of the quality of the annotation.

Evidence codes fall into eight general categories as described below.

Experimental Evidence Codes

Use of an experimental evidence code in a GO annotation indicates that the cited paper displayed results from a physical characterization of a gene or gene product that has supported the association of a GO term. The Experimental Evidence Codes are:

High Throughput Experimental Evidence Codes

High throughput (HTP) evidence codes may be used to make annotations based upon high throughput methodologies. Use of HTP evidence codes should be carefully considered and follow the GOC's guidelines for their use. The High Throughput Experimental Evidence Codes are:

Similarity Evidence Codes

Use of the similarity evidence codes indicates that the annotation is based on an in silico analysis of the gene sequence described in the cited reference. The evidence codes in this category also indicate a varying degree of curatorial input. The Similarity Evidence Codes are:

Combinatorial Evidence Codes

Author Statements

Author statement codes indicate that the annotation was made on the basis of a statement made by the author(s) in the reference cited. The Author Statement Evidence Codes used by GO are:

Curator Inference

Use of the curatorial statement evidence codes indicates an annotation made on the basis of a curatorial judgement that does not fit into one of the other evidence code classifications. The Curatorial Statement Evidence Codes are:

No Biological Data

Automatic Assertion

GO also used one evidence code that is assigned by automated methods, without curatorial judgement. The Automatically-Assigned Evidence Code is: