Guidance for updating deprecated Annotation Extension Relations: Difference between revisions
mNo edit summary |
|||
Line 140: | Line 140: | ||
'''Example 1''' | '''Example 1''' | ||
Gene product: UniProtKB: | Gene product: UniProtKB: Q12510 (Cmr1) | ||
GO ID: GO:0034399 nuclear periphery | GO ID: GO:0034399 nuclear periphery | ||
Extension: in_presence_of(CHEBI:25255 methyl methanesulfonate) | Extension: in_presence_of(CHEBI:25255 methyl methanesulfonate) | ||
'''''Action:''''' In this case the paper (PMID:25817432) states that Cmr1 localizes to the nuclear periphery in response to methyl methanesulfonate, a DNA damaging agent. This type of information is out of scope for GO, however there is a term GO:0072702 response to methyl methanesulfonate that should be considered for a direct annotation to Cmr1 instead. | |||
==Protein identifiers== | ==Protein identifiers== |
Revision as of 10:55, 13 October 2015
Dependent_on
ChEBI identifiers
When dependent_on is used with a ChEBI ID, consider the following;
- Whether the chemical in the annotation extension is already mentioned in the GO term definition. If it is, there is no reason to add the chemical in the extension
- If the definition does not mention the chemical, should the definition be updated to include the chemical, i.e. is it always required?
- Is the chemical physiologically relevant? For example using a chemical to induce a physiological response: if an author uses hydrogen peroxide to induce oxidative stress then consider making an annotation to a GO term concerned with oxidative stress
- Do not capture assay conditions in the annotation extension. If this has been done, delete the annotation extension
- If the chemical only represents an assay condition, delete the extension
- If none of the above applies, consider using a different relation e.g. “activated_by”
Example 1
Gene product: UniProtKB:P25632 GO ID: GO:0015616 DNA translocase activity Extension: dependent_on(CHEBI:15422 ATP)
Action: ATP is in the definition of the GO term, therefore no need to put ATP in the extension (Catalysis of the reaction: ATP + H2O = ADP + phosphate, to drive movement along a single- or double-stranded DNA molecule.).
Example 2
Gene product: UniProtKB:A0A060VXQ0 GO ID: GO:0090632 N-glycolylneuraminic acid (Neu5Gc) cytidylyltransferase activity Extension: dependent_on (CHEBI:18420 magnesium(2+))
Action: In this case the Mg2+ is not in the definition of the GO term. It appears that Mg2+ is essential for this activity, therefore consider re-writing the GO term definition, or alternatively using the “activated_by” relation instead.
GO identifiers
When dependent_on is used with a GO ID, consider the following;
- If the GO ID in the extension is a Biological Process or Molecular Function, consider changing the relation to part_of, if you consider the primary GO term used is part of the GO term used in the extension (see example below)
- If the GO ID in the extension is a Biological Process or Molecular Function, consider reversing the annotation, i.e. make the primary annotation to the GO term in the extension and using the other GO term in the extension with either “part_of” or “causally_upstream_of” (see examples below)
- If the primary GO term used is a Cellular Component and the GO ID in the extension is a Biological Process, consider changing the relation to exists_during. (see example below)
Example 1
Gene product: UniProtKB: O43474 GO ID: GO:0045944 positive regulation of transcription from RNA polymerase II promoter Extension: dependent_on(GO:0000165 MAPK cascade)
Action: Consider changing relation to part_of
Example 2
Gene product: UniProtKB: Q60855 GO ID: GO:0097527 necroptotic signaling pathway Extension: dependent_on(GO:0004672 protein kinase activity)
Action: Consider reversing the GO IDs and changing the relation to part_of;
Gene product: UniProtKB: Q60855 GO ID: GO:0004672 protein kinase activity Extension: part_of(GO:0097527 necroptotic signaling pathway)
Example 3
Gene product: UniProtKB: Q15485 GO ID: GO:0001867 complement activation, lectin pathway Extension: dependent_on(GO: 0003823 antigen binding)
Action: Consider reversing the GO IDs and changing the relation to causally_upstream_of;
Gene product: UniProtKB: Q15485 GO ID: GO:0003823 antigen binding Extension: dependent_on(GO:0001867 complement activation, lectin pathway)
Example 4
Gene product: UniProtKB: P07737 GO ID: GO:0005938 cell cortex Extension: dependent_on(GO:0006939 smooth muscle contraction)
Action: Consider changing the relation to exists_during.
Protein identifiers
When dependent_on is used with a protein ID, consider the following;
- If the protein stated in the extension is required for the Biological Process or Molecular Function described by the primary GO term, the protein in the extension should instead be annotated directly to that BP or MF
- If the protein in the extension acts together with the protein in the primary annotation, consider annotating the Complex Portal ID for the complex directly with the BP or MF.
- If the primary GO term that is annotated is a Cellular Component then consider annotating the protein ID in the extension with “protein localization to [component]”
Example 1
Gene product: UniProtKB: Q9C0C7 GO ID: GO:0043552 positive regulation of phosphatidylinositol 3-kinase activity Extension: dependent_on(UniProtKB:O60260 Parkin)
Action: In this case, consider whether Parkin should be annotated to GO:0043552 directly. Don’t attempt to put all proteins involved in a pathway into the extension.
Example 2
Gene product: UniProtKB: Q9ESD1 GO ID: GO:0005615 extracellular space Extension: dependent_on(UniProtKB:O70362 Phospholipase D)
Action: Here it appears that phospholipase D is responsible for localization of Q9ESD1, in which case the annotation should instead be;
Gene product: UniProtKB: O70362 GO ID: GO:0071692 protein localization to extracellular region Extension: transports_or_maintains_localization_of(UniProtKB:Q9ESD1 Prostasin)
Pfam identifiers
When dependent_on is used with a Pfam ID, consider the following;
- Is the Pfam domain just indicating the region of the protein being bound? If so, this could be captured using a more specific protein binding term
Example 1
Gene product: UniProtKB: O60132 (Tea4) GO ID: GO:0005515 protein binding With/from: PomBase: SPBC776.02c (Dis2) Extension: dependent_on(Pfam:PF00018 SH3 domain)
Action: In this case the paper (PMID:24554432) describes that Tea4 requires it’s SH3 domain in order to bind to Dis2. This information should not be captured in the annotation for Tea4 but instead the annotation for Dis2 should use the GO term GO:0017124 SH3 domain binding with the Tea4 identifier in the with/from field.
Sequence Ontology identifiers
When dependent_on is used with a Sequence Ontology ID, consider the following;
- If the SO ID is referring to a region that is being bound the gene product being annotated, consider using the occurs_at relation instead.
Example 1
Gene product: UniProtKB: Q9UIF9 GO ID: GO:0042393 histone binding With/from: UniProtKB: P62805 (Histone H4) Extension: dependent_on(SO:0001729 H4K16 acylation site)
Action: The extension appears to be describing the region of Histone H4 that is being bound, so consider using occurs_at instead.
In_presence_of
ChEBI identifiers
When in_presence_of is used with a ChEBI ID, consider the following;
- Whether the chemical in the annotation extension is already mentioned in the GO term definition. If it is, there is no reason to add the chemical in the extension
- If the definition does not mention the chemical, should the definition be updated to include the chemical, i.e. is it always required?
- Is the chemical physiologically relevant? For example using a chemical to induce a physiological response: if an author uses hydrogen peroxide to induce oxidative stress then consider making an annotation to a GO term concerned with oxidative stress
- Do not capture assay conditions in the annotation extension. If this has been done, delete the annotation extension
- If the chemical only represents an assay condition, delete the extension
- If none of the above applies, consider using a different relation e.g. “activated_by”
Example 1
Gene product: UniProtKB: Q12510 (Cmr1) GO ID: GO:0034399 nuclear periphery Extension: in_presence_of(CHEBI:25255 methyl methanesulfonate)
Action: In this case the paper (PMID:25817432) states that Cmr1 localizes to the nuclear periphery in response to methyl methanesulfonate, a DNA damaging agent. This type of information is out of scope for GO, however there is a term GO:0072702 response to methyl methanesulfonate that should be considered for a direct annotation to Cmr1 instead.
Protein identifiers
When in_presence_of is used with a protein ID, consider the following;
- If the protein stated in the extension is required for the Biological Process or Molecular Function described by the primary GO term, the protein in the extension should instead be annotated directly to that BP or MF
- If the protein in the extension acts together with the protein in the primary annotation, consider annotating the Complex Portal ID directly with the BP or MF.
- If the primary GO term that is annotated is a Cellular Component then consider annotating the protein ID in the extension with “protein localization to [component]”
Example 1
Gene product: UniProtKB: P40096 GO ID: GO:0003713 transcription coactivator activity Extension: in_presence_of(UniProtKB:Q92317)
Action: If the proteins form a complex to perform this activity then each of them should be annotated to the GO term. Consider also annotating the Complex Portal ID for the complex with this GO term. If one of the proteins is regulating the activity they can be annotated to the activity using the contributes_to qualifier.