2010 GO camp Use of Regulation issues: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
mNo edit summary
(30 intermediate revisions by 5 users not shown)
Line 1: Line 1:
[[Category:Workshops]]
=Background=
=Background=


David Hill's [[Media:regulates.pdf | slides]]
David Hill's [[Media:regulates.pdf | slides]]
Line 276: Line 276:
* example 1: [http://amigo.geneontology.org/cgi-bin/amigo/gp-assoc.cgi?gp=GeneDB_Spombe:SPCC1223.13&session_id=1462amigo1269990854]
* example 1: [http://amigo.geneontology.org/cgi-bin/amigo/gp-assoc.cgi?gp=GeneDB_Spombe:SPCC1223.13&session_id=1462amigo1269990854]


=Proposed annotation policy=
=Proposed annotation policy - Discussion on May 26=
Discussed at the May 26, 2010 conference call
*''' Attendees: Ruth Lovering, Jane Lomax, Pascale Gaudet, Varsha Khodiyar, Kimberly VanAuken, Mary Dolan, Rachael Huntley, Emily Dimmer, Li Ni, Rama Balakrishnan, Yasmin Alam-Faruque, Jody Hirchman'''
----
'''Proposed guidelines'''


# A process is composed of a series of molecular functions:
# A process is composed of a series of molecular functions:
Line 284: Line 286:
## If you DON'T KNOW exactly how your gp is involved in the process, annotate to the PROCESS term, which is broader.
## If you DON'T KNOW exactly how your gp is involved in the process, annotate to the PROCESS term, which is broader.
#Use your biological knowledge. If it is a well-known pathway and hasn't been fully represented in GO, then background knowledge is needed to decide if the function is part of the pathway, regulates the pathway, or does both. Things to consider:
#Use your biological knowledge. If it is a well-known pathway and hasn't been fully represented in GO, then background knowledge is needed to decide if the function is part of the pathway, regulates the pathway, or does both. Things to consider:
##. How much is known about the process?
## How much is known about the process?
##. Is there a defined pathway for this process in which the major players have been identified?
## Is there a defined pathway for this process in which the major players have been identified?
##. Is the gene product being annotated believed to be a major player in the process or pathway or outside of it?
## Is the gene product being annotated believed to be a major player in the process or pathway or outside of it?
#When deciding whether to annotate to a parent process term versus a regulation term based upon a mutant phenotype, curators should consider the assay, the nature of the allele used in the studies (null versus reduction-of-function), and perhaps also the identity of the gene product to choose an appropriate annotation.
#'''IMP'''. When deciding whether to annotate to a parent process term versus a regulation term based upon a mutant phenotype, curators should consider the assay, the nature of the allele used in the studies (null versus reduction-of-function), and perhaps also the identity of the gene product to choose an appropriate annotation.
#* (from the discussion): Experiments on mutants can be confusing because of the definition of the 'regulation' terms: "Any process that modulates the frequency, rate or extent of any biological process, quality, or function". Mutations in genes that are part of a pathway can give a phenotype of aberrant "frequency, rate or extent of (some) biological process", so that fits the 'regulation' definition.
#*<font color = "red">Action item for Jane and Kimberly</font>: check annotations to 'regulation' with IMP codes to see if the misannotation rate is high enough to warrant a QC on all such annotations.  
#If the gene product or pathway is not fully described, then try to reflect what the paper you are reading is saying. The author should give hints about what is happening, and will be the experts in this field.
#If the gene product or pathway is not fully described, then try to reflect what the paper you are reading is saying. The author should give hints about what is happening, and will be the experts in this field.
#Processes should have a defined beginning and end. If this isn't clear from the definition, then you may need to start up a dialogue (tracker, mailing lists) to get the term redefined.
#Processes should have a defined beginning and end. If this isn't clear from the definition, then you may need to start up a dialogue (tracker, mailing lists) to get the term redefined.
#If new information changes the view of a gene product's role in a process, older annotations should be checked and possibly removed for consistency. Annotations should reflect the most up-to-date view of a gene product's role.
#If new information changes the view of a gene product's role in a process, older annotations should be checked and possibly removed for consistency. Annotations should reflect the most up-to-date view of a gene product's role.
===<font color="red">Comments</font>===
* In principle everyone agree with those guidelines. The problem is that they are a bit too high level and dont always help annotators to make a decision about some experiments. To deal with this, we will collect a list of examples where it's right to annotate to 'regulation of x', and where it's not.
** <font color="red">Action item for Kimberly, Jodi, Ruth, and all : </font> Collect a list of examples where it's right to annotate to 'regulation of x', and where it's not.
* Another issue is that in the GO the beginning and end of processes are not clear. The signaling group has decided that in general ligand binding to a receptor is part of that pathway, see placement of GO:0023007 ligand binding to T cell receptor [http://www.ebi.ac.uk/QuickGO/GTerm?id=GO:0023007 QuickGO], GO:0023007 ligand binding to T cell receptor is effectively a part_of GO:0050852 T cell receptor signaling pathway. However, a ligand is likely to be the rate limiting step in a signaling pathway and therefore will be annotated to both the signaling pathway and the regulation of the signaling pathway.
* One possible way to handle this would be :
# some gene products involved in synthesis, transport, etc of ligand which trigger PATHWAY X could be annotated to 'regulation of pathway x'
# gene products in the pathway that are directly shown to affect the rate,or extent of the pathway would be annotated to 'regulation of pathway x'
# Members of the pathway would be annotated directly to 'pathway x' (for exceptions, see #2 above)
# Downstream effects would be 'regulated by' the pathway, but not part of the pathway.
** <font color="red">Action item for Ontology development group (Jane will ask Alex)</font> Provide some guidance about pathway beginning and end
** <font color="red">Action item for Jane and Kimberly</font> Collect list of gene products annotated to both 'pathway x' and 'regulation of pathway x', and determine whether the annotation errors are high enough to implement a QC on those.


=Examples (papers) and discussion of GO annotation issues =
=Examples (papers) and discussion of GO annotation issues =


*'''Vote [http://www.doodle.com/eyhpiduamtfum65v here] for how you would annotate in each of these examples:'''
The poll results and additional comments are available here: page [[:Image:Regulation annotation poll.jpg | 1]], [[:Image:Regulation annotation poll_2.jpg | 2]], [[:Image:Regulation annotation poll_3.jpg | 3]], [[:Image:Regulation annotation poll_4.jpg | 4]]


==Example 1==
==Example 1==
Line 324: Line 340:


[Comment from Rachael] Example 2: at the least I would use regulation of autophagy because of Abstract 1 stating that Atg1 negatively feedsback to TOR thereby regulating autophagy. However, as Val points out I think the definition should be updated as to the start of the process. Since the authors use the phrase 'initiation of autophagy' quite a lot, I would side on 'regulation of autophagy' from what is in David's slides - 'If ‘Y’ regulates ‘X’, then it stops it, starts it, slows it down, or speeds it up.'. However, this is not clear in the definition of some regulation terms e.g. regulation of autophagy 'Any process that modulates the frequency, rate or extent of autophagy...' - no mention of starting or stopping.
[Comment from Rachael] Example 2: at the least I would use regulation of autophagy because of Abstract 1 stating that Atg1 negatively feedsback to TOR thereby regulating autophagy. However, as Val points out I think the definition should be updated as to the start of the process. Since the authors use the phrase 'initiation of autophagy' quite a lot, I would side on 'regulation of autophagy' from what is in David's slides - 'If ‘Y’ regulates ‘X’, then it stops it, starts it, slows it down, or speeds it up.'. However, this is not clear in the definition of some regulation terms e.g. regulation of autophagy 'Any process that modulates the frequency, rate or extent of autophagy...' - no mention of starting or stopping.
'''The Atg1 kinase complex is involved in the regulation of protein recruitment to initiate sequestering vesicle formation for nonspecific autophagy in Saccharomyces cerevisiae.'''
'''Cheong H, Nair U, Geng J, Klionsky DJ.'''
ABSTRACT: Autophagy is the major degradative process for recycling cytoplasmic constituents and eliminating unnecessary organelles in eukaryotic cells. Most autophagy-related (Atg) proteins are recruited to the phagophore assembly site (PAS), a proposed site for vesicle formation during either nonspecific or specific types of autophagy. Therefore, appropriate recruitment of Atg proteins to this site is critical for their function in autophagy. Atg11 facilitates PAS recruitment for the cytoplasm-to-vacuole targeting pathway, which is a specific, autophagy-like process that occurs under vegetative conditions. In contrast, it is not known how Atg proteins are recruited to the PAS, nor which components are involved in PAS formation under nonspecific autophagy-inducing, starvation conditions. Here, we studied PAS assembly during nonspecific autophagy, using an atg11Delta mutant background to eliminate the PAS formation that occurs during vegetative growth. We found that protein complexes containing the Atg1 kinase have two roles for PAS formation during nonspecific autophagy. The Atg1 C terminus mediates an interaction with Atg13 and Atg17, facilitating a structural role of Atg1 that is needed to efficiently organize an initial step of PAS assembly, whereas Atg1 kinase activity affects the dynamics of protein movement at the PAS involved in Atg protein cycling. http://www.ncbi.nlm.nih.gov/pubmed/18077553
and it's addendum:
'''Dual role of Atg1 in regulation of autophagy-specific PAS assembly in Saccharomyces cerevisiae.'''
'''Cheong H, Klionsky DJ.'''
ABSTRACT: Most autophagy-related (Atg) proteins are assembled at the phagophore assembly site or pre-autophagosomal structure (PAS), which is a potential site for vesicle formation during vegetative or starvation conditions. To understand the initial step of vesicle formation, it is important to know how Atg proteins are recruited to the PAS. Atg11 facilitates PAS assembly for the cytoplasm to vacuole targeting (Cvt) pathway in vegetative conditions. To examine autophagy-specific PAS formation, an ATG11 deletion mutant was used to eliminate the PAS formation that occurs in vegetative conditions. We found that Atg1, Atg13 and Atg17 play a similar role for PAS formation under autophagy-inducing conditions as seen for Atg11 during vegetative growth. In particular, Atg1 is proposed to have dual roles for autophagy-specific PAS recruitment. Atg1 plays a structural role for efficient recruitment of Atg proteins to the PAS, which is mediated by interaction with Atg13 and Atg17. In contrast, Atg1 kinase activity is needed for dissociation of Atg proteins from the PAS during autophagy inducing conditions, a function which is also critical for autophagy activity. http://www.ncbi.nlm.nih.gov/pubmed/18552550
[comment from Jodi and Rama regarding the above 2 papers from the Klionsky lab] SGD curators discussed both papers and agreed that this isn't a good example of a
case where you'd assign both a BP term and a regulation of the same BP
term.  (Rama has an example of a yeast gene that will work much better
for that.)  In terms of these
papers, we didn't feel that the evidence was solid enough to assign a
'regulation of' term.  It there was more evidence, we'd probably assign
a regulation term like 'regulation of protein localization' or
'regulation of protein localization during autophagic vacuole formation'
but not necessarily a corresponding 'protein localization' BP term.
The BP term would likely be 'autophagic vacuole assembly' (which we already have for this gene).


==Example 3==
==Example 3==

Revision as of 11:48, 12 April 2019

Background

David Hill's slides Notes extracted from this presentation:

  • annotators should annotate to the most granular term possible - in the context of understanding the biology represented in the curated paper as best as they can, and decide which term is the best.
  • this isn't straightfoward; we don't understand all of biology. It is not always clear from a phenotype if a gene product is carrying out a regulatory process or is carrying out a function that is an integral part of the process.
  • And the regulates relationship is hard to fully define. This is partly due to human nature: people like to show that what they are working on is important, and biology is a series of regulatory events; almost everything that happens is regulatory.

Current guidelines about using the regulation terms in GO: - Use when a gene product is modulating the rate, frequency or extent of 'X'

--- 'X' is something that happens, a process or a function

--- If 'Y' regulates 'X', then it stops it, starts it, slows it down, or speeds it up.

--- The key is defining 'X'. If we can define 'X' with respect to where it starts, stops and its parts, then it should be easy to tell if 'Y' regulates 'X'.

--- If 'Y' regulates 'X', then it stops it, starts it, slows it down, or speeds it up. Gene product 'Y' will have this effect on the beginning, end or some part of the process. Therefore it becomes important that molecular functions are linked with biological processes (all biological processes can then be thought of as a series of molecular functions; one that starts it, one or more that are in the middle, and one that stops it. If another process modulates the activity of one of those molecular functions, then it regulates the process.

  • However, the definition of a process is subjective, and GO needs to reflect the community consensus about what functions are part of a process and what functions are not. With certain metabolic pathways, working out what molecular functions are core to a process will be easy. In contrast, some developmental pathways - such as metamorphosis - this will be impossible.
  • In addition a gene product that is centrally involved in a pathway can also be involved in a feedback loop that regulates the same pathway Insert example here. In such a case, the enzyme in the pathway negatively_regulates the pathway and would also be part_of the pathway.
  • Even for the easy biological processes - this is a huge amount of work for ontology developers - as different pathways will involve different molecular functions in different organisms, and the same molecular function will contribute to different pathways.

Therefore:

1. Use your biological knowledge. If it is a well-known pathway and hasn't been fully represented in GO, then background knowledge is needed to decide if the function is part of the pathway, regulates the pathway, or does both.

2. If the gene product or pathway is not fully described, then try to reflect what the paper you are reading is saying. The author should give hints about what is happening, and will be the experts in this field.

3. If there is still doubt - annotate to the less specific parent (e.g. to the biological process term instead of the regulation of BP term).

http://wiki.geneontology.org/index.php/File:LigandReceptorSignalingPathways.pdf

Discussion on 14-04-2010

Present: Jane Lomax, Rachael Huntley, Emily Dimmer, Jennifer Smith, Val Wood, Kimberly Van Auken, David Hill, Fiona McCarthy, Maria Costanzo, Jodi Hirschman, Varsha Khodiyar, Ruth Lovering, Pascale Gaudet. Chair: Jane

Minutes: Emily

Minutes:

Ruth - with regards to annotation of the least granular term when the curator is uncertainly - doesn't this contradit the statement that regulation terms should not be considered as part_of a process

David - the regulation relationship is a restriction on the process term - it is wrong to use the part_of relationship between 'regulation of BP X' and 'BP X' terms, however the regulation relationship carries more information as to the aspect of the process it is involved with.

Jane - this means that it is not possible to distinguish between those gene products which are known, core components of a biological pathway, and those that are peripheral regulation members. Users will be unable to distinguish between these two categories of members of a BP.

David - this is true. It is difficult to distinguish between integral and regulation members. However to annotate to terms that describe regulation when you are not sure that contribution a gp provides is worse.

Val - however, results from a HTP screen are likely to implicate many gp's which are involved in upstream processes - I wouldn't like to annotate all of these gp's to the BP. Should we indicate if regulation is direct or if it is indirect?

David - this is the caviate that comes with analysing results from such screens. Direct annotation to BP terms are more appropriate as you can say that the gp is involved in the process, however we do not know how.

Kimberly - to confidently annotate to a regulation term, a curator may sometimes take into account the moelcular identity of the gene product - not only the phenotype. For example when looking at a range of genes that are implicated in the locomotion process, if I know which gp is a transcription factor and which is a myosin chain, I may be able to make a more informed annotation decision. E.g. Transcription factor - regulation of locomotion; Myosin - locomotion.

David - Agree. This is a central job for curators - we need to include curation knowledge when determining the correct annotation.

Ruth - when talking with a researcher - he found that when analysing a microarray dataset - the mouse phenotype data was useful in certain analyses, and the GO annotation data more useful in other circumstances. I have a concern regarding the overlap of phenotype and biological process annotation. Aren't we in danger of overcurating when we only have phenotype information?

Pascale - agree. Sometimes it is just phenotypic data we annotate to.

David - however we should represent this information from papers. It is important to annotate to some downstream effects, esp for biomedical users - who are looking at diseases. We need to be able to say if a protein .e.g a insulin receptor has effect x.

Ruth - Regulation terms are defined as 'Any process that modulates the frequency, rate or extent of a biological process....'. Shouldn't we define such terms more discretely?

David - BP terms should be primarily defined via the functions, using BP->MF links

Pascale - how much should we care at the distinction between a 'Regulation of BP' and a 'BP' term?

David - the distinction is going to be nebulous if the field is still not fully characterised.

Val - how far from the beginning of a Biological Process should you annotate to a regulation term? How far upstream should a curator go? Should direct/indirect regulation be indicated?

David - this should be possible eventually with the ontology, its difficult to accurately determine which process is direct and which is indirect. We'll get closer to this through MF->BP links

Jane - could curators contribute BP-specific MF's?

David - yes. the editors would then need to ensure these are correct.

Looked at Serenella's example (see below) - David - we have to reflect what the community defines as a process. A signalling pathway consists of multiple regulation events. We need to know where the community considers the beginning/end of the process - and then decide if a MF is part of, or regulates the pathway. In Serenella's example I would consider that the protein is a part of the pathway.

Emily - how to increase consistency, through Qcs. Could PAINT help identify where closely related organisms have been differentially annotated to regulation of BP/BP terms for the same ortholog?

Val - in adidtion the Matrix tool available from AmiGO labs could also help to look at intersections between processes.

Val - I have an example that I discussed with SGD regarding autophagy and the gene ATG1 - this gp is involved in an on/off switch - it was not regulation a process, although the authors stated it did have a regulation role. It starts the process of - it does not regulate rate, therefore would consider part of the process and not regulating it. David - we need to ensure that we have information on what is considered part of the process - if ATG1's MF is not included in that definition, then it should be annotated as regulating the process. ACTION: Val will circulate example


Action items:

- Jane to send Doodle to settle on a date for the next meeting (in approx. 2 weeks time)

- Val to send ATG1 example

- Between now and the next meeting, curators in the WG should look at the examples below. And make a decision as to how they would annotate the data - using the regulation of BP or the BP term directly.

- Curators should then vote (and make short comments) using the Doodle poll that Jane will email.

- Val to send round URL for AmiGO Labs matrix and example query.

Longer term action items proposed:

- Guidance for curators who would like to suggest MF->BP links to editors, or BP-specific MFs.
- Ensure documentation described above is generally accepted.
- Annotate the documentation with examples drawn from a range of species/biological processes
- Any QC methods possible to help locate inconsistent usage of terms? (identified by PAINT/ AmiGO Labs matrix tool?)

Discussion on 05-05-2010

In attendance: Jane Lomax, Emily Dimmer, Pascale Gaudet, Rachael Huntley, Yasmin Alam-Faruque, Varsha Khodiyar, Rama Balakrishnan, Ruth Lovering, Kimberly Van Auken (minutes)

Discussion of annotation examples:

Example 1: receptor internalization vs regulation of receptor internalization

Looking at the Doodle poll (http://www.doodle.com/eyhpiduamtfum65v), a number of curators initially thought that the regulation term would be chosen, but the term definition makes that annotation difficult.

Does the regulation term include the activation step? Doesn't seem to.

Is this process (receptor internalization upon ligand binding) considered transport or signaling?

Does receptor internalization regulate a signaling pathway?

Ruth: Does ligand internalization get used similarly to a protein getting phosphorylated, i.e. a target of the process?

Pascale: My interpretation of the experiment is that receptor internalization is a way to downregulate the pathway. First the ligand binds, then internalization. Two functions of ligand: activate kinase, activate signaling pathway.

Rama: Is receptor internalization a consequence of the kinase activity?

Ruth: Receptors are recycled. Is the ligand involved in the process of receptor internalization or is it the target of internalization?

One possible conclusion: Annotate to neither term, create a new term.

Kimberly: Would ligands be annotated to both activation and down regulation?

Need to modify the receptor internalization definition. Where does the process of receptor internalization start?

Check Chris' SourceForge item.

Ruth: By analogy, should cholesterol be annotated to endocytosis?

Jane: Receptor internalization is a child of transport.

Does receptor internalization always have an effect on signaling?

Jane: are we talking about a single receptor or a population of receptors?

Pascale: We agree that the ligand shouldn't be annotated to either of the terms in the doodle poll, but are not yet clear on what new term(s) would be needed

Ruth: if you look at the ontology, you couldn't put ligand annotations under transport there's a whole machinery that effect endocytosis, regardless of what's being internalized could consult experts on endocytosis, consult Alex? is it useful to have cytokines and receptors annotated to receptor internalization terms?

Pascale: send this issue around to the GO list for more opinions.

Action Item: Rachael and Ruth will send this example out to the GO list.


Example 2: autophagic vacuole assembly versus regulation of autophagic vacuole assembly

Rama: SGD's curatorial approach to regulation annotations: Is the gene product making a decision for that pathway? For Atg1, the 'go' or 'no go' decision has already been made, and a more upstream gene is actually regulating the pathway.

Pascale: This makes sense, and this may be a case where authors use regulation which can mislead curators.

Rachael: Need to look at the negative feedback data available for Atg1.

Kimberly: The definition of the regulation terms can fit many, many phenotypes.

Rama: The definitions may need some work.

Pascale: Use of regulation terms can be trickey for IMP annotations; curators should be careful.

Pascale: Do we agree that Atg1 is required for the process or does it regulate the process?

Kimberly: Annotation to process term fit with paper that SGD annotated.

Rama: Is a kinase always a regulator?

Kimberly: What are the general principles?

Pascale: Be careful about using regulation terms with IMP.

Rama: What is the assay, are the key players known, is this gene new/outside of the pathway? David had said, "When in doubt, annotate to the parent term."

Ruth: It'd be useful to add to the doodle pool another row to see what conclusion we reached.

Pascale: Looking at the paper, authors say Atg1 is involved in regulation of the autophagic process.

Rama:Tthere is another paper that came later; the existing annotation reflects combined knowledge.

Kimberly: Should annotations be changed if the view of the pathway changes?

Emily, Pascale: Annotations should reflect the most up-to-date view of a gene's role in a process.

Jane: The term definition is vague, should try to pin down definitions on where they start and finish.

Rama: The gene product detecting the stimulus regulates the pathway. TOR is upstream of autophagic vacuole assembly.

Rama: Some regulation terms are not part_of the process, they just have regulation relationships to the parent process. Are there any regulation terms that are part_of the process? Rama will try to find an example.

Jane: We are trying to break the processes down into individual steps.

Pascale: Look at the work the signaling group is doing - try to standardize with them?

Jane: Yes, this makes sense.

Pascale: Review the signaling definition - very broad.

Rama: Looking at definition of autophagy - thought that process started when nutrient starvation is detected

Pascale: Should autophagy be a child of a signaling term?

Rama: It is a child of response to stress.

Jane: We will need to add start and stop to as many definitions as possible.

Kimberly, Rama: Will be easier for process terms that describe molecular pathways, not so easy for more general process terms like body morphology.

Action Item: Rama will check negative feedback info, but it looks like the parent process annotation is the preferred annotation; SGD will check the beginning of autophagic vacuole assembly.

Example 3: Much more of a consensus here, not discussed further.

Example 4: Not discussed due to time constraints.

Notes added after meeting (Kimberly): Some general principles that seem to be emerging and could perhaps be used as a starting point for writing guidelines:

1. Process term definitions should state, wherever possible, the beginning and end points of a process.

2. When choosing whether or not to annotate to a regulation term, curators should consider:

a) How much is known about the process?

b) Is there a defined pathway for this process in which the major players have been identified?

c) Is the gene product being annotated believed to be a major player in the process or pathway or outside of it?

3) If it's not known where in a process or pathway a gene product functions, annotations to the parent term are preferred.

4) If new information changes the view of a gene product's role in a process, older annotations should be checked and possibly removed for consistency. Annotations should reflect the most up-to-date view of a gene product's role.

5) When deciding whether to annotate to a parent process term versus a regulation term based upon a mutant phenotype, curators should consider the assay, the nature of the allele used in the studies (null versus reduction-of-function), and perhaps also the identity of the gene product to choose an appropriate annotation.

6) If needed, GO should consult with experts in the field to make sure that the definitions of parent and regulation process terms, as well as the annotations to those terms, reflect the generally accepted knowledge of researchers in that field.

Review of current GO annotation practices

Question 1: How do groups decide on when to annotate to 'regulates process x' or 'process x'? For instance does SLIT regulate axon guidance or involved in the process of axon guidance? Val - this depends on the defined start/end of a process and somethings can be annotated to BOTH the regulates term and also directly to the process term. Some groups decide that if removing the activity of a gene product produces an all/nothing event - then they define it as being part of the process. When should annotations be inherited up the regulates relationship?

Question 2: Annotations could be checked more efficiently if GO term definitions could include the beginning and end of a process.

Question 3 (from Ruth): Should we also consider how the 'activity' of a protein is defined when making these regulation/process decisions. The definition for transcription factor activity is: The function of binding to a specific DNA sequence in order to modulate transcription.... This suggests that a transcription factor is involved in the process of regulating transcription rather than involved in the process of transcription itself. However, some of us in the UK feel that many transcription factors are involved in the transcription process itself definition: The synthesis of either RNA on a template of DNA or DNA on a template of RNA. Does the definition for transcription factor activity need modification to enable this annotation, or are only the polymerases involved in the synthesis?)

Question 3: Regulation of processes, especially with IMP;

Proposed annotation policy - Discussion on May 26

  • Attendees: Ruth Lovering, Jane Lomax, Pascale Gaudet, Varsha Khodiyar, Kimberly VanAuken, Mary Dolan, Rachael Huntley, Emily Dimmer, Li Ni, Rama Balakrishnan, Yasmin Alam-Faruque, Jody Hirchman

Proposed guidelines

  1. A process is composed of a series of molecular functions:
    1. If you know that your gp PERFORMS one of the functions in the process, then the annotation should be to the PROCESS term.
    2. If you know that your gp REGULATES one of the functions, the annotation should be to the REGULATION term.
    3. If you DON'T KNOW exactly how your gp is involved in the process, annotate to the PROCESS term, which is broader.
  2. Use your biological knowledge. If it is a well-known pathway and hasn't been fully represented in GO, then background knowledge is needed to decide if the function is part of the pathway, regulates the pathway, or does both. Things to consider:
    1. How much is known about the process?
    2. Is there a defined pathway for this process in which the major players have been identified?
    3. Is the gene product being annotated believed to be a major player in the process or pathway or outside of it?
  3. IMP. When deciding whether to annotate to a parent process term versus a regulation term based upon a mutant phenotype, curators should consider the assay, the nature of the allele used in the studies (null versus reduction-of-function), and perhaps also the identity of the gene product to choose an appropriate annotation.
    • (from the discussion): Experiments on mutants can be confusing because of the definition of the 'regulation' terms: "Any process that modulates the frequency, rate or extent of any biological process, quality, or function". Mutations in genes that are part of a pathway can give a phenotype of aberrant "frequency, rate or extent of (some) biological process", so that fits the 'regulation' definition.
    • Action item for Jane and Kimberly: check annotations to 'regulation' with IMP codes to see if the misannotation rate is high enough to warrant a QC on all such annotations.
  4. If the gene product or pathway is not fully described, then try to reflect what the paper you are reading is saying. The author should give hints about what is happening, and will be the experts in this field.
  5. Processes should have a defined beginning and end. If this isn't clear from the definition, then you may need to start up a dialogue (tracker, mailing lists) to get the term redefined.
  6. If new information changes the view of a gene product's role in a process, older annotations should be checked and possibly removed for consistency. Annotations should reflect the most up-to-date view of a gene product's role.

Comments

  • In principle everyone agree with those guidelines. The problem is that they are a bit too high level and dont always help annotators to make a decision about some experiments. To deal with this, we will collect a list of examples where it's right to annotate to 'regulation of x', and where it's not.
    • Action item for Kimberly, Jodi, Ruth, and all : Collect a list of examples where it's right to annotate to 'regulation of x', and where it's not.
  • Another issue is that in the GO the beginning and end of processes are not clear. The signaling group has decided that in general ligand binding to a receptor is part of that pathway, see placement of GO:0023007 ligand binding to T cell receptor QuickGO, GO:0023007 ligand binding to T cell receptor is effectively a part_of GO:0050852 T cell receptor signaling pathway. However, a ligand is likely to be the rate limiting step in a signaling pathway and therefore will be annotated to both the signaling pathway and the regulation of the signaling pathway.
  • One possible way to handle this would be :
  1. some gene products involved in synthesis, transport, etc of ligand which trigger PATHWAY X could be annotated to 'regulation of pathway x'
  2. gene products in the pathway that are directly shown to affect the rate,or extent of the pathway would be annotated to 'regulation of pathway x'
  3. Members of the pathway would be annotated directly to 'pathway x' (for exceptions, see #2 above)
  4. Downstream effects would be 'regulated by' the pathway, but not part of the pathway.
    • Action item for Ontology development group (Jane will ask Alex) Provide some guidance about pathway beginning and end
    • Action item for Jane and Kimberly Collect list of gene products annotated to both 'pathway x' and 'regulation of pathway x', and determine whether the annotation errors are high enough to implement a QC on those.

Examples (papers) and discussion of GO annotation issues

The poll results and additional comments are available here: page 1, 2, 3, 4

Example 1

(from Rachael): PMID:19424712 - Ang1 is the ligand for TIE2, so it activates TIE2 which is then internalized and degraded. (Fig. 5a) HUVECs were incubated with Ang1 and the amount of Tie2 remaining at cell surface was decreased, therefore it has been internalized - question is whether Ang1 should be annotated to the process 'receptor internalization' or the regulation of? Or is GO:0007171 'activation of transmembrane receptor protein tyrosine kinase activity' enough?

Example 2

(from Val) I queried with SGD why ATG1 was annotated to "autophagic vacuole assembly"rather than a "regulation of" (see 3 abstracts below). I think that the general conclusion was that the annotation was OK, because researchers considered that ATG1 was part of the autophagy process.

  • The general consensus was that this is an integral part of autophagy and the arguments for this were convincing.
  • This seems OK, but I still felt that something is missing. Maybe, if this is an example of where the definition of the process needs to be updated to say exactly which step is the beginning of the process. Not knowing much about this process, with these papers I know I would have annotated to "regulation of", mainly bacause it's a kinase and because of the use of "regulation" in the titles and abstracts by the authors. Abstracts:

Scott RC, et al. (2007)

Direct induction of autophagy by Atg1 inhibits cell growth and induces apoptotic cell death. Curr Biol 17(1):1-11

BACKGROUND: To survive starvation and other forms of stress, eukaryotic cells undergo a lysosomal process of cytoplasmic degradation known as autophagy. Autophagy has been implicated in a number of cellular and developmental processes, including cell-growth control and programmed cell death. However, direct evidence of a causal role for autophagy in these processes is lacking, resulting in part from the pleiotropic effects of signaling molecules such as TOR that regulate autophagy. Here, we circumvent this difficulty by directly manipulating autophagy rates in Drosophila through the autophagy-specific protein kinase Atg1. RESULTS: We find that overexpression of Atg1 is sufficient to induce high levels of autophagy, the first such demonstration among wild-type Atg proteins. In contrast to findings in yeast, induction of autophagy by Atg1 is dependent on its kinase activity. We find that cells with high levels of Atg1-induced autophagy are rapidly eliminated, demonstrating that autophagy is capable of inducing cell death. However, this cell death is caspase dependent and displays DNA fragmentation, suggesting that autophagy represents an alternative induction of apoptosis, rather than a distinct form of cell death. In addition, we demonstrate that Atg1-induced autophagy strongly inhibits cell growth and that Atg1 mutant cells have a relative growth advantage under conditions of reduced TOR signaling. Finally, we show that Atg1 expression results in negative feedback on the activity of TOR itself. CONCLUSIONS: Our results reveal a central role for Atg1 in mounting a coordinated autophagic response and demonstrate that autophagy has the capacity to induce cell death. Furthermore, this work identifies autophagy as a critical mechanism by which inhibition of TOR signaling leads to reduced cell growth.

Evolution of Atg1 function and regulation. Chan EY, Tooze SA.

The serine/threonine kinase Atg1 plays an essential role downstream of TOR for the regulation of autophagy. In yeast, where Atg1 was first identified, a complex regulatory mechanism has been described that includes at least seven other interacting proteins and a phosphorylation-dependent switch. Recent findings confirm that the mammalian Atg1 homologues ULK1 and 2 have autophagy regulatory roles. However, we and others have also demonstrated mechanistic differences with the yeast model and between these two Atg1 family members. Here, we elaborate on our growing understanding of Atg1 function, incorporating findings from yeast, C. elegans, D. melanogaster and mammalian cells. We propose that through evolution, Atg1 proteins have adopted additional cellular functions and regulatory mechanisms, which could involve multiple gene family isoforms working within multifunctional protein complexes. The gene family expansion observed in higher eukaryotes might reflect an increased functional diversity of Atg1 proteins in cell growth, differentiation and survival. PMID:19411825

The role of the Atg1/ULK1 complex in autophagy regulation. Mizushima N.

The Atg1/ULK complex plays an essential role in the initiation of autophagy: receiving signals of cellular nutrient status, recruiting downstream Atg proteins to the autophagosome formation site, and governing autophagosome formation. Recent studies of mammalian Atg1 homologs (ULK1 and ULK2) have identified several novel interacting proteins, FIP200, mAtg13, and Atg101. FIP200 and Atg101 are not conserved in Saccharomyces cerevisiae, despite the high conservation rates of other downstream Atg proteins between the yeast and mammals. Furthermore, through studies of the Atg1/ULK1 complex, the molecular mechanism by which (m)TORC1 regulates autophagy is now being clarified in detail. Copyright 2009 Elsevier Ltd. PMID:20056399

Extra info added (Ruth) Figure ATG1 and autophagy [2] Within the yeast cell, nearly all Atg proteins collect at the pre-autophagosomal structure (PAS) near the vacuolar membrane, and the PAS is thought to be the site of formation of the Cvt vesicle and autophagosome. Function of the Atg1/ULK1 complex in autophagy: Although the Atg1/ULK1 complex receives information regarding the cellular nutrient status from (m)TORC1, it is also essential for mTOR-independent autophagy. Lithium induces autophagy through the inhibition of inositol monophosphatase in an mTOR-independent manner..... Thus, the Atg1/ULK1 complex plays a general role in autophagy regulation. ... A systematic analysis performed in yeast showed that the Atg1 complex plays an essential role in recruiting other Atg proteins to the PAS.

[Comment from Rachael] Example 2: at the least I would use regulation of autophagy because of Abstract 1 stating that Atg1 negatively feedsback to TOR thereby regulating autophagy. However, as Val points out I think the definition should be updated as to the start of the process. Since the authors use the phrase 'initiation of autophagy' quite a lot, I would side on 'regulation of autophagy' from what is in David's slides - 'If ‘Y’ regulates ‘X’, then it stops it, starts it, slows it down, or speeds it up.'. However, this is not clear in the definition of some regulation terms e.g. regulation of autophagy 'Any process that modulates the frequency, rate or extent of autophagy...' - no mention of starting or stopping.

The Atg1 kinase complex is involved in the regulation of protein recruitment to initiate sequestering vesicle formation for nonspecific autophagy in Saccharomyces cerevisiae. Cheong H, Nair U, Geng J, Klionsky DJ.

ABSTRACT: Autophagy is the major degradative process for recycling cytoplasmic constituents and eliminating unnecessary organelles in eukaryotic cells. Most autophagy-related (Atg) proteins are recruited to the phagophore assembly site (PAS), a proposed site for vesicle formation during either nonspecific or specific types of autophagy. Therefore, appropriate recruitment of Atg proteins to this site is critical for their function in autophagy. Atg11 facilitates PAS recruitment for the cytoplasm-to-vacuole targeting pathway, which is a specific, autophagy-like process that occurs under vegetative conditions. In contrast, it is not known how Atg proteins are recruited to the PAS, nor which components are involved in PAS formation under nonspecific autophagy-inducing, starvation conditions. Here, we studied PAS assembly during nonspecific autophagy, using an atg11Delta mutant background to eliminate the PAS formation that occurs during vegetative growth. We found that protein complexes containing the Atg1 kinase have two roles for PAS formation during nonspecific autophagy. The Atg1 C terminus mediates an interaction with Atg13 and Atg17, facilitating a structural role of Atg1 that is needed to efficiently organize an initial step of PAS assembly, whereas Atg1 kinase activity affects the dynamics of protein movement at the PAS involved in Atg protein cycling. http://www.ncbi.nlm.nih.gov/pubmed/18077553

and it's addendum:

Dual role of Atg1 in regulation of autophagy-specific PAS assembly in Saccharomyces cerevisiae. Cheong H, Klionsky DJ.

ABSTRACT: Most autophagy-related (Atg) proteins are assembled at the phagophore assembly site or pre-autophagosomal structure (PAS), which is a potential site for vesicle formation during vegetative or starvation conditions. To understand the initial step of vesicle formation, it is important to know how Atg proteins are recruited to the PAS. Atg11 facilitates PAS assembly for the cytoplasm to vacuole targeting (Cvt) pathway in vegetative conditions. To examine autophagy-specific PAS formation, an ATG11 deletion mutant was used to eliminate the PAS formation that occurs in vegetative conditions. We found that Atg1, Atg13 and Atg17 play a similar role for PAS formation under autophagy-inducing conditions as seen for Atg11 during vegetative growth. In particular, Atg1 is proposed to have dual roles for autophagy-specific PAS recruitment. Atg1 plays a structural role for efficient recruitment of Atg proteins to the PAS, which is mediated by interaction with Atg13 and Atg17. In contrast, Atg1 kinase activity is needed for dissociation of Atg proteins from the PAS during autophagy inducing conditions, a function which is also critical for autophagy activity. http://www.ncbi.nlm.nih.gov/pubmed/18552550

[comment from Jodi and Rama regarding the above 2 papers from the Klionsky lab] SGD curators discussed both papers and agreed that this isn't a good example of a case where you'd assign both a BP term and a regulation of the same BP term. (Rama has an example of a yeast gene that will work much better for that.) In terms of these papers, we didn't feel that the evidence was solid enough to assign a 'regulation of' term. It there was more evidence, we'd probably assign a regulation term like 'regulation of protein localization' or 'regulation of protein localization during autophagic vacuole formation' but not necessarily a corresponding 'protein localization' BP term. The BP term would likely be 'autophagic vacuole assembly' (which we already have for this gene).

Example 3

(from Ruth):

Upregulation of CLIC4 is required for Ca2+-induced keratinocyte differentiation

To determine if upregulation of CLIC4 is essential for Ca2+-induced differentiation in keratinocytes, we reduced CLIC4 level by expressing CLIC4 antisense in an adenoviral vector (AS-CLIC4 Ad) in keratinocytes exposed to 0.5 mM (Hi) Ca2+ medium. We have previously shown that this vector reduces CLIC4 expression in a dose-dependent manner (Suh et al., 2005 ). Under conditions in which AS-CLIC4 adenovirus (Ad) reduced CLIC4 and prevented the Ca2+-induced increase in CLIC4 expression, K1 and K10 proteins were not elevated at 24 hours after Ca2+ addition, and the expression of the late differentiation marker filaggrin was inhibited after 48 hours (Fig. 3A). Expression of AS-CLIC4 did not alter the endogenous level of PKC (Fig. 3A), suggesting that CLIC4 upregulation is an important component of the differentiation response to elevated Ca2+. To verify this observation by an independent differentiation marker, cell cycle parameters were monitored to determine if the rapid G1 cell cycle arrest associated with Ca2+-induced differentiation, was altered (Dotto, 1999 ). Within 24 hours of 0.5 mM Ca2+ exposure, BrdU pulse-labeled cells were absent when assayed by flow cytometry whereas the number in G0-G1 increased substantially (Fig. 3B,C). However, these changes were prevented by expression of CLIC4 antisense, and the proliferating population was maintained up to 48 hours as in the 0.05 mM Ca2+ medium (Fig. 3C). To verify CLIC4 is a specific CLIC family member that mediates the keratinocyte differentiation response, non-specific (NS) and specific shRNA constructs were developed for CLIC1 (sR-1), CLIC4 (sR-4) and CLIC5 (sR-5) and used for a transient knockdown of CLIC members (Fig. 3D,E). When transfected cells are induced to differentiate by 0.5 mM Ca2+, CLIC4 shRNA prevented upregulation of K10 but the other shRNAs did not. This result also distinguishes CLIC4 from other CLIC proteins that might have been influenced by AS-CLIC4 expression (Suh et al., 2005).

Which GO term should be associated with Q9QYB1 mouse Clic4:

'GO:0030216 ; keratinocyte differentiation' IMP
OR
GO:0045618 positive regulation of keratinocyte differentiation IMP?

The term GO:0071277 cellular response to calcium ion was also associated with this gene

Example 4

(from Serenella)

  • David said that 'if another process modulates the activity of one of those molecular functions, then it regulates the process'.
  • This makes me wonder what if I have a protein A which is a regulatory subunit annotated as 'molecular function regulator' .
  • Lets say that the 'molecular function' is part of a 'Biological process'.
  • Should this protein A be annotated as 'Biological process regulation' or 'Biological process'?
Example:

Protein A:

  • P85A_HUMAN (P27986)
  • MF:
  • GO; GO:0035014; phosphoinositide 3-kinase regulator activity.
  • BP:
  • DR GO; GO:0060396; growth hormone receptor signaling pathway.
  • DR GO; GO:0008286; insulin receptor signaling pathway.
  • DR GO; GO:0048009; insulin-like growth factor receptor signaling pathway.
  • DR GO; GO:0014065; phosphoinositide 3-kinase cascade.
  • DR GO; GO:0046854; phosphoinositide phosphorylation.

Suggestions for Quality Control procedures


Back to 2010_GO_camp_Meeting_Agenda


slide from Rachael