Binding Terms Conference Call Information

From GO Wiki
Jump to navigation Jump to search

Back to binding terms working group discussion

What problems are we trying to solve?

This issue was originally brought up in the GOC meeting in Oregon [binding minutes]

This meeting identified that

  1. The documentation is confusing on the proper use of binding
  2. There were conflicting views about whether or not GO should include catalytic substrate annotations such as 'ATP binding' and the problem of including both substrate and product from a catalytic reaction.
  3. Most people agreed that GO should capture non-transformative binding, eg. binding of X resulting in an allosteric change to the thing doing the binding.
  4. Perhaps cross product annotations should be used to describe majority of binding annotations (see Annotation_Cross_Products#binding_example)
  5. There was a concern about how limiting 'binding' annotations to non-catalytic interactions may affect queries for genes involved in 'ATP binding', for example, researchers might reasonably expect to get back kinases by such a query.
  6. It was unclear whether there should be a transfer of 'binding' term annotations via ISS/ISO

ACTION ITEMS: Peter (lead), Ruth, Debby, Jim form a working group to examine the issues raised in the discussion. Should GO capture catalytic binding? Mike, Ben, Emily, David also joined this working group.

Binding terms survey

This survey was written to address the issue: Should GO capture catalytic binding?

The way we capture catalytic binding may change the decision on whether or not we should capture catalytic binding. However, this is a bit of a chicken and an egg situation as deciding that catalytic binding should not be captured at all will mean that discussions about how to capture catalytic binding would be irrelevant.

For more information/comments about options to capture catalytic binding please see:

The results for the survey are available as a chart and in brackets (x) within the options of a copy of the survey.

Comments from Debby

The current binding terms discussion started in response to the issue of consistency of annotation among curators in the use of binding terms. Some groups are using binding terms to annotate substrate binding for enzymes and transport proteins and others aren't. I think that the major source of confusion and inconsistency is that for most curators it makes sense to annotate that an enzyme or transporter binds its substrate. From my point of view as a curator, my recommendations would be to allow GO:0005488 "binding" to be used for substrate binding, but discourage curators from using it for this purpose. This can be achieved by making the GO documentation on binding terms clearer, rewriting GO term definitions, adding appropriate usage suggestions to Amigo, and remove usage suggestions that suggest annotating a catalytic or transport activity to a binding term.

I think that this approach would answer the concerns expressed about deleting information from GO. It would also make it easier to deal with situations such as the one Emily raised (re PMID:10980193) where binding of GTP has been experimentally determined, but GTPase activity, while likely, is still only predicted based on amino acid sequence similarity. There should probably also be guidelines that restrict the creation of new child binding terms, which is another way of educating curators about correct usage. Personally, I think it makes sense to capture the identity of whatever is being bound by using column 16 to hold the CHEBI or UniProt ID, but this may be a different, altho' related issue.

Comments from Ben

Plus additional comments

I think there is a fundamental difference in how some (annotation) groups view GO. I (and curators at SGD share this opinion) feel that one should not use the GO to attempt to annotate any and all information about gene products that appear in any given literature reference. It is simply a ridiculous task for the GOC to attempt this, and I feel that this is very clearly illustrated in the "divide" regarding Binding terms.

I feel that while every little datum is certainly important, and deserves to be captured _somewhere_, adding a GO term annotation for it is often inappropriate, and generally harmful to the homogeneity of GO annotations. The GOC has always, for better or worse, allowed individual MODs and curation groups to decide what and how to curate, and while this flexibility is often warranted, based on the scope and depth of the literature (for a given organism or group of organisms) it also leads to large discrepancies in annotation practice and makes comparing cross-organism GO information "fun", to say the least.

That being said - I feel that lists of substrates and cofactors, including allosteric interactions aka "nontransformative binding" - i.e, "X binding" should NOT be captured in GO.

The last grey areas remaining are situations where an experiment demonstrates binding to "X" and possibly infers some catalytic activity thereof, but does not demonstrate it. I agree that "partial" information is tricky to deal with. If we can denote it in a sensible way, i.e, "this gene product has the property 'ATP binding' (via IDA) but the purpose of that binding is currently Unknown" I would be in favor of using binding terms in this way.

The advantage of removing 'substrate binding terms' would be to:

  • enable GO curators to concentrate on annotations which are unique to GO rather than those which are covered by other databases.
  • reduce the inconstancies that exist between different databases and different annotation procedures.

Comment from Peter

... so annotations like these should not be allowed? Both the computational ones and the manual one?

Comments from Jim

  • Focusing on the question at hand: Should GO capture catalytic binding?. I agree with Ben that GO annotation should not try to annotate substrates for enzymes. Capturing "catalytic binding" would involve a change in the publicly goals of GO in existing documentation and publications.
  • However, given as long as there are "x binding" terms in the ontology, I predict that no amount of documentation will lead to uniform annotation practice with respect to the use of these terms (see Peter's question above). This will lead to inconsistencies in data mining annotations to "x binding" that we should simply accept as a weakness of GO where the proposed cures are worse than the disease.
    • I do not understand Ben's objection to 'substrate binding terms' because as far as I know 'substrate binding' does not currently exist in GO; x binding terms do not distinguish between substrate binding and other forms of binding, AFAICT. My extremely long-winded other comments are somewhat independent of whether or not "catalytic binding" is embraced or not. "Nontransformative binding" should be discussed separately. It may be possible to remove x binding terms altogether. However, I would argue includes terms such as "x carrier activity" or "x transporter activity" face the same proliferation issues that I raised for binding, which Chris does not think are serious (He hasn't convinced me, but I expect to be outvoted). How to handle, x binding remains a problem, in my view, but is beyond the scope of the present discussion.

Bottom line, I recommend no change in existing policy on substrates (recommend that curators don't annotate them), but no additional effort for enforcement of this policy beyond adding usage notes to AmiGO and GONUTS.