Ontology meeting 2012-05-23

From GO Wiki
Jump to: navigation, search

MINUTES: JANE

45-minute call


SVN WORKFLOW

Chris will expand on the following:

"Before editing:

cd editors/

svn update

if there are any conflicts, choose "(p)" postpone. Then edit the gene_ontology_write.obo file and resolve the conflicts.

This should be exactly the same as the old cvs workflow.

After you have resolved these, type:

svn resolved gene_ontology_write.obo


PS: What is the svn command for checking out a new folde if we want to grab a clean copy? (svn co [path] isn't sufficient it seems).

Jane and TOny have solved this... just remove the file (rm) and update (up) to get a new version (as long as folder is in place).

FOLLOW-UP ON CHEBI ALIGNMENT

  • Follow-up on all action items from last week:

http://wiki.geneontology.org/index.php/Ontology_meeting_2012-05-16#MINUTES

  • Reviewing links to be removed

http://build.berkeleybop.org/job/build-go-xp-chebi/lastSuccessfulBuild/artifact/main/go-unentailed-assertions.txt

This is the build artefact of most interest - it contains all of the links in the GO process graph between defined classes that *cannot* be re-inferred using CHEBI. We only try and re-infer links between two classes where both classes have a logical definition referencing CHEBI.

(T/D) Still something off:

SubClassOf(GO:0015746 'citrate transport' GO:0015711 'organic anion transport')

but

go_xp_chebi.obo

  [Term]
  id: GO:0015746 ! citrate transport
  intersection_of: GO:0006810 ! transport
  intersection_of: results_in_transport_of CHEBI:16947 ! citrate(3-)

and

http://www.ebi.ac.uk/chebi/chebiOntology.do?chebiId=CHEBI:16947


  • Open issues with adding chemical xps are listed in this wiki (to be edited as we go along):

http://wiki.geneontology.org/index.php/Adding_missing_CHEBI_xps

In particular: shall we go ahead and delete the xps for "secretion" terms for now? And/or resolve on a more complex relation such as e.g.

results_in_transport_out_of_anatomical_structure?


  • Also: how are we going to deal with metabolic terms in GO where the CHEBI entry is only a role in ChEBI?

E.g. see http://sourceforge.net/tracker/?func=detail&aid=3524826&group_id=36855&atid=440764

secondary metabolite (CHEBI:26619) is_a role only in ChEBI.


  • Related: what parentage for GO terms such as GO:0042417 ! dopamine metabolic process?

See email thread on the go-ontology mailing list; Chris suggests the following as an alternate plan:

"we treat CHEBI roles **as if they were structural classifications**, and if we want to indicate the context, **we do this ourselves using GO, rather than the CHEBI role hierarchy**".


 secondary metabolite metabolism ; GO:NEW
 --[isa]endocrocin metabolic process ; GO:1900600

  • DH: chemical HAS_ROLE in CHEBI means that the chemical can have the role. The classic example is dopamine, which is NOT ALWAYS a neurotransmitter (e.g. outside of the nervous system and in plants). Therefore, the problem with putting all these chemicals under 'secondary metabolite metabolism' is that they might not always be secondary metabolites in all organisms.

At least 3 options:

  1. Add the 'x metabolism' terms under 'secondary metabolite metabolism ; GO:NEW'. Remove this parentage IF a paper shows that the chemical is NOT a secondary metabolite in a given organism/context.
  2. Create child terms under 'secondary metabolite metabolism ; GO:NEW'. E.g. endocrosin secondary metabolite metabolism ; GO:NEW'. Would create alot of work, and alot of redundant terms.
  3. Capture this at the annotation stage by linking the two terms. Annotate to 'endocrocin metabolic process ; GO:1900600' with 'is_a:secondary metabolite metabolism ; GO:NEW', or 'HAS_ROLE: secondary metabolite' in column 16.

Progress on involved_in logical definitions

There are four categories of terms:

  1. straightforward (GO:0043689),
  2. terms where a genus needs to be added to make the cross products (GO:0021812)
  3. terms where additional relationships are appropriate that are not necessarily specified by the logical definitions (GO:0086071)
  4. terms that have more than one differentium (GO:0060136).

BP-CC cross-products

http://cvsweb.geneontology.org/cgi-bin/cvsweb.cgi/go/scratch/xps/biological_process_xp_cellular_component.obo

Current relationships in this document:

  results_in_transport_of
  results_in_transport_from
  results_in_transport_to
  results_in_transport_towards
  results_in_transport_involving
  results_in_transport_along
  results_in_transport_across
  results_in_transport_to_from_or_in
  results_in_assembly_of
  results_in_disassembly_of
  results_in_organization_of
  results_in_fusion_of
  results_in_fission_of
  results_in_localization_of
  results_in_morphogenesis_of
  results_in_connection_of
  results_in_distribution_of
  results_in_remodeling_of
  results_in_maturation_of
  mediated_by
  occurs_in
  has_output
  dependent_on
  has_level
  part_of

*AI: Cam-Eds to take care of these. Either need to arrange another call with the annotators (e.g. Val, Midori, Emily) and editors (including David, Chris) to get the relationships sorted. Or just go ahead and put the transport XPs straight in the file.

  • DH: 'has_target' is more specific than 'has_input'. E.g. for protein phosphorylation, both ATP and the protein substrate are inputs, but only the protein substrate is the target.
  • This is subjective, depending on what you're interested in. E.g. the MENGO group are interested in by-products and consider these to be the important output for a reaction.
  • This may not matter because at the annotation stage, for the protein phosphorylation example, you would only put the protein substrate ID in the annotation extension column.
  • For 'regulation of transcription', what is the target? The gene or the protein that comes out.