Difference between revisions of "Annotation Call November 11, 2014"

From GO Wiki
Jump to: navigation, search
(Minutes)
(Col-16)
Line 11: Line 11:
 
<pre> PMID:22426534 uses ChIP to show that pombe Pfh1
 
<pre> PMID:22426534 uses ChIP to show that pombe Pfh1
 
localizes to regions within chromatin where the DNA sequence contains
 
localizes to regions within chromatin where the DNA sequence contains
replication fork barriers (e.g. SO:0001914) and at some specific highly transcribed genes.  I can easily annotate to nuclear chromatin in CC, but there's
+
replication fork barriers (e.g. SO:0001914) and at some specific highly transcribed genes.  I can easily  
 +
annotate to nuclear chromatin in CC, but there's
 
important specificity to put in extensions if possible.</pre>
 
important specificity to put in extensions if possible.</pre>
  

Revision as of 08:18, 11 November 2014

Agenda

GAF checks on Jenkins Dashborad

http://build.berkeleybop.org/view/GAF/

Col-16

1) For terms like transport and transporter activity, transcription factor/transcription do we have guidelines on if we should add col-16 data to both aspects?

2) chromatin, new relationship 'RO_0002008: Conincident with' (no underscore)

 PMID:22426534 uses ChIP to show that pombe Pfh1
localizes to regions within chromatin where the DNA sequence contains
replication fork barriers (e.g. SO:0001914) and at some specific highly transcribed genes.  I can easily 
annotate to nuclear chromatin in CC, but there's
important specificity to put in extensions if possible.

Application of the new RO_0002008: Conincident with relationship: definition "A relation that holds between two linear structures that are adjacent or overlapping and are approximately parallel to each other for their entire length. "^^string comment "Example: if we define region of chromosome as any subdivision of a chromosome along its long axis, then we can define a region of chromosome that contains only gene x as 'chromosome region' that coincident_with some 'gene x', where the term gene X corresponds to a genomic sequence."

Resulting annotation: pombe Pfh1 GO:0000790 nuclear chromatin IDA, Annotation extension:RO_0002008: Coincident with SO:0001914 replication fork barriers

Annotation of ChIP data

Discussed [wiki page] and [source forge] In summary A ChIP experiment does not indicate whether interaction is direct or not since chromatin complexes are all cross-linked with DNA, they only show colocalization.

General agreement that it is acceptable to annotate to the Cellular Component term 'chromatin' (or child terms) with SO and Gene IDs in the annotation extension field using the 'Coincident with' relationship (as above). Use SO terms to specify the DNA element (e.g. UAS, FRE, TCS etc)

Need clarification on annotation to a Molecular Function term: chromatin binding (or child terms)

Is it acceptable to annotate to chromatin binding (MF term, or child terms) based on ChiP data; with occurs_at SO and/or has_direct_input Gene IDs in the annotation extension field?

Need to consider whether the authors have used appropriate controls if including SO and Gene IDs

Is it acceptable to annotate to DNA binding (MF term, or child terms) based on ChiP data if the authors are confident that the protein is a DNA binding protein? (e.g. http://www.pombase.org/spombe/result/SPBC32H8.11).

Next Call (Preview)

small conjugating enzyme ontology development (DavidH and Val)

1) The ontology edits are complete for all of the ubiquitin and other small conjugating enzymes. There are three parents: activating enzyme activity (E1), conjugating enzyme activity (E2) and ligase activity (E3)

2) Now all of the annotations for these need to be checked. We need a strategy for this re-annotation.

Checkpoint terms guidelines

Minutes

In attendance: Rama, Moni, Suzi, Edith, Kimberly, Midori, Donghui, Stacia, Pascale, Rachael, Prudence, Judy, Susan, Jane, Paola, Alex, David


  • Jenkins dashboard
    • several different files
    • GAF validation report - tells which are the offending rows and why
    • we want feedback if the reports are not informative enough
    • will report both hard and soft checks (e.g., do not manually annotate)
    • GAF summary - how many rows were flagged, what ontologies were used, etc.
    • gene_association - inferences made using inter-ontology links
    • Please look at the validation reports and if the report doesn’t make sense give feedback on how to make it better.
    • Blue ball - all good
    • Yellow - a few offending rows
    • Red - lots of offending rows
    • Rachael - not all GOA files have links to error reports
      • Rama - submitted a github ticket for this, but will need to follow up with Heiko
  • Web site editing - use Drupal system
    • Can get an account from Seth if needed
  • Column 16
    • transport, transporter activity
    • transcription, transcription factor activity
    • Are targets captured in both MF and BP?
    • Groups do different things - some in both, others just in MF.
    • Kimberly - add to function for enzyme/substrate
    • need to check about transcription - may add to both
    • Ruth - add to both
    • David - may not be consistent with other groups
    • will need to review annotations once ontology group works out what relations to use
    • have put extensions in both MF and BP, but these may be chained
    • ontology group has been going through the annotation relations, some are being removed, others better defined
    • ontology group will come up with guidelines