10 NOV 2009 RefGen Phone Conference (Archived)

From GO Wiki
Revision as of 12:04, 9 November 2009 by Karadolinski (talk | contribs) (Proposed SOP for incorporating GAF files)

Jump to: navigation, search

Incorporating new protein family annotations via GAF files

  1. How are you handling externally submitted GAF files right now? Primarily, these are the GAF files from GOA. What is the existing process ? For example:
    • Are there manual and/or automated verifications (redundancies, quality of annotations) before a file from GOA gets integrated in your database?
    • How are existing IEA annotations handled ?
    • What is the frequency of incorporation ?
    • How are the accepted annotations from GOA loaded into the MOD ?
    • What appears in their own GAF files they send to the GO site as the annotation source ?

Proposed SOP for incorporating GAF files

Steps to the review process (monthly to start):

  1. Start on the central protein family annotation page here: http://wiki.geneontology.org/index.php/GAFs_for_trees-based_annotations
  2. View notes/summary on family: http://wiki.geneontology.org/index.php/PANTHER10977
  3. Review the annotations: from the above page, we will have summaries, visual representations of the annotations, and tables to peruse. Are the two display options for viewing the annotations just discussed adequate for this review process? While reviewing these proposed annotations is up to the MOD, if you do review them, we especially encourage you to answer the following for each PAINT ISS:
    1. Is the annotation consistent with what you know about the gene to date?
    2. Is the inference justified by the relationship of the protein to the proteins from which the inference was drawn?
    • Review of the reference genome GAFs will necessarily be different than GOA. Unlike GOA, which provides a mix of IEA and literature based annotations, these will all be ISS.
    • The primary issue is whether you know of any evidence that might contradict the ISS that has been made based on protein family.
    • Eventually you may simply incorporate them automatically without manual review
  4. Provide feedback to the particular PAINT curator, or if all is well, just download the GAF. Final GAFs will be available from central GO site.
  5. Are there ISS annotations for your species ?
    1. If yes: then if you see mistakes or have questions contact the ref. genome curator who made the inference for corrections. Note the reason for the correction in the annotation guidelines for the future
    2. If no: then you're done
  6. Repeat the above until all questionable annotations have been resolved
  7. Download the GAF file from the geneontology web site
  8. Load the GAF file into your MOD
  9. Submit comprehensive GAF file for your species as usual

Back to Conference_Calls