10 AUG 2010 RefGen Phone Conference (Archived)

From GO Wiki
Revision as of 11:56, 9 August 2010 by Pascale (talk | contribs)
Jump to navigation Jump to search

Present

wnt project

WNTs have cytokine activity [Ruth/Alex]

  • see PMID:18986540
  • Wnts identified as affecting the formation of synapses is controlling an effector function of cells.
  • Note due to lack of information provided in this paper only Wnt5a (R&D Systems only sells mouse Wnt5a) and Wnt7a (R&D Systems only sells human Wnt7a) can be annotated.

Update on annotations from all groups

  • Many groups seem to be up to date. Are there any comments/problems?
  • We have asked the ontology development to process ref genome SF items with higher priority. Please use the SF priority settings and set to a higher priority when you submit a ref genome request.


Tree annotation

  • Mike

Subfamilies annotation report from Mary Dolan

I have put together a report (subfamilies.xlsx posted ftp://ftp.informatics.jax.org/pub/curatorwork/Pascale/) that summarizes the GO annotation status (as of August 1,2010) of Panther subfamilies according to the type of annotation for each refG organism. This is similar to something I had done for the old PPOD clusters.

There are separate sheets for function, process, and component. For each subfamily that contains a refG organism member, I display the type (color-coded) of annotation: experimental (green), ISS-type (yellow), other (red), or none (an organism entry for the subfamily indicates that it has a subfamily member, the slot will be empty if the organism has none). I include ICs as experimental annotations; ND annotations are not counted. I have added a few mappings from panther ids to ga file ids for completeness.

The numbers on the 'annotation_counts' sheet should be useful information for the GO grant renewal in that they give an indication of how much data that is already available in MOD ga files from years of annotation effort might be used by other annotation groups via the refG/Panther project. I think the numbers are quite impressive.

For example, of the 69,566 panther subfamilies: 32,991 have one or more model organism members (14,583 have two or more refG members). Of these 32,991 refG member subfamilies: 24,265 have some GO function annotation to one or more organisms; 15,276 have either experimental or ISS annotation; and 11,653 have experimental annotation. The numbers are similar for process and component.

I see this as complementary to the careful group by group literature curation followed by PAINT review, which, of course, provides the best comprehensive annotation. This provides a view of the maximum possible gain of shared information we could get for 'free' for other refG MODs or other genome annotation groups for their organism's Panther subfamily genes.

The report also has sheets with various counts of things -- e.g. how many genes for each organism in each subfamily. I have also done this report for panther families but I think the idea of shared information might apply more to subfamilies.

Hoping this will be useful, I look forward to your comments. Thanks. Mary

Mary E. Dolan, Ph.D.

Action items from last call


Action items for the Sept GOC meeting

  • Action: All annotation groups to review PAINT annotation sets with a view to integrating them into their association files [all groups]. Manager: Pascale
    • Are all groups okay with integrating annotations in terms of the review process?
  • Action: Reference Genome group needs to explore ways of improving the target list selection method to ensure a greater participation by all curation groups [Pascale and Kara].
    • Now doing wnt pathway. Do we have suggestions for future projects?
  • Action: Figure out a mechanism by which taxonomic restrictions derived from PAINT could be fed back into the ontology taxon restrictions file [Paul and Jane]. Manager: Jane.
    • Pascale emailed Jane and Paul to check the status of that one.

PAINT GAF files

Following last month's call: Suzi will write a script to run on a regular basis (daily? weekly?) either from Berkeley or at Stanford. This script will generate 13 GAF files and place them in the go/gene-associations directory where the other gaf files are kept. These files will be named thusly:

gene_association.paint_XXX.gz (where XXX will be substituted by the name of one of the ref. genome organisms) gene_association.paint_other.gz (for the platypus, opossums, and all the other non-ref. genome organisms)

If a MOD curator decides they would like to look at a particular annotation they would still have the option of retrieving all the files needed to load that particular family into PAINT from the file directory located here: go/gene-associations/submission/paint/PTHRnnnnn (where nnnnn is the number of that family). The script will be using these as the source to build the files listed above.