Reference Genome progress report for 2010 (Archived): Difference between revisions

From GO Wiki
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
Line 2: Line 2:
* Conference calls are held every month to discuss annotation target prioritization, annotation issues, development of the PAINT software, requirements for uploading PAINT generated GAF files, and other issues relevant to the reference genome group.  
* Conference calls are held every month to discuss annotation target prioritization, annotation issues, development of the PAINT software, requirements for uploading PAINT generated GAF files, and other issues relevant to the reference genome group.  


* Mi H, Dong Q, Muruganujan A, Gaudet P, Lewis S, Thomas PD. 2010. PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium. Nuclei Acids Res 8(Database issue): D204-D210.


* Gaudet P, Lee E, Livestone, M, Dolinski K, Lewis SE, Thomas P. Functional Gene Ontology Annotation across Species using PAINT. Poster presented at the ISMB meeting. Boston, July 2010.


* Fisk D et al. (SGD), Title: "Annotation Across Species: SGD and the Reference Genome Project"; Model Organism to Human Biology Conference. June 2010


===Topic covered: ===
===Topic covered: ===

Revision as of 12:19, 17 November 2010

Publications/communications

  • Conference calls are held every month to discuss annotation target prioritization, annotation issues, development of the PAINT software, requirements for uploading PAINT generated GAF files, and other issues relevant to the reference genome group.
  • Mi H, Dong Q, Muruganujan A, Gaudet P, Lewis S, Thomas PD. 2010. PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium. Nuclei Acids Res 8(Database issue): D204-D210.
  • Gaudet P, Lee E, Livestone, M, Dolinski K, Lewis SE, Thomas P. Functional Gene Ontology Annotation across Species using PAINT. Poster presented at the ISMB meeting. Boston, July 2010.
  • Fisk D et al. (SGD), Title: "Annotation Across Species: SGD and the Reference Genome Project"; Model Organism to Human Biology Conference. June 2010

Topic covered:

  • General description of the GO – Ontology, annotation, tools and technical aspects
  • GO browsers: AmiGO and QuickGO
  • Comparison of methods of annotation propagation: PAINT, Compara, HAMAP
  • Discussion of annotation practices: 'binding', 'response to', 'downstream effects', 'regulation', high throughput data, annotation of protein complexes

Outcome

Curation Targets

As of November 2009, the selection of the curation targets is done from a 'systems' perspective, for example with respect to a biological pathway. The rationale is that if targets encompass a single biological phenomenon the annotation will be more accurate and more complete since the curators will be able to familiarize themselves with the subject. The advantages of this approach are :

  • facilitates coordination with ontology development
  • makes it easier to do the annotations because we're addressing a single general area of biology
  • makes it possible to solicit the help of experts to help review the annotations and ensure that nothing is missing.

The first project done using this approach was lung branching morphogenesis. This project was difficult because available experimental data does not allow to clearly infer how specific proteins individually influence the development of the lung (which is the level at which GO annotations are captured)

In the it next phase we have started to annotate the Wnt_signaling_Pathway. We have annotated 9 families of proteins implicated in this pathway, both at the primary level and using PAINT to propagate annotations across all the proteins from the 48 species currently in Panther.

Progress for primary annotations

  • 461 families have been annotated by MODs.
  • 8,000 proteins annotated


PAINT: Software for annotation propagation

The software being developed for annotation propagation, PAINT, is now at version beta29. Many improvements have been made in terms of speed and functionality of the software. There are still some improvements to be made, but beta29 is a 'working version', in that it allows to produce valid GAF files that can be uploaded by the GO databases and the Model Organism Databases.


PAINT-based annotations

As of Nov 17, there are 17 families with annotations in the GO cvs directory: http://cvsweb.geneontology.org/cgi-bin/cvsweb.cgi/go/gene-associations/submission/paint/#dirlist

Can someone do stats - how many annotations, species, etc?


Visualizing PAINT annotations with Pantree

Pantree is a new website live since October 2010 that has been developed by Paul Thomas' group and that allows to view the annotations to the families done with PAINT. http://pantree.org/

Visualizing Reference Genome annotations in AmiGO

http://amigo.berkeleybop.org/cgi-bin/amigo/amigo?mode=homolset_summary&session_id=


Annotation reports

We now generate reports on the annotation status of PAINT families. Those reports indicate how many species contain homologs in a given family, how many members of each family exist in every species, how many members have experimental annotations associated with them, the date a member of the family was last annotated, etc. The reports can be viewed at: http://amigo-sven.princeton.edu/cgi-bin/amigo/phylotree?

Electronic annotation jamborees


Annotation camp

The 3rd Gene Ontology’s Annotation Camp was held from June 16-18 2010 at the Centre Medical Universitaire (CMU) Geneva, Switzerland. The Gene Ontology project (GO) provides a set of controlled vocabularies for use in annotation of gene products (http://geneontology.org/). Members from several model organism databases were represented, for a total of 63 attendees, including 40 from the SIB and 23 external delegates. This annotation camp aimed to update and refine the skills of GO biocurators, including the Swiss-Prot curation team. The major themes of the meeting covered processes difficult to represent in the Gene Ontology such as regulation, responses to stimulus, and protein complexes. The goal is to improve annotation consistency for GO users to have high quality data to support their work.

New GO curators

Swiss-Prot annotators are now doing GO annotation. Emily Dimmer and Rachael Huntley trained 34 annotators during the past year.

New Tree curators

In addition to Mike Livstone, 4 other curators have been trained to do PAINT curation: Rama Balakrishnan, Varsha Khodiyar, and (Nov 19) Li Ni and Dmitry Sitnikov