Reference Genome progress report for 2010 (Archived)

From GO Wiki
Jump to navigation Jump to search

Publications/communications

  • Conference calls are held every month to discuss annotation target prioritization, annotation issues, development of the PAINT software, requirements for uploading PAINT generated GAF files, and other issues relevant to the reference genome group.


Annotation camp

The 3rd Gene Ontology’s Annotation Camp was held from June 16-18 2010 at the Centre Medical Universitaire (CMU) Geneva, Switzerland. The Gene Ontology project (GO) provides a set of controlled vocabularies for use in annotation of gene products (http://geneontology.org/). Members from several model organism databases were represented, for a total of 63 attendees, including 40 from the SIB and 23 external delegates. This annotation camp aimed to update and refine the skills of GO biocurators, including the Swiss-Prot curation team. The major themes of the meeting covered processes difficult to represent in the Gene Ontology such as regulation, responses to stimulus, and protein complexes. The goal is to improve annotation consistency for GO users to have high quality data to support their work.


Topic covered:

  • General description of the GO – Ontology, annotation, tools and technical aspects
  • GO browsers: AmiGO and QuickGO
  • Comparison of methods of annotation propagation: PAINT, Compara, HAMAP
  • Discussion of annotation practices: 'binding', 'response to', 'downstream effects', 'regulation', high throughput data, annotation of protein complexes

Outcome

Curation Targets

As of November 2009, the selection of the curation targets is done from a 'systems' perspective, for example with respect to a biological pathway. The rationale is that if targets encompass a single biological phenomenon the annotation will be more accurate and more complete since the curators will be able to familiarize themselves with the subject. The advantages of this approach are :

  • facilitates coordination with ontology development
  • makes it easier to do the annotations because we're addressing a single general area of biology
  • makes it possible to solicit the help of experts to help review the annotations and ensure that nothing is missing.

The first project done using this approach was lung branching morphogenesis. This project was difficult because available experimental data does not allow to clearly infer how specific proteins individually influence the development of the lung (which is the level at which GO annotations are captured)

In the it next phase we have started to annotate the Wnt_signaling_Pathway. We have annotated 9 families of proteins implicated in this pathway, both at the primary level and using PAINT to propagate annotations across all the proteins from the 48 species currently in Panther.

Annotation progress

461 families have been annotated by MODs.

PAINT: Software for annotation propagation

The software being developed for annotation propagation, PAINT, is now at version beta29. Many improvements have been made in terms of speed and functionality of the software. There are still some improvements to be made, but beta29 is a 'working version', in that it allows to produce valid GAF files that can be uploaded by the GO databases and the Model Organism Databases.


PAINT-based annotations

As of Nov 17, there are 17 families with annotations in the GO cvs directory: http://cvsweb.geneontology.org/cgi-bin/cvsweb.cgi/go/gene-associations/submission/paint/#dirlist

Can someone do stats - how many annotations, species, etc?


Visualizing PAINT annotations with Pantree

Pantree is a new website live since October 2010 that has been developed by Paul Thomas' group and that allows to view the annotations to the families done with PAINT. http://pantree.org/

Visualizing Reference Genome annotations in AmiGO

http://amigo.berkeleybop.org/cgi-bin/amigo/amigo?mode=homolset_summary&session_id=


Annotation reports

We now generate reports on the annotation status of PAINT families. Those reports indicate how many species contain homologs in a given family, how many members of each family exist in every species, how many members have experimental annotations associated with them, the date a member of the family was last annotated, etc. The reports can be viewed at: http://amigo-sven.princeton.edu/cgi-bin/amigo/phylotree?

Electronic annotation jamborees


New annotators

We are very pleased to announce that Swiss-Prot annotators are now doing GO annotation. Emily Dimmer and Rachael Huntley trained 34 annotators during the past year.