RGD December 2016
RGD, The Rat Genome Database, December 2016
1. Staff working on GOC tasks
RGD Admin: Mary Shimoyama
GO Curators: Stan Laulederkind, Tom Hayman, Shur-Jen Wang, Victoria Petri (~1.5 fte, 0 funded by NHGRI GOC grant)
IT staff associated with GO related projects such as the development of the online curation tool and of pipelines, the updates/loads of GO ontologies in the database and the generation and submission of RGD Gene Association files: Omid Ghiasvand, Marek Tutaj, Jeff DePons (1 fte, 0 fte funded by NHGRI GOC grant)
2. Annotation progress
|Gene Products||Annotations 2015||Annotations 2016||% Change|
|39,206||547,064 (314,567 non-IEA)||599,250 (323,853 non-IEA)||+9.5%|
The table above is based on a review of the GOC gene_association.rgd files from December 2015 to December 2016. The number of manual annotations from RGD has increased from 49,791 to 52,136 (+ 2,345 annotations, +4.7%) and the number of genes with manual annotations has increased from 5,998 to 6,260 (+262, +4.4%).
3. Methods and strategies for annotation
Because the pipelines for GO annotations are automated and updated weekly, all of the curators’ efforts are involved in manual annotation. Although RGD curators also annotate to other ontologies, approximately 25% of their curation efforts have been related to GO annotations in the past year.
a. Literature curation: RGD targets gene sets for manual curation and all rat papers published about those genes are curated. In 2016, there have been 2 major types of gene datasets curated:
- disease related: hematologic disease genes and development-related disease genes
- genes involved in targeted metabolic, signaling, regulatory, and disease pathways.
b. Computational annotation strategies:
- Rat genes manually curated by other groups are brought in electronically from GOA with their associated evidence codes and the originating group acknowledged in the source.
- ISO - RGD is not currently doing manual annotation with ISO. ISO annotations are created through our automated pipelines that map GO annotations from mouse genes over to their Rat orthologs. For each mouse gene that has a confirmed rat ortholog, if the GO annotation to the Mouse gene is of evidence type IDA, IMP, IPI, IGI or IEP then the annotation is loaded onto the rat ortholog as an ISO annotation.
- IEA - rat annotations based on GO mapping to InterPro, Enzyme Commission and Swiss-Prot keywords, are brought in electronically with IEA evidence code from GOA. Annotations from GOA for all categories are updated weekly.
c. Priorities for annotation: There are several ways in which RGD assigns priorities for the annotation of genes to GO ontology terms. These include: genes associated with targeted disease categories, genes involved in particular biological pathways, and genes associated with specific QTLs.
4. Presentations and publications
a. Papers with substantial GO content
- Shimoyama M, Laulederkind SJ, De Pons J, Nigam R, Smith JR, Tutaj M, Petri V, Hayman GT, Wang SJ, Ghiasvand O, Thota J, Dwinell MR. Exploring human disease using the Rat Genome Database. Dis Model Mech. 2016 Oct 1;9(10):1089-1095.
b. Presentations including Talks and Tutorials and Teaching
c. Poster presentations with GO content
5. Other Highlights
A. GO terms and related contributions by RGD
RGD has contributed 150 - 160 new terms, new synonyms, or definition/synonym/spelling corrections to GO from December 2015 to December 2016.
B. Annotation outreach and user advocacy efforts
C. Other highlights
Education Video tutorials, available on YouTube, Vimeo.com, and rgd.mcw.edu.
- OLGA (Object List Generator tool) Tutorial- including information on using curated GO data
- Gene Annotator Tutorial- including information on using curated GO data