TAIR, September 2009: Difference between revisions
Line 73: | Line 73: | ||
b. Presentations including Talks and Tutorials and Teaching | b. Presentations including Talks and Tutorials and Teaching | ||
Kate Dreher, Extracting Information from Scientific Papers: Challenges and Opportunities for Researchers and Curators. June 2009. Carnegie Institution, Stanford CA | Kate Dreher, Extracting Information from Scientific Papers: Challenges and Opportunities for Researchers and Curators. June 2009. Carnegie Institution, Stanford CA | ||
Revision as of 19:32, 16 September 2009
TAIR, The Arabidopsis Information Resource, September 2009 / WORKING VERSION, NOT FINAL VERSION;
1. Staff working on GOC tasks
Tanya Berardini, Donghui Li
The total number of FTE working on GOC tasks is 1.4.
2. Annotation progress
Table 1: Number of Annotations to Various GO Aspects ========TO BE UPDATED=====
Annotations | BP (12/07) | BP (10/08) | change | MF (12/07) | MF (10/08) | change | CC (12/07) | CC (10/08) | change | |
---|---|---|---|---|---|---|---|---|---|---|
non-IEA/non-ND | 11038 | 13757 | + 2719 | 9048 | 9648 | + 600 | 5976 | 18281 | + 12305 | |
IEA | 6627 | 6584 | - 43 | 5062 | 5575 | + 513 | 10334 | 8767 | - 1567 | |
ND | 9062 | 8027 | - 1035 | 2453 | 2267 | - 186 | 8693 | 7302 | - 1391 |
Table 2: Number of Genes Annotated to Various GO Aspects
Genes | BP (12/07) | BP (10/08) | change | MF (12/07) | MF (10/08) | change | CC (12/07) | CC (10/08) | change | |
---|---|---|---|---|---|---|---|---|---|---|
non-IEA/non-ND | 5897 | 6761 | + 864 | 6247 | 6575 | + 328 | 3787 | 6953 | + 3166 | |
IEA | 4516 | 4309 | - 207 | 2833 | 2676 | -157 | 8085 | 6904 | - 1181 | |
ND | 8368 | 7973 | - 395 | 2286 | 2219 | - 67 | 8080 | 7268 | - 812 |
3. Methods and strategies for annotation
a. Literature curation: We continue to put most of our effort (95%) into annotation of gene products from the literature.
b. Computational annotation strategies: With every genome release, we rerun two computational GO annotation pipelines, one based on INTERPROtoGO mapping and the other based on a TargetP analysis. These results are integrated into our GO annotation file. This represents roughly 5% of our annotation effort. We integrate GOA Arabidopsis GO annotations into our gene association file so that all Arabidopsis annotations, regardless of original source, are now relayed to GO via TAIR with the appropriate source attribution.
c. Priorities for annotation:
(1) literature of any age pertaining to Reference Genome genes,
(2) literature describing the characterization of previously undescribed ('novel') genes,
(3) recent literature from high impact factor journals
4. Presentations and publications
a. Papers with substantial GO content - none
b. Presentations including Talks and Tutorials and Teaching
Kate Dreher, Extracting Information from Scientific Papers: Challenges and Opportunities for Researchers and Curators. June 2009. Carnegie Institution, Stanford CA
c. Poster presentations - none
5. Other Highlights
A. Ontology Development Contributions
- GO terms contributed by TAIR
Donghui Li has submitted 101 SourceForge term requests on behalf of TAIR curators from October 2008 to September 2009 (each request may contain multiple terms). Of these 101 requests, 99 have been closed. 122 new GO terms have been created.
Tanya Berardini, working with David Hill of MGI, continues to work on:
(1) quality control reports that are generated by OBOL and reasoner, both within OBO-Edit and in external scripts. This is an ongoing effort that we address as issues arise. [[1]]
(2) regulation related SF items submitted by the GO community.
(3) development specific ontology development. Both curators attended the Annual meeting of the Society for Developmental Biology in San Francisco in July 2009. Ontology improvements from this meeting are detailed here [[2]]
B. Annotation outreach and user advocacy efforts
- TAIR/Plant Physiology collaboration
The collaboration to collect functional information about Arabidopsis genes from authors at the time of submission to Plant Phys continues. We have implemented an AJAX auto-complete feature in the webform [[3]] that suggests GO and PO terms pulled from the TAIR database.
- TAIR/Plant Journal collaboration
We have also begun a new collaboration (going live in September 2009) with The Plant Journal that is similar to that with Plant Physiology. In the case of TPJ, the authors are asked to fill in and submit a spreadsheet with the functional annotation. This file is considered supplemental data for TPJ and will be published with the article. The spreadsheets will be forwarded to TAIR from TPJ after the manuscripts have been accepted for publication.
C. Other highlights - none