TAIR March2010: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
Line 88: Line 88:
'''A. Ontology Development Contributions'''
'''A. Ontology Development Contributions'''


Donghui Li has submitted 101 SourceForge term requests on behalf of TAIR curators from October 2008 to September 2009 (each request may contain multiple terms). Of these 101 requests, 99 have been closed.  122 new GO terms have been created.  
Donghui Li has submitted 5 SourceForge term requests on behalf of TAIR curators from October 2009 to March 2010 (each request may contain multiple terms). Of these 5 requests, 5 have been closed.  13 new GO terms have been created.  





Revision as of 19:41, 26 March 2010

TAIR, The Arabidopsis Information Resource, March 2010 (NOT FINAL, STILL IN PROGRESS)

covers the period from September 2009 to March 2010

1. Staff working on GOC tasks

Tanya Berardini, Donghui Li

The total number of FTE working on GOC tasks is 1.4.

2. Annotation progress

Table 1: Number of Annotations to Various GO Aspects

Annotations BP (09/09) BP (03/10) change MF (09/09) MF (03/10) change CC (09/09) CC (03/10) change
non-IEA/non-ND 15584 16557 + 973 10392 10942 + 550 19175 19423 + 248
IEA 10776 10529 - 247 20147 19562 - 585 10490 10382 - 108
ND 14327 14238 - 89 8820 8805 - 15 14519 14470 - 49

Table 2: Number of Genes Annotated to Various GO Aspects

Genes BP (09/09) BP (03/10) change MF (09/09) MF (03/10) change CC (09/09) CC (03/10) change
non-IEA/non-ND 7263 7586 + 323 6904 7125 + 221 7333 7471 + 138
IEA 6857 6719 - 138 8205 8029 - 180 7806 7737 - 69
ND 14327 14238 - 89 8819 8804 - 15 14515 14466 - 49

3. Methods and strategies for annotation

a. Literature curation: We continue to put most of our effort (95%) into annotation of gene products from the literature.

b. Computational annotation strategies: With every genome release, we rerun two computational GO annotation pipelines, one based on INTERPROtoGO mapping and the other based on a TargetP analysis. These results are integrated into our GO annotation file. This represents roughly 5% of our annotation effort. We integrate GOA Arabidopsis GO annotations into our gene association file so that all Arabidopsis annotations, regardless of original source, are now relayed to GO via TAIR with the appropriate source attribution.

c. Priorities for annotation:

(1) literature of any age pertaining to Reference Genome genes,

(2) literature describing the characterization of previously undescribed ('novel') genes,

(3) recent literature from high impact factor journals

4. Presentations and publications

a. Papers with substantial GO content

Hill DP, Berardini TZ, Howe DG, Van Auken KM. (2009). Representing Ontogeny Through Ontology: A Developmental Biologist’s Guide to The Gene Ontology. Mol Reprod. Dev. 77(4):314-29.

Mungall CJ, Bada M, Berardini TZ, Deegan J, Ireland A, Harris MA, Hill DP, Lomax J. (2010). Cross-Product Extensions of the Gene Ontology. J. Biomed. Inform. doi:10.1016/j.jbi.2010.02.002

The Gene Ontology Consortium. (2010). The Gene Ontology in 2010: extensions and refinements. Nucleic Acids Res. 38:D331-5. (TB, corresponding author)

b. Presentations including Talks and Tutorials and Teaching

c. Poster presentations - none

5. Other Highlights

A. Ontology Development Contributions

Donghui Li has submitted 5 SourceForge term requests on behalf of TAIR curators from October 2009 to March 2010 (each request may contain multiple terms). Of these 5 requests, 5 have been closed. 13 new GO terms have been created.


Tanya Berardini continues to work with:

I. David Hill on

  1. quality control reports that are generated by OBOL and reasoner, both within OBO-Edit and in external scripts. This is an ongoing effort that we address as issues arise. [[1]]
  2. regulation related SF items submitted by the GO community.
  3. development specific ontology development. Both curators attended the Annual meeting of the American Society for Cell Biology. Ontology improvements from this meeting are detailed here [[2]]
  4. continuing to add interontology links between MF and BP.

II. Midori Harris, David Hill, Chris Mungall, Jen Deegan and Jane Lomax to develop cross-products within the three GO namespaces. The regulation cross-products have been released in the extended GO. David and Tanya are now quality checking the internal biological process cross-products.

III. David Hill, Varsha Khodiyar, Doug Howe, Susan Tweedie, Ruth Lovering and community experts to expand the heart development portion of the ontology.

IV. Midori Harris, David Hill, Jane Lomax and Harold Drabkin to align the representation of biochemicals in GO with CheBI.

B. Annotation outreach and user advocacy efforts

1. Tanya Berardini coordinated the writing of the 2010 Nucleic Acids Research Database Issue paper for the consortium.

2. The TAIR-Plant Physiology collaboration continues to gather Arabidopsis gene function data directly from authors whose articles have just been accepted for publication. This partnership has resulted in direct author submission of function or expression information for 422 genes drawn from 104 articles published in Plant Physiology over the last 12 months (March 09 - February 10) (574 GO annotations and 35 PO annotations). We reevaluated the submission rate after last year's changes in the wording of the data request. Sampling one month's worth of papers, we found that the base submission rate had increased from 21% to about 50%.

We expanded our journal collaboration to include The Plant Journal in Fall 2009 and have received 2 submissions from Plant Journal containing function information for 3 genes (8 GO annotations and 3 PO annotations) since then.

We are currently in talks with editors and publishers from 6 additional journals: Journal of Integrative Plant Biology, Journal of Experimental Botany, Plant Science, Environmental Botany, Plant Physiology and Biochemistry, and Plant, Cell and Environment, to establish similar collaborations. At the same time, we are developing a web-based data submission tool that will be hosted by TAIR that can be used by authors from any journal and will accelerate the incorporation of this type of data into the TAIR database.


C. Other highlights - none