Cell Ontology Progress Report 2010: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
mNo edit summary
No edit summary
Line 1: Line 1:
<big>STILL UNDER REVISION (12-10-10)</big>
Work on the Cell Ontology is funded by an ARRA Competitive Revision to the GO Consortium Grant.  This work commenced on September 30, 2009.  The project is based at MGI, with sub-contract work at LBNL.
Work on the Cell Ontology is funded by an ARRA Competitive Revision to the GO Consortium Grant.  This work commenced on September 30, 2009.  The project is based at MGI, with sub-contract work at LBNL.


Line 42: Line 40:


*Devise new structure for the Cell Ontology
*Devise new structure for the Cell Ontology
**We have been prototyping the new CL structure using the hematopoeitic cell types as a test case.
**A new high level structure for the CL was approved at the May 2010 Cell Ontology WorkshopParts have been implemented, and more will be in 2011.
**Started month: 1
**Estimated duration:  6 months


*Add formal logical axioms to the CL.
*Add formal logical axioms to the CL.
**We have made the first steps towards adding logical axioms to theCL. The first is the formal definition of the capable_of relation, required for linking cell types to biological processes (for example, Tr1 cells to IL-10 production). The LBL group has created the formal definition in terms of the Basic Formal Ontology (BFO) "disposition" class, and whilst the Jackson group has applied this in the prototype ontology.
**A system of OWL macro expansions has been developed for CL relations such as "capable_of" that allow for easy conversion to OWL versions that can be more fully reasoned over using Protege reasoners.
**Started month:  1
**Estimated duration:  6 months


*Add logical definitions using other OBO ontologies
*Add logical definitions using other OBO ontologies
**Extensive computable definitions (AKA "logical definitions" or "cross-products") have been added to terms in the hematopoietic cell branch of the CL, and to some other terms in the CL.
**We have identified the following ontologies as key to defining cell types:
**We have identified the following ontologies as key to defining cell types:
***PRO - Protein Ontology
***PRO - Protein Ontology
Line 57: Line 52:
*** PATO - Phenotypic Quality Ontology
*** PATO - Phenotypic Quality Ontology
*** Uberon - We are also evaluating the use of a general-purpose gross anatomical ontology (Uberon) for certain cell types defined by location in the body.
*** Uberon - We are also evaluating the use of a general-purpose gross anatomical ontology (Uberon) for certain cell types defined by location in the body.
**Started month:  1
**Estimated duration:  6 months


*Implement automated reasoner strategy
*Implement automated reasoner strategy
**Although we do not estimate commencing this until month 6 we have made some preliminary steps, partially implementing the MIREOT methodology within OBO-Edit.  This allows the CL editors at Jackson to select classes from external ontologies and bring in only the subset of those ontologies that pertain to these referenced classes.  This improves reasoner performance.
**This has been an integral part of our ongoing development of the CL.
**Estimated start month:  6
**Estimated duration:  3 months


* General software support
* General software support
**Support with managing ontology versions in Open Bio-Ontologies repository
**Support with managing ontology versions in Open Bio-Ontologies repository
**OBO-Edit bug fixes
**OBO-Edit bug fixes
**OBO-Edit tutorials for CL curators
<br>
<br>


===Aim 2:  Improving and extending the content of CL===
===Aim 2:  Improving and extending the content of CL===
*We have begun planning for a Cell Ontology workshop, to be held May 18-19, 2009 at the Jackson Laborotory, Bar Harbor, Maine.  The workshop will be used for discussion of general issues involved in the revision of the Cell Ontology.
*We held a [http://obofoundry.org/wiki/index.php/Cell_Ontology_Workshop_2010 Cell Ontology workshop], on May 18-19, 2010 at the Jackson Laborotory, Bar Harbor, Maine, and extensively discussed both ontological and biological aspects of CL development.
*We are implementing the cross-product/logical definition formulation of the hematopoietic terms.  As part of this work, we are improving definitions for a number of cell types and adding new cell types.
*A workshop on neurons and nervous system cell types is scheduled for April 12-14, 2011More information may be found [http://obofoundry.org/wiki/index.php/Cell_Ontology-INCF_Neuron_Workshop here].
*We are working to add is_a parents to all terms in the CL.
*We have implementing the computable defintion (AKA "logical definition" or "cross-product") formulation of the hematopoietic terms.  As part of this work, we have improved definitions for a number of cell types and adding many new cell types.
*We are preparing definitions for current nervous system terms in the CL that lack them.<br><br>
*A paper, "Logical Development of the Cell Ontology" by Meehan, Masci, Abdulla, Cowell, Blake, Mungal, and Diehl, which described the work on the hematopoietic cells in the CL has been accepted (December 2010) for publication in BMC Biouinformatics.
*We have imported a large number of terms from the FMA into the CL.
*We have added is_a parents to all terms in the CL.
*We have prepared textual and logical definitions for current nervous system terms in the CL that lack them, although these are not yet implemented in cell.obo.<br><br>


===Aim 3:  Applying the CL to the GO===
===Aim 3:  Applying the CL to the GO===


* This work is to commence in month 6
* We are using manual and automated methods to ensure interoperability between the GO and CL for cell types and the logical relationships between them.




Line 86: Line 79:
* Based on current progress towards completion of our aims we are on track, with 60% of the work accomplished.
* Based on current progress towards completion of our aims we are on track, with 60% of the work accomplished.


== LBL Portion ==
== LBL Contributions to CL development ==


ONTOLOGY ALIGNMENT AND INTEGRATION: we have supported the JAX ontology
ONTOLOGY ALIGNMENT AND INTEGRATION: we have supported the JAX ontology editors by providing and executing tools to integrate human cell types represented in the FMA (Foundational Model of Anatomy) into CL. These tools identified generic cell types in common between the two ontologies, and used the Obol toolkit (developed at Berkeley) to identify compositional class expressions for specific cell types (e.g. "smooth muscle cell of jejunum"). These compositional classescan then be integrated automatically, using the cross-species gross anatomy ontology UBERON.
editors by providing and executing tools to integrate human cell types
represented in the FMA (Foundational Model of Anatomy) into CL. These
tools identified generic cell types in common between the two
ontologies, and used the Obol toolkit (developed at Berkeley) to
identify compositional class expressions for specific cell types
(e.g. "smooth muscle cell of jejunum"). These compositional classes
can then be integrated automatically, using the cross-species gross
anatomy ontology UBERON.


ONTOLOGY EDITOR SUPPORT: we have provided ontology editor support to
ONTOLOGY EDITOR SUPPORT: we have provided ontology editor support to the JAX editors of the ontology, improving the OBO-Edit ontology editing environment in reponse to curator requests. We have also implemented a variant of the MIREOT (Minimal Information Retrieved for External Ontology Terms) strategy which allows scalable integration across multiple ontologies. We have also devised a build environment for the CL, which allows the easy distribution of multiple versions for different purposes, and uses OWL reasoners to classify and perform logical consistency analysis on the ontology automatically.
the JAX editors of the ontology, improving the oboedit ontology
editing environment in reponse to curator requests. We have also
implemented a variant of the MIREOT (Minimal Information Retrieved for
External Ontology Terms) strategy which allows scalable integration
across multiple ontologies. We have also devised a build environment
for the CL, which allows the easy distribution of multiple versions
for different purposes, and uses OWL reasoners to classify and perform
logical consistency analysis on the ontology automatically.


ONTOLOGY LANGUAGE SUPPORT: we identified key areas in which the
ONTOLOGY LANGUAGE SUPPORT: we identified key areas in which the ontology format used by the Gene Ontology was insufficient for representing cell types. Rather than move wholesale to the more expressive OWL (Ontology Web Langauge) formalism, which would have entailed abandoning tools and interoperation with model organism databases, we enhanced obo format using a macro mechanism and implemented a prototype macro expansion engine (http://precedings.nature.com/documents/5292/version/1). This allowed us to have the best of both worlds, and to be able to transition between the oboedit environment, and the Protege4 editors environment, and to use OWL reasoners to classify the ontology.
ontology format used by the Gene Ontology was insufficient for
representing cell types. Rather than move wholesale to the more
expressive OWL (Ontology Web Langauge) formalism, which would have
entailed abandoning tools and interoperation with model organism
databases, we enhanced obo format using a macro mechanism and
implemented a prototype macro expansion engine
(http://precedings.nature.com/documents/5292/version/1). This allowed
us to have the best of both worlds, and to be able to transition
between the oboedit environment, and the Protege4 editors environment,
and to use OWL reasoners to classify the ontology.


This work also necessitated creation of a formal specification of obo
This work also necessitated creation of a formal specification of obo format and its translation to OWL ([http://berkeleybop.org/~cjm/obo2owl/obo-syntax.html]), which significantly extends on previous efforts made in 2007, before theOWL2 language was finalized.
format and its translation to OWL
(http://berkeleybop.org/~cjm/obo2owl/obo-syntax.html), which
significantly extends on previous efforts made in 2007, before the
OWL2 language was finalized.




Line 133: Line 96:


Mungall CJ, Bada M, Berardini TZ, Deegan J, Ireland A, Harris MA, Hill DP, Lomax J. 2010. Cross-product extensions of the Gene Ontology. J Biomed Inform. [http://dx.doi.org/10.1016/j.jbi.2010.02.002 doi:10.1016/j.jbi.2010.02.002].
Mungall CJ, Bada M, Berardini TZ, Deegan J, Ireland A, Harris MA, Hill DP, Lomax J. 2010. Cross-product extensions of the Gene Ontology. J Biomed Inform. [http://dx.doi.org/10.1016/j.jbi.2010.02.002 doi:10.1016/j.jbi.2010.02.002].
Meehan TF, Masci AM, Abdulla A, Cowell LG, Blake JA, Mungall CJ, Diehl AD, 2010, Logical Development of the Cell Ontology, accepted for publication in BMC Bioinformatics.


<br>
<br>


[[http://wiki.geneontology.org/index.php/Grant_Progress_Reports_December_2010 Return to 2010 Progress Reports]]
[[http://wiki.geneontology.org/index.php/Grant_Progress_Reports_December_2010 Return to 2010 Progress Reports]]

Revision as of 01:59, 18 December 2010

Work on the Cell Ontology is funded by an ARRA Competitive Revision to the GO Consortium Grant. This work commenced on September 30, 2009. The project is based at MGI, with sub-contract work at LBNL.


Personnel

  • Project Leaders
    • Alexander Diehl, The Jackson Laboratory/University at Buffalo (50% effort)
    • Christopher Mungall, LBNL (10% effort)
  • Cell Ontology Curator
    • Terrence Meehan, The Jackson Laboratory (100% effort)
  • Programmers
    • Amina Abdulla and Nomi Harris, LBNL (75% effort, split)
  • Senior Advisors
    • Judith Blake, The Jackson Laboratory
    • Suzanna Lewis, LBNL
  • External Consultants
    • Lindsay Cowell, Duke University
    • Anna Maria Masci, Duke University


Collaborations

  • INCF Program on Ontologies for Neuroscience Task Force for Representation and Deployment, identification of nervous system cell types for addition to the CL.
  • Ceri van Slyke, ZFIN, addition of fish cell types and other revisions to CL.
  • Sirarat Sarntivijai and Oliver He, University of Michigan, development of the Cell Line Ontology.
  • The Plant Ontology Consortium, plant cell types.
  • David Osumi-Sutherland, Flybase, Drosophila cell types.
  • Onard Mejino, Foundation Model of Anatomy, import of FMA cell types
  • David Dougall and Richard Scheuermann, UT Southwestern, import of novel hematopoietic cell types identified via flow cytometry
  • Helen Parkinson, EBI, assistance in making cell types in CL and the Experimental Factor Ontology compatible and in providing support for experimentally derived cell types in OBI.



Item 1 - Activities

Aim 1: Reforming the Structure of the Cell Ontology

  • Devise new structure for the Cell Ontology
    • A new high level structure for the CL was approved at the May 2010 Cell Ontology Workshop. Parts have been implemented, and more will be in 2011.
  • Add formal logical axioms to the CL.
    • A system of OWL macro expansions has been developed for CL relations such as "capable_of" that allow for easy conversion to OWL versions that can be more fully reasoned over using Protege reasoners.
  • Add logical definitions using other OBO ontologies
    • Extensive computable definitions (AKA "logical definitions" or "cross-products") have been added to terms in the hematopoietic cell branch of the CL, and to some other terms in the CL.
    • We have identified the following ontologies as key to defining cell types:
      • PRO - Protein Ontology
      • GO - Gene Ontology (particularly biological processes such as immune response)
      • PATO - Phenotypic Quality Ontology
      • Uberon - We are also evaluating the use of a general-purpose gross anatomical ontology (Uberon) for certain cell types defined by location in the body.
  • Implement automated reasoner strategy
    • This has been an integral part of our ongoing development of the CL.
  • General software support
    • Support with managing ontology versions in Open Bio-Ontologies repository
    • OBO-Edit bug fixes


Aim 2: Improving and extending the content of CL

  • We held a Cell Ontology workshop, on May 18-19, 2010 at the Jackson Laborotory, Bar Harbor, Maine, and extensively discussed both ontological and biological aspects of CL development.
  • A workshop on neurons and nervous system cell types is scheduled for April 12-14, 2011. More information may be found here.
  • We have implementing the computable defintion (AKA "logical definition" or "cross-product") formulation of the hematopoietic terms. As part of this work, we have improved definitions for a number of cell types and adding many new cell types.
  • A paper, "Logical Development of the Cell Ontology" by Meehan, Masci, Abdulla, Cowell, Blake, Mungal, and Diehl, which described the work on the hematopoietic cells in the CL has been accepted (December 2010) for publication in BMC Biouinformatics.
  • We have imported a large number of terms from the FMA into the CL.
  • We have added is_a parents to all terms in the CL.
  • We have prepared textual and logical definitions for current nervous system terms in the CL that lack them, although these are not yet implemented in cell.obo.

Aim 3: Applying the CL to the GO

  • We are using manual and automated methods to ensure interoperability between the GO and CL for cell types and the logical relationships between them.


Item 2 -- Percent Completion

  • Based on current progress towards completion of our aims we are on track, with 60% of the work accomplished.

LBL Contributions to CL development

ONTOLOGY ALIGNMENT AND INTEGRATION: we have supported the JAX ontology editors by providing and executing tools to integrate human cell types represented in the FMA (Foundational Model of Anatomy) into CL. These tools identified generic cell types in common between the two ontologies, and used the Obol toolkit (developed at Berkeley) to identify compositional class expressions for specific cell types (e.g. "smooth muscle cell of jejunum"). These compositional classescan then be integrated automatically, using the cross-species gross anatomy ontology UBERON.

ONTOLOGY EDITOR SUPPORT: we have provided ontology editor support to the JAX editors of the ontology, improving the OBO-Edit ontology editing environment in reponse to curator requests. We have also implemented a variant of the MIREOT (Minimal Information Retrieved for External Ontology Terms) strategy which allows scalable integration across multiple ontologies. We have also devised a build environment for the CL, which allows the easy distribution of multiple versions for different purposes, and uses OWL reasoners to classify and perform logical consistency analysis on the ontology automatically.

ONTOLOGY LANGUAGE SUPPORT: we identified key areas in which the ontology format used by the Gene Ontology was insufficient for representing cell types. Rather than move wholesale to the more expressive OWL (Ontology Web Langauge) formalism, which would have entailed abandoning tools and interoperation with model organism databases, we enhanced obo format using a macro mechanism and implemented a prototype macro expansion engine (http://precedings.nature.com/documents/5292/version/1). This allowed us to have the best of both worlds, and to be able to transition between the oboedit environment, and the Protege4 editors environment, and to use OWL reasoners to classify the ontology.

This work also necessitated creation of a formal specification of obo format and its translation to OWL ([1]), which significantly extends on previous efforts made in 2007, before theOWL2 language was finalized.


Related Publications for 2010

Diehl AD, Augustine AD, Blake JA, Cowell LG, Gold ES, Gondré-Lewis TA, Masci AM, Meehan TF, Morel PA, Nijnik A, Peters B, Pulendran B, Scheuermann RH, Yao QA, Zand MS, Mungall CJ. 2010. Hematopoietic Cell Types: Prototype for a Revised Cell Ontology. J Biomed Inform. doi:10.1016/j.jbi.2010.01.006.

Mungall CJ, Bada M, Berardini TZ, Deegan J, Ireland A, Harris MA, Hill DP, Lomax J. 2010. Cross-product extensions of the Gene Ontology. J Biomed Inform. doi:10.1016/j.jbi.2010.02.002.

Meehan TF, Masci AM, Abdulla A, Cowell LG, Blake JA, Mungall CJ, Diehl AD, 2010, Logical Development of the Cell Ontology, accepted for publication in BMC Bioinformatics.


[Return to 2010 Progress Reports]