Annotation Conf. Call 2015-11-10: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
=Bluejeans URL: https://bluejeans.com/993661940=
=Agenda=
=Agenda=
==Deprecated Annotation Extension Relations==
==Deprecated Annotation Extension Relations==

Revision as of 13:11, 9 November 2015

Bluejeans URL: https://bluejeans.com/993661940

Agenda

Deprecated Annotation Extension Relations

Entity IDs in Annotation Extensions

Goals

For this meeting

To understand the scope of the project by discussing what types of objects annotators would like to use in annotation extensions.

Final Goals

  • To agree on a set of ID spaces that will be used in annotation extensions that provides both manual and computational consistency.
  • Ideally a user would be able to query on ANY type of ID that is deemed to be recognized by the GOC and return results of a seamless query of those IDs.
  • To use the results of this discussion for the creation of annotation documentation.

Initial Assumptions

  • We will not be able to mandate the types of IDs that are used by annotation groups, but we should be able to mandate that the IDs used are compatible/translatable to the ID spaces used by the group (MOD) that is primarily responsible for submission of annotations to the GOC.
  • Upon processing of submitted annotations, primary responsible groups (MODs) will translate IDs into the objects used for curation by that group. That group will then provide the translated (normalized) annotations to the GOC.
  • The GOC may then translate the submitted IDs for the purpose of data integration across species.
  • Before submission of annotations, the submitter and the primary responsible group (MOD) will work together to make sure that all ID spaces can be normalized. This should become an SOP for any new group wishing to submit annotations.

Types of IDs used in Annotation Extensions

  • IDs used to represent genes
    • MOD gene identifiers (MGI:MGI:, WB:, ZFIN:ZDB-GENE-, TAIR:locus: etc)
    • Generic UniprotKB Ids (UniProtKB:)
    • ENSEMBL gene IDs (Ensembl:)
    • NCBI gene IDs (NCBI_gene:)
    • RNA central IDs (RNAcentral:)
    • HGNC IDs (HGNC:)
  • IDs used to represent cell types
    • cell ontology IDs
    • wormbase anatomy and cell IDs
    • Plant ontology IDs
  • IDs used to represent chemicals
    • ChEBI
  • IDs used to represent gene products
    • Proteins/Proteoforms
      • Protein ontology IDs (Pro:)
      • UniProt isoform-specific IDs (UniProtKB:######-#)
      • MOD gene identifiers
    • Transcripts
      • EMBL IDs
      • MOD gene identifiers
  • IDs used to represent protein domains
    • InterPro IDs
  • IDs used to represent biological processes
    • GO IDs
  • IDs used to represent molecular functions
    • GO IDs
  • IDs used to represent cellular components
    • GO IDs
  • IDs used to represent anatomical structures
    • EMAPA IDs
    • UBERON IDs
    • Wormbase anatomy and cell IDs
    • Plant ontology IDs

Next Curation Consistency Exercise

  • 2015-11-24
  • TAIR is up next to select a paper
  • Continue with consistency exercises in 2016?
  • Suggestions for changes or improvements?
    • Model each paper in LEGO
  • Groups still to select paper: dictyBase, EBI/UniProt, BBOP (Moni?), NextProt, USC, AgBase, anyone else?

Minutes