Extension of Protein2GO to non-UniProtKB Identifiers

From GO Wiki
Revision as of 16:12, 5 December 2013 by Vanaukenk (talk | contribs) (Created page with "Conference Call Agenda =What types of entity identifiers might be needed?= #Proteins not in UniProtKB #ncRNAs #Orphan genes (variations not associated with a specific gene) #Pro...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Conference Call Agenda

What types of entity identifiers might be needed?

  1. Proteins not in UniProtKB
  2. ncRNAs
  3. Orphan genes (variations not associated with a specific gene)
  4. Protein complexes

See Google spreadsheet:

https://docs.google.com/spreadsheet/ccc?key=0Aiei4RvoiQdqdHBFVEcwXzRvcW94V2JOLVFSNjJaTHc&usp=drive_web#gid=0

Knowledge Representation

  • What kind of biological statements do we want to make?
  • Given these statements, what is the appropriate resource for the entity IDs?
  • How will this be represented in the GAFs/GPADs?

Practical Considerations

  • How many of each type?
  • ID stability - if there is churn, can IDs be mapped forward, not go stale?