Counting gene products
From GO Public
[edit] Guidelines for Characterization of Reference Genome Descriptions
All descriptions based on Sequence Ontology terms
All counts are necessarily estimates, but some can be estimated to the ones digits, while others just to the 1000's. Therefore no need to distinguish, just look at the significant digit. It is recognized that different databases will be currently able to provide different portions of this. A goal should be for each database to provide numbers for each of these categories.
Numbers to be presented.
- CDS: count one per genomic occurrence (mRNA? this might need to be refined, if the group is annotating proteins, not genes)--required
- snoRNA: count one per genomic occurrence
- rRNA: count one per type
- snRNA: count one per genomic occurrence
- tRNA: count one per genomic occurrence
- ncRNA: count one per genomic occurrence and do not double count (i.e. if snoRNA count is supplied, don't double count it here)
- transposable_element: count one per genomic occurrence
- transposable_element_gene: count one per unique mRNA occurrence per transposable_element type
- pseudogene: count one genomic occurrence
