File description: go-annotation-changes no pb: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
=Usage=
=Usage=
This file is used for doing QC on annotations for GO releases.
This file is used for doing QC on annotations for GO releases. Direct annotations to <code>GO:0005515 protein binding</code> are excluded from the counts, therefore, the total number of annotations is lower, as well as the number of annotated species (both total and filtered), and the number of annotated references.


=Input data =
=Input data =
Calculated from the [[File_Description:_go-annotation-changes |go-annotation-changes.json]], in which annotations to <code>GO:0005515 protein binding</code> are removed. Therefore, the total number of annotations is lower, as well as the number of annotated species (both total and filtered), and the number of annotated references.
Calculated as the [[File_Description:_go-annotation-changes |go-annotation-changes.json]], but  excluding direct annotations to <code>GO:0005515 protein binding</code>.  


=Format(s)=
=Format(s)=
Line 10: Line 10:


=File description=
=File description=
The <code>go-annotation-changes_no_pb</code> file contains the following information:
See documentation for [[File_Description:_go-annotation-changes|go-annotation-changes]] file.
 
==summary==
===current===
* '''release_date:''' Obtained from <code>release/metadata/release-date.json</code> or <code>snapshot/metadata/release-date.json</code>.
* '''annotations
** '''total''': Total number of annotations.
** '''[[GO_stats-glossary#aspect |by_aspect]]''': P, F, C.
** '''[[GO_stats-glossary#evidence_cluster|by_evidence_cluster]]''': PHYLO, IEA, OTHER, EXP, ND
, HTP.
* '''[[GO_stats-glossary#bioentity_type |bioentities]]:''' Number of bioentities annotated in the GO database.
* '''taxa:''' Number of species with annotations.
* '''taxa_filtered:''' Number of species with at least 1,000 annotations. 
* '''references:''' Number of distinct annotated references (includes PMIDs, GO_REFs, DOIs, internal IDs for Model Organism Databases and Reactome (note that for papers with both a PMID and an internal reference ID, the paper is counted twice).
* '''pmids:''' Number of annotated PMIDs.
 
===previous===
* Same information as for the [[File_Description:_go-annotation-changes#current | current release]].
* Note that the date of the previous release is the most recent release in <code>release.geneontology.org/</code>, using the file <code>release.geneontology.org/YYYY-MM-DD/metadata/release-date.json</code>.
 
===changes===
Differences between the current and the previous release for all the fields above. In addition, for taxa, references and pmids, the number of added and removed items are counted.
====annotations====
* '''total''': Change in the total number of annotations.
* '''[[GO_stats-glossary#aspect |by_aspect]]:''' Changes in the total number of annotations for each aspect: P, F, C.
* '''[[GO_stats-glossary#evidence_cluster|by_evidence_cluster]]''': Changes in the total number of annotations for each [[GO_stats-glossary#evidence_cluster| evidence cluster]] (PHYLO, IEA, OTHER, EXP, ND
, HTP.).
 
====[[GO_stats-glossary#bioentity_type |bioentities]]====
Change in the number of annotated [[GO_stats-glossary#bioentity_type |bioentities]].
 
====taxa====
* '''total:''' Changes in the total number of annotated species.
* '''[[GO_stats-glossary#filtered_taxa |filtered]]:''' Changes in the number of annotated species by [[GO_stats-glossary#filtered_taxa |filtered taxa]].
* '''added:''' Number of new species annotated.
* '''removed:''' Number of removed species having lost all annotations.
 
====references====
* '''total''': Change in the number of annotated references .
* '''added ''': Number of newly annotated references (data not yet available).
* '''removed''': Number of references having lost all annotations (data not yet available).
 
====pmids====
* '''total''': Change in the number of annotated PMIDs.
* '''added ''': Number of newly annotated PMIDs (data not yet available).
* '''removed''': Number of PMIDs having lost all annotations (data not yet available).
 
==detailed_changes==
All data in this section is shown as x/y, where x is the difference in the total number of annotations, y is the total number of annotations in the current release, and the % change in shown in parentheses. 
 
===annotations===
*'''total: '''Change in the total number of annotations.
* '''[[GO_stats-glossary#aspect |by_aspect]]:''' Change in the total number of annotation for each [[GO_stats-glossary#aspect |aspect]]: P, F, C.
* '''[[GO_stats-glossary#bioentity_type| by_bioentity_type]]'''
**'''[[GO_stats-glossary#bioentity_type| all]] :''' Change in the number of annotations for each [[GO_stats-glossary#bioentity_type| bioentity type]].
**'''[[GO_stats-glossary#bioentity_type_cluster| cluster]]:''' Change in the number of annotations by [[GO_stats-glossary#bioentity_type_cluster| bioentity type cluster]]. 
*'''by_taxon''': Change in the number of annotations by species.
*'''by_evidence'''
**'''[http://geneontology.org/docs/guide-go-evidence-codes all]'''
**'''[[GO_stats-glossary#evidence_cluster|by_evidence_cluster]]'''
*'''[[GO_stats-glossary#model_organism |by_model_organism]]'''
**'''[http://geneontology.org/docs/guide-go-evidence-codes by_evidence]:''' For each species, the number of annotations are shown for each individual evidence code, detailed by [[GO_stats-glossary#aspect |aspect]].
**'''[[GO_stats-glossary#evidence_cluster|by_evidence cluster]]:''' For each species, the number of annotations are shown for each [[GO_stats-glossary#evidence_cluster|evidence cluster]], detailed by [[GO_stats-glossary#aspect |aspect]].
*'''by_group:''' Changes in the total number of annotations for each contributing group, obtained using the 'assigned_by' field..
 
===taxa===
*'''added: '''List of added species with current number of annotations.
*'''removed:''' List of removed species with previous number of annotations.
 
===bioentities===
*'''total:''' The difference in the total number of annotated bioentities.
*'''by_type: '''
**'''[[GO_stats-glossary#bioentity_type| all]]:''' Difference in the number of annotated bioentities.
**'''[[GO_stats-glossary#bioentity_type_cluster|cluster]]:''' Difference in the number of annotated bioentities, grouped by [[GO_stats-glossary#bioentity_type_cluster |bioentity type clusters]].
*'''[[GO_stats-glossary#filtered_taxa |by_filtered_taxon]]:''' Difference in the number of annotated bioentities for each [[GO_stats-glossary#filtered_taxa|filtered species]], detailed by aspect (A, P, F, C).
**'''[[GO_stats-glossary#bioentity_type |all]]:''' Difference in the number of annotated bioentities for each species.
**'''[[GO_stats-glossary#bioentity_type_cluster|cluster]]:''' Difference in the number of annotated bioentities grouped by [[GO_stats-glossary#bioentity_type_cluster|bioentity clusters]], for each species.
 
===references===
*'''all'''
**'''total''' Change in annotated references.
**'''[[GO_stats-glossary#filtered_taxa |by_filtered_taxon]]:''' Change in annotated references by species.
**'''by_group:''' Change in annotated references for each contributing group, obtained using the 'assigned_by' field..
*'''pmids'''
**'''total''' Change in annotated PMIDs.
**'''[[GO_stats-glossary#filtered_taxa |by_filtered_taxon]]:''' Change in annotated PMIDs by species.
**'''by_group:''' Change in annotated PMIDs for each contributing group, obtained using the 'assigned_by' field..


=Direct access to files=
=Direct access to files=
Line 107: Line 23:
= Review Status =
= Review Status =


Last reviewed: October 24, 2019
Last reviewed: March 5, 2020




[[Category:Release Pipeline]]
[[Category:Release Pipeline]]

Latest revision as of 17:28, 6 March 2020

Usage

This file is used for doing QC on annotations for GO releases. Direct annotations to GO:0005515 protein binding are excluded from the counts, therefore, the total number of annotations is lower, as well as the number of annotated species (both total and filtered), and the number of annotated references.

Input data

Calculated as the go-annotation-changes.json, but excluding direct annotations to GO:0005515 protein binding.

Format(s)

  • json
  • tsv

File description

See documentation for go-annotation-changes file.

Direct access to files

snapshot

current

Review Status

Last reviewed: March 5, 2020