Phylogenetic Annotation Project: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
Line 2: Line 2:




----
 


=PAINT : Software for tree annotation=
=PAINT : Software for tree annotation=
The reference project group has developed a software to perform the annotations based on phylogenetic trees, [[PAINT]].  
The reference project group has developed a software to perform the annotations based on phylogenetic trees.  


* [[PAINT]] software


=PAINT Curation guidelines=
=PAINT Curation guidelines=
[[PAINT_SOP]]
* [[PAINT_SOP]]


* PAINT family list based on number of genes, duplication nodes and GO terms: https://docs.google.com/spreadsheets/d/1uHgcaXO7t9__9GuXgBibT0H-YlnU252IsRGWIyhCWN8/edit?usp=sharing. The family is usually easier to PAINT if there are less duplication nodes, but please select ones that have more GO terms and more genes (>100)
* PAINT family list based on number of genes, duplication nodes and GO terms: https://docs.google.com/spreadsheets/d/1uHgcaXO7t9__9GuXgBibT0H-YlnU252IsRGWIyhCWN8/edit?usp=sharing. The family is usually easier to PAINT if there are less duplication nodes, but please select ones that have more GO terms and more genes (>100)
Line 15: Line 16:




----


=[[Scripts to ensure PAINT data integrity]]=
=[[Scripts to ensure PAINT data integrity]]=
Line 99: Line 99:
* [[Reference_Genome_sequence_annotation]]: GFF3 sequence files for reference genome MODs
* [[Reference_Genome_sequence_annotation]]: GFF3 sequence files for reference genome MODs
* [[Reference Genome Database Requirements Discussion]]
* [[Reference Genome Database Requirements Discussion]]
 
* [[Source_Forge_items_for_reference_genomes_(Retired)]]


==Past Annotation targets==
==Past Annotation targets==

Revision as of 06:56, 17 January 2018



PAINT : Software for tree annotation

The reference project group has developed a software to perform the annotations based on phylogenetic trees.

PAINT Curation guidelines

Please contact PAINT team to have access to the document.


Scripts to ensure PAINT data integrity

PAINT annotation SOPs

GAFs for trees-based annotations

Standard Operating Procedure for Tree-based propagation of annotations

Reference Genomes Metrics | Metrics: Discussion on annotation progress measurements

Orthology determination

Data used to make orthology calls

reference proteomes files

At the July 2009 Quest for Orthologs meeting, it was agreed to decide upon a standard set of genomes, and compile "complete" sets of protein coding genes for each genome, and a representative protein sequence for each gene.

New gene2geneproduct file

At the April 2009 Reference Genome meeting it was decided to create a new file to replace the GP2protein file, called 'gene2geneproduct'. Specifications can be found on this page (will be added soon).

GAF file 2.0

The GAF file should contain 17 columns, and the meaning of columns 2, 12 and 17 have been modified. See that page for specifications.




Software/database development


The purpose of this page is to discuss features and requirements that would be desirable in a database used to replace the existing Google Spreadsheet system for managing target genes, their annotations and metrics.


Archived & retired Pages

Preretired pages! To discuss with DH and KVH

Gene pages

http://wiki.geneontology.org/index.php/Category:Reference_Genome_Genes > I forgot how those were created. They can probably all be removed. Can this be done in bulk?


Archived communication

Retired Pages

Those pages are kept as reference but the information in them is not the most current information.


Past Annotation targets