Phylogenetic Annotation Project: Difference between revisions
mNo edit summary |
|||
Line 11: | Line 11: | ||
[[PAINT_SOP]] | [[PAINT_SOP]] | ||
== Annotation targets == | == Annotation targets == | ||
Line 128: | Line 122: | ||
==Past Annotation targets== | ==Past Annotation targets== | ||
* [[RefG annotation priorities]] of September 2009 | |||
*[[Lung_branching_morphogenesis_genes]] December 2009 | *[[Lung_branching_morphogenesis_genes]] December 2009 | ||
* [http://proto.informatics.jax.org/prototypes/GOgraphEX/PPOD12_Graphs/ All PPOD clusters with at least one object from each of the twelve refG organisms] | * [http://proto.informatics.jax.org/prototypes/GOgraphEX/PPOD12_Graphs/ All PPOD clusters with at least one object from each of the twelve refG organisms] |
Revision as of 06:49, 17 January 2018
PAINT : Software for tree annotation
The reference project group has developed a software to perform the annotations based on phylogenetic trees, PAINT.
PAINT Curation guidelines
Annotation targets
PAINT Annotation target 2014
- DNA repair family list: http://goo.gl/BaQxMC
- PAINT family list based on number of genes, duplication nodes and GO terms: https://docs.google.com/spreadsheets/d/1uHgcaXO7t9__9GuXgBibT0H-YlnU252IsRGWIyhCWN8/edit?usp=sharing. The family is usually easier to PAINT if there are less duplication nodes, but please select ones that have more GO terms and more genes (>100)
Please contact PAINT team to have access to the document.
Selected refG target sets
- PPOD clusters selected since April 2008
- Manually curated target sets selected before April 2008
Target Gene List August 2006-April 2008
- Access requires your email to be added to the system. Email Pascale if you would like to be added.
- This spreadsheet contains links to separate spreadsheets maintained by each of the reference genome groups.
Scripts to ensure PAINT data integrity
Work in progress
PAINT annotation SOPs
GAFs for trees-based annotations
Standard Operating Procedure for Tree-based propagation of annotations
Reference Genomes Metrics | Metrics: Discussion on annotation progress measurements
Branding Ref.Genome Project
Ideas for publicizing Ref.Genome Annotation Data
Orthology determination
Data used to make orthology calls
reference proteomes files
At the July 2009 Quest for Orthologs meeting, it was agreed to decide upon a standard set of genomes, and compile "complete" sets of protein coding genes for each genome, and a representative protein sequence for each gene.
New gene2geneproduct file
At the April 2009 Reference Genome meeting it was decided to create a new file to replace the GP2protein file, called 'gene2geneproduct'. Specifications can be found on this page (will be added soon).
GAF file 2.0
The GAF file should contain 17 columns, and the meaning of columns 2, 12 and 17 have been modified. See that page for specifications.
Software/database development
The purpose of this page is to discuss features and requirements that would be desirable in a database used to replace the existing Google Spreadsheet system for managing target genes, their annotations and metrics.
Archived & retired Pages
Preretired pages! To discuss with DH and KVH
Gene pages
http://wiki.geneontology.org/index.php/Category:Reference_Genome_Genes > I forgot how those were created. They can probably all be removed. Can this be done in bulk?
Archived communication
- Reference Genome Mailing list - disabled
- Conference Calls - 2007-2011
- Electronic jamborees
Retired Pages
Those pages are kept as reference but the information in them is not the most current information.
- Annotation_pipeline By Judy, Suzi, Michael
- Reference Genome Annotation Project Summary
- Project timeline
- Reference_Genome Contact Persons from each database
- Reference Genome Progress Reports
- Procedure for selection of target genes
- Procedure for filling Genome-Specific spreadsheets
- Tools for orthology determination: A summary of tools available to identify orthologs.
- SOP for determining ortholog (by database): The purpose of this page was to discuss the method by which each group establishes orthology between reference genome genes and human disease genes. We now collaborate with PANTHER to provide that. (Issues are different)
- Reference Genome Web Page Draft: We now have a real web page!
- List of potentially problematic families for all vs. all BLAST methods of orthology determination
- Running P-POD orthology tool on the reference genomes gene set by Kara Dolinski at Princeton - Nov2007.
- Reference_Genome_sequence_annotation: GFF3 sequence files for reference genome MODs
- Reference Genome Database Requirements Discussion
Past Annotation targets
- RefG annotation priorities of September 2009
- Lung_branching_morphogenesis_genes December 2009
- All PPOD clusters with at least one object from each of the twelve refG organisms
- Target Gene List: May 2008-Jan 2010
- Tree annotation progress 2010-2011
- RefG_Heart_Development_co-curation#Heart_Development_Transcription_Annotation_Targets: May- Sept 2011
- Wnt_signaling_Pathway June-Sept 2010
- Apoptosis Reference Genome Targets February-April 2011
- PAINT - Apoptosis Nov 2013