EcoliWiki December 2015: Difference between revisions

From GO Wiki
Jump to navigation Jump to search
Line 173: Line 173:
=== Methods and strategies for annotation ===
=== Methods and strategies for annotation ===
(please note % effort on literature curation vs. computational annotation methods)
(please note % effort on literature curation vs. computational annotation methods)
a.    Literature curation:   EcoCyc annotates directly from the literature, currently citing 30,224 individual references for 4500 genes.  EcoCyc works to document biochemical pathways in their entirety, and uses a mini-review format in addition to GO to capture as much information as possible.  EcoCyc is dedicated to Escherichia coli K-12 MG1655, whereas EcoliWiki incorporates annotations from EcoCyc but also annotates other Escherichia coli strains as well as numerous bacteriophage proteins.  Annotations from both sites for taxon 8333 are submitted in the gene_association.ecocyc file.
==== Literature curation: ====
EcoCyc annotates directly from the literature, currently citing 30,224 individual references for 4500 genes.  EcoCyc works to document biochemical pathways in their entirety, and uses a mini-review format in addition to GO to capture as much information as possible.  EcoCyc is dedicated to Escherichia coli K-12 MG1655, whereas EcoliWiki incorporates annotations from EcoCyc but also annotates other Escherichia coli strains as well as numerous bacteriophage proteins.  Annotations from both sites for taxon 8333 are submitted in the gene_association.ecocyc file.
 
EcoliWiki supports community manual curation both through EcoliWiki itself and through GONUTS/CACAO.  


Effort: 100%
Effort: 100%


b. Computational annotation strategiesEcoliWiki focuses on manual annotation, so computational annotation is not a priority.  The IEAs produced by UniProtKB are assimilated into the GAF and resubmitted.
==== Computational annotation strategies ====
EcoliWiki focuses on manual annotation, so computational annotation is not a priority.  The IEAs produced by UniProtKB are assimilated into the GAF and resubmitted.


Effort: 0%
Effort: 0%
   
   
c.  Priorities for annotation:  EcoliWiki and EcoCyc work to annotate all literature across the entire genome. EcoCyc emphasises annotation of transcriptional regulation, transporters, and metabolic pathways.
c.  Priorities for annotation:  EcoliWiki and EcoCyc work to annotate all literature across the entire genome.  
 
CACAO continues to prioritize recruiting additional participants. In 2015 we also began working on support for integration of CACAO with Phage hunting courses at Maryland-Baltimore County (w/Ivan Erill) TAMU with Ry Young and Jason Gill.


=== Presentations and Publications ===
=== Presentations and Publications ===

Revision as of 18:31, 18 December 2015

Overview:

Staff:

* TAMU/EcoliWIki
    • Jim Hu (PI)
    • Suzi Aleksander (Curation/CACAO)
    • Debby Siegele (Co-PI/Curation)
  • SRI/EcoCyc
  • Peter Karp (PI)
  • Ingrid Keseler (Curation)
  • Amanda Mackie (Curation)
  • Carol Fulcher (Curation)
  • Socorro Gama-Castro (Curation)
  • Alberto Santos-Zavaleta (Curation)
  • Markus Krummenacker (computer support)

None of the EcoliWiki/EcoCyc staff are funded by GO.

Annotation Progress

  • 09/14 mined on 8 September 2014
  • 12/15 mined on 14 December 2015

<protect>

Annotations Non-IEA/Non-ND IEA ND All ECs

BP (12/14)

4791

10049

0

14840

BP (12/15)

6548

11266

0

17814

change

+1757

+1217

0

+2974

MF (12/14)

6827

13310

1

20138

MF (12/15)

8163

12934

1

21098

change

+1336

-376

0

+960

CC (12/14)

1808

9190

0

10998

CC (12/15)

4661

9377

0

14038

change

+2853

+187

0

+3040

All Aspects(12/14)

13426

32549

1

45976

All Aspects(12/15)

19372

33577

1

52950

change

+5946

+1028

0

+6974

</protect>



Methods and strategies for annotation

(please note % effort on literature curation vs. computational annotation methods)

Literature curation:

EcoCyc annotates directly from the literature, currently citing 30,224 individual references for 4500 genes. EcoCyc works to document biochemical pathways in their entirety, and uses a mini-review format in addition to GO to capture as much information as possible. EcoCyc is dedicated to Escherichia coli K-12 MG1655, whereas EcoliWiki incorporates annotations from EcoCyc but also annotates other Escherichia coli strains as well as numerous bacteriophage proteins. Annotations from both sites for taxon 8333 are submitted in the gene_association.ecocyc file.

EcoliWiki supports community manual curation both through EcoliWiki itself and through GONUTS/CACAO.

Effort: 100%

Computational annotation strategies

EcoliWiki focuses on manual annotation, so computational annotation is not a priority. The IEAs produced by UniProtKB are assimilated into the GAF and resubmitted.

Effort: 0%

c. Priorities for annotation: EcoliWiki and EcoCyc work to annotate all literature across the entire genome.

CACAO continues to prioritize recruiting additional participants. In 2015 we also began working on support for integration of CACAO with Phage hunting courses at Maryland-Baltimore County (w/Ivan Erill) TAMU with Ry Young and Jason Gill.

Presentations and Publications

a. Papers with substantial GO content

none in 2014-15

b. Presentations including Talks and Tutorials and Teaching

CACAO curation was held in both the Fall and Spring Semesters.

c. Poster presentations

5. Other Highlights:

A. Ontology Development Contributions:

EcoliWiki contributes to discussions on the GitHub (previously SourceForge) trackers, as well as bringing attention to potential issues with terms and the ontology as we become aware of them.

B. Annotation Outreach and User Advocacy Efforts:

EcoliWiki operates another wiki, GONUTS (Gene Ontology Normal Usage Tracking System), which is available to the general public as a way to access the ontology and existing annotations for all proteins, regardless of organism, currently in UniProt. GONUTS facilitates the addition of annotations and usage notes to terms, journal articles, and proteins. EcoliWiki also promotes annotation through CACAO, a course hosted at Texas A&M University on the GONUTS website. CACAO (Community Assessment of Community Annotation with Ontologies) teaches mainly undergraduate and graduate students, both at Texas A&M and remotely worldwide, not only how to utilise the Gene Ontology but also to improve the Ontology. To date, over 1200 users have edited the GONUTS wiki, and 3848 manual annotations have been submitted to EBI for propagation. CACAO students have had numerous New Term Requests approved and also have identified errors in term definitions and ontology structure.


C. Other Highlights: