Difference between revisions of "GOC Meeting Minutes September 2009"
m (→Binding Discussion(Ruth))
(Adding action items section)
|Line 499:||Line 499:|
Up to [[Cambridge_GO_Consortium_Meeting]] or [[Consortium Meetings]]
Up to [[Cambridge_GO_Consortium_Meeting]] or [[Consortium Meetings]]
Revision as of 09:04, 29 March 2010
- 1 Weds. AM
- 1.1 review grant status (Judy)
- 1.2 Community Interactions and Outreach
- 1.3 Reference Genomes Progress Report (Pascale)
- 1.4 PAINT demo(Suzi)
- 2 THURSDAY
- 2.1 Infrastructure (Mike/Chris)
- 2.2 Ontology Quality Control Reports(David)
- 2.3 Cross-Products in GO (Midori)
- 2.4 Function-Process Links
- 2.5 Alignment of GO with Pathways Databases (David)
- 2.6 Annotation Relationships (Chris/Jane)
- 2.7 UniPathway (Anne Morgat)
- 2.8 Binding Discussion(Ruth)
- 2.9 GO and external ontologies
- 2.10 Display Issues and Change Management(Chris/Mike)
- 2.11 Binding Discussion Summary (Ruth)
- 2.12 Ontology generation tool for OBO-Edit(Thomas Wachter)
- 2.13 GO Slim Overhaul(Jane/Val)
- 3 Friday
- 4 Action Items
review grant status (Judy)
- 2yr project to fund CL development focusing on immunology and neurons funded!
- Protein ontology development supplemental grant funded to represent complexes.
Discussion of PRO/GO and their relation to each other
- (Judy)Pro very specific representation..mouse proteins in mouse complex...GO component is representing the class (ie not species specific).
- each pro complex should have a xref to the GO complex.
- complexes should be represented by their function, not their protein members
- (David)some complexes defined by the proteins they containe, some by what the complex does.
- (Judy)function of a complex may be cell type specific.
- (Chris)GO should do complexes, PRO should do proteins which are then pointed to by GO components.
- (Jud)this is not how the process was described in the PRO grant.
- (Ben)complexes defined by their function may introduce a naming issue if the same complex has different functions in different places or cell types.
- (Suzi)GO and PRO need to discuss how best to represent these complexes and relate PRO to GO.
ACTION: Primary stake holders in GO and PRO need to agree on how GO components and PRO will relate to each other
- Funding obtained for protein set specification (panther group)
- (Judy)Gene targets may want to be skewed towards genes known involved in Human disease. Important for grant and ongoing funding
- (Rex)It is mportant to balance with genes having unknown function..emerging new information is important too..balance in choosing targets is needed.
- (Ruth)There are human genes which we know a lot about that are not annotated. People want the data and are surprised not to find it.
- (Judy)Increasing concern that we are sorely lacking in annotation depth in areas we know a lot about. Need more resources.
- (Rex)How do we define relevance to Human biology.?
- (Suzi)The original Reference Genomes proposal goes for annotation breadth AND depth. Need ways to prioritize targets, but this does not preclude inclusion of genes outside of a selected target gene set.
- (Judy)We can look at systems, diseases, processes
ACTION: Annotation groups need to get ISS annotations from PAINT into their databases this year.
ACTION: Participating groups need to provide a file of their comprehensively annotated genes
Species to consider outreach to
- daphnia (Indiana)has GO annotations (Michael Lynch)
- Sea Urchin
Facilitation of annotation for new genomes
- (Brenley)GONUTS can support annotation of any alternate species.
- (Rama)We need an annotation camp. It’s been several years.
- We need more ongoing training
- (Judy/Paul)PAINT needs a mechanism of annotation for new genomes, have GO jamboree, bring these new groups in.
- (Emily)Electronic annotation pipelines lack development and improvement efforts which could help new groups.
- (Emily)More effort needed in expanding electronic curation efforts. More sources of electronic annotation are out there.
- (David)Automated annotations are too general, result in clustering at high level (like developmental process) which is not that valuable for new groups.
- (Judy)Need continuing discussion here to focus on utility of GO. What do users want/need? How do we focus our effort here for the next year? Need to address to get continued funding.
- (Rex)This was 'THE REASON' for the Reference Genomes Project. How can it’s role be expanded? Why isn’t the project working for non-reference genenomes species?
- (Micheal)EMO-emerging model organisms
- (Paul)82 species have been chosen to be close to as many species as possible for ortholog prediction and functional annotation to related species.
- SP/UNIPROT group are increasing manual annotation efforts primarily focused on Human proteins
- (Kara)Consider outreach to ourselves as a priority for the next year. It’s not clear there is “MOD buy in”. How is our own work being incorporated into our OWN databases? Peter Good wants to know “how does this work advance biomedical hypothesis generation?”
- (David)How do we get non-NIH funding from sources interested in general biology?
- Perhaps there is a mis-perception of the state of things. PAINT annotations are still under development so not yet fully spreading the resulting ISS annotations to other species...that is coming.
- (Susan)Should we highlight Reference Genomes annotations? Groups with one annotator get behind in curation, this should not be confused with lack of buy in or desire to participate
ACTION: MODs-make an effort to highlight Ref Genome annotation
- (David)Data in Reference Genomes paper highlights progress and effect of ref genome effort on annotations.
- (Mike)do MODs announce or advertise their participation in Reference Genomes project?
- (Ruth)Following up with ISS annotations is difficult.
- (Judy)We are poised to make the next step of propagating annotations. This should highlight the annotations generated from the Reference Genomes project.
- (Rex)For grant progress report and Reference Genomes website, we should provide metrics like the number of Reference Genome annotations and the percent of databases that have those annotations propagated.
- Real value will become apparent from propagation of annotations to other species
- (Judy)If we can provide a set of annotations for one of the non-Reference Genomes species in the next year this will be good demonstration of the power of the effort. This may be a good focus point for the next year.
- (David)It is easy but not obvious how to get a version of the GO. Needs to be easier if we are asking users to do this.
- (Midori)Users are not asking for how to find ontology version numbers
- (David) I see requests for how to find out what version was used in past studies
Community Interactions and Outreach
- (Val)pombe community annotation: http://www.sanger.ac.uk/Projects/S_pombe/community_curation.shtml
- How did Val get 19/20 respondants?
ACTION:Val will make available the form used to elicit responses from authors
- (val)Estimates 1/3 the time to complete the curation of papers with authors filling in this form. Form sent to last author. Results are processed by Val. Results generate questions etc..so much work follows on responses, but does get info direct from authors.
- not traced to who did the annotation though.
- facilitates user understanding of the existing annotations.
- TAIRs Plant Phys submission form can be seen here: http://www.aspb.org/publications/tairsubmission.cfm
- (Tanya)TAIR Plant Phys submission form was improved with autocomplete to help submitters pick GO terms. Also started similar process with the journal Plant Journal using a template spreadsheet. PJ handles this as a supplemental data submission included with the paper when published.
- (Jane)Recommendations from the GO review
- (Judy)What is a function, what is a process? We should define these..what do we mean by these in GO? We need to be clear about what we are doing.
Web Stats (Seth)
- Seth can set up access to Google Analytics if you send him your gmail account. (File:New-analytics.pdf)
- (Mike) Machines were moved at Stanford, disrupting Google Analytics reporting, but usage stats have been very constant over past 12-24 months..what does that mean?
- (Jane) GO News site(http://go.berkeleybop.org/news4go/), twitter, and RSS feed are available off gene ontology web page.
AmiGO status (Chris)
- (Chris)AmiGO status-IEAs included, but not GOA files...species excluded consequently, but IEAs for MODs included.
- (Judy)What are the questions that wet bench biologists want to ask of GO...and are we making that possible for them?
- (Ruth)Labs use Inginuity because Inginuity have a large number of annotators and users get more complete results. They use their own interface, GO+their own ontologies, own annotations. How many groups pay for access to inginuity? Is that money well spent?
- (David)Examine inginuity..why are users using it? What can we learn from that?
- (Jen/Varsha)Incorrect annotation is offputting, so is incomplete annotation
- (David)Users may perceive an annotation as incorrect, but often that is not the case.
- What is the issue at hand?
- interface (ala inginuity..inginuity provides a distinct service than GO)
- do we need to distinguish GO from inginuity, when to use which, for users?
- lack of completeness of annotation or ontology?
- incorrect annotation or ontology structure?
- user perception or understanding of the GO? (dopamine is not always a neurotransmitter for example)
- why are biologists not using GO? If not using GO what alternative is being offered?
- Users/developers of third party GO tools may not keep their ontologies up to date which makes GO seem less useful.
- (Mike)We need to be creative about what GO provides.
- (Judy)We might want to lower the visibility of GO Tools and increase the visibility of using GO from the GO site and point from MODS to GO site.
- (Paul)If we are too far removed from end users it is hard to value our contribution and hard to get feedback from users directly
- (Tanya)Target surveys to MOD users list, but also meeting registrants who may or may not be MOD users. Ask if they use GO? Provide a web page for a survey so all survey respondants can fill out the same form no matter which meeting they are at?
ACTION: Build a survey for grant update (Val, Jane, Jen), determine where to send it. (Mike has subscription to SurveyMonkey)
- (Midori)EBI targets biologists...ask them how they do it!
- (Eva)Send reminder emails to survey recipients to try to reduce self selection of survey responders.
ACTION: provide two survey urls and compare responses from a targeted list like the submitters to GO help list with responses from a random list of biologists
Reference Genomes Progress Report (Pascale)
- (David)Do we have a gold standard set of genes we use to test annotation metrics? Not currently..but we should find a set for this purpose. This will standardize testing of annotation progress.
- (Brenley)Should GONUTS use UniProt IDs? -Yes
- (Brenley)Should GONUTS use UniProt protein names? maybe..but may not be that useful
- (Judy)There is a proposal for Human/Mouse/Rat isofom nomenclature
- (Chris)in GAF2.0 annotations to genes, col. 17 will contain specific isoform reference, typically UniProt.
- need java 1.5 to support drag-and-drop function critical to PAINT annotation process.
- PAINT uses the GOlite DB, updated ~weekly
- Update GAF files no less frequently than once per month!
- Transferring annotations only can happen when manual annotation is complete by all groups for a given gene (QUESTION:How does this get reconciled with the proposal put forth by Pascale later in the day to do away with Google spreadsheet?)
ACTION: Locate code for making font size changes in PAINT(Paul)
- Name column in PAINT table view should say “symbol” perhaps?
- Uniprot IDs should be indicated as ‘Reviewed” or “unreviewed” (UniProt vs. Trembl)?
- (Judy)Make lines in table for genes that have manual annotation more prominent somehow..bold is not visible enough, Michael assures it is plain as day on the computer screen...
- (David)Would like to see filter/highlight by species to examine annotation of paralogs
- (David)We should be sure to have a common mechanism for feeding GAF files back to MODs, could be used for PAINT annotations, GONUTS GAF dumps, and inferred annotations from Function/process cross references.
- Positive annotations should not be propagated to clades that have NOT qualified annotations to related terms..Suzi assured us that won't happen.
- As PAINT tree annotation proceeds, propagation of annotations that can be automated will be. Helps to streamline PAINT curation process.
- (Pascale)GAF files generated from PAINT are already available from wiki and Ref Genome page for MODs to load into their own data.
- (Paul)Propagation of annotations depends upon our ability to interpret annotations and the context in which that annotation was made. *Some of those details lay in col. 16 like process X occurs in cell CL:#### at stage Y.
- (Suzi)If PAINT illustrates problems with protein/gene records (example: gene merge needed), how can that be fed back to sources to get the problem adjusted?
- (Paul)Consider infrastructure to make more conjunctive statements-function X occurs in component A of cell Y for example.
- Consider choosing target genes based on functional systems
- (Rex/Judy)All time taken away from literature curation takes away some experimental annotations needed to support inferential annotations.
- (Paul)The PAINT tree curation process is not highly scalable. Re-reading of papers is sometimes needed to facilitate proper tree curation...can that be mitigated in some way?
- (Judy)Efficiency of tree curation is important, propagated annotations will be examined by MOD curators before they are added to our databases. This will then address some of the complexity that Paul was bringing up.
- (Rex)bottom line...we (all of us) don’t have enough funding to curate all the necessary literature.
- (Judy)We already set the bar lower from “complete” annotation to “comprehensive” to help address the volume issues.
- (Judy)We have a lot of genes named that users reasonably expect to be annotated. Proper experiments to demonstrate the functions won’t ever be done in mouse for example if they have been done in cerivesia. These annotations are needing to be transferred..users expect them to be there..basic functionality like spliceosome.
Proposed method to get away from Google Spreadsheets
- Pascal’s proposed method to stop using the Google spreadsheets for Reference Genomes annotation tracking was generally accepted as worth further exploration.
- Compare relative dates of PAINT tree annotation with the most recent date for experimental annotation from all GAFs.
- (Rex)How will status of ‘comprehensive’ annotation be captured with Pascale’s proposal? Earlier it was stated that 'comprehensive' annotation status is needed before PAINT curation can occur.
- (Pascale)May not need concept of ‘comprehensive’..why not just go with ‘here is what could be propagated as of this date’, and then keep track of which genes need new PAINT tree curation due to newly added experimental annotation?
- (Pascale)In PAINT, the presence of experimental annotations should be emphasized from ANY source..not just Ref. genomes.
- Review Weds. Minutes-results of discussion logged in wiki minutes
ACTION: OBO format 1.3 tags (creation date, etc.) will be added to gene ontology ext
- Everyone agreed to this
ACTION: OBO version number added to all OBO files
- Everyone agreed to this
GAF publishing pipelines
- GAF 2.0 files currently get converted to GAF1.0 during filtering process, 2.0 format file is in submissions directory
- WHEN should switch be made to pure GAF 2.0 format? Need sufficient lead time.
ACTION: GAF files from PAINT will need to be GAF 2.0 format
ACTION: Switch to GAF 2.0 in publishing pipeline with substantially long (3 Months) public notice to this change.
ACTION: add notices like this change to GAF 2.0 to GO News Feeds
Unannotated genes and GAFs
- The problem: we want all genes represented in AmiGO
- Currently not happening b/c GAF files only contain annotated genes
- Uncertainty still exists regarding proper format for gp2protein file
ACTION: post a clear spec of the desired format for gp2protein files for all MODs-include 1 row for every gene? 1 protein ID per row? No protein ID is OK if no protein is available?
ACTION: keep GAF file as is. Provide a new file (gpfile?) to describe gene products. Provide a detailed spec of the contents of this new file. This file may subsume gp2protein
Function->Process inferred annotations by IC
ACTION: MODS please look at the F->P IC annotations proposed for your species in http://geneontology.org/scratch/gaf-inference. If you have issues get back to Chris in the next 2 weeks.
ACTION: MODS will load these F->P IC annotations to their database
ACTION: Taxon constraint checks, report should be sent back to group but not filtered out..still load.
ACTION: chris check that interontology inferred annotations limited to inference from experimental annotations only.
Changes to GO Database management practices
ACTION: use a more relaxed schedule for building GO Full (quarterly perhaps?)
ACTION: reduce load of go lite from 3 to 1 time per week.
- Both of these actions agreed to by all.
XP term request template
- Automation of term requests for regulates terms
- Curator could use ID immediately
- If term is not allowed by ontology editors, it could be obsoleted
- Siimilar mechanism for cellular component cross products
- If is_a parent can’t be found for some reason, don’t allow submission
- Should probably be behind a login or security mechanism to know who did the submission
- This mechanism will hide new term requests from other annotators which is undesirable-find a way to alert everyone to new term requests.
- SF requests provide a history for new term requests which may be desirable for these as well.
- Curators like to be able to determine how many term requests were made..can do this through SF. Would like to retain that functionality.
ACTION: implement a version like shown for curators to try before next GO meeting
Ontology Quality Control Reports(David)
- Now using OBOEdit to provide implied links via regulates.
- Many QC reports have greatly improved ontology quality
Cross-Products in GO (Midori)
- proposal to move corss-product definitions from multiple files into gene_ontology_write.ob
- start with Regulates, then BPxBP using part_of, then CCxCC, CCxMF or BP
- New relations needed to implement CCxBP and CCxMF cross products
- Are there any consequences for software rendering graphs?
- BP->MF now has regulates relationship
- MF->BP starting to get part_of relationship
- Barry concerned about use of part_of between F and P
- Another relation that allows propagation of annotations across the ontologies would be good
- Function ontology represents activities, which can be related to processes since they are both occurrants.
- Use systematic names for these terms
- (Barry)consider renaming Molecular Function to Molecular Activity
- Often enzyme assay shows function, mutant shows process..but the two are not combined in the experimental results. Curators will need to be careful of annotating with P->F compound terms.
Alignment of GO with Pathways Databases (David)
- pilot to take Reactome pathway and coordinate MF in GO with catalytic activities in Reactome as test case.
Annotation Relationships (Chris/Jane)
- There is an implicit relation between an annotation and the gene to which it applies
- explicit relation between annotation and gene would be better
- relations may be like ‘extrinic to’ or ‘acted on during’ to relate annotation to gene
- host-symbiont processes have more than one participant
- this is represented currently in ontology structure via host processes and symbiont processes
- Chris suggested to have a relation between the gene product and the annotation
- This is a long term solution, currently seeking comments
ACTION: Group agreed this makes some sense. Work up a more concrete specification of how this would work.
UniPathway (Anne Morgat)
- linking from UniPathway to GO via EC won’t be good long term solution. EC not represented 1EC->1GO.
- gene-specific GO terms were recently removed, this project will re-instate such terms.
ACTION:Pascale, Anne, Harold, Chris, Peter D’Eustachio, and Mary Dolan will work together to provide cross products b/t MFunction and Chebi to provide and report back to the group in the Spring meeting
- Guidelines for Curation of Binding
- Avoid redundant binding relationships for substrates/products -group agreed
- The purpose of the bind term guidelines is to minimize redundancy and duplication of GO term information-group agreed
- An enzyme MUST bind all of the substrates and products of the reaction it catalyzes-group agreed
- Similarly, a transporter MUST bind the molecules it transports-group agreed
- Therefore, binding is implied by the molecular function GO term describing the activity of an enzyme or transport-group agreed
- Consequently, it is redundant to annotate an enzyme or transporter with GO binding terms for each of its substrate/products, and curators should avoid making such redundant annotations.-tentative agreement by group, Noted that some transporters don’t bind their substrates.
- There will be some cases, however, where it is appropriate to annotate to a binding term-group agrees
- For example, published experiments may show that a gene product binds a non-hydrolyzable ATP analog, without demonstrating that it has ATPase activity-tentative agreement with some lack of understanding perhaps. (pascale)-We don’t create sub reaction terms for other cases, why should we have them for things like “ATP binding” which is a sub step for a reacion?
- In such a case, it would be appropriate to annotate to ATP binding using IDA-Agree
- The GO is committed to ‘annotating to the experiment’-Agree
- The curator should try to capture specifics as much as feasible-Agree...this one encapsulates points 1-10!
- Curators use judgement about when to associate an enzyme or transporter with a binding term for it’s substrates/products and also use their judgement to decide whether the interaction is physiologically relevant-Accepted
- Curators should recognize that GO annotations should capture information relevant to the in vivo situation, not artifical substrates.-Agree tentatively, Rex-what about something like ‘RGD binding’?
- (Paul, Amelia) If ATP binding is shown in an instance of a kinase activity, we should consider having a term like ‘ATP binding involved in kinase activity’.
- Should we distinguish substrate binding from effector (non-catalytic) binding?
- No big push back on this issue from group
- What about single molecule which is a effector in one case and substrate in other case.
- Should IC be used to annotate to binding term where catalytic activity has been shown? -NO-this is not necessary.
- What, if any, binding term annotations can be transferred via ISS/ISO?
- Binding terms should be transferred by ISS, but whether to transfer the thing that was bound is controversial. ‘protein binding’ annotations become vague if we drop the ‘with’ field contents upon transfer, while more specific terms like ‘kinase binding’ are less vague when transferred even if we drop the ‘with’ field contents of the experimental annotation.
ACTION:more work needed to iron out HOW to transfer binding term annotations properly to retain all information
- Paul NEEDS us to be able to do this for PAINT tree annotation
(Rex) ACTION:actin-dependent ATPase activity and tubulin-dependent GTPase activity terms are incorrect
GO and external ontologies
- ChEBI have a direct submission tool for new chemical submissions
- ChEBI are removing plurals where possible, will still have to retain plurals in cases where root is a member of the class (phenol is a member of the class phenols)
- (Amelia)GO have not always made it clear either whether they care discussing the chemical class or the chemical itself in GO terms.
- ChEBI curation tool-
- ChEBI:GO alignment-need to get these aligned ASAP, little work done here yet, it’s on Markus’ desk.
ACTION:Increase effort to get GO and ChEBI aligned
- ChEBI soon to incorporate ~500,000 chemicals (mostly drugs) and other large data sets from LIAI and Sabio-RK
Importing the virus ontology
- conceptually importing external ontologies is fine, need to examine on a case by case basis and evaluate how it will be updated.
GO cellular component and the NIF-subcellular ontology
- Looking at alignments and trying to move forward there
- NIF has high level splits that are not reflected in GO component, like lumens vs physical entities.
- (Alex)Cell behavior ontology-have mentioned creating their own cellular component ontology. They may not yet know exactly where they are going with that..we should follow them and encourage them to use GO
- GOOGLE for ‘cell behavior ontology’ finds their project
- Sounds like they may need a local specific application for thier own modeling strategies...may or may not be supported by GO
Display Issues and Change Management(Chris/Mike)
- Some folks are surprised by ‘has_part’ relation, which points in a different direction from ‘part_of’, ‘is_a’ etc.
- Term enrichment software may not be handling the ‘has_part’ relation properly.
- (Mike)Annotation group may have needed more education before ‘has_part’ went in perhaps to avoid confusion.
- (Mike)When ‘has_part’ was added, ‘part_of’ was removed, so when ‘has_part’ is also stripped out to remain compliant with past OBO formats, we lose the part_of information.
- (Judy)Maybe we need a GO annotation camp for ourselves..bring curators up to speed with some new concepts like the reorg of the component ontology, how to interpret the ‘has_part’ relation, cross product generation and usage, etc.. etc..
ACTION:Have another annotation camp, perhaps just for current GO curators
- (Harold)software engineers also need to understand the rules of GO so they can implement it. Consider inviting developers to a GO meeting to help in that area
- (Emily)cost of travel may be prohibitive for an effective face to face camp. Consider a number of virtual conferences focused on very specific items like implementation of col. 16.
- (Ruth)Be sure to identify all affected parties when new ideas are being implemented
- Use cross-functional groups to examine specifications for new projects so all interested parties are represented (software, curator, DB admin)
- GO satellite meeting at the next biocuration meeting in Japan?
Binding Discussion Summary (Ruth)
Agreed by the group:
- Do not separate substrate binding from effector binding
- Do not use IC to add a binding term annotation if an enzyme/transporter activity is present which covers this statement
ACTION: Current working group will edit the binding term annotation gielines in line with the comments from discussion and distribute to PI’s for approval and then include in guidelines.
ACTION:paul and Emily interested in joining new working group discussion on ISS/IC topic
Ontology generation tool for OBO-Edit(Thomas Wachter)
- Tool found in tools section of OBOedit
- Where do we want to go with this tool?
- Pascale, Suzi and Jen have some ideas to share with Thomas
GO Slim Overhaul(Jane/Val)
- Problems found with high level terms is slowing this process.
- Eventually it would nice for users to be able to expand or collapse the slim to customize it for their own usage.
- EBI has a policy of only retaining staff for 9 years. Midori is leaving next April. We have decided to divide up her work among the staff members remaining.
Jane and Jen will share the work. Jen will take over the SF item triaging. Jane will do more ontology development, and write ontology project reports. We would like input from all of you about management issues.
- The next meeting will be a major SAB meeting. We will try to sort out the new directions for the next grant cycle.
- Next meeting – March or April.
Washington DC is Judy’s suggestion. Who will host it? Also it makes everybody travel. Would it be cheaper in Rockville. Michelle is in Baltimore. Jim has offered Texas. But April might be too hot in texas. Big annuval fly conference is first week of April. CA it is! Please look at your schedule and let Judy know.
Lot of confusion about has_part. Chris presented his slides. Many of the existing terms look biologically illogical not intuitive. If there is difference biologically, then it makes sense to capture it separate terms. Propagating gps in the case of TFIIB complex makes sense. But not in the nucleus and mitochondrion chrmosome example. In my organism, there is not NEFF2 complex. Then this causes a problem. This also relies on the identiy of complexes beased on the members of the complex.
Another relationship type? All_parts_are_part_of
Continue to add has_part where this doesn not involve removing existing links. Solicit Input when making new has_parts Have a protocol similar to obsolete protocol. Send a message to the list etc.
We need a inference task force to make sure these relationships work etc.
Complexes changes subunits based on tissue type and what it does etc. Representation of complexes is very important for future development.
Our existing model answers question- what gps are always there vs gps might be there?
The structure should be irrelevant to the annotator- do they have to know the relationship and the ontology to be able to annotate.
Can you have has_part in addition to part_of? How do we deal with sometimes part_of? Some organisms don’t have NEFF3 part of CoreTFIIH?
Summary- Continue to add has-part in cases where it augments ontology, and in other cases consult the annotation groups before deleting those relationships.
CoreTFIIB haven’t been touched. Spliceosome terms were changed and they were TPVs anyway.
Jen-Signaling First slide - Definition- What about signaling between organisms? Why is Synthesis of the signal not part of signaling?
-Individual signaling process under signaling -signal termination etc under signaling process
There is signal release that happens within the cell- so where definiton says ‘extra cellular medium’ - that needs to be fixed. Get rid of the last sentence for signal transduction (about transcription). Intiation of signal transduction should have a relationship to positive regulation of signal transduction. Information that gets removed from the definition needs to be captured in Comment section. Initiation of signal transduction- definition is bothersome. Steriods may not fit with this (Peter says steroids are fine). Is_a postitive regulation of signal transduction. Including the receptors (did not get that). Remove the last sentence. If it includes the receptors then it would not annotated to the sign_tran process.
Signal transduction begin with activation of a receptor and ends with the change of function or state of the cell. Not all transduction begins with a receptor. Signal can be light.
Intiation of signal transduction and signal synthesis?
Peter- Receptor mediated transduction is distinct from other types of signal transduction. We need to make that distinction. Cell membrane is not necessary to start this process. 2 kinds- Surface receptors and cytosolic receptors. Paul – can you please email the definitions to the working group.
Subsequent email from Paul:
As promised, here are the terms, definitions and relationships (actually, the is_a children are listed for each term) for signal transduction that we developed for PANTHER in conjunction with some experts in 2001.
Name: Signal transduction Definition: The series of receptor-dependent signaling events required for a cell to respond to an extracellular signal, not including transcriptional events. Category ID: BP00102 Child: Cell communication Cell surface receptor mediated signal transduction Intracellular signaling cascade
Name: Cell communication Definition: Part of the signaling process that occurs outside of cells. These events usually involve cell adhesion molecules, signaling molecules, and other molecules that signal by binding receptors. Category ID: BP00274 Child: Cell adhesion-mediated signaling Extracellular matrix protein-mediated signaling Ligand-mediated signaling Steroid hormone-mediated signaling
Name: Cell adhesion-mediated signaling Definition: Signal transduction between cells using cell adhesion molecules as the signal.
Name: Extracellular matrix protein-mediated signaling Definition: Signal transduction between cells using extracellular matrix proteins as the signal.
Name: Ligand-mediated signaling Definition: Signal transduction between cells using a ligand, usually a secreted peptide such as a growth factor or a cytokine as the signal.
Name: Steroid hormone-mediated signaling Definition: Signal transduction between cells using a steroid hormone as the signal.
Name: Cell surface receptor mediated signal transduction Definition: Part of the signal transduction process mediated by a receptor on the cell surface. The process includes all the events upstream of secondary messengers. Category ID: BP00103 Child: Cytokine and chemokine mediated signaling pathway G-protein mediated signaling Receptor protein serine/threonine kinase signaling pathway Receptor protein tyrosine kinase signaling pathway
Name: Cytokine and chemokine mediated signaling pathway Definition: Part of the cell surface receptor mediated signal transduction process involving cytokine or growth factor receptors.
Name: G-protein mediated signaling Definition: Part of the cell surface receptor mediated signal transduction process involving G-protein coupled receptors. Cyclic AMP and IP3 mediated signaling processes are part of this category.
Name: Receptor protein serine/threonine kinase signaling pathway Definition: Part of the cell surface receptor mediated signal transduction process involving serine/threonine protein kinase receptors.
Name: Receptor protein tyrosine kinase signaling pathway Definition: Part of the cell surface receptor mediated signal transduction process involving tyrosine protein kinase receptors.
Name: Intracellular signaling cascade Definition: Part of the signal transduction process occurring in the cytoplasm of the cell. These events are usually downstream of the receptor mediated secondary messengers. Category ID: BP00111 Child: Calcium mediated signaling JAK-STAT cascade JNK cascade MAPKKK cascade NF-kappaB cascade NO mediated signal transduction
Name: Calcium mediated signaling Definition: A signaling process dependent on the intracellular concentration of the calcium.
Name: JAK-STAT cascade Definition: An intracellular signaling process involving mainly two types of kinases: Janus Kinase (JAK1), and the Signal Transducers and Activators of Transcription (STATs), such as STAT1. The process is usually initiated by a cytokine.
Name: JNK cascade Definition: An intracellular signaling process involving the activation of JNK kinase, a specific type of MAP kinase. Its expression is induced by ultraviolet light, causing it to bind the c-JUN transactivation domain, phosphorylating it on serine residues.
Name: MAPKKK cascade Definition: An intracellular signaling process involving Mitogen Activated Protein kinase kinase kinase (MAPKKK). The relevant cascades include MAPK or extracellular signal-regulated kinase (ERK), MAPK kinase (MAP2K, also called MKK or MEK), and MAPK kinase kinase (MAP3K, also called MAPKKK or MEKK). MAPKK kinase/MEKK phosphorylates and activates its downstream protein kinase, MAPK kinase/MEK, which in turn activates MAPK. The kinases of these signaling cascades are highly conserved, and homologs exist in yeast, Drosophila, and mammalian cells.
Name: NF-kappaB cascade Definition: An intracellular signaling process involving the eventual activation of NF-kappaB, a transcription factor originally found to switch on transcription of genes for the kappa class of immunoglobulins in B lymphocytes. NF-kappaB is found in the cytoplasm in an inactive form, bound to the protein I-kappaB. A variety of stimuli, such as tumor necrosis factor, phorbol esters and bacterial lipopolysaccharide activate it, by releasing it from I-kappaB, allowing it to enter the nucleus and bind to DNA.
Name: NO mediated signal transduction Definition: Nitric oxide-dependent signaling.
MikeT- There is no clear and beginning to all these processes. Thinking of modularity is also not good idea. It is a very loaded concept. Because the boundaries are not clear and hence not easy to define.
This can be done with MF-process terms (check with Dave). Use function terms (MAPKK for example ) to capture the part of the process. Describe a module as a collection of molecular functions
What happens to the term- cell communication?
End of signaling discussion----
Heart development- Varsha
One of the advisors was able to map MGI phenotypes to new GO terms and this can be used to annotate human genes (ISO/ISS). Chris said there is a mapping file already available and there is a paper coming out on that. 28 terms already in and 100 more coming. Varsha will pass on the spreadsheet that the advisor gave her to Chris. Chris will try to work with other groups and figure out rules to transfer annotations.
- Primary stake holders in GO and PRO need to agree on how GO components and PRO will relate to each other
- Annotation groups need to get ISS annotations from PAINT into their databases this year.
- Participating groups need to provide a file of their comprehensively annotated genes
- MODs-make an effort to highlight Ref Genome annotation
- Val will make available the form used to elicit responses from authors
- Build a survey for grant update (Val, Jane, Jen), determine where to send it. (Mike has subscription to SurveyMonkey)
- provide two survey urls and compare responses from a targeted list like the submitters to GO help list with responses from a random list of biologists
- Locate code for making font size changes in PAINT(Paul)
- OBO format 1.3 tags (creation date, etc.) will be added to gene ontology ext
- OBO version number added to all OBO files
- GAF files from PAINT will need to be GAF 2.0 format
- Switch to GAF 2.0 in publishing pipeline with substantially long (3 Months) public notice to this change.
- add notices like this change to GAF 2.0 to GO News Feeds
- post a clear spec of the desired format for gp2protein files for all MODs-include 1 row for every gene? 1 protein ID per row? No protein ID is OK if no protein is available?
- keep GAF file as is. Provide a new file (gpfile?) to describe gene products. Provide a detailed spec of the contents of this new file. This file may subsume gp2protein
- MODS please look at the F->P IC annotations proposed for your species in http://geneontology.org/scratch/gaf-inference. If you have issues get back to Chris in the next 2 weeks.
- MODS will load these F->P IC annotations to their database
- Taxon constraint checks, report should be sent back to group but not filtered out..still load.
- chris check that interontology inferred annotations limited to inference from experimental annotations only.
- use a more relaxed schedule for building GO Full (quarterly perhaps?)
- reduce load of go lite from 3 to 1 time per week.
- implement a version like shown for curators to try before next GO meeting
- Group agreed this makes some sense. Work up a more concrete specification of how this would work.
- Pascale, Anne, Harold, Chris, Peter D’Eustachio, and Mary Dolan will work together to provide cross products b/t MFunction and Chebi to provide and report back to the group in the Spring meeting
- more work needed to iron out HOW to transfer binding term annotations properly to retain all information
- actin-dependent ATPase activity and tubulin-dependent GTPase activity terms are incorrect
- Increase effort to get GO and ChEBI aligned
- Have another annotation camp, perhaps just for current GO curators
- Current working group will edit the binding term annotation gielines in line with the comments from discussion and distribute to PI’s for approval and then include in guidelines.
- paul and Emily interested in joining new working group discussion on ISS/IC topic