The CyberCell Database (CCDB) is a freely available, web-accessible database that provides quantitative genomic, proteomic as well metabolomic data on Escherichia coli.[1] Escherichia coli (strain K12) is perhaps the best-studied bacterium on the planet and has been the organism of choice for several international efforts in cell simulation. These cell simulation efforts require up-to-date web-accessible resources that provide comprehensive, non-redundant, and quantitative data on this bacterium. The intent of CCDB is to facilitate the collection, revision, coordination and storage of the key information required for in silico E. coli simulation.[1]

The CyberCell database
Content
DescriptionA database providing quantitative genomic, proteomic, and metabolomic data of E. coli.
Data types
captured
Gene and protein data; functional or ontological information; gene position and protein location; protein, metabolite and RNA expression levels; protein interaction and protein stoichiometry information; enzyme rate constants; metabolite structures, reactions and pathways; lists of cofactors and ligands; other molecular data.
Contact
Research centerUniversity of Alberta
LaboratoryDavid S. Wishart
Primary citation[1]
Access
Websitehttp://ccdb.wishartlab.com/CCDB/
Miscellaneous
Data release
frequency
Last updated on 2006

Content

edit

The CCDB contains four different browsable databases providing gene/protein information (CCDB), 3D protein structure data (CC3D), tRNA and rRNA information (CCRD), and metabolite data (CCMD), respectively. The data has been collected or generated using various sources and tools, including textbooks, published scientific articles, electronic databases, in house software as well as web-based programs. Each database exists as a re-formattable, easily browsed synoptic table which allows users to casually scroll through the different databases. Detailed information about each gene, protein, RNA, 3D structure or metabolite may be obtained by clicking on the ‘ColiCard’ on the left column. Every card contains more than 60 fields concerning all aspects of the sequence, function or structure of a given molecule as well as hyperlinks to other sources of information such as EcoGene [2] and EcoCyc,[3] scientific abstracts, and interactive applets to view structures or chromosomal maps. One of the more attractive characteristics of CCDB is its ability to support database searching and sorting. It offers utilities for local BLAST searches, Boolean text searches, chemical structure searches, and relational database extraction. The results of the latter can be exported in various formats including HTML, Excel and a circular chromosome applet view. One of the most popular features in the CCDB are its “E. coli Statistics” pages (available through the “Stats” link at the top of the CCDB home page). This link provides a rich source of information about the content, dimensions and physical-chemical characteristics of the E. coli cell.

Community annotation

edit

In order to facilitate the correction and submission of data, the CyberCell database allows users to electronically edit or update ‘ColiCards’. These modifications are permanently added only after they have been reviewed and accepted by an archivist. In addition to modifications by users, CCDB also performs automated self-updating operations on a regular basis, which keeps the database up-to-date.

Scope and Access

edit

All data in the CCDB is non-proprietary or is derived from a non-proprietary source. It is freely accessible and available to anyone. In addition, nearly every data item is fully traceable and explicitly referenced to the original source. CCDB data is available through a public web interface and downloads.

See also

edit

References

edit
  1. ^ a b c Sundararaj, S; Guo A; Habibi-Nazhad B; Rouani M; Stothard P; Ellison M; Wishart DS (2004). "The CyberCell Database (CCDB): a comprehensive, self-updating, relational database to coordinate and facilitate in silico modeling of Escherichia coli". Nucleic Acids Res. 32 (Database issue): D293-5. doi:10.1093/nar/gkh108. PMC 308842. PMID 14681416.
  2. ^ Rudd, K.E. (2000). "EcoGene: a genome sequence database for Escherichia coli K-12". Nucleic Acids Res. 28 (1): 60–64. doi:10.1093/nar/28.1.60. PMC 102481. PMID 10592181.
  3. ^ Karp, PD; Riley M; Saier M; Paulsen IT; Collado-Vides J; Paley SM; Pellegrini-Toole A; Bonavides C; Gama-Castro S. (2002). "The EcoCyc Database". Nucleic Acids Res. 30 (1): 56–8. doi:10.1093/nar/30.1.56. PMC 99147. PMID 11752253.