Statistical analysis of the Bacterial Carbohydrate Structure Data Base (BCSDB): Characteristics and diversity of bacterial carbohydrates in comparison with mammalian glycans
- Stephan Herget†1Email author,
- Philip V Toukach†2,
- René Ranzinger1,
- William E Hull1,
- Yuriy A Knirel2 and
- Claus-Wilhelm von der Lieth1
© Herget et al; licensee BioMed Central Ltd. 2008
Received: 26 February 2008
Accepted: 11 August 2008
Published: 11 August 2008
There are considerable differences between bacterial and mammalian glycans. In contrast to most eukaryotic carbohydrates, bacterial glycans are often composed of repeating units with diverse functions ranging from structural reinforcement to adhesion, colonization and camouflage. Since bacterial glycans are typically displayed at the cell surface, they can interact with the environment and, therefore, have significant biomedical importance.
The sequence characteristics of glycans (monosaccharide composition, modifications, and linkage patterns) for the higher bacterial taxonomic classes have been examined and compared with the data for mammals, with both similarities and unique features becoming evident. Compared to mammalian glycans, the bacterial glycans deposited in the current databases have a more than ten-fold greater diversity at the monosaccharide level, and the disaccharide pattern space is approximately nine times larger. Specific bacterial subclasses exhibit characteristic glycans which can be distinguished on the basis of distinctive structural features or sequence properties.
For the first time a systematic database analysis of the bacterial glycome has been performed. This study summarizes the current knowledge of bacterial glycan architecture and diversity and reveals putative targets for the rational design and development of therapeutic intervention strategies by comparing bacterial and mammalian glycans.
Natural glycans are known to take part in many key biological processes such as cell adhesion, recognition, receptor activation or signal transduction, and they also exhibit major structural functions in both bacteria and plants. In addition, bacterial glycans act as virulence, osmoprotection and desiccation protection factors . The diversity of structures within the mammalian glycome seems limited and is well described in the literature . On the other hand, the bacterial glycome exhibits greater diversity, stemming largely from the distinct cell wall architecture of these organisms.
The cell envelope of either Gram-positive or Gram-negative bacteria is based on peptidoglycan, a polymer in which polysaccharide chains are cross-linked with short peptide chains . Gram-negative bacteria possess an additional outer membrane that is composed of a lipopolysaccharide-protein complex. Gram-positive bacteria have no outer membrane, but the peptidoglycan wall is thicker (> 30 nm vs. 10 nm in Gram-negative bacteria) and contains polysaccharides with teichoic acids attached (a carbohydrate polymer containing alditols and phosphodiester linkages).
Both Gram-positive and Gram-negative bacteria produce extracellular polysaccharides, present either as a discrete capsule covalently attached to the cell envelope or as a slime weakly bound to the cell surface. These various glycoconjugates and polysaccharides on the surface of the cell often contain the antigenic determinants that initiate an immunogenic response in a host. In addition, these surface carbohydrates provide recognition elements for pathogens such as bacteriophages.
The lipopolysaccharide of Gram-negative bacteria contains lipid A, a phosphorylated GlcN-GlcN disaccharide moiety, N- and O-acylated with fatty acids which anchor the molecule in the outer leaflet of the outer membrane. Lipid A is covalently linked to a heteropolysaccharide which interacts with the environment and consists of an inner core (commonly containing Kdo (3-deoxy-D-manno-oct-2-ulosonic acid) and manno-heptoses) and an outer O-specific chain, a complex polysaccharide which determines the serological or antigenic properties of the lipopolysaccharide [4, 5]. These so-called O-antigens are mainly heteropolymers containing a large variety of residues (mainly monosaccharides, but also alditols, amino acids, etc.). These components, together with the capsular polysaccharides (K-antigens [6, 7]), can elicit an immune response in higher organisms.
The structures of the various carbohydrate antigens are unique, often being characterized by repeating units in the polymer structure. Indeed, all types of monosaccharides, including L-rhamnose (6-deoxy-L-mannose) and L-fucose (6-deoxy-L-galactose), are found in bacteria, together with rarer, modified sugars, such as 3,6-dideoxyhexoses and Kdo. Knowledge of the structures of surface carbohydrates and their variations is required for understanding how cellular recognition, adhesion, and the immune response operate at the molecular level. This understanding provides a basis for the design of synthetic carbohydrate-based vaccines, diagnostic agents, and immunostimulators. Certain fragments of bacterial polysaccharides, in the form of appropriate glycoconjugates, are known to act as vaccines .
Carbohydrates represent the most diverse class of biopolymers, and there is growing interest in the study and analysis of this diversity and its biomedical significance. For example, vertebrate glycan variability is assumed to act as a barrier that prevents the spread of an infection within a given population . Although it is widely known that the diversity of carbohydrates is much greater in bacteria than in mammals, no systematic attempt has been undertaken to examine the diversity of bacterial carbohydrates in detail. The structures deposited in glycoscience databases have been only sporadically evaluated. However, statistical structure-oriented investigations using carbohydrate databases were proven to be useful for immunochemical research and serotyping . Systematic analysis of all publicly available data will not only expand our general knowledge and understanding of the complexity of glycans in biological systems but will also offer a framework for the design of more comprehensive high-throughput screening methods or devices.
Comprehensive data concerning carbohydrate diversity within the entire bacterial world will be useful for the classification of bacteria according to their glycan structures and facilitate the search for the most widespread carbohydrate markers of various bacterial taxonomic groups. These markers are critical for medical applications, and a simple ranking by abundance is a good starting point for the design of synthetic biologically-active carbohydrates and for corresponding immunological studies. In particular, the statistics of monomer composition reveal potential taxonomic markers and also simplify the creation of carbohydrate microarrays by providing candidates for spotting .
A one-enzyme-class/one-saccharide-linkage paradigm applies for almost all individual steps of glycan biosynthesis. Accordingly, complete information on the diversity of disaccharide fragments allows one to describe the diversity of the glycosyltransferases expressed in individual taxonomic groups, and these enzymes may become potential targets for antimicrobial treatment.
For this study we performed statistical analyses of the Bacterial Carbohydrate Structure Data Bank (BCSDB), the largest database for bacterial glycans containing nearly all known bacterial glycan structures published up to 2007 . For comparison the mammalian glycans documented in the GLYCOSCIENCES.de database  (derived mainly from CarbBank ) have also been examined. The properties analyzed include glycan size, branching, and charge density, as well as the frequency of occurrence of specific monosaccharide residues, residue pairs and their linkage configurations. Precise definitions for the terminology used in this study can be found in the Methods section.
Results and Discussion
Distribution of carbohydrate structures among taxonomic groups
We first examined the number of sequences found in the BCSDB and GLYCOSCIENCES.de for various taxonomic ranks (class, order, family). Where possible, the taxonomic relationships were traced using the NCBI taxonomy database . The GLYCOSCIENCES.de database currently contains a total of 23120 glycan and glycoconjugate records, of which 13704 records for diverse animal, plant, bacteria and fungi classes have some information concerning taxonomy. In the BCSDB there are a total of 8504 records for bacteria only, and 8479 of these contain information concerning taxonomy. These numbers may include multiple records for a given glycan when the same glycan is reported for more than one species. Note that not all taxonomic classes are represented in the databases and that for bacterial glycans there is considerable overlap between the two databases.
The taxonomic class Mammalia is found to have 4739 assigned sequence/taxon pairs, of which 2118 are of human origin (family Hominidae). All other animal or plant classes in the database have less than 350 pairs. The category "unresolved" refers to the 1482 records for which the source is defined but the specific taxonomic class could not be traced automatically using the NCBI. Only about half of the bacterial phyla are represented in the BCSDB with a total of 6098 sequence/taxon pairs, and nine classes have less than 10 records. Note that the number of carbohydrates or glycoconjugates registered for a given taxonomic class does not necessarily reflect its species diversity, but more likely the intensity with which the class has been studied. Thus, the apparent diversity of carbohydrates in the various taxonomic classes reflects to a large part the information bias in the published literature, and this situation must be kept in mind when making conclusions based on the distributions of properties discussed below.
In the combined databases there are a total of 12659 records in the category "no taxonomy" which means that either no information concerning the taxonomy of the source is available or that the carbohydrate is not of purely natural origin. These records were not included in Fig. 1 and were not used in the following analyses.
Choice of taxonomic datasets for statistical comparisons
Definition of taxonomy Set 1.
Number of sequences*
Total number of different carbohydrate (glycan) sequences for Mammalia, as registered in GLYCOSCIENCES.de.
Total number of different repeating units in the polymeric carbohydrate (glycan) sequences for all bacteria registered in the BCSDB.
Total number of different oligosaccharide sequences (nonrepeating units) for all bacteria registered in the BCSDB.
Definition of taxonomy Set 2.
Number of sequences**
other γ-Proteobacteria [orders]
In order to obtain meaningful statistics, only those taxonomic groups are compared for which at least 200 carbohydrate sequences are available. For this reason the classes Chlamydiae, Clostridia, and Bacteroidetes, for example, have not been included in Set 2. Note that in Tables 1 and 2 the total number of unique carbohydrate sequences in each group is listed, and these sets were utilized in all subsequent analyses.
Carbohydrate size, branching and charge density
In Fig. 2B the distributions of the size parameter for the bacterial groups defined in taxonomy Set 2 are found to differ considerably. Narrow distributions with essentially a single prominant peak are found for Actinobacteria (mean: 4.51, median: 3), Bacilli (mean: 5.18, median: 5) and the order Enterobacteria (mean: 5.18, median: 6) with peaks at ca. 2.5, 5.5 and 4.5 residues, respectively. The various other classes of Proteobacteria have broader distributions with more or less pronounced multiple peaks, e.g. at 3, 8 and 11 residues for the δ,ε-Proteobacteria group.
Diversity of monosaccharides, basetypes and basic entities for various taxonomic groups.
To minimize the influence of errors and artifacts on the statistics of Table 3, a threshold for the occurrence of monosaccharides, basetypes and basic entities was set at 10 for mammals and 2 for bacteria. This means that a given residue type was included in the statistics only when its number of occurrences exceeded the defined threshold. A relatively low threshold was chosen for bacteria because, in contrast to mammals, bacteria are known to produce a great variety of unique monosaccharide residues with low occurrence. When the threshold for bacteria was reduced from 2 to 0, the diversity of detected residues increased by about 25%. A complete list of monosaccharide residues and basetypes found for each taxonomic group is available in GlycoCT nomenclature in the additional material [see Additional files 2 (tables a-i) and 3].
For mammals the analysis returned 18 occurrences of D-Fuc as a basic entity. However, this residue was excluded from Table 3 because all carbohydrates in GLYCOSCIENCES.de which are specified to contain D-Fucose originate from old publications in which the absolute configuration of Fuc was not specified. The occurrence of D-Fuc in mammalian carbohydrate records can be regarded as a data translation error since there is no evidence for a mammalian enzyme which synthesizes D-Fuc.
Attention should be paid to the distribution of monosaccharides at the terminal positions of oligomers and side chains of polymers. In higher organisms such residues are optimally positioned to mediate recognition by endogenous carbohydrate-binding proteins . According to our findings bacterial carbohydrates often have glucose residues at the nonreducing ends, in contrast to mammalian glycans (data not shown). This may be the result of the evolutionary adaptation of bacteria since exposed terminal glucose residues are important for the adherence of bacteria and entry into host epithelial cells, as demonstrated for Salmonella and Pseudomonas .
O-methylation is the most frequent modification for Actinobacteria (ca. 23% of all residues, mainly at O6 of glucose) but occurs with a frequency of < 5% in other bacteria classes and is essentially absent in mammals.
O-acetyl, amino, or phosphate substituents are also much more prevalent for bacteria (4–7%) than for mammals (< 1%), and the O-acetylation pattern is often different for different cultures of a single bacteria strain. O-acetyl groups mask the protective epitopes for bacteria through steric hindrance or altered conformations, as shown for Meningococci .
Amino sugars with free aminogroups are present in about 7% of bacterial carbohydrate residues compared to ca. 1% for mammals and feature a positively charged -NH3+ substituent at neutral pH. The occurrence of these residues in the bacterial cell wall affects hydrophobicity and makes bacteria resistant to the lysozyme of the host, as has been demonstrated for glucosamine in Gram-positive bacteria . Several secondary modifications appear to be unique for bacterial carbohydrates (pyruvate, lactate, ethanolamine, O-methyl and formyl) while sulfation or N-glycolyl substitution occurs primarily in mammals. Finally, about 7% of bacterial residues have modifications listed under the category "other" in Fig. 9A, with Actinobacteria and Bacilli showing the highest frequencies (Fig. 9B).
Disaccharide fragment patterns in bacteria and mammals
In order to describe carbohydrate sequences at a higher level of complexity, we need to consider not only the identities of the linked monosaccharides but also the linkage configuration. All free hydroxyl groups on each acceptor monosaccharide are potential sites of glycosyltransferase reactions. Therefore, we define the child (donor) to parent (acceptor) connection in terms of the directed glycosylation linkage pattern, analogous to reaction patterns described elsewhere . Thus, the descriptor "a1–4", for example, indicates that an alpha anomeric O1 of the donor is linked to C4 of the acceptor. The statistics of linkage patterns provide a direct description of the expression and activity of glycosyltransferases and the carbohydrate structure repetoire in an organism or taxonomic group. Such statistics have been employed successfully for a variety of bioinformatic tasks with the glycome, e.g. matrix generation  and pattern detection . This kind of information is also valuable for recognizing both unique and common linkages and can serve as a basis for a deeper understanding of the immunogenicity of bacterial carbohydrates and for designing targeted vaccines.
A complete list of mammalian disaccharide fragments found in GLYCOSCIENCES.de is available [see Additional file 5], where the data are encoded to illustrate the differences between our database findings and the data from Ohtsubo & Marth . Three disaccharides reported in , namely D-Glc(α1–2) D-Gal, D-GlcA(β1–4) D-Gal and D-GlcNAc(α1–6) D-GlcNAc, are absent from our databases. On the other hand, many existing mammalian disaccharides have not been mentioned by Ohtsubo & Marth, among them reasonably abundant ones such as D-GlcNAc(β1–3) D-GalNAc, D-GlcNAc(β1–4) D-Man, D-GlcNAc(β1–6) D-Man, and D-GlcNS(α1–4) L-IdoA. The last disaccharide listed is present in more than 500 records of GLYCOSCIENCES.DE for human carbohydrates and reported in the literature in association with Sandhoff's Disease .
Diversity of disaccharide linkages found in various taxonomic groups.
Identity and abundances of mammalian and bacterial disaccharides.
GlcA, IdoA or ΔGlcA
Neu5Ac or Neu5Gc
a1–3 (0.01) [0.15]
a1–2 (2.68) [0.18]
a1–3 (2.00) [0.32]
a1–4 (0.54) [0.09]
a1–6 (2.84) [0.01]
a1–3 (0.92) [0.60]
a1–3 (0.02) [0.12]
a1–3 (0.00) [0.32]
a1–3 (0.00) [0.26]
b1–4 (0.11) [0.00]
a1–4 (0.22) [0.54]
b1–3 (2.41) [0.54]
a1–6 (0.00) [0.49]
b1–3 (1.86) [0.52]
a1–6 (0.02) [0.25]
b1–4 (0.11) [0.11]
b1–3 (0.04) [0.18]
b1–4 (18.06) [1.68]
b1–3 (0.70) [0.80]
b1–4 (2.51) [2.32]
b1–4 (0.11) [0.34]
b1–6 (0.00) [0.18]
b1–6 (0.05) [0.42]
a1–3 (0.80) [0.05]
a1–3 (0.07) [0.32]
b1–4 (0.68) [0.02]
b1–4 (0.12) [0.08]
b1–3 (0.30) [0.44]
b1–3 (0.03) [0.10]
b1–4 (0.61) [0.36]
b1–3 (0.02) [0.40]
a1–2 (0.06) [0.95]
a1–4 (0.03) [0.27]
a1–3 (0.19) [0.19]
b1–6 (0.01) [0.34]
a1–3 (0.08) [0.83]
b1–4 (0.01) [0.12]
b1–4 (0.00) [0.14]
a1–2 (*) [0.31]
a1–4 (0.15) [0.39]
a1–6 (0.10) [0.57]
b1–3 (0.05) [0.62]
b1–4 (0.00) [1.69]
b1–6 (0.01) [0.90]
b1–3 (5.38) [1.64]
b1–3 (0.68) [0.04]
b1–3 (0.01) [0.34]
a1–4 (0.10) [0.05]
b1–2 (9.46) [0.11]
b1–4 (0.13) [0.02]
b1–6 (1.45) [0.00]
b1–4 (5.61) [0.69]
b1–4 (0.13) [0.13]
b1–4 (2.83) [0.05]
b1–6 (1.17) [0.04]
b1–6 (0.01) [0.22]
b1–6 (1.64) [0.02]
GlcA, IdoA or ΔGlcA
b1–3 (0.13) [0.23]
a1–3 (0.14) [0.01]
a1–4 (0.16) [0.01]
b1–4 (0.01) [0.19]
b1–4 (*) [0.14]
b1–3 (0.13) [0.14]
a1–3 (0.00) [0.27]
b1–4 (6.22) [0.02]
a1–2 (1.60) [0.92]
a1–3 (6.39) [0.41]
a1–4 (0.01) [0.16]
a1–6 (6.35) [0.53]
Neu5Ac or Neu5Gc
a2–3 (4.85) [1.01]
a2–8 (0.29) [0.12]
a2–6 (2.64) [0.02]
b1–4 (0.00) [0.10]
A potential application of the information and methods outlined here is the design and validation of carbohydrate vaccines against bacterial pathogens. Carbohydrate-based vaccines against Haemophilus influenzae Type b, Neisseria meningitidis and Streptococcus pneumoniae have already been licensed, and many similar products are in various stages of development. For example, the disaccharides D-Glc(α1–2)D-Gal and D-Glc(β1–4)D-Gal are not present in mammalian organisms accordingly to our analysis and are both constituents of the capsular polysaccharides of Salmonella pneumonia, which were shown to be target candidates for vaccine development .
In this study we combined the BCSDB, the largest available bacterial carbohydrate database, with the GLYCOSCIENCES.de database to obtain a set of 13775 nonredundant glycan/taxon pairs (carbohydrate sequences with a defined taxonomy), of which 6098 were assigned to Bacteria and 4739 to Mammalia. The representative statistical analyses presented here reveal the basic principles of carbohydrate architecture in bacteria vs. mammals. The major monosaccharides which characterize different branches of the tree of life were extracted from the database and are in accordance with the published literature. Several monosaccharides unique to certain subclasses of bacteria were identified and could prove useful as molecular markers for these classes. Similarly, a variety of structural modifications of monosaccharides have been detected, and many of these are characteristic in that they may be either highly abundant or totally absent in individual taxonomic classes.
A linkage analysis was performed for all disaccharide fragments of bacterial and mammalian glycans and revealed that there are a number of abundant linkages as well as nonexistent linkages which may be useful for characterizing the various taxonomic groups. Through a comparison of the disaccharide linkage ensembles or spaces for bacteria and mammals, one obtains an overview of those glycosyltransferase activities which are common to both classes and those which appear to be unique for mammals or bacteria or even for specific bacteria subclasses. Thus, differential cross-species expression analysis is possible and may ultimately provide a deeper understanding of immunogenic patterns present in pathogenic bacteria.
The analyzed sequences were obtained from the meta-database GlycomeDB , which contains all sequences from the Bacterial Carbohydrate Structure DataBase (BCSDB)  and the GLYCOSCIENCES.de portal  in a harmonized format. With the help of the NCBI taxonomy database , subsets of these databases were taken and further analyzed using routines implemented in JAVA. The results of the analytic routines were stored in a PostgreSQL 8.2 database. Additional analytical procedures were implemented in PHP, which finally generated Microsoft Excel tables used for further analysis and graphical visualization.
Definition of terms
According to IUPAC nomenclature a monosaccharide is a poly(hydroxy) aldehyde or ketone with three or more carbon atoms (triose, tetrose, etc.); the term denotes a single structural component without glycosidic linkages and includes a variety of derivatives such as amino, deoxy or carboxy forms. Oligosaccharides are compounds in which monosaccharides and their derivatives are coupled in a precisely defined manner via glycosidic linkages. The term polysaccharide generally refers to oligosaccharides with a large or undefined number of monosaccharide residues. The term carbohydrate includes all mono-, oligo- or polysaccharides and molecules derived from monosaccharides modified by reduction, oxidation, or substitution. The term glycan is frequently used to refer to any saccharide component of a glycoconjugate, such as a glycoprotein or glycolipid, even when the chain length is short. A glycoconjugate is formed by a covalent linkage between a glycan and a nonglycan entity. Polysaccharide may be used to refer to polymers with glycosidic and/or phosphodiester linkages (such as teichoic acids). The carbohydrate databases used in this study may contain any of the compounds described above but do not contain DNAs or RNAs.
The term sequence will be used here to refer to a specific carbohydrate molecule or a glycan obtained from a larger molecule (glycoconjugate). Sequences may be linear or branched. Each database record may refer to either an individual carbohydrate sequence or to a particular glycoconjugate containing a given glycan. Thus, each unique glycan may have multiple database records, one for each different glycoconjugate.
A residue is a specific building block, e.g. a monosaccharide, within a carbohydrate sequence, analogous to the amino acid residues in proteins.
The term unit will be used to specify the smallest sequence fragment which describes a given carbohydrate molecule or glycan. For a nonrepeating oligomer sequence the unit will be the entire sequence; for polymers built up from repeating subsequences, the unit will be one such subsequence.
A branching point is a particular residue to which two or more carbohydrate residues are attached via nonreducing hydroxy functions or other functional groups.
A monosaccharide is a unique carbohydrate residue according to the IUPAC definition given above and is specified by the number of carbons, the ring type, the anomeric (α,β) and absolute (D, L) configurations, and all primary and secondary modifications. For example, α-D-Glcp N, β-D-Glcp N and α-D-Glcp NAc are three different monosaccharides derived from glucose.
Primary modifications of monosaccharides are those which alter the stereochemical designation or electronic hybridisation state of at least one carbon atom (e.g. deoxy, carboxy, keto, double-bond modifications). Secondary modifications are all modifications which are not primary (substituents such as amino, O-methyl, O-acetyl, sulfate, phosphate, etc.).
We define the basetype of a monosaccharide to include only those characteristics which specify the order and stereochemical designations of its carbon atom skeleton, i.e., the anomeric and absolute configurations, ring type, and primary modifications. Basetype is not altered by secondary modifications. Thus, α-D-Glcp, α-D-Glcp N and α-D-Glcp 6S all have the same basetype (α-D-Glcp) while β-D-Glcp and α-D-Glcp A are different basetypes.
With respect to common historical usage, the term basic entity will be used to specify the following characteristics of a monosaccharide: the stereochemical configuration (D, L), all primary modifications, and only those secondary modifications involving amine groups, including substituted amines, at any position other than the anomeric carbon. The basic entity definition does not include anomeric configuration, ring type, or any secondary modifications. Thus, α-D-Galf N and β-D-Galp NAc have the same basic entity (D-GalN) while β-D-Galp, β-D-Galp A and β-D-Fucp are all different basic entities.
PT thanks the DKFZ for stipends supporting his stay in Heidelberg. The development of GLYCOSCIENCES.de at the DKFZ was supported by a Research Grant from the Deutsche Forschungsgemeinschaft (DFG BIB 46 HDdkz 01-01) within the digital library program (SH). The development of the BCSDB (PT) was supported by the International Science and Technology Center (Project 1197p), the Russian Foundation for Basic Research (Project 05-07-90099) and the Russian President Grant Committee (Project MK-2005.1700.4). The EUROCarbDB project, supported by the EU (6th Research Framework Program, RIDS contract number 011952), has also contributed resources to this analysis (RR).
- Varki A, Cummings R, Esko J, Freeze H, Hart G, Marth J: Essentials of Glycobiology. New York: Cold Spring Harbor Laboratory Press; 1999.Google Scholar
- Ohtsubo K, Marth J: Glycosylation in cellular mechanisms of health and disease. Cell 2006, 126: 855–867. 10.1016/j.cell.2006.08.019View ArticleGoogle Scholar
- Vollmer W, Blanot D, de Pedro MA: Peptidoglycan structure and architecture. FEMS Microbiology Reviews 2008.Google Scholar
- Raetz CR, Reynolds CM, Trent MS, Bishop RE: Lipid A modification systems in gram-negative bacteria. Annual Review of Biochemistry 2007, 76: 295–329. 10.1146/annurev.biochem.76.010307.145803View ArticleGoogle Scholar
- Raetz CR, Whitfield C: Lipopolysaccharide endotoxins. Annual Review of Biochemistry 2002, 71: 635–700. 10.1146/annurev.biochem.71.110601.135414View ArticleGoogle Scholar
- Boulnois GJ, Jann K: Bacterial polysaccharide capsule synthesis, export and evolution of structural diversity. Molecular Microbiology 1989, 3: 1819–1823. 10.1111/j.1365-2958.1989.tb00168.xView ArticleGoogle Scholar
- Whitfield C: Biosynthesis and assembly of capsular polysaccharides in Escherichia coli . Annual Review of Biochemistry 2006, 75: 39–68. 10.1146/annurev.biochem.75.103004.142545View ArticleGoogle Scholar
- Vliegenthart JF: Carbohydrate based vaccines. FEBS Letters 2006, 580: 2945–2950. 10.1016/j.febslet.2006.03.053View ArticleGoogle Scholar
- Gagneux P, Varki A: Evolutionary considerations in relating oligosaccharide diversity to biological function. Glycobiology 1999, 9: 747–755. 10.1093/glycob/9.8.747View ArticleGoogle Scholar
- Stenutz R, Weintraub A, Widmalm G: The structures of Escherichia coli O-polysaccharide antigens. FEMS Microbiology Reviews 2006, 30: 382–403. 10.1111/j.1574-6976.2006.00016.xView ArticleGoogle Scholar
- Seeberger PH, Werz DB: Synthesis and medical applications of oligosaccharides. Nature 2007, 446: 1046–1051. 10.1038/nature05819View ArticleGoogle Scholar
- Toukach F, Knirel Y: New database of bacterial carbohydrate structures. XVIII International Symposium on Glycoconjugates; Florence, Italy 2005, 216–217. [http://www.glyco.ac.ru/bcsdb/]Google Scholar
- Luetteke T, Bohne-Lang A, Loss A, Goetz T, Frank M, von der Lieth CW: GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. Glycobiology 2006, 16: 71R-81R. 10.1093/glycob/cwj049View ArticleGoogle Scholar
- Doubet S, Bock K, Smith D, Darvill A, Albersheim P: The Complex Carbohydrate Structure Database. Trends in Biochemical Sciences 1989, 14: 475–477. 10.1016/0968-0004(89)90175-8View ArticleGoogle Scholar
- Wheeler DL, Chappey C, Lash AE, Leipe DD, Madden TL, Schuler GD, Tatusova TA, Rapp BA: Database resources of the National Center for Biotechnology Information. Nucleic Acids Research 2000, 28: 10–14. 10.1093/nar/28.1.10View ArticleGoogle Scholar
- Herget S, Ranzinger R, Maass K, Lieth C-Wvd: GlycoCT – a unifying sequence format for carbohydrates. Carbohydrate Research 2008, 343: 2162–2171. 10.1016/j.carres.2008.03.011View ArticleGoogle Scholar
- Hedlund M, Tangvoranuntakul P, Takematsu H, Long JM, Housley GD, Kozutsumi Y, Suzuki A, Wynshaw-Boris A, Ryan AF, Gallo RL, Varki N, Varki A: N-glycolylneuraminic acid deficiency in mice: implications for human biology and evolution. Molecular and Cellular biology 2007, 27: 4340–4346. 10.1128/MCB.00379-07View ArticleGoogle Scholar
- Hoare A, Bittner M, Carter J, Alvarez S, Zaldivar M, Bravo D, Valvano MA, Contreras I: The outer core lipopolysaccharide of Salmonella enterica serovar Typhi is required for bacterial entry into epithelial cells. Infection and Immunity 2006, 74: 1555–1564. 10.1128/IAI.74.3.1555-1564.2006View ArticleGoogle Scholar
- He XM, Liu HW: Formation of unusual sugars: mechanistic studies and biosynthetic applications. Annual Review of Biochemistry 2002, 71: 701–754. 10.1146/annurev.biochem.71.110601.135339View ArticleGoogle Scholar
- Fusco PC, Farley EK, Huang CH, Moore S, Michon F: Protective meningococcal capsular polysaccharide epitopes and the role of O acetylation. Clinical Vaccine Immunol 2007, 14: 577–584. 10.1128/CVI.00009-07View ArticleGoogle Scholar
- Kawagishi S, Araki Y, Ito E: Bacillus cereus autolytic endoglucosaminidase active on cell wall peptidoglycan with N-unsubstituted glucosamine residues. Journal of Bacteriology 1980, 141: 137–143.Google Scholar
- Kawano S, Hashimoto K, Miyama T, Goto S, Kanehisa M: Prediction of glycan structures from gene expression data based on glycosyltransferase reactions. Bioinformatics (Oxford, England) 2005, 21: 3976–3982. 10.1093/bioinformatics/bti666View ArticleGoogle Scholar
- Aoki K, Mamitsuka H, Akutsu T, Kanehisa M: A score matrix to reveal the hidden links in glycans. Bioinformatics 2005, 21(8):1457–1463. 10.1093/bioinformatics/bti193View ArticleGoogle Scholar
- Aoki-Kinoshita K, Ueda N, Mamitsuka H, Kanehisa M: ProfilePSTMM: capturing tree-structure motifs in carbohydrate sugar chains. Bioinformatics (Oxford, England) 2006, 14: e25-e34. 10.1093/bioinformatics/btl244View ArticleGoogle Scholar
- Strecker G, Herlant-Peers MC, Fournet B, Montreul J: Structure of seven oligosaccharides excreted in the urine of a patient with Sandhoff's disease (GM2 gangliosidosis-variant O). European Journal of Biochemistry/FEBS 1977, 81: 165–171. 10.1111/j.1432-1033.1977.tb11937.xView ArticleGoogle Scholar
- Jones C: Vaccines based on the cell surface carbohydrates of pathogenic bacteria. Anais da Academia Brasileira de Ciencias 2005, 77: 293–324.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.