logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000105_03060

You are here: Home > Sequence: MGYG000000105_03060

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Bacteroides clarus
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides; Bacteroides clarus
CAZyme ID MGYG000000105_03060
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
842 94964.41 6.2163
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000105 3966085 Isolate Canada North America
Gene Location Start: 531205;  End: 533733  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000105_03060.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 115 588 1.2e-77 0.992822966507177
CE4 617 749 2.3e-17 0.9230769230769231

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 6.78e-65 123 588 7 374
Glycosyl hydrolase family 9.
pfam02927 CelD_N 3.47e-28 24 105 2 83
Cellulase N-terminal ig-like domain.
cd02850 E_set_Cellulase_N 5.33e-28 24 110 1 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
cd10917 CE4_NodB_like_6s_7s 2.11e-26 639 821 19 171
Catalytic NodB homology domain of rhizobial NodB-like proteins. This family belongs to the large and functionally diverse carbohydrate esterase 4 (CE4) superfamily, whose members show strong sequence similarity with some variability due to their distinct carbohydrate substrates. It includes many rhizobial NodB chitooligosaccharide N-deacetylase (EC 3.5.1.-)-like proteins, mainly from bacteria and eukaryotes, such as chitin deacetylases (EC 3.5.1.41), bacterial peptidoglycan N-acetylglucosamine deacetylases (EC 3.5.1.-), and acetylxylan esterases (EC 3.1.1.72), which catalyze the N- or O-deacetylation of substrates such as acetylated chitin, peptidoglycan, and acetylated xylan. All members of this family contain a catalytic NodB homology domain with the same overall topology and a deformed (beta/alpha)8 barrel fold with 6- or 7 strands. Their catalytic activity is dependent on the presence of a divalent cation, preferably cobalt or zinc, and they employ a conserved His-His-Asp zinc-binding triad closely associated with the conserved catalytic base (aspartic acid) and acid (histidine) to carry out acid/base catalysis. Several family members show diversity both in metal ion specificities and in the residues that coordinate the metal.
cd10949 CE4_BsPdaB_like 3.88e-22 619 834 2 190
Putative catalytic NodB homology domain of Bacillus subtilis putative polysaccharide deacetylase PdaB, and its bacterial homologs. The Bacillus subtilis genome contains six polysaccharide deacetylase gene homologs: pdaA, pdaB (previously known as ybaN), yheN, yjeA, yxkH and ylxY. This family is represented by the putative polysaccharide deacetylase PdaB encoded by the pdaB gene on sporulation of Bacillus subtilis. Although its biochemical properties remain to be determined, the PdaB (YbaN) protein is essential for maintaining spores after the late stage of sporulation and is highly conserved in spore-forming bacteria. The glycans of the spore cortex may be candidate PdaB substrates. Based on sequence similarity, the family members are classified as carbohydrate esterase 4 (CE4) superfamily members. However, the classical His-His-Asp zinc-binding motif of CE4 esterases is missing in this family.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUT66756.1 0.0 1 835 1 835
QUT35459.1 0.0 1 835 1 835
QUT98069.1 0.0 1 835 1 835
QBJ19309.1 0.0 1 835 1 835
QMI80662.1 0.0 1 835 1 835

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3X17_A 4.88e-35 25 586 18 551
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
5U2O_A 2.66e-27 33 611 2 553
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
5U0H_A 5.86e-24 33 611 2 553
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
4CJ0_A 2.26e-23 22 589 27 545
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]
1CLC_A 2.38e-23 22 589 41 559
ChainA, ENDOGLUCANASE CELD; EC: 3.2.1.4 [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23658 6.34e-23 25 591 4 545
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
P0C2S4 1.24e-22 22 589 27 545
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DDN1 1.35e-22 22 589 51 569
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1
A3DCH1 3.77e-22 25 651 214 857
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 1.14e-21 25 651 214 857
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000624 0.997760 0.001041 0.000187 0.000181 0.000187

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000105_03060.