logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004471_01462

You are here: Home > Sequence: MGYG000004471_01462

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA1394 sp900554975
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UBA1394; UBA1394 sp900554975
CAZyme ID MGYG000004471_01462
CAZy Family CBM4
CAZyme Description Endoglucanase C
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
383 41975.49 4.2848
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004471 1997651 MAG Israel Asia
Gene Location Start: 7959;  End: 9110  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.73 3.2.1.-

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM4 26 151 3.8e-32 0.9920634920634921

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02927 CelD_N 4.69e-24 191 268 2 82
Cellulase N-terminal ig-like domain.
cd02850 E_set_Cellulase_N 2.04e-23 192 275 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam00759 Glyco_hydro_9 3.46e-23 282 383 1 101
Glycosyl hydrolase family 9.
pfam02018 CBM_4_9 6.73e-08 25 136 1 117
Carbohydrate binding domain. This family includes diverse carbohydrate binding domains.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL16782.1 2.03e-190 2 383 11 396
AQR94654.1 4.06e-104 26 383 38 399
AGF55906.1 5.71e-104 26 383 38 399
AAT66046.1 4.51e-98 28 383 36 396
ADL51229.1 4.51e-98 28 383 36 396

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5U2O_A 4.25e-27 205 383 7 205
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
3X17_A 7.25e-27 192 383 18 231
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
4CJ0_A 1.01e-25 188 383 26 229
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]
1CLC_A 1.04e-25 188 383 40 243
ChainA, ENDOGLUCANASE CELD; EC: 3.2.1.4 [Acetivibrio thermocellus]
5U0H_A 1.32e-24 205 383 7 205
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P14090 2.06e-27 77 383 230 563
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2
P0C2S1 3.44e-27 24 383 39 445
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
A3DCH1 6.24e-27 24 383 39 445
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P23658 1.27e-25 192 383 4 202
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
P0C2S4 5.54e-25 188 383 26 229
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.260928 0.737857 0.000480 0.000269 0.000222 0.000227

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004471_01462.