logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004471_00373

You are here: Home > Sequence: MGYG000004471_00373

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA1394 sp900554975
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UBA1394; UBA1394 sp900554975
CAZyme ID MGYG000004471_00373
CAZy Family CBM79
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
707 78470.91 4.2516
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004471 1997651 MAG Israel Asia
Gene Location Start: 111981;  End: 114104  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.-

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 406 675 2.7e-104 0.9891304347826086
CBM79 234 343 2.2e-28 0.9636363636363636
CBM79 94 207 1.5e-23 0.9545454545454546

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 6.79e-67 394 678 4 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.36e-39 378 707 46 394
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam18522 DUF5620 1.98e-26 94 209 2 119
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.
pfam18522 DUF5620 5.38e-23 234 344 1 118
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CCZ83818.1 4.53e-239 67 707 72 717
CDE32105.1 1.38e-236 15 705 8 700
CDE11886.1 4.30e-227 69 701 63 704
ERJ92483.1 3.88e-226 9 707 28 734
EWM54675.1 1.37e-183 67 707 56 683

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6MQ4_A 3.54e-112 374 705 10 348
ChainA, cellulase [Acetivibrio cellulolyticus]
4IM4_A 2.99e-108 374 705 5 334
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]
3NDY_A 6.89e-106 378 706 14 341
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]
6Q1I_A 2.23e-104 375 707 13 354
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6PZ7_A 3.64e-102 374 704 7 335
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P28623 4.93e-104 378 706 45 372
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P54937 2.21e-102 375 707 38 379
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P10477 3.67e-102 374 705 55 384
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P28621 2.01e-100 374 705 40 373
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P16216 1.25e-93 370 700 62 392
Endoglucanase 1 OS=Ruminococcus albus OX=1264 GN=Eg I PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000001 0.000076 0.999972 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004471_00373.