logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004707_01232

You are here: Home > Sequence: MGYG000004707_01232

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Monoglobus sp900554105
Lineage Bacteria; Firmicutes_A; Clostridia; Monoglobales; Monoglobaceae; Monoglobus; Monoglobus sp900554105
CAZyme ID MGYG000004707_01232
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
529 58682.26 4.2944
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004707 1967109 MAG Denmark Europe
Gene Location Start: 10672;  End: 12261  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 251 488 4.4e-88 0.9957805907172996

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 8.16e-67 249 494 1 271
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.77e-12 227 524 33 301
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam00395 SLH 1.84e-11 34 75 1 42
S-layer homology domain.
NF033190 inl_like_NEAT_1 1.88e-05 29 130 638 746
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
NF033190 inl_like_NEAT_1 2.66e-05 28 204 576 751
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QQR30621.1 8.20e-179 11 527 4 519
ANU55408.1 8.20e-179 11 527 4 519
ASB41357.1 8.20e-179 11 527 4 519
QGT51240.1 4.48e-165 34 527 27 515
AUG58357.1 1.44e-120 235 523 38 328

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 2.90e-117 235 523 7 296
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
1A3H_A 2.01e-108 235 526 3 296
EndoglucanaseCel5a From Bacillus Agaradherans At 1.6a Resolution [Salipaludibacillus agaradhaerens],2A3H_A Cellobiose Complex Of The Endoglucanase Cel5a From Bacillus Agaradherans At 2.0 A Resolution [Salipaludibacillus agaradhaerens],3A3H_A Cellotriose Complex Of The Endoglucanase Cel5a From Bacillus Agaradherans At 1.6 A Resolution [Salipaludibacillus agaradhaerens]
1H11_A 2.22e-108 235 526 6 299
2-DEOXY-2-FLURO-B-D-CELLOTRIOSYL/ENZYMEINTERMEDIATE COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION [Salipaludibacillus agaradhaerens],1H2J_A ENDOGLUCANASE CEL5A IN COMPLEX WITH UNHYDROLYSED AND COVALENTLY LINKED 2,4-DINITROPHENYL-2-DEOXY-2-FLUORO-CELLOBIOSIDE AT 1.15 A RESOLUTION [Salipaludibacillus agaradhaerens],1HF6_A ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHAERENS IN THE ORTHORHOMBIC CRYSTAL FORM IN COMPLEX WITH CELLOTRIOSE [Salipaludibacillus agaradhaerens],1OCQ_A COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION with cellobio-derived isofagomine [Salipaludibacillus agaradhaerens],1W3K_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellobio Derived-tetrahydrooxazine [Salipaludibacillus agaradhaerens],1W3L_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellotri Derived-Tetrahydrooxazine [Salipaludibacillus agaradhaerens],4A3H_A 2',4' Dinitrophenyl-2-Deoxy-2-Fluro-B-D-Cellobioside Complex Of The Endoglucanase Cel5a From Bacillus Agaradhaerens At 1.6 A Resolution [Salipaludibacillus agaradhaerens],5A3H_A 2-Deoxy-2-Fluro-B-D-CellobiosylENZYME INTERMEDIATE COMPLEX Of The Endoglucanase Cel5a From Bacillus Agaradhearans At 1.8 Angstroms Resolution [Salipaludibacillus agaradhaerens],6A3H_A 2-Deoxy-2-Fluro-B-D-CellotriosylENZYME INTERMEDIATE COMPLEX OF THE Endoglucanase Cel5a From Bacillus Agaradhearans At 1.6 Angstrom Resolution [Salipaludibacillus agaradhaerens],7A3H_A Native Endoglucanase Cel5a Catalytic Core Domain At 0.95 Angstroms Resolution [Salipaludibacillus agaradhaerens],8A3H_A Cellobiose-derived imidazole complex of the endoglucanase cel5A from Bacillus agaradhaerens at 0.97 A resolution [Salipaludibacillus agaradhaerens]
1H5V_A 2.30e-108 235 526 6 299
Thiopentasaccharidecomplex of the endoglucanase Cel5A from Bacillus agaradharens at 1.1 A resolution in the tetragonal crystal form [Salipaludibacillus agaradhaerens]
1E5J_A 2.38e-108 235 526 6 299
EndoglucanaseCel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Methyl-4ii-S-Alpha-Cellobiosyl-4ii-Thio Beta-Cellobioside [Salipaludibacillus agaradhaerens],1QHZ_A Native Tetragonal Structure Of The Endoglucanase Cel5a From Bacillus Agaradhaerens [Salipaludibacillus agaradhaerens],1QI0_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Cellobiose [Salipaludibacillus agaradhaerens],1QI2_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With 2',4'-Dinitrophenyl 2-Deoxy-2-Fluoro-B- D-Cellotrioside [Salipaludibacillus agaradhaerens],2V38_A Family 5 endoglucanase Cel5A from Bacillus agaradhaerens in complex with cellobio-derived noeuromycin [Salipaludibacillus agaradhaerens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P15704 1.19e-112 236 523 42 331
Endoglucanase OS=Clostridium saccharobutylicum OX=169679 GN=eglA PE=3 SV=1
O85465 2.94e-106 235 526 32 325
Endoglucanase 5A OS=Salipaludibacillus agaradhaerens OX=76935 GN=cel5A PE=1 SV=1
P07983 1.10e-105 235 523 36 325
Endoglucanase OS=Bacillus subtilis OX=1423 GN=bglC PE=3 SV=2
P06565 2.46e-104 235 526 32 325
Endoglucanase B OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celB PE=3 SV=1
P10475 9.59e-104 235 523 36 325
Endoglucanase OS=Bacillus subtilis (strain 168) OX=224308 GN=eglS PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000514 0.998521 0.000382 0.000179 0.000183 0.000181

TMHMM  Annotations      download full data without filtering help

start end
12 34