logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000033_01031

You are here: Home > Sequence: MGYG000000033_01031

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-41 sp900066215
Lineage Bacteria; Firmicutes_A; Clostridia; Monoglobales_A; UBA1381; CAG-41; CAG-41 sp900066215
CAZyme ID MGYG000000033_01031
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
507 MGYG000000033_5|CGC1 56198.74 4.4254
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000033 3010191 Isolate United Kingdom Europe
Gene Location Start: 69227;  End: 70750  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000033_01031.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 235 468 3.1e-88 0.9957805907172996

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.21e-68 233 475 1 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.04e-17 201 507 27 304
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam00395 SLH 2.12e-09 23 64 1 42
S-layer homology domain.
NF033190 inl_like_NEAT_1 1.62e-06 22 90 642 717
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
pfam00395 SLH 4.77e-05 152 184 10 42
S-layer homology domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QGT51240.1 1.58e-231 2 507 5 515
QQR30621.1 1.03e-203 23 507 32 519
ANU55408.1 1.03e-203 23 507 32 519
ASB41357.1 1.03e-203 23 507 32 519
CAA83942.1 3.28e-107 219 503 32 322

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 1.05e-114 219 503 7 296
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
1A3H_A 4.48e-110 219 503 3 293
EndoglucanaseCel5a From Bacillus Agaradherans At 1.6a Resolution [Salipaludibacillus agaradhaerens],2A3H_A Cellobiose Complex Of The Endoglucanase Cel5a From Bacillus Agaradherans At 2.0 A Resolution [Salipaludibacillus agaradhaerens],3A3H_A Cellotriose Complex Of The Endoglucanase Cel5a From Bacillus Agaradherans At 1.6 A Resolution [Salipaludibacillus agaradhaerens]
1H11_A 4.96e-110 219 503 6 296
2-DEOXY-2-FLURO-B-D-CELLOTRIOSYL/ENZYMEINTERMEDIATE COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION [Salipaludibacillus agaradhaerens],1H2J_A ENDOGLUCANASE CEL5A IN COMPLEX WITH UNHYDROLYSED AND COVALENTLY LINKED 2,4-DINITROPHENYL-2-DEOXY-2-FLUORO-CELLOBIOSIDE AT 1.15 A RESOLUTION [Salipaludibacillus agaradhaerens],1HF6_A ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHAERENS IN THE ORTHORHOMBIC CRYSTAL FORM IN COMPLEX WITH CELLOTRIOSE [Salipaludibacillus agaradhaerens],1OCQ_A COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION with cellobio-derived isofagomine [Salipaludibacillus agaradhaerens],1W3K_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellobio Derived-tetrahydrooxazine [Salipaludibacillus agaradhaerens],1W3L_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellotri Derived-Tetrahydrooxazine [Salipaludibacillus agaradhaerens],4A3H_A 2',4' Dinitrophenyl-2-Deoxy-2-Fluro-B-D-Cellobioside Complex Of The Endoglucanase Cel5a From Bacillus Agaradhaerens At 1.6 A Resolution [Salipaludibacillus agaradhaerens],5A3H_A 2-Deoxy-2-Fluro-B-D-CellobiosylENZYME INTERMEDIATE COMPLEX Of The Endoglucanase Cel5a From Bacillus Agaradhearans At 1.8 Angstroms Resolution [Salipaludibacillus agaradhaerens],6A3H_A 2-Deoxy-2-Fluro-B-D-CellotriosylENZYME INTERMEDIATE COMPLEX OF THE Endoglucanase Cel5a From Bacillus Agaradhearans At 1.6 Angstrom Resolution [Salipaludibacillus agaradhaerens],7A3H_A Native Endoglucanase Cel5a Catalytic Core Domain At 0.95 Angstroms Resolution [Salipaludibacillus agaradhaerens],8A3H_A Cellobiose-derived imidazole complex of the endoglucanase cel5A from Bacillus agaradhaerens at 0.97 A resolution [Salipaludibacillus agaradhaerens]
1H5V_A 5.13e-110 219 503 6 296
Thiopentasaccharidecomplex of the endoglucanase Cel5A from Bacillus agaradharens at 1.1 A resolution in the tetragonal crystal form [Salipaludibacillus agaradhaerens]
1E5J_A 5.30e-110 219 503 6 296
EndoglucanaseCel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Methyl-4ii-S-Alpha-Cellobiosyl-4ii-Thio Beta-Cellobioside [Salipaludibacillus agaradhaerens],1QHZ_A Native Tetragonal Structure Of The Endoglucanase Cel5a From Bacillus Agaradhaerens [Salipaludibacillus agaradhaerens],1QI0_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Cellobiose [Salipaludibacillus agaradhaerens],1QI2_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With 2',4'-Dinitrophenyl 2-Deoxy-2-Fluoro-B- D-Cellotrioside [Salipaludibacillus agaradhaerens],2V38_A Family 5 endoglucanase Cel5A from Bacillus agaradhaerens in complex with cellobio-derived noeuromycin [Salipaludibacillus agaradhaerens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O85465 6.61e-108 219 503 32 322
Endoglucanase 5A OS=Salipaludibacillus agaradhaerens OX=76935 GN=cel5A PE=1 SV=1
P06565 1.11e-105 219 503 32 322
Endoglucanase B OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celB PE=3 SV=1
P15704 2.65e-105 219 505 41 333
Endoglucanase OS=Clostridium saccharobutylicum OX=169679 GN=eglA PE=3 SV=1
P06566 1.07e-99 238 503 49 320
Endoglucanase A OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celA PE=3 SV=1
P07983 2.54e-97 216 506 33 328
Endoglucanase OS=Bacillus subtilis OX=1423 GN=bglC PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000006 0.002479 0.997562 0.000001 0.000001 0.000001

TMHMM  Annotations      download full data without filtering help

start end
5 27