logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004891_01906

You are here: Home > Sequence: MGYG000004891_01906

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; KM106-2;
CAZyme ID MGYG000004891_01906
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1465 161367.63 9.2158
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004891 3652874 MAG China Asia
Gene Location Start: 35381;  End: 39778  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004891_01906.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 323 600 2.1e-73 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 2.67e-42 304 598 5 267
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 7.63e-17 302 558 50 322
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam09479 Flg_new 1.66e-14 1301 1368 1 65
Listeria-Bacteroides repeat domain (List_Bact_rpt). This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.
pfam09479 Flg_new 3.08e-12 1232 1295 3 65
Listeria-Bacteroides repeat domain (List_Bact_rpt). This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.
NF033189 internalin_A 4.02e-12 1102 1391 498 778
class 1 internalin InlA. Internalins, as found in the intracellular human pathogen Listeria monocytogenes, are paralogous surface or secreted proteins with an N-terminal signal peptide, leucine-rich repeats, and usually a C-terminal LPXTG processing and cell surface anchoring site. See PMID:17764999 for a general discussion of internalins. Members of this family are internalin A (InlA), a class 1 (LPXTG-type) internalin.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AUO18792.1 7.09e-303 150 1097 108 1061
AUO19859.1 1.12e-183 212 822 30 645
BCN29385.1 3.17e-55 293 978 351 946
AIQ53446.1 1.02e-52 290 636 35 399
AIQ47948.1 2.55e-52 290 636 35 399

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2JEP_A 1.69e-52 300 636 39 393
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQP_A 1.81e-44 288 578 8 314
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
3AYR_A 1.19e-42 290 636 14 359
GH5endoglucanase EglA from a ruminal fungus [Piromyces rhizinflatus]
4X0V_A 1.88e-42 300 635 44 393
Structureof a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_B Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_C Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_D Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_E Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_F Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_G Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_H Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32]
6Q1I_A 2.55e-42 286 636 1 352
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 2.53e-49 275 636 19 398
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P54937 6.19e-41 273 636 13 377
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P23660 1.79e-38 290 634 21 360
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P17901 3.06e-37 306 669 57 432
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1
Q12647 3.98e-37 291 636 20 362
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001683 0.997131 0.000547 0.000243 0.000195 0.000176

TMHMM  Annotations      download full data without filtering help

start end
21 38