logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001157_00441

You are here: Home > Sequence: MGYG000001157_00441

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Gemmiger sp900540775
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Gemmiger; Gemmiger sp900540775
CAZyme ID MGYG000001157_00441
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
914 MGYG000001157_21|CGC1 98206.78 4.2642
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001157 2792015 MAG Austria Europe
Gene Location Start: 19789;  End: 22533  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001157_00441.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 376 889 2.7e-119 0.7068145800316957

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02057 Glyco_hydro_59 4.81e-91 381 711 1 292
Glycosyl hydrolase family 59.
NF033913 fibronec_FbpA 1.46e-05 49 100 203 256
LPXTG-anchored fibronectin-binding protein FbpA. FbpA, a fibronectin-binding protein described in Streptococcus pyogenes, has a YSIRK-type (crosswall-targeting) signal peptide and a C-terminal LPXTG motif for covalent attachment to the cell wall. It is unrelated to the PavA-like protein from Streptococcus gordonii (see BlastRule NBR009716) that was given the identical name, so the phase LPXTG-anchored is added to the protein name for clarity.
pfam04886 PT 1.96e-05 59 97 2 36
PT repeat. This short repeat is composed on the tetrapeptide XPTX. This repeat is found in a variety of proteins, however it is not clear if these repeats are homologous to each other. The alignment represents nine copies of this repeat.
pfam13385 Laminin_G_3 9.42e-05 191 332 2 140
Concanavalin A-like lectin/glucanases superfamily. This domain belongs to the Concanavalin A-like lectin/glucanases superfamily.
pfam04886 PT 1.90e-04 55 89 2 36
PT repeat. This short repeat is composed on the tetrapeptide XPTX. This repeat is found in a variety of proteins, however it is not clear if these repeats are homologous to each other. The alignment represents nine copies of this repeat.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUW95523.1 2.17e-157 351 881 29 585
AMW11854.1 1.71e-156 362 881 39 584
QLH25411.1 6.47e-156 362 881 39 584
AZM74018.1 1.18e-155 358 881 33 582
QKW59509.1 1.27e-154 358 881 33 582

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4CCC_A 9.47e-20 378 718 22 325
StructureOf Mouse Galactocerebrosidase With 4nbdg: Enzyme-substrate Complex [Mus musculus],4CCD_A Structure Of Mouse Galactocerebrosidase With D-galactal: Enzyme-intermediate Complex [Mus musculus],4CCE_A Structure Of Mouse Galactocerebrosidase With Galactose: Enzyme-product Complex [Mus musculus],4UFH_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine IGF [Mus musculus],4UFI_A Mouse Galactocerebrosidase complexed with aza-galacto-fagomine AGF [Mus musculus],4UFJ_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine lactam IGL [Mus musculus],4UFK_A Mouse Galactocerebrosidase complexed with dideoxy-imino-lyxitol DIL [Mus musculus],4UFL_A Mouse Galactocerebrosidase complexed with deoxy-galacto-noeurostegine DGN [Mus musculus],4UFM_A Mouse Galactocerebrosidase complexed with 1-deoxy-galacto-nojirimycin DGJ [Mus musculus],5NXB_A Mouse galactocerebrosidase in complex with saposin A [Mus musculus],5NXB_B Mouse galactocerebrosidase in complex with saposin A [Mus musculus],6Y6S_A Chain A, Galactocerebrosidase [Mus musculus],6Y6T_A Chain A, Galactocerebrosidase [Mus musculus]
3ZR5_A 9.52e-20 378 718 24 327
STRUCTUREOF GALACTOCEREBROSIDASE FROM MOUSE [Mus musculus],3ZR6_A STRUCTURE OF GALACTOCEREBROSIDASE FROM MOUSE IN COMPLEX WITH GALACTOSE [Mus musculus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54818 5.55e-19 378 718 52 355
Galactocerebrosidase OS=Mus musculus OX=10090 GN=Galc PE=1 SV=2
Q0VA39 7.28e-16 378 718 44 347
Galactocerebrosidase OS=Xenopus tropicalis OX=8364 GN=galc PE=2 SV=1
Q498K0 1.51e-14 378 718 44 346
Galactocerebrosidase OS=Xenopus laevis OX=8355 GN=galc PE=2 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001165 0.995037 0.002635 0.000597 0.000316 0.000205

TMHMM  Annotations      download full data without filtering help

start end
12 31