logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004314_00011

You are here: Home > Sequence: MGYG000004314_00011

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Varibaculum timonense
Lineage Bacteria; Actinobacteriota; Actinomycetia; Actinomycetales; Actinomycetaceae; Varibaculum; Varibaculum timonense
CAZyme ID MGYG000004314_00011
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1834 MGYG000004314_1|CGC1 196256.72 6.1453
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004314 2027678 MAG Israel Asia
Gene Location Start: 11460;  End: 16964  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.97 3.2.1.140

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 233 912 3.6e-189 0.9929278642149929
CBM51 1396 1534 6.2e-46 0.9850746268656716

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12905 Glyco_hydro_101 4.72e-133 474 765 1 273
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
cd14244 GH_101_like 1.15e-113 490 787 1 297
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam08305 NPCBM 2.03e-59 1393 1535 1 136
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
pfam18080 Gal_mutarotas_3 1.72e-55 228 473 1 243
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.
smart00776 NPCBM 3.01e-52 1392 1535 2 145
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QXT63218.1 0.0 11 1536 1 1520
AZR00042.1 0.0 16 1390 18 1405
AZR07626.1 0.0 16 1390 18 1405
QIU85950.1 0.0 16 1390 16 1403
AJC69926.1 0.0 16 1390 7 1394

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6QEP_A 1.13e-117 227 1273 11 1114
EngBFDARPin Fusion 4b H14 [Bifidobacterium longum]
6SH9_B 1.18e-117 227 1273 11 1114
EngBFDARPin Fusion 4b D12 [Bifidobacterium longum subsp. longum JCM 1217]
6QFK_A 1.18e-117 227 1273 11 1114
EngBFDARPin Fusion 4b G10 [Bifidobacterium longum]
6QEV_B 1.18e-117 227 1273 11 1114
EngBFDARPin Fusion 4b B6 [Bifidobacterium longum]
2ZXQ_A 1.55e-117 227 1273 26 1129
Crystalstructure of endo-alpha-N-acetylgalactosaminidase from Bifidobacterium longum (EngBF) [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 5.28e-237 226 1562 50 1332
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
Q8DR60 9.78e-108 183 1275 290 1418
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
Q2MGH6 1.30e-107 183 1275 290 1418
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
B2UNU8 2.80e-06 1429 1531 97 202
Alpha-1,3-galactosidase B OS=Akkermansia muciniphila (strain ATCC BAA-835 / DSM 22959 / JCM 33894 / BCRC 81048 / CCUG 64013 / CIP 107961 / Muc) OX=349741 GN=glaB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000249 0.999097 0.000147 0.000205 0.000153 0.000138

TMHMM  Annotations      download full data without filtering help

start end
7 29
1803 1825