logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004799_01180

You are here: Home > Sequence: MGYG000004799_01180

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Schaedlerella sp900765975
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Schaedlerella; Schaedlerella sp900765975
CAZyme ID MGYG000004799_01180
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1047 MGYG000004799_39|CGC1 117966.41 4.6006
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004799 2888994 MAG China Asia
Gene Location Start: 199;  End: 3342  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004799_01180.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 7 331 2.1e-66 0.4823196605374823

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd14244 GH_101_like 1.05e-72 44 320 4 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam12905 Glyco_hydro_101 4.87e-54 23 294 1 270
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
pfam17451 Glyco_hyd_101C 5.10e-18 303 402 1 110
Glycosyl hydrolase 101 beta sandwich domain. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by a S. pneumoniae protein, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor. This domain represents C-terminal the beta sandwich domain.
pfam17974 GalBD_like 3.32e-12 594 682 1 114
Galactose-binding domain-like. Proteins containing a galactose-binding domain-like fold can be found in several different protein families, in both eukaryotes and prokaryotes. The common function of these domains is to bind to specific ligands, such as cell-surface-attached carbohydrate substrates for galactose oxidase and sialidase, phospholipids on the outer side of the mammalian cell membrane for coagulation factor Va, membrane-anchored ephrin for the Eph family of receptor tyrosine kinases, and a complex of broken single-stranded DNA and DNA polymerase beta for XRCC1. The structure of the galactose-binding domain-like members consists of a beta-sandwich, in which the strands making up the sheets exhibit a jellyroll fold.
pfam09479 Flg_new 1.92e-10 909 967 2 64
Listeria-Bacteroides repeat domain (List_Bact_rpt). This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
BBK62189.1 3.86e-183 5 854 715 1694
BBK22105.1 1.95e-182 5 854 715 1694
ATD58436.1 1.88e-169 5 855 891 1782
ATD54117.1 1.88e-169 5 855 891 1782
QBJ76357.1 1.88e-169 5 855 891 1782

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6QEP_A 1.72e-75 8 682 263 1028
EngBFDARPin Fusion 4b H14 [Bifidobacterium longum]
6SH9_B 1.74e-75 8 682 263 1028
EngBFDARPin Fusion 4b D12 [Bifidobacterium longum subsp. longum JCM 1217]
6QFK_A 1.74e-75 8 682 263 1028
EngBFDARPin Fusion 4b G10 [Bifidobacterium longum]
6QEV_B 1.74e-75 8 682 263 1028
EngBFDARPin Fusion 4b B6 [Bifidobacterium longum]
6QFO_A 1.77e-75 8 682 263 1028
EngBFDARPin Fusion 9b 3G124 [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 1.99e-71 2 682 277 963
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
Q8DR60 2.38e-56 8 723 572 1359
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
Q2MGH6 1.71e-55 8 723 572 1359
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000055 0.000010 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      download full data without filtering help

start end
1019 1041