logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001573_00794

You are here: Home > Sequence: MGYG000001573_00794

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Actinobacteriota; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella;
CAZyme ID MGYG000001573_00794
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1187 126376.04 4.3252
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001573 1547360 MAG United States North America
Gene Location Start: 37;  End: 3600  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001573_00794.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM32 119 234 7.2e-16 0.8709677419354839

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd08759 Type_III_cohesin_like 5.19e-37 321 488 1 167
Cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. Two specific calcium-dependent interactions between cohesin and dockerin appear to be essential for cellulosome assembly, type I and type II. This subfamily represents type III cohesins and closely related domains.
pfam00754 F5_F8_type_C 4.52e-18 111 227 1 118
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam00754 F5_F8_type_C 3.11e-10 817 954 5 127
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
cd00057 FA58C 7.66e-10 113 227 18 134
Substituted updates: Jan 31, 2002
pfam07554 FIVAR 5.25e-08 970 1035 3 69
FIVAR domain. This domain is found in a wide variety of contexts, but mostly occurring in cell wall associated proteins. A lack of conserved catalytic residues suggests that it is a binding domain. From context, possible substrates are hyaluronate or fibronectin (personal obs: C Yeats). This is further evidenced by. Possibly the exact substrate is N-acetyl glucosamine. Finding it in the same protein as pfam05089 further supports this proposal. It is found in the C-terminal part of Bacillus sp. Gellan lyase, which is removed during maturation. Some of the proteins it is found in are involved in methicillin resistance. The name FIVAR derives from Found In Various Architectures.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QWT17625.1 0.0 12 1125 973 2117
QNM10857.1 1.41e-257 4 1113 924 2019
BCT46261.1 1.06e-214 10 1110 930 2041
QOY60737.1 3.76e-176 4 493 354 848
BBK61154.1 6.80e-164 7 1116 953 2075

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4LPL_A 9.92e-22 83 238 23 182
Structureof CBM32-1 from a family 31 glycoside hydrolase from Clostridium perfringens [Clostridium perfringens ATCC 13124]
2V72_A 3.38e-15 106 238 11 143
Thestructure of the family 32 CBM from C. perfringens NanJ in complex with galactose [Clostridium perfringens]
4LKS_A 1.14e-14 817 958 32 166
Structureof CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with galactose [Clostridium perfringens ATCC 13124],4LKS_C Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with galactose [Clostridium perfringens ATCC 13124],4LQR_A Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens [Clostridium perfringens ATCC 13124],4P5Y_A Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with N-acetylgalactosamine [Clostridium perfringens ATCC 13124]
6M76_A 6.25e-12 12 104 871 963
GH31alpha-N-acetylgalactosaminidase from Enterococcus faecalis [Enterococcus faecalis ATCC 10100],6M77_A GH31 alpha-N-acetylgalactosaminidase from Enterococcus faecalis in complex with N-acetylgalactosamine [Enterococcus faecalis ATCC 10100]
7F7Q_A 6.25e-12 12 104 871 963
ChainA, GH31 alpha-N-acetylgalactosaminidase [Enterococcus faecalis ATCC 10100]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q0TR53 1.05e-06 150 237 672 766
O-GlcNAcase NagJ OS=Clostridium perfringens (strain ATCC 13124 / DSM 756 / JCM 1290 / NCIMB 6125 / NCTC 8237 / Type A) OX=195103 GN=nagJ PE=1 SV=1
Q8XL08 1.05e-06 150 237 672 766
O-GlcNAcase NagJ OS=Clostridium perfringens (strain 13 / Type A) OX=195102 GN=nagJ PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000064 0.000001 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      download full data without filtering help

start end
1159 1181