logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002216_01552

You are here: Home > Sequence: MGYG000002216_01552

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; ;
CAZyme ID MGYG000002216_01552
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1456 163641.28 4.7453
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002216 3003083 MAG Spain Europe
Gene Location Start: 4355;  End: 8725  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002216_01552.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH95 454 1086 2.3e-49 0.8310249307479224
CBM32 150 277 2.5e-22 0.9193548387096774
CBM32 309 438 1.2e-19 0.9112903225806451

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00754 F5_F8_type_C 9.06e-19 147 276 1 127
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
sd00036 LRR_3 2.49e-16 1322 1422 14 127
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 2.69e-16 1342 1434 2 93
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam00754 F5_F8_type_C 3.88e-16 309 437 6 127
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam13306 LRR_5 3.25e-14 1327 1422 16 122
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QHW32096.1 7.16e-205 26 1134 26 1087
AWS40658.1 1.92e-151 2 1128 4 926
QXE33659.1 2.02e-149 44 1128 70 952
BCB76148.1 1.14e-137 305 1128 89 879
QKW17790.1 1.70e-78 454 1104 134 753

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2J7M_A 3.32e-06 174 274 43 140
Characterizationof a Family 32 CBM [Clostridium perfringens]
2J1A_A 3.38e-06 174 274 44 141
Structureof CBM32 from Clostridium perfringens beta-N- acetylhexosaminidase GH84C in complex with galactose [Clostridium perfringens ATCC 13124],2J1E_A High Resolution Crystal Structure of CBM32 from a N-acetyl-beta- hexosaminidase in complex with lacNAc [Clostridium perfringens ATCC 13124]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000125 0.100230 0.899495 0.000058 0.000047 0.000037

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002216_01552.