logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003693_00345

You are here: Home > Sequence: MGYG000003693_00345

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Phocaeicola plebeius_A
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Phocaeicola; Phocaeicola plebeius_A
CAZyme ID MGYG000003693_00345
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
452 MGYG000003693_2|CGC5 50129.01 4.244
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003693 3967780 Isolate China Asia
Gene Location Start: 104884;  End: 106242  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003693_00345.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM32 318 447 5.3e-18 0.9596774193548387

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam08522 DUF1735 3.13e-14 177 288 2 117
Domain of unknown function (DUF1735). This domain of unknown function is found in a number of bacterial proteins including acylhydrolases. The structure of this domain has a beta-sandwich fold.
pfam00754 F5_F8_type_C 3.49e-14 318 446 3 125
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam08522 DUF1735 1.35e-10 36 160 1 120
Domain of unknown function (DUF1735). This domain of unknown function is found in a number of bacterial proteins including acylhydrolases. The structure of this domain has a beta-sandwich fold.
smart00231 FA58C 4.92e-05 324 450 15 139
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
pfam07738 Sad1_UNC 0.002 391 452 62 129
Sad1 / UNC-like C-terminal. The C. elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localizes at the spindle pole body. UNC-84 and and Sad1 share a common C-terminal region, that is often termed the SUN (Sad1 and UNC) domain. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2. The SUN domain of Sun2 has been demonstrated to be in the periplasm.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUT88787.1 1.69e-170 4 451 1 445
QUT73146.1 5.81e-161 6 451 4 445
QUT40999.1 4.72e-160 6 451 4 445
BCA52367.1 9.48e-160 6 451 4 445
QGT72760.1 9.74e-153 4 451 1 447

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2KD7_A 4.07e-15 306 433 1 128
ChainA, Putative chitobiase [Bacteroides thetaiotaomicron VPI-5482]
3F2Z_A 5.54e-15 309 433 4 128
Crystalstructure of the C-terminal domain of a chitobiase (BF3579) from Bacteroides fragilis, Northeast Structural Genomics Consortium Target BfR260B [Bacteroides fragilis NCTC 9343]
3GGL_A 9.46e-15 309 433 14 138
X-RayStructure of the C-terminal domain (277-440) of Putative chitobiase from Bacteroides thetaiotaomicron. Northeast Structural Genomics Consortium Target BtR324A. [Bacteroides thetaiotaomicron],6OE2_A X-Ray Structure of the C-terminal domain (277-440) of Putative chitobiase from Bacteroides thetaiotaomicron. Northeast Structural Genomics Consortium Target BtR324A. Re-refinement of 3GGL with correct metal Mn replacing Zn. New metal confirmed with PIXE analysis of original sample. [Bacteroides thetaiotaomicron]
2EIC_A 4.75e-07 310 440 9 135
ChainA, Galactose oxidase [Fusarium graminearum]
2EID_A 4.75e-07 310 440 9 135
ChainA, Galactose oxidase [Fusarium graminearum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P0CS93 2.67e-06 310 440 50 176
Galactose oxidase OS=Gibberella zeae OX=5518 GN=GAOA PE=1 SV=1
I1S2N3 3.52e-06 310 440 50 176
Galactose oxidase OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) OX=229533 GN=GAOA PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000000 1.000039 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003693_00345.