logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000959_01979

You are here: Home > Sequence: MGYG000000959_01979

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus sp900752785
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus; Ruminococcus sp900752785
CAZyme ID MGYG000000959_01979
CAZy Family CBM4
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
783 MGYG000000959_77|CGC1 86674.85 4.3266
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000959 2365658 MAG Denmark Europe
Gene Location Start: 721;  End: 3072  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.73 3.2.1.-

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 286 714 3.8e-108 0.992822966507177
CBM4 34 159 6.7e-33 0.9920634920634921

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 4.92e-93 289 714 2 374
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 1.70e-26 197 281 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 5.57e-25 195 274 1 82
Cellulase N-terminal ig-like domain.
PLN03009 PLN03009 3.34e-15 278 723 21 495
cellulase
cd14256 Dockerin_I 2.63e-13 725 780 1 56
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL16782.1 0.0 1 783 1 789
AQR94654.1 6.30e-222 10 718 12 725
AGF55906.1 8.92e-222 10 718 12 725
QEH68101.1 7.53e-215 30 715 32 722
ADZ82408.1 3.29e-213 30 715 32 722

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4CJ0_A 1.82e-102 192 747 25 581
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]
1CLC_A 2.63e-102 192 747 39 595
ChainA, ENDOGLUCANASE CELD; EC: 3.2.1.4 [Acetivibrio thermocellus]
3EZ8_A 2.08e-84 197 714 9 528
CrystalStructure of endoglucanase Cel9A from the thermoacidophilic Alicyclobacillus acidocaldarius [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3GZK_A Structure of A. Acidocaldarius Cellulase CelA [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3H2W_A Structure of A. acidocaldarius cellulase CelA in complex with cellobiose [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3H3K_A Structure of A. acidocaldarius cellulase CelA in complex with cellotetraose [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3RX5_A structure of AaCel9A in complex with cellotriose-like isofagomine [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3RX7_A Structure of AaCel9A in complex with cellotetraose-like isofagomine [Alicyclobacillus acidocaldarius subsp. acidocaldarius],3RX8_A structure of AaCel9A in complex with cellobiose-like isofagomine [Alicyclobacillus acidocaldarius subsp. acidocaldarius]
5E2J_A 6.79e-84 197 714 32 551
Crystalstructure of single mutant thermostable endoglucanase (D468A) from Alicyclobacillus acidocaldarius [Alicyclobacillus acidocaldarius subsp. acidocaldarius],5E2J_B Crystal structure of single mutant thermostable endoglucanase (D468A) from Alicyclobacillus acidocaldarius [Alicyclobacillus acidocaldarius subsp. acidocaldarius]
3X17_A 5.58e-80 197 713 18 552
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P0C2S4 9.95e-102 192 747 25 581
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DDN1 2.62e-101 192 747 49 605
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1
P23658 6.06e-97 197 715 4 543
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
Q05156 7.46e-82 7 720 3 745
Cellulase 1 OS=Streptomyces reticuli OX=1926 GN=cel1 PE=1 SV=1
A3DCH1 3.89e-80 26 747 35 854
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000387 0.998805 0.000204 0.000216 0.000189 0.000167

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000959_01979.