logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001602_01049

You are here: Home > Sequence: MGYG000001602_01049

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Blautia sp900547685
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Blautia; Blautia sp900547685
CAZyme ID MGYG000001602_01049
CAZy Family GH73
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
863 96954.14 4.5309
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001602 2582611 MAG China Asia
Gene Location Start: 42221;  End: 44812  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001602_01049.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH73 731 859 4e-17 0.953125

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033930 pneumo_PspA 4.20e-62 415 611 445 641
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 2.30e-57 415 591 488 665
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 4.06e-56 433 630 443 640
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 2.66e-54 433 611 486 664
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 2.11e-51 450 695 440 656
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ASM68313.1 3.13e-301 1 863 1 1051
QHB24702.1 1.21e-61 673 862 68 258
QEI32209.1 1.21e-61 673 862 68 258
QRT30849.1 1.21e-61 673 862 68 258
CBL26238.1 4.89e-60 614 862 3 254

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7RAG_B 3.18e-14 199 391 15 213
ChainB, Germination-specific N-acetylmuramoyl-L-alanine amidase, Autolysin [Clostridioides difficile]
5J72_A 1.13e-13 173 383 425 633
ChainA, Putative N-acetylmuramoyl-L-alanine amidase,autolysin cwp6 [Clostridioides difficile 630],5J72_B Chain B, Putative N-acetylmuramoyl-L-alanine amidase,autolysin cwp6 [Clostridioides difficile 630]
4BIN_A 2.85e-12 156 387 133 393
Crystalstructure of the E. coli N-acetylmuramoyl-L-alanine amidase AmiC [Escherichia coli K-12]
1JWQ_A 3.04e-11 202 390 3 179
Structureof the catalytic domain of CwlV, N-acetylmuramoyl-L-alanine amidase from Bacillus(Paenibacillus) polymyxa var.colistinus [Paenibacillus polymyxa]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O51481 4.14e-22 727 854 68 188
Uncharacterized protein BB_0531 OS=Borreliella burgdorferi (strain ATCC 35210 / DSM 4680 / CIP 102532 / B31) OX=224326 GN=BB_0531 PE=3 SV=1
P54525 1.06e-15 203 387 32 204
Uncharacterized protein YqiI OS=Bacillus subtilis (strain 168) OX=224308 GN=yqiI PE=3 SV=3
Q02114 2.87e-15 172 387 303 495
N-acetylmuramoyl-L-alanine amidase LytC OS=Bacillus subtilis (strain 168) OX=224308 GN=lytC PE=1 SV=1
Q9RMZ0 5.73e-15 203 390 335 523
Uncharacterized cell wall amidase pXO2-42/BXB0045/GBAA_pXO2_0045 OS=Bacillus anthracis OX=1392 GN=pXO2-42 PE=3 SV=2
O48471 6.59e-15 202 391 4 185
Endolysin OS=Bacillus phage SPP1 OX=10724 GN=25 PE=4 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000278 0.999071 0.000170 0.000160 0.000151 0.000132

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001602_01049.