logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004699_02939

You are here: Home > Sequence: MGYG000004699_02939

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Clostridiales; Clostridiaceae; Clostridium;
CAZyme ID MGYG000004699_02939
CAZy Family GH19
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
276 31594.52 8.6014
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004699 3820928 MAG Germany Europe
Gene Location Start: 474;  End: 1304  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004699_02939.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH19 138 238 1.3e-21 0.47619047619047616

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033840 PspC_relate_1 3.59e-27 4 99 512 627
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.
NF033838 PspC_subgroup_1 1.03e-24 6 104 535 631
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033838 PspC_subgroup_1 1.92e-24 4 99 488 582
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 3.64e-23 5 104 511 608
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
COG3179 COG3179 1.51e-22 104 274 1 203
Predicted chitinase [General function prediction only].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QMV40485.1 2.30e-77 94 276 148 327
AET57870.1 1.61e-76 107 275 166 334
AMK74618.1 7.25e-73 107 275 54 225
AOS70187.1 7.25e-73 107 275 54 225
API98400.1 3.11e-71 107 275 176 347

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4OK7_A 8.21e-07 175 274 125 221
ChainA, Endolysin [Salmonella phage SPN1S],4OK7_B Chain B, Endolysin [Salmonella phage SPN1S],4OK7_C Chain C, Endolysin [Salmonella phage SPN1S]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O64203 1.31e-15 140 274 215 360
Endolysin A OS=Mycobacterium phage D29 OX=28369 GN=10 PE=1 SV=1
P44187 3.64e-08 162 268 90 191
Glycosyl hydrolase family 19 domain-containing protein HI_1415 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_1415 PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000067 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004699_02939.