logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002216_00547

You are here: Home > Sequence: MGYG000002216_00547

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; ;
CAZyme ID MGYG000002216_00547
CAZy Family GH142
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1431 160058.72 4.6773
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002216 3003083 MAG Spain Europe
Gene Location Start: 23913;  End: 28208  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002216_00547.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH142 585 1037 1.7e-171 0.9582463465553236

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
sd00036 LRR_3 4.46e-12 1321 1398 60 127
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 3.11e-11 1321 1398 14 81
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 2.54e-10 1322 1395 12 74
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam13306 LRR_5 1.24e-09 1322 1398 57 122
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam07523 Big_3 2.32e-09 1147 1217 1 67
Bacterial Ig-like domain (group 3). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in a variety of bacterial surface proteins.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QRO14849.1 9.14e-130 588 1034 54 502
AST55953.1 9.14e-130 588 1034 54 502
QUT52376.1 9.14e-130 588 1034 54 502
QUT20767.1 9.14e-130 588 1034 54 502
QUT97167.1 9.14e-130 588 1034 54 502

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5MQS_A 1.68e-110 590 1032 651 1090
SialidaseBT_1020 [Bacteroides thetaiotaomicron]
5MQR_A 2.96e-104 590 1032 651 1090
SialidaseBT_1020 [Bacteroides thetaiotaomicron]
6M5A_A 1.08e-16 468 1067 165 831
Crystalstructure of GH121 beta-L-arabinobiosidase HypBA2 from Bifidobacterium longum [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
E8MGH9 8.35e-16 468 1067 196 862
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1
A0A401ETL2 1.68e-07 1052 1218 1296 1452
Exo-beta-1,6-galactobiohydrolase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=bl1,6Gal PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000546 0.998365 0.000472 0.000220 0.000195 0.000174

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002216_00547.