dbCAN3 server

What is dbCAN3

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation, funded by the National Science Foundation (DBI-1933521). Similar resources on the web include CAZy, CAT (obsolete), and dbCAN-sub. dbCAN3 server is an updated version of the original dbCAN web server (obsolete) and dbCAN2 meta server (obsolete) , and has the following new features (thanks to dbCAN users all over the world for suggestions):

dbCAN3 server allows users to predict the substrate of CAZyme gene clusters (CGCs) by using two approaches: dbCAN-PUL search and eCAMI subfamily
dbCAN3 server allows submission of nucleotide sequences: prokaryotic genomic sequences (fna file) of draft genomes and metagenomes; for eukaryotic genomes, please still submit protein seqs (faa file)
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:

HMMER for annotated CAZyme domain boundaries according to the dbCAN CAZyme domain HMM database
DIAMOND for fast blast hits in the CAZy database
HMMER for dbCAN-sub a database of carbohydrate active enzyme subfamilies for substrate annotation

dbCAN3 server can identify transcription factors (TFs), transporters (TCs), and further CAZyme gene clusters (CGCs) using CGC-Finder if users submit faa+gff files or fna file
dbCAN3 server combines the results from the three tools and allows visualization as venn diagram and detailed results as graphs

dbCAN3 server will be updated once a year to use the most updated CAZy database, dbCAN HMM database and dbCAN-sub carbohydrate active enzyme subfamilies

News

8/9/2022: dbCAN HMMdb v11 is released (based on CAZyDB 8/7/2022). Now the HMMdb contains 699 CAZyme HMMs (452 family HMMs + 3 cellulosome HMMs + 244 subfamily HMMs). The CAZyDB for Diamond search is also updated, containing in total 2,428,817 fasta sequences. See readme for details.
06/29/2022: dbCAN-sub (HMMdb from eCAMI subfams and allows EC and substrate inferences) is now deployed on dbCAN meta server and replaces eCAMI (consumes too much RAM and too slow).
12/21/2021: updated run_dbcan python package to V3.0.1. Major updates include: (1) replaced Hotpep with eCAMI (recommended by an evaluation study); (2) added EC number in the overview output file (inferred by eCAMI); (3) formated cgc.out to make it more readable. The web server has been updated accordingly.
10/03/2021: updated CAZyDB for Diamond search. Now this file contains 2,161,786 fasta sequences. The old CAZyDB fasta file CAZyDB.07292021.fa was deleted in the download folder.See readme for details.
8/17/2021: dbCAN HMMdb v10 is released (based on CAZyDB 7/26/2020). Now the HMMdb contains 692 CAZyme HMMs (445 family HMMs + 3 cellulosome HMMs + 244 subfamily HMMs). The CAZyDB for Diamond search is also updated, containing in total 1,776,583 fasta sequences. See readme for details.
04/28/2021: We received an NIH R01 award to continue the development of dbCAN family tools
8/04/2020: dbCAN HMMdb v9 is released (based on CAZyDB 7/30/2020). Now the HMMdb contains 681 CAZyme HMMs (434 family HMMs + 3 cellulosome HMMs + 244 subfamily HMMs). The CAZyDB for Diamond search is also updated, containing in total 1,716,043 fasta sequences. See readme for details.
04/21/2020: dbCAN2 Hotpep PPR patterns updated to most recent release of CAZyDB (2019). Also missing group EC# files for families added in.
10/07/2019 run_dbcan python package is released. You should not only use pip install run-dbcan==2.0.0 to download it, but also install Miniconda or Anaconda as well to install dependencies packages(conda install -c bioconda diamond hmmer=3.1b2 prodigal fraggenescan). And then use only one command to download and compress all the related databases from Download section. Read more on run_dbcan2.
/08/2019: dbCAN HMMdb v8 is released (based on CAZyDB 7/26/2019). Now the HMMdb contains 641 CAZyme HMMs (421 family HMMs + 3 cellulosome HMMs + 217 subfamily HMMs). The CAZyDB for Diamond search is also updated, containing in total 1,386,849 fasta sequences. See readme for details.
4/01/2019: dbCAN2 has a docker version written by Haidong Yi.
3/19/2019: dbCAN2 web server has moved to UNL and has a new URL
1/20/2019: dbCAN2 standalone package is available on github; if you prefer to still use the old hmmscan way, the data are available in the download page
8/25/2018: dbCAN HMMdb v7 is released (based on CAZyDB 7/31/2018): HMMs of 15 new families were added (AA14, AA15, CBM82, CBM83, GH146, GH147, GH148, GH149, GH150, GH151, GH152, GH153, GT105, GT106, PL28), GT2 family HMM now is replaced with 8 Pfam HMMs (GT2_Chitin_synth_1, GT2_Chitin_synth_2, GT2_Glycos_transf_2, GT2_Glyco_tranf_2_2, GT2_Glyco_tranf_2_3, GT2_Glyco_tranf_2_4, GT2_Glyco_tranf_2_5, GT2_Glyco_trans_2_3)
5/2/2018: dbCAN2 meta server paper is accepted to publish at Nucleic Acids Research
8/15/2017: Tanner and Le Huang begin to work on dbCAN2 meta server
7/1/2017: Yanbin is awarded the NSF CAREER grant for CAZyme bioinformatics research