blast数据库含义
程序员文章站
2022-06-14 17:00:12
...
blast的数据库里面有这几个数据库,每一个的具体含义: https://ncisf.org/index.php?q=software-databases/blast-databases A list of the databases available on the cluster, including information about the database, it's source, update method and
blast的数据库里面有这几个数据库,每一个的具体含义:
https://ncisf.org/index.php?q=software-databases/blast-databases
A list of the databases available on the cluster, including information about the database, it's source, update method and description.
All databases are located in /sw/db
Name |
Type |
Update Method |
Source |
Description |
---|---|---|---|---|
nt | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/nt.* | nucleotide sequence database, with entries from all traditional divisions of GenBank, EMBL, and DDBJ excluding bulk divisions (gss, sts, pat, est, and htg divisions. wgs entries are also excluded. Not non-redundant. |
nr | protein | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/nr.* | non-redundant protein squence database with entries from GenPept, Swissprot, PIR, PDF, PDB and NCBI RefSeq |
swissprot | protein | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/swissprot.tar.gz | swiss-prot sequence databases (last major update), it's parent database is nr. |
human_genomic | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/human_genomic.* | Human RefSeq (NC_######) chromosome records with gap adjusted concatenated NT_ contigs |
est_human | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/est_human.* | Alias and mask files for human subset of the est database. These alias and mask files need all volumes of est to function properly. |
pataa | protein | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/pataa.* | Patent protein sequence database. Directly from USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ |
patnt | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/patnt.* | Patent nucleotide sequence database. Directly from USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ |
pdbaa | protein | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/pdbaa.* | Protein sequneces from PDB protein structures, it's parent database is nr. |
pdbnt | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/pdbnt/* | Nucleotide sequences from pdb nucleic acid structures. It's parent database is nt. They are NOT the protein coding sequences for the corresponding pdbaa entries. |
sts | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/sts.* | Sequences from the STS division of GenBank, EMBL, and DDBJ |
vector | nucleic | Automatic - NCBI formatted. | ftp://ftp.ncbi.nih.gov/blast/db/vector.* | Vector sequence database. (Note that for vector screening, NCBI recommend using the UniVec database, please contact support@qfab.org should you require this database). |