- 首先打开IMG/VR数据库地址,注册一个自己的账户;
- 获取自己的账户cookies到当前下载的目录
#把自己的账号密码替换一下
curl 'https://signon.jgi.doe.gov/signon/create' --data-urlencode 'login=【自己的账号】' --data-urlencode 'password=【自己的密码】' -c cookies > $PWD
- 使用自己的cookies进行下载(核心蛋白文件,核酸序列,分类表,宿主信息)
curl -C - -b cookies -o IMGVR_all_proteins-high_confidence.faa.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a2/IMGVR_all_proteins-high_confidence.faa.gz'
curl -C - -b cookies -o IMGVR_all_nucleotides-high_confidence.fna.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a0/IMGVR_all_nucleotides-high_confidence.fna.gz'
curl -C - -b cookies -o IMGVR_all_Sequence_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a4/IMGVR_all_Sequence_information-high_confidence.tsv'
curl -C - -b cookies -o IMGVR_all_Host_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a6/IMGVR_all_Host_information-high_confidence.tsv'
#公共服务器建议删掉cookies,自己的服务器无所谓
rm cookies
4.可以对比一下MD5信息
md5sum *
File_name | MD5 |
---|---|
IMGVR_all_Host_information-high_confidence.tsv | 71b54d0f5c186d813f058bf0379dfd24 |
IMGVR_all_nucleotides-high_confidence.fna.gz | 83301c9c6dfefea3305a53ee2a41bac3 |
IMGVR_all_proteins-high_confidence.faa.gz | 19e266b87ec7ca96fe586aed172438fe |
IMGVR_all_Sequence_information-high_confidence.tsv | 3c516db128082fa29dc2c2f60520da1b |
PS:服务器似乎不支持断点再续,和多线程下载,若网络问题重新下载需要删除源文件,建议白天下载,晚上下载速度较慢。