megablast 参数

2021/07/02

megablast 2.2.25 arguments:

./megablast --help

  • -d Database [String] default = nr
  • -i 输入文件 [File In]
  • -e E值 [Real] default = 10.0
  • -m 比对文件格式:
    (1)0 = pairwise,
MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

>5 
          Length = 180915260

 Score = 63.9 bits (32), Expect = 3e-09
 Identities = 32/32 (100%)
 Strand = Plus / Plus

                                              
Query: 1      aaaataatgcatttgaaatagagatctagcaa 32
              ||||||||||||||||||||||||||||||||
Sbjct: 233526 aaaataatgcatttgaaatagagatctagcaa 233557


  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(2)1 = query-anchored showing identities,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database:/home/user/database/hg19.fa
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 ................................ 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(3)2 = query-anchored no identities,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 aaaataatgcatttgaaatagagatctagcaa 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(4)3 = flat query-anchored, show identities,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa 
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 ................................ 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(5)4 = flat query-anchored, no identities,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 aaaataatgcatttgaaatagagatctagcaa 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(6)5 = query-anchored no identities and blunt ends,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 aaaataatgcatttgaaatagagatctagcaa 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(7)6 = flat query-anchored, no identities and blunt ends,

MEGABLAST 2.2.25 [Feb-01-2011]


Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7(1-2):203-14.

Database: /home/user/database/hg19.fa 
           93 sequences; 3,137,161,264 total letters

Searching..................................................done

Query= seqname
         (32 letters)



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

5                                                                      64   3e-09

1_0 1      aaaataatgcatttgaaatagagatctagcaa 32
5   233526 aaaataatgcatttgaaatagagatctagcaa 233557
  Database: /home/user/database/hg19.fa
    Posted date:  Jun 27, 2018  11:28 AM
  Number of letters in database: 3,137,161,264
  Number of sequences in database:  93
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 0, Extension:  3.5
Number of Sequences: 93
Number of Hits to DB: 259,093
Number of extensions: 1
Number of successful extensions: 1
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 32
Length of database: 3,137,161,264
Length adjustment: 18
Effective length of query: 14
Effective length of database: 3,137,159,590
Effective search space: 43920234260
Effective search space used: 43920234260
X1: 11 (21.8 bits)
X2: 20 (39.6 bits)
X3: 51 (101.1 bits)
S1: 16 (32.2 bits)
S2: 16 (32.2 bits)

(8)7 = XML Blast output,

<?xml version="1.0"?>
<!DOCTYPE BlastOutput PUBLIC "-//NCBI//NCBI BlastOutput/EN" "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd">
<BlastOutput>
  <BlastOutput_program>blastn</BlastOutput_program>
  <BlastOutput_version>blastn 2.2.25 [Feb-01-2011]</BlastOutput_version>
  <BlastOutput_reference>~Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, ~Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), ~&quot;Gapped BLAST and PSI-BLAST: a new generation of protein database search~programs&quot;,  Nucleic Acids Res. 25:3389-3402.</BlastOutput_reference>
  <BlastOutput_db>/home/user/database/hg19.fa</BlastOutput_db>
  <BlastOutput_query-ID>lcl|1_0</BlastOutput_query-ID>
  <BlastOutput_query-def>seqname</BlastOutput_query-def>
  <BlastOutput_query-len>32</BlastOutput_query-len>
  <BlastOutput_param>
    <Parameters>
      <Parameters_expect>10</Parameters_expect>
      <Parameters_sc-match>1</Parameters_sc-match>
      <Parameters_sc-mismatch>-3</Parameters_sc-mismatch>
      <Parameters_gap-open>0</Parameters_gap-open>
      <Parameters_gap-extend>0</Parameters_gap-extend>
      <Parameters_filter>F</Parameters_filter>
    </Parameters>
  </BlastOutput_param>
  <BlastOutput_iterations>
    <Iteration>
      <Iteration_iter-num>1</Iteration_iter-num>
      <Iteration_query-ID>lcl|1_0</Iteration_query-ID>
      <Iteration_query-def>seqname</Iteration_query-def>
      <Iteration_query-len>32</Iteration_query-len>
      <Iteration_hits>
        <Hit>
          <Hit_num>1</Hit_num>
          <Hit_id>lcl|5</Hit_id>
          <Hit_def>No definition line found</Hit_def>
          <Hit_accession>5</Hit_accession>
          <Hit_len>180915260</Hit_len>
          <Hit_hsps>
            <Hsp>
              <Hsp_num>1</Hsp_num>
              <Hsp_bit-score>63.9245</Hsp_bit-score>
              <Hsp_score>32</Hsp_score>
              <Hsp_evalue>2.50885e-09</Hsp_evalue>
              <Hsp_query-from>1</Hsp_query-from>
              <Hsp_query-to>32</Hsp_query-to>
              <Hsp_hit-from>233526</Hsp_hit-from>
              <Hsp_hit-to>233557</Hsp_hit-to>
              <Hsp_query-frame>1</Hsp_query-frame>
              <Hsp_hit-frame>1</Hsp_hit-frame>
              <Hsp_identity>32</Hsp_identity>
              <Hsp_positive>32</Hsp_positive>
              <Hsp_align-len>32</Hsp_align-len>
              <Hsp_qseq>AAAATAATGCATTTGAAATAGAGATCTAGCAA</Hsp_qseq>
              <Hsp_hseq>AAAATAATGCATTTGAAATAGAGATCTAGCAA</Hsp_hseq>
              <Hsp_midline>||||||||||||||||||||||||||||||||</Hsp_midline>
            </Hsp>
          </Hit_hsps>
        </Hit>
      </Iteration_hits>
      <Iteration_stat>
        <Statistics>
          <Statistics_db-num>93</Statistics_db-num>
          <Statistics_db-len>3137161264</Statistics_db-len>
          <Statistics_hsp-len>18</Statistics_hsp-len>
          <Statistics_eff-space>4.39202e+10</Statistics_eff-space>
          <Statistics_kappa>0.711</Statistics_kappa>
          <Statistics_lambda>1.374</Statistics_lambda>
          <Statistics_entropy>1.31</Statistics_entropy>
        </Statistics>
      </Iteration_stat>
    </Iteration>
  </BlastOutput_iterations>
</BlastOutput>

(9)8 = tabular,

seqname 5   100.00  32  0   0   1   32  233526  233557  3e-09   63.9

(10)9 tabular with comment lines,

# BLASTN 2.2.25 [Feb-01-2011]
# Query: seqname
# Database: /home/user/database/hg19.fa
# Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score
seqname 5   100.00  32  0   0   1   32  233526  233557  3e-09   63.9

(11)10 ASN, text
(12)11 ASN, binary [Integer] default = 0 range from 0 to 11

  • -o BLAST报告输出文件名,默认 = stdout
  • -F 过滤输入序列 [String],【T / F】
  • -X 间隙对齐的X衰减值(位)[整数],默认=20 (X dropoff value for gapped alignment (in bits) [Integer] default = 20)
  • -I 显示GI在deflines 【T / F
  • -q 对核苷酸错配的惩罚 【-3】
  • -r 核苷酸匹配奖励 【1】
  • -v 显示(V)的一行描述的数据库序列数 【500】
  • -b 显示(B)比对的数据库序列数 【250】
  • -D 输出类型:
    0 - 对齐终点和分数,
    1 - all ungapped segments endpoints,
    2 - traditional BLAST output,
    3 - tab-delimited one line format,
    4 - incremental text ASN.1,
    5 - incremental binary ASN.1 [Integer] default = 2
  • -a 要使用的处理器数量 【1】
  • -O ASN.1 SeqAlign文件;必须与-D2选项结合使用
  • -J 相信查询定义 【T / F
  • -M 单次搜索的最大查询总长度 【5000000】
  • -W word大小(最佳完美匹配的长度)【28】
  • -z 数据库的有效长度 【0】
  • -Y 搜索空间的有效长度【0】
  • -P 散列值的最大位置数(设置为0以忽略)[整数] 【0】
  • -S 查询链搜索数据库:3是两者,1是顶部,2是底部 【3】
  • -T 输出HTML 【T / F
  • -l 将数据库搜索限制为GI的列表[String]
  • -G 打开gap的成本 【-1】
  • -E 扩展gap的成本 【-1】
  • -s 报告的最小命中分数 【0】
  • -Q 屏蔽查询输出,必须与-D 2选项结合使用
  • -f 在输出中显示完整的ID(默认-仅限GIs或加入)【T / F
  • -U 使用FASTA序列的小写过滤 【T / F
  • -R 在输出结束时报告日志信息 【T / F
  • -p 相似性百分比cut 【0】
  • -L 查询序列上的位置
  • -A 多次点击窗口大小;对于不连续的模板,默认值为0(即单击扩展)或40(负数覆盖此)[整数] 【0】
  • -y 无上限扩展的X dropoff值[整数] 【10】
  • -Z X动态编程间隙扩展的压差值[整数] 【50】
  • -t 不连续单词模板的长度(如果为0,则为连续单词)[整数] 【0】
  • -g 使不连续的megablast为数据库的每个基础生成单词(当前的BLAST引擎是强制性的)【T / F】
  • -n 对亲合差距分数使用非贪婪(动态规划)扩展 【T / F
  • -N 不连续词模板的类型 【0 - 编码 / 1 - 最优 / 2 - 两个同时】
  • -H 每个数据库序列要保存的最大HSP数 【0】
  • -V 强制使用遗留blast引擎 【T/F
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 211,743评论 6 492
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 90,296评论 3 385
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 157,285评论 0 348
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 56,485评论 1 283
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 65,581评论 6 386
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 49,821评论 1 290
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 38,960评论 3 408
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 37,719评论 0 266
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 44,186评论 1 303
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 36,516评论 2 327
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 38,650评论 1 340
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 34,329评论 4 330
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 39,936评论 3 313
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 30,757评论 0 21
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 31,991评论 1 266
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 46,370评论 2 360
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 43,527评论 2 349

推荐阅读更多精彩内容