1. 下载安装
http://abacus.gene.ucl.ac.uk/software/paml.html
记得把src文件下的*.c文件全部拷贝到bin文件下:
2. 配置codeml.ctl
里面有samples。你可以看着做自己的(model参数哪里算0和2的)
seqfile = F:\YT-biosoft\soft\paml4\paml4.8\examples\Myoglobin.nuc * sequence data filename
treefile = F:\YT-biosoft\soft\paml4\paml4.8\examples\Myoglobin.trees * tree structure file name
outfile = 2Myoglobin.txt * main result file name
noisy = 3 * 0,1,2,3,9: how much rubbish on the screen
verbose = 0 * 0: concise; 1: detailed, 2: too much
runmode = 0 * 0: user tree; 1: semi-automatic; 2: automatic
* 3: StepwiseAddition; (4,5):PerturbationNNI; -2: pairwise
seqtype = 1 * 1:codons; 2:AAs; 3:codons-->AAs
CodonFreq = 2 * 0:1/61 each, 1:F1X4, 2:F3X4, 3:codon table
* ndata = 10
clock = 0 * 0:no clock, 1:clock; 2:local clock; 3:CombinedAnalysis
aaDist = 0 * 0:equal, +:geometric; -:linear, 1-6:G1974,Miyata,c,p,v,a
aaRatefile = dat/jones.dat * only used for aa seqs with model=empirical(_F)
* dayhoff.dat, jones.dat, wag.dat, mtmam.dat, or your own
model = 2
* models for codons:
* 0:one, 1:b, 2:2 or more dN/dS ratios for branches
* models for AAs or codon-translated AAs:
* 0:poisson, 1:proportional, 2:Empirical, 3:Empirical+F
* 6:FromCodon, 7:AAClasses, 8:REVaa_0, 9:REVaa(nr=189)
NSsites = 0 * 0:one w;1:neutral;2:selection; 3:discrete;4:freqs;
* 5:gamma;6:2gamma;7:beta;8:beta&w;9:betaγ
* 10:beta&gamma+1; 11:beta&normal>1; 12:0&2normal>1;
* 13:3normal>0
icode = 1 * 0:universal code; 1:mammalian mt; 2-10:see below
Mgene = 0
* codon: 0:rates, 1:separate; 2:diff pi, 3:diff kapa, 4:all diff
* AA: 0:rates, 1:separate
fix_kappa = 0 * 1: kappa fixed, 0: kappa to be estimated
kappa = 2 * initial or fixed kappa
fix_omega = 0 * 1: omega or omega_1 fixed, 0: estimate
omega = .4 * initial or fixed omega, for codons or codon-based AAs
fix_alpha = 1 * 0: estimate gamma shape parameter; 1: fix it at alpha
alpha = 0. * initial or fixed alpha, 0:infinity (constant rate)
Malpha = 0 * different alphas for genes
ncatG = 8 * # of categories in dG of NSsites models
getSE = 0 * 0: don't want them, 1: want S.E.s of estimates
RateAncestor = 1 * (0,1,2): rates (alpha>0) or ancestral states (1 or 2)
Small_Diff = .5e-6
cleandata = 1 * remove sites with ambiguity data (1:yes, 0:no)?
* fix_blength = -1 * 0: ignore, -1: random, 1: initial, 2: fixed
method = 0 * Optimization method 0: simultaneous; 1: one branch a time
* Genetic codes: 0:universal, 1:mammalian mt., 2:yeast mt., 3:mold mt.,
* 4: invertebrate mt., 5: ciliate nuclear, 6: echinoderm mt.,
* 7: euplotid mt., 8: alternative yeast nu. 9: ascidian mt.,
* 10: blepharisma nu.
* These codes correspond to transl_table 1 to 11 of GENEBANK.
3.
1. 准备文件
自己的文件或者去ncbi下载,我下的肌红蛋白文件。用mega比对,保存比对后的文件(fasta格式是可以的)以及树文件(.nwk文件)。
最后把fasta格式文件改为.nuc格式(paml文件的samples文件下有例子,自己参照):如下
8 8049
Homo sapiens
GGAGCGAGGTGGCAATGTTGGGCACGGCGGCAGACCATGCC--CAACATGGGGGAAGGGAGTGTATGTGGT-TCAGGCTTGTGCGTGTCTCTGTGTGTGTGTGCAGGTGAGTATGGCGTGTGCTGGGATATATTTATG-TGTACCT-CTGTGTGAGTGTGCAGGCCCATA----TGTGAGTGTGCACACACATCTGTGAGAGCCTGCATGCATGT--ACACGT---GTGAGGTGTTGATACATGTCCATGTAGGTATCAGTTTGCCTGTA----------------------AATATCTCTGTGTAAC-AATAACAAAGTCCGAATAGTCATCAAGAGT---ATAGACATGAAGCCAGACTGCCTGGGTTAAATCCCC--AGCTAGTTAGTCTTTCGGTGCCTCTGTTTCTTCCTCTGCAAAATGGCGTCTACCCCATCAGGATT--TTAAAATCAGTTAA-TATGGCTG--GGC--GCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCGGATCAC--AAGGTCAGGAGATCGAGACCATCCTGCCTAACATGGAGAATTCTGAGAATTGCTCAAACCCAGGAGGTGGAGGTTG-CGGTGAGCAGAGATTGCACCACTGCACTCCAGCCTGGGCAACAG-AGCCAGATTCCATCTCAAAAAAGAAAAACAAAAAACAAAAAAACCATGAACTCATTTT-CAGGTTGAGGAGCTCAGCATCCTGGTTGTGAAATACCCTCC--TCAT--AAAACCCTG-GGATGGA-GACTACGGGG-AT-CAGGTGCTT-CCTTGTGACAACTTCTGGGCATGGTGGCTCAGG-GCGC--AAACTGGAGTGT---GGCCACAATACATACTGTGTACTTTTACAAGGATGTCACAGAGCCTGGGTATCATAAAAGAG----GAGCTTTTCAAGGAACTGAAACCATTAGACAGGAGAGAG----------AGCCCTGGGCAGACAGGGTTGCCCGTGCCAAACATTTCAGC----TGTGGCACAAGGGAAAGGGTGGGAGTTATGAAACTGTTCCATTTTGGGTTTAGGTCTGGGCTCTGCCGCTAGCTAGCCAAGTGACCTT-----GGCCACTTATCTCTGTGGTCTTCC---------ATGAGTAAAAGGCGGAAACTCACTCCTACC--CAGAGGGCAGGTCTGACTCCCTTTAACCAGCACCCACCTGCTCACAGCAGGA--AGGACTGAGGTCT-----AAAGCTGGAGGTGGGCAGGAAGGA-CTGAGGTCTAAAGCTGGAGGTGGGCAGGAAGGACC--AAGGTCTAAAGTT--GGAGGTGGG---CAGGAAGGACC-------------------------------GAGGTCTAAAGCTGGA------------GGTGGCTGC-----TCAGAGTC--CCAGCAGAGGCCTCTGGGGCACCTCACTGAG----------TGCCTGGCAGGAGTG-GGTGCCTGTCTC------------AGGGCTGGGTTGAGTTGCTCCCACC--------------------AGGACCCTTCGTCATC--TGCACAGTGAGGG-------------------------------GACTGGGAGGTTCAGAGAGTCAC-------AGCTTGGGCTCAAAACAAGCAAG----AGGTTTCTGAGTGTGAGGATTGCTC-TGGA------------GTGGAATGGCCCTCACA----GGTAGGAGTG-----------AGCCTCCTGT-AGC----TAGAGGTAT-------TTAAGCAGCTGAAGGACAA-----TCCCTGGGCA----GGAAGCTGCAGAGATGGTCG-----CAGCGTGGACTA-----GAACTGCTGTT----TTGGTCACTCAGACCTCATTCCAGCCTGGCTTCTCTGGACAGCACCCCTGCAATAGTGAGCTGGT--GACTTTACGCCTCAGAACCTCGGTTTCCACATCTGTAAAATGGGAATTATATGACACTCACTATGTGCCAGACACCCTGTTGGT-----ACATAGCACACACTATCTCACTTAATCCTTCAAGTAGGGACAAGTTATCCCCATCCCTTATATGAGGAAGCT----GAGGCACAGAGAGGTGAAGTGA---ATGGCCCAAGGTCACACAGCTGGG-A--AGACAGG---GAGC-TAAACTTGAAC--------TCTAGTCTGGCTGCCCCCAGACCTCACACCGCACCTCCCATGCCGACTCCAGCCTTCCCTGTGCCCA-CAGGCTCTTTAAGGGTCACCCAGAGACTC-----TGGAGAAGTTTGACAAGTTCAAGCACCTGAAGTCAGAGGACGAGATGAAGGCGTCTGAGGACTTAAAGAAGCATGGTGCCACCGTGCTCACCGCCCTGGGTGGCATCCTTAAGAAGAAGGGGCATCATGAGGCAGAGATTAAGCCCCTGGCACAGTCGCATGCCACCAAGCACAAGATCCCCGTGAAGTACCTGGAGGTAGGAGCA-----------------GAGCCTGGGCAGGTGGGAGGATGCGGGGAAGGCCTCGGGTGGGGCAATGGGATCTGGGTTCGAGTCCAAGCTCAGCCACTAACTTGTGGGATGACCTATGCCACTCTTCTCTGTGCCCCAGGTTTCTCATTTGTAAAGGGGACTGCCACCCACTTTGCCTTCCTCCTGGGATTGTTGAGAATGAACACACTTAGCATTTTTAATTTAGTATGCCAAATTCACATCTTATTACCAAAGAGGAAA-GGGAGAGGGGATATTGG-GTGCAAAATTTGCATCCTCTCCATGGGTAGGTACCATTATCATATCCACTTGATAGATGGGGAAACTGAGGCTCACAGAGGTTAAGCAGCT-TGTCCACGGTCACAGGAG----GTGGATAATGGCAGAGCCAAGATTCAAACGCAGGTCTCTATTACTACAGAACCCC--AGCCCCTAACTGCTGTGCCACTGGGAGTCTGGTACATGC--AG--GACTTATGTGGCAGGAGCTCA-GCAAGTGGGGCTCAATTTGGGGTGGGGGTGACCAGCAGGTT----GGCTCTATTGGTTCCAGCATCTTCACAGATGAAGAGACAGGACCTCGGTTTCCAGCACAAGCAATT---GGTTTGGACCTCCTGAGATGGGTTGGAAAGTTG-GGTGGATCAGGGTTGGGGGCAGGAGCCTGGGCTTCAGGTTGTGTGTCTATAACTGGTGGGAGGAGGCGATTTGGGG-------------------------------------------------------------------AGAGGAGGGAGCTGGGGATGAAGGACCACAG--G-GACAGGTGCATCCCCCGAGGGTAGAA-ACAGCAGGAAGTCTGGTGCAGCCATGAGGATTAGGATGTGGTGATA-GCTACCCGCTGGGATGGGCCACAGTGAGCAT--TTGCT--------------------------------------------------------------------------------------------GCCAT--GCCTAGCACA----TGGCATCCATCCTCAAAGTTGCCTCATGG-CCAAAATGACTGCAAGAGCTCCAGCCACCTCTTCTATATTCCCAACTGG--AAGCAGGAGAAAGAGAGGAATGCTCTCTTTTGAGGAGTTTAAGGAGTCCCAGAAATCTCATCCAACAATTTTATTTACATCTCATTGGCCAGAAGTTAGTTACGGAGCCAT-CCTGTCTGCAC--GGGAAGCTGGGAAAGGTAGTCTATCA---CCCCTCCAAATACAA-CTAGTGTTCTGTTACCAAGGAAGAAAG-GGAAATGGATATTGTGGACGTAA---TTAGCAGTCTCTG--CCCCAGGCAAGTACCCTTCTCATCCCCATTTTG---CTGGTTTGGAAACTGAGAGCTCAGAGGGTTTAAGTAGCTTGCCCAAGGTCACACAGCTGATAAGCCAGGAGGTACAGTCAGATCCATGGTACTCTGGAA-CCAG---GCTCTGAATCCACTGTGGCACAATATCACCCATTGACAACCACCGCCACCC-----------------------------------------------------------------------------------------------CTCTTTA------------ACTTCGACCTT----------------------------------------------------------------------------------TGCAC--------------------------------------CCCACCCCAC--------AATTGCGCAGAGTCCTGCCTGCCCAACTGCTCCACATCACCAGCGT-GAACAGACAACCCTGCATGTGAG----------GC-CGTCC--CTGCCT----GCCCATCTTC-TCTCT--GCAAATCCC-TGTTCATCCACTAAGGCCCAGTGC-GAAGCCATCTCCCCTGCTTGGAACCCCAGG--CCCACGAGGCTTTCAGCGGTTCCATCTGCCAGCCCCTCCCTGCACCA----TGGCACTTTATCTTCATCTCCGGTCATGGCACT-GCCAGGCTGTGTGATAATTTGTCTCTTTGCCCAGCGGTCTC-CCAGCAGACACTC-AGCTCCCTAGGGGC--------------TACCATTTCTGTGCCCCACTGGCACGTGTTTG-GCTCACC-ATACACTTCCAATGACTGAGT-----------------------TCTCTCTCTAGAGAGGAAGAAGCCGAGT------------------------------------GTTGGATAGAAAAAACCATGC------------------------------------------------------AGCCAAGATAT---------------------------------------------------------CACAACCAGTTGG--------------------------------------------------CCACTA----------------------------------------------------------------------------------GAACCCAGGTCTCCCTAAAACATGCTTAATCTGATGCATAACCT-CTGCAAAAAATTGAGACAGAGCCCAGATGTCTGCTTTTCCC-CCTCACGGTGGACCTTCGCTCTGCGCTGTTCATAAGACAGT-CTCCACCTCACCTGTGGACAGATTGTCTCTCTGAGGGACTGAGGGGCTGTACAGGGTGGTGGGTGCCGAGAGTGTTT-CCATTTCTCAATTATGCAGTGGCTCCGTTCCA-CTGTTTTCTAGATATGAGTGGCTCTGCCATGGACTCCAGAAAGTCTGAGCTGGGGGCCCAACCAGG-CCATCTGGGTG-----------GGTGAAGTCACACATGGAGT-CGGGGCATAGGTGGCCCTGGAA----CTTGATTCTCTC-AAGTTCTCTAAGGACAGGGCTTGCTCCATGACACCCTGAGCCCCCACTTTGTTTTGTGGCATTCATGAAATGGAAGGTGAAGT----CAAGCATCCACCTTGATGCCCAAGAGTAGGCGTTTGTGGAGCTGCCCCTCTTTCCCAGCAGAGGGGC--TTATGGGGTGTTGATGCT----TC--TTGAGGCCTCCCTTTGCATGTCCCACC--------CGCGACT-ATCCCTGCTTTATGTCACTTGCACCCTGGGGCCCT-------------------GAGAAAACGAG-AAACCCAGCCTTGGTTC---------AGCC--ACTGCCTCCGTTCCCAGCTCTGGCAGTG--GCCTGCCCACAGCCTCCTGGGCGCAA-----ATCCATCTCTGCTCTCTGCTTCCTGCGTGACCT--CGGTCTGGCCACTTGA----TTTTCTCCTCTGTAAAA--CGACACTCCTTGGTGCA--CCAGCCAGGGCT------GTGGTTGGGATCAAATC-TGGTTAAC--TTGAGAAAGCAC-AGCCGCATTTCACATCCATGAG-TCTTTCCATCCCT-AAAGCAGCACCA-TGCTACATCTCCGTTTTCCCATGCCCATCTCTGTTATCCGGGCAGTGAGACTGTGGGTACTAAAGCAAATGGCAATGCTGAGGCTGATAGCAGACATTCTCCATCCTGGGAGCCAGCCGCGGGCCTCATCCCTGTCTTCTTCCTCTCTCTCCTTCCCTCCCACCAGTGTTTCTG---------------TGCTTGGTATAAAAAATAGGAGGCTGTGCCC------------------CCAAAATAGGGTCTTTAAAACAATAACCATACCAAGTCATTAA-GTATGCAAAAATTGCATACACACAAATAGAAA-TAGTTCCT-------------------TTCTAGACT--------------------------------------------------------------TTCTGATTGCAAAATCCTGAATACAA----------------------------------------------------------------------------------------------------------------------------------------------TAATGAAAT--------CTGAG-----------------------------------------------------------------------------------------------------------------------------------------------------CATTTCCCTTCTTTTCTGCTGCCCCCAAGCGGGTGGTGCTC------------------TGAGCTCTCACCTGGTTTCAGTGGGG-TCTACATCCTGA-TGGAGTGGAGG---GGGCTGTGAGTAAGAGCGTGGGCTCCGGAGCCGGCCCTCCTGG--GTCCAAATGTCCCTTCCA---TTCAACCTCCCCTCGCCTCAGTT-TCTGCATCTGTAAATCGAGGGCAGTTGTAGTATCTATCTCACAGTGG--TTGTGGGGATCAAAGGGGTTCATCCGTGGAGA-TCACACAGACTCTCACCTGGTGCCTAGCAAGTGCTCAATACACGGTCCTGGAATAAAGAGAAGGTAGGAGGACAACTGACTCCCATCTGGCCCCTGGCTTGTCCCACCCTGGTGACCATTTTCTCTCC--TCACCCTCCCTGCAGTTCATCTCGGAATGCATCATCCAGGTTCTGCAGAGCAAGCATCCCGGGGACTTTGGTGCTGATGCCCAGGGGGCCATGAACAAGGCCCTGGAGCTGTTCCGGAAGGACATGGCCTCCAACTACAAGGAGCTGGGCTTCCAGGGCTAGGCCCCTGCCGC----TCCCACCCCCACCCATCTGGGCCCCGGGTTCAAGAGAGAGCGGGGTCTGATCTCGTGTAGCCATATAGAGTTTGCTTCTGAGTGTCTGCTTTGTTTAGTAGAGGTGGGCAGGAGGAGCTGAGGGGCT--GGGGCTGGGGTGTTGAAGTTGGCTTTGCATGCCCAGCGATGCGCCTCCCTGTGG--GATGTCATCACCCTGGGAACCGGGAGT-----GGCCCTTGGCTCACTGTGTTCTGCATGGTTTGGATCTGAATTAATTGTCCTTTCTTCTAAATCCCAACCGAACTTCTTCCAACCTCCAAACT-------------------GGCTGTAACCCCAAATCCAAGCCATTAACTACA-CCTGACAGTAGCAATTGTCTGATTA-ATCACTGGCCCCTTGAA-GACAGCAGAATGTCCC-TTTGCAATGAGGAGGAGATCTGGGC------------------TGGGCGGGCCAGC----TGGGGAAGCATTTGACTATCTGGAACTTGTGTGTGCCTCCTCAGGTATGGCAGTGACTCACCTGGTTTTAATAAAACAACCTGCAACATCTCA
Mus musculus
----------CAAGCTTGAGCATCAAACAGTCCACCAGATGAATCCTATTACGGCAGAGAGTCAGAATAAT--GAGGGCAGTGATAGATCTCGTCTACTAGTGTGGACTGAAAGCTTGGGTAATGTGGTTGCCTTCGGCTGCAAAGGATTCCTAGACATGCCCAGAAAAATTGCAGATTCTGAGCCTGGGCAAGGGGGTAGGGGTGGAGGGGTGGCGAGGCTCATACACAGGGCCTCACTGCTCACCCAGAGGCTGGAACTCTTCCCGGGGTCCTGGGCTTGGTTGCCTCAGAAGCCTGATGGGATATGCATGATTGATAATTCTTGGCTCTGGGGAGG-AGATTGCTCTGTTGCTAGCAGCCCCTGCACTGTCTGTGAAGCTGTAGTCTGGAAATGAATTCCTTGCTATTTATGATCTAGGGTGA-GTTCCCCTCCTGATCTTCACATTCTTGCATACTCCCTTAGTCTGTTTCTAACCCCTCTGCTTGGGAGCAAGGTCATCTAAGGCCAGAGCCCTTT---AGGTAGCTGAGCTGGAAGGAGGCTCACCAGCTTCTGGAACACCTTGGTCTCTCAGCTTCTTCCTGCCTGTCTTTCTCCCCCCAATCCT--TAAGTGCTGGGCCTACAAGCC-ATCTAACTGGACTAACTCCCACTCCAAATTAGCATAAATTCCTAGGCCACAGGTTTCCTATGTGACCTTGGACAAGTTGTCTCTCCTCTCTGG----GCCCTATTTCTGCTTTGG--ACTGTTTGTCATGAGCT--GAGACCACCTGTCCACTCTCGTGCCCTGAGATTTGGAGCCCGTACTCTCTTTTTGTTGTGG--ACAGCAGATTATTGAGGGCAAGACTGGTAGGACGGAGTCTTGCAAGCCTCCAAGGCTGCCAGGGCCTGGGCAGTGACTT--GGGGCTTGAAGGAGACTCAGAGTATTGTGCCATTAGAGAGATACTCTTCCCTTCCCAGGGGAGTAATTGAGGCCTGCCAGAACCTGGTGCCAGGTGGGGCACAATGCCAAGGCTGCTGACCCTGGGAGG-CCCAA----GGGTCTAACACCCTATCATGTCTCAGGCTTGGGTCATGGCGCC----ACACCAGCTGCTCCAGCCCTCTCC----------ACTGGGGGCACAGGGAAGTGGTCTTCCAGG----GTGGCATGCCAACTCTCAGGGTAGCAAGTGTCCCATGGCACTAGGCTCCCAGCCAGTCCCAGACAC--CGACCCCAGCCTCACCCGCTGACCTC-AGCAAGGTCATTTCTGCAAGTACCTTAGCAGAACTCTGAGGTCTGAGCTGGTAAGCACAGAATTCTAGAACGAGGAGCAAGCAAAAGATCTTTGCACTCCAGGTGGTAGGGGCTGTTCTCT----CACCACTTCATGGTCAGATAGTTTCTGGTTTATTTATACCTGCATTT----------------------------TTTTTAAAA------------------------------------AAAGGATGTAC-----------------------------TTAATTTACTTTGTGAGTGTTTTGT--CCACATATCTGGATGTGCATCATATGCATGCAATACCCAGGAG-CCAGAGGAGGGCATTGGATCCTCTGGAACTGGAGTTACAGACAGTTGTGAGCTGTTGTGAGGTTGCTGGGAAGTGAAC-TCAGATCCTCTGGGAGAGCAGCAGATGTTCTTACTGAACCACCTTCCCAACCCTCATGATGACATTTTT-GCTGGGGAAAGGATTTTGGAGCTGTGAAGGTTTGGAAATGACTG--GTATTGACAGCAACCTTTCACACAATGAGA--ACTGCCTCAGAGAGTCACAAACAG--------------CCTGTG----CCAGTGATGATCCGCCCCTAAAAAAA-GGCTCCCGGTGGTGATAATGGAGGTGAAGGTGG---AGAGG--AAGGGGA--------------GGAAGTCTCA---------CAGAGCAAGCCCTGTGTTGGATCAC-------CTTAATGAACTTAAGTATGAACTCACGTATCACAGCTGACAG---------AGAGGTT---------------------------------TAGCAA---CTTGACCCAAATCACACAGCTAAT-ACGACTGTGGATAGGATTCAAACTCAGGCCACTTGGCTGTAGCCCATGCTCTTGGTGACCCTTCTATCTACCACCCCGTGTGCTTCTAAGATGGAGTCTGATTGGGCAACTGTTTAGAAGACAAGCAGAGAGGCTTATCTTCCTTTCCCTGACCAGACAGACTG--TATATGCTTTGCCTGGTGTCCTGGCAACATGATGAAGAAAGGAACCCTGGGAAACTATCCTTGGTCTCTTCAACAGGGACCGAATCAGTTGTGCTCAGGGGAATCCA-GGATACGGGCTCTGGACCTCTCCCGGGACTCCATCTGTGGAAC-----TCAGGGTTCAGGGAGCAAGTGACTGTGTGGAGCCTGTAAACCTGTACCTCTCCTCCTATAATGTGAGCAAGCCTAATGCTGTTCTGCCCAGTGCTGGGCAAAGGTGAGGGTAGGGTAATCCTTCTAGAGGTTGG--GAAATCAGTTTGGTTTACA-CTCAGTGATTAAGAC-GAAGAAAGGCCGAGTGTCTGTTCTGGCATCCCACAGGA-CCCTGGAGACCAACCCAGAACCACAGTGTA------GTATCTCTAA-----AGCCTACAGCCTTGAGTCCCT----CTTCTGGCCC-CATCCAGC-CTGGTGACCTTGGGCTCACCTGTGGGTGTCA---GGCCTCCCGTCTGTCAGC--AGGAACTCAAGGGTCATTGTCGATAGATAATAATAGTCAGGAGCACAGGGACCCTGGACAAAGTGCCAGAGACAAGCTGTCAGTGACGCACACAACTCTGGATAGGCTCCTGGGGTGAT-TCTGTAATACAACCAGGACC-GGGGACAGGCCCAGCCAATCTTCATGGCACCAGCCTGTGTGCATGGCTCTTCACCCACGTGAAGAACAGACTTGCATGTTTGTGTGTGTGTGCTTCTTAGAATACACACATATGTGTGTG-TGCTTCTGAGAACATGTGTTGGGGGGTGGAGGGTCTGCATACTATCATTTACAATGGGATGCCATTGTGTGACAGGGACAGAGCCCAAGTGGTCACCCAGAGGATGAGGGGGAAGCCATAGGGCCTAGTTAAACCCCTGCAGGACACTCAAGCTTCCTGTGCCTCAGTTTCTCCAGCTGTCACATGGAGTCTGCTGCATGAGGCTCTTTCAGCAGGGTTCTTTTCAGTGATGGTTGCCTCGGAGTGGCTTGGAATCACAGAGCAAGAAGTAGCCCATGTATGAACCTGAGTCCGTGTAGTTGGAATGGGTACATGGTGGGGTGTTTGGGAGGGAGCTGGTGTCAACAGAACTGATAACAGACACACAGTGAAGTGCTTGAGAGGTGGGAAGGCTTGTCTGTGATGAGTGACCATGGACGTGGAAGAGGGGCGTAAAGCAGCTGACCTGAGAGG-CTGTATCAAG---CACCCACCCACTGGTGAGGTGAGACCATCC-CCACTGTAGGGC-------CTTTTTTGTCCTGCCCTGTCTGGAGACTAATCTAGTCGTGCGCAGAGAGACAGGGCTCCC-----------ACACTCCCTCTGCTGGATGCACACTAGACCTCTTAGTTCCTCCTCTGAGACATGATGCAATCACAGAACTTGCTTCAGGTGTACTGTGGGGATTCAGTAAATTACCCTGCAAA--CTGCTTTAAGGACATGCATGTCCTTTGTGAATACTCTATAGGTGACAGTTCATGCTCTCTCATGGAAAACTCAGAGGCCACCTGTCCCTCAGTGTGGATCCTAATCAGGCTGGTTTTAGAGTGGGGATGGAACAGGGAAGGAACACCAGACTACCACATGAACACAGGCTCCATTGGCCCCACAAAGGGATGGGACAGCTCGAGAGAAG-AGTTGAAGACATCAGCACATCACAAACTCAAGGGATTGTAGAGGAAGCAGCCAGATAATTCAGCAGCCAGACCTAGCTCTTGGATTGGATGAAGCAGCTTAGTGTCCTGGTCATAGGATGTCCTCCCCATGGGACTGTGGGGCAGAGGCCAGGGGGACCAAGTGCTTCCC--------------AGACAGCTTGGTACA------CAAATGTATATGCTCATAGTGATGCCATAAGGATCACGTAGAGTCTAGATAGCTGTGGTCACCCTGATAAACATGTCAGAGTGGGG---------CACTATGGGGCATCCGGTGTAAATTCTAGTTCTGTCCAGGCTC-CAGTTCTGAGCTCTGACATCACATGCTGTGGCCTAGGCC-AAGCAACTGCCTTTGATGGTGTTTTATCTGTCCACAAGTGGATCAGGGACTGGCCTCATCTGGGGAAGTTGGTGGTTCTTGTAGCCCTCATTATCTTCCCAGAGCAGGACTTGGGTCCCTGGTTGGATAAAGCCAGAGCTCATGAGGTGACAGGCAGAAGCAGCTGT-GGCCTTTGGTT-----G-----GTCCATTTACCATTGCCAGTGGAAAGTGGTTGGGGT---TAGATCTGAAGTTTATGCTCATGTAGAGGGGTCTCAACAACAGCCACAGATAGTCAGGAGGTCTAAAACTTCAGGTTTCAGCATCTTCAGGCAGGAAGCAATAGAGGCATCCCTGACACAGATGTCTTGAGGAATAGAGG----------------------------------GTAC-----------------------------TTCTGAGTC-------------------------------------------------------------------------ATATAGAT--------CCAATG------GCTGCCTGC-------------------------------------------------CTTCTCTGCA---------------------CACTAGCCCT--------------------------------------------------------------------------------------------------------------GAGATCC------ACGGG-------------------------------------------------------------------------TCTCACTCCATCTCTTCTTGTCCCACAGTCTGTTTAAGACTCACCCTGAGACC--------CTGGATAAGTTTGACAAGTTCAAGAACTTGAAGTCAGAGGAAGATATGAAGGGCTCAGAGGACCTGAAGAAGCATGGTTGCA-CCGTGCTCACAGCCCTGGGTACCATCCTGAAGAAGAAGGGACAACATGCTGCCGAGATCCAGCCTCTAGCCCAATCACACGCCACCAAGCACAAGATCCCGGTCAAGTACCTGGAGGTAGGCGGCCACAGCA--------AGTCTCCA--------------------------GGGCAGAGATATAAATCCCAGCTTAGCCACTCAATACGAGTGGCCT---GCTTTCTCCCCACTAAGCTTTCTCCCCAGTTCTCCTCTATAACCTACCCCAGCCCTGTTCCCCT--GAGGGTGCTGAGGAGCCACACAGCATCCAATTTAATGGCTTGC-CCAAATGCTGGGCCAGCACCTTAGTGTGAAATATGACT-AATACACAACTTGCCTTCTCCTCAGAAGCGGGAGCTCTTGTTGGACCCACTGGACTGGACAGATGTGGAAACTGAGGCACAGGCAGCTTGGGCAAGTCACTGAGGCAGGATTCACACTCAGGGCGAATCAGTTCTGTCAGAGAGCGGTAGACGCT------TTGTTTCTCTGATGCTCTGCTGCCCCAGTCCATGATGGTTCAGACACCAGTTGGAAAGTGGCCTAAAGAGAGGCCAGCAGCCCTCAGTGGGGTCCTGTCTGTCTTCATTTTGCAACTGGAGTCACAGGGCAGAGCAGGGCAGAGCAGGGCAGAGCAGGGCAGGGC-CAGACACATACCCAGCCTCCCTGAAGCTGGGGCTGGTTCCGCAGA-------------CCCAGGCTCCACAGACGTTTC--TGTATCTGTGAATTGTGAGGCAGATTTCCCAGAGGTGTATTTGCAAAGATGTCCTGTGCCTCTGGGTGCACTTGTGTCCCTCGGTGTCCCCTTCTGCTGTGTCCCACTCTCATCTCTGCTGTGTCACTTGCACATAAGGGCCCCAGCCCAAAACAGCAGCTGTACCCCCACTCCTGAGCTAGCA-----GGGGTGTGTCCACAGTCCCT-GGGTCTAATCTCAGCTTACTATCTACCTGCTACATGACCTCAGCCTGGTCACTTGGTAG-TCTCTTCAGTA------------------CAGCAGCCAAAGC--------------------------------------------------------TGTACGGGACCAAACTCTAAGTCAATG-----------------------------------------------------------------------------------------------------------------ACGATAAGTAAGTGTTATGATAGTTTCATCGTGAGCCTTTGGATTCTGGGTGGTCCCGATCCCCATCTAGGCAATGTCGGTACCTGATCTCCCCTGGCCCCTCTCCCCTTGTT-------------------AGTACCTGTTCCTCAGTCCCAGGTTTAGGGATAAAATCACATGATGGGAAAAGCCTCCTCCCAAGTGTCCTGATTGACAGTCCTGGTGGGCTAGCCGATGCTGAGCAAG------GTATCATGGGCCGGCTGCTGGGCCTGCATC-CTGAAGTAGTGTGGGCACACGAGAGGTGAGAAAGGGGCCCCAGAGGGGCTCCCTGAGTT-TGAGTGACCCTCCCTGACTGCATGACTTCATGCACACAG-CTATACCTGTCTTTGCCTCAGTT-TCCCCATTTACAGAGCAGGGATGGTGGTGGTGCCTGCTTCCAGGTA-----GTCAGGTTTAAAGGAGTTGGTTCATGGTAA-TTGCATGGAATGGCACTTGGTACATCGTAAGTGTGCAAGGAACAGTCCTATAGTAAGGAGAAGGTCAG---TGAGTACACACCCCCTTAGCTCATGGCTTGCTCCGTCC-CCTGACCTACAACCTCTT--GTCCCTTTCTTGCAGTTTATCTCAGAAATTATCATTGAAGTCCTGAAGAAGAGACATTCCGGGGACTTTGGAGCAGATGCTCAGGGCGCCATGAGCAAGGCCCTGGAGCTCTTCCGGAATGACATTGCCGCCAAGTACAAGGAGCTAGGCTTCCAGGGCTGAGCCATGG------------GCTCCCACTGTCCAGCCCACC-----------AAGCTGGGACCCAGTGTTGTGTAGC-AAGTAGCGTGTGCA-------GTGTTCTAGGTTAGCAGAGAACAGAAGAGGGGAGCATAGTGTGGCATCCACCCACACCCCTGGGG-------------------ACAGGGCTCTGGGCAG-------TGTTACCCTGGAGCCCAGAGGTGCAAAG----TGGCCTTCG-----TCAGCTCTGCC-GGGTCATGCTCAGGTCTCCT-------AAGTCCCAGTCCATTTTCTTC-TGGTTTT------------------------GGGAAAA-TCTCTTTTCCA--CTGTCACATTTGACCC--CAAATCCAAGTCACTGACTAGCAGACCCTGACCTTTGGGCGAGATGGAGGGTTGC--TTAGAGGGAGTGGAGGGTGAAAAC---------------------------------------GGGGC-GGTGAGCATC--AAGTCTC----CCACTGCTCAGCT-TCCCGTTGACCCACCTTGTCTCAATAAAATATCCTGCGAGTCCTCA
Rattus norvegicus
----------CAAGCTTAAGCATCAAACAGTCCACCAGATGAATGTTATCATGGCAGAGAGGCAGAATAGT--GAGGGCAGTGATAGATCTCATCTACGAGCGTGGACTGAAAGCTTAGATAAGGCGGTTGCCTTTGGTTGCAAGGGATTCCTAGACATGCCCAGAAAAATTGCAGATTCTGAGCCTGGGCAAGGGGAAGGGGCTG-----------AGGTTCATACACAGGGCCTCACTGCTCACCCAGGGGCTGCGGCTCTTCCTGG-----------------------AAGACTGATGGGATATGCATGAGTGGTAATTTTTGGCTCTGGGGGGGGAGGGTGCCCTGTTGCTAGCAGCCCCTGCCATGTCTGTGTAGCTGTAGTTTGGAAATGAATTCCTTGCTATTTATGATCTAGGGTGATAGATCCCTCCTGATCTCAATGTTCTTGGATACTCCCTTTGTCCGTTTCTAACCCCTCTGCTTGGGAGCAAAGTTGTCTGAGGTCAGAGTCCTTT---AGGTGGCCATGCTGGAAGGAGGTTCTCTGGCTGCTGGAACACCTGGGTACCTCAGTTTCTTCCTACCTGTCTTTCTCCCCCAACCCCTCTTAAATGCTGGGCCTACAAGTCTGCCCAACTGGACCAACTCCCACAGCAAGTTGGCATTAATTCT--------GGATTTCTTATGTGACCTTGGGCAAGTTGTTTCTGCTTTCTAG----GTCCTATTTCTGCCTCGG--ACTGTATGTCACAGTCTCTGAGACC-CCTGCCCACTCTCATGCCCTGGGATTCGGAGTCC-TACCCTGTATTT-CTGTGG--ACAGCAGATTATCGAGGGCAGGATTGGTAGAACAGAGTTTTGCAAGACTCCAAGGCTGCCAGGGCCTGGGCAGAAACTT--GGGGCTTGAAGGAGACTCAGGCTATTATGCCATTAGAGAGATACTCTTCCCTTCCCAGGGGAGTAATTGAGGCCTACCAGGACCCAGGGCCAGGTGGGGCACAAAGCCAAGGCTGCTGACCCTGGGAGGTCCCAA----GGGTCTAACACCTTGTCATTTTTCAGGCTTGGGTCATGGGGCC----ACACCAGCTGCTCCTGCCCTCTCCT---------GTTGGGGGCACAGGGAAGTGGCCTTCCA------------------CTTTCAGGGTAACAAGTGTCCTATGGCACTGGGCTCCTGGCCTGTCCCAAATCC--CGACCTCAGCCTTGTCTGCTGCCCTC-AGCAAAGTCATTTCTGTAAGTACCTTAGCAGAACTCTGAGGCCTGGTCTGGTAAGCACAGAGTTCTAGAACAAGAAGCAGGCAGCAGAGCTGTGGGCTCCAGATGGTCAGGGCTGTTCCCTACCTCACCTCTCCATGGTCAGATATTTTCTGGTCCATTTATACCTGCATTTAAAAAAAGATGCATTATATTTTTTTTTTTTTTTGGAGCTGGGGACCGAACCCAGGGCCTTGCGCTTGCTAGGCAAGCGCTCTACCACTGAGCCAAATCCCCAATCAAGATGCATTATTTTACTTTCTAAGTATTTTGC--CTGCATATCTGGATGTGAACCATATGCATGCAATACCCAGGAGACCAGAAGAGGGCGTTGGATTCCCTGGAACTGGAGTTCCAGACTGTTGTGAGCTGTCGTAAGGTTCCTGGGAATTGAAC-TTGGATCCTCTGGAAGAGCAGCAGATGCTCTTACTGAGCCATCCTCCCAACCCCTGTGATGACATTTTTTGTTGGGGAAAGGATTTTGGAGCCAAGAAGGTTTGGAAATGACTGTGACAGTGACAGCAACCTTTCACATAATGAGAGAACCATCTCACAGAGTCACAACAAGGGGAG----GGAGTCTCATAGAGCCCTGTGTCGGATCACCTTTATGAACTCAGGTAACAGAGGGGTTTACGGCTGGAGAGATGGCTCAGTGGTTAAGAGCACTGTCTGCTCTTCCAGAGGTCCTAAGTTCAATTCCCAGCAA-CCACATGGTGGCTCACAACCATCTGTAATGGGATCTGATGCCTTCTTCTGGTGCGTCAGTGACAGGATACTCATAAACATTAAATAAATAAATCTTTAAAAAAAACCACACAGATAGCAA---CTTGATCCAAATCACACAGCTAGC-ATGACTGTGGATGGGATTCTAACTTGG----------TGTAGCCCATGCTCTCAGTGACCCTTCTATCTACCAACCCATGTGCTCCTAAGATGGGGTCTGACTGGGCAACTGTTTAGAGGACAAGTAGAGAGGC--AGCTTCCCTTCCCTGACCAGCCAGACTGCCTGTATGCTGAACCTGGCCTCCTAGCAGCAGGATGAAGAAAGGAACCCTGGGAAACCATCCCTGGTCTCTTCAGCAGGAGCTGAATCAAGTGTGCTCACGGGAATCCAAGGGTGTGGGCTCTGCACCTCTCCCAGGACTCTATCTGTGGGAC-----TCAGGGTTCAGGAAGCAAGTGGCTGTGTGGGGCCTGTAGACCTGTACCTCTCCTCCTAGAATGTGAGCAAGCCTAAGTCTATTCCGCCTAGTGCTGGGCAAAGGTGAGGCTGGGGTAAGCCTCCTAGAAGCTAG--GAAGTCAGTTTGGTTTACA-CTCAGTGAGCTAA---GGAAAAAGGTTCGGTGCCTGTTTTAGCATCCCACAGGA-CACTGGAGACTGAGCCAGAGCCAGAGTATA------GTGTCTCTAG-----AGCCTACAGCCCCAAGACCCT----GGTCTTGCCCGCCTCTGGCTCCATTCAGCCTGG--TGACCT-TGGGTT--------CGCCCCATCTGTCAGT--AGGAACTCAAGGGTCATTGTCGCTAGATAATAATAGCCAGGAGTACAGGGACCCTGGACAAGGTGCCAGAGACAAGCTGTCAGTGACGCACACAGCTCTGGATGGGCTCCTAGGGTGAT-GCTGTG-TGCAACTAGGGCCTGGGGACAGGCCCAACCAGTCGTCATGGCACTAGCTTGTGCACATGGCTCTTCACCCACGTGAAGAACAGACTT----------------------CTGAGAATACACAC-CGTGTGTGTG-TGTGCGTGCGAGGATGCTTCTGGGAGTG-------------------------------------TGTGTG-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------TGTGTG------------------------------------------------------------CAACAGAACTGATGACTGACACACACTGAAGTGCTTGAGAGGTGGGAGG-----------ATGAGTGACCATGGACCTGAAAGAGGGGCATAGATTAGCTGACCTGAGAGG-CTGTATCGAG---CACCCATCCACTGGTGAGGTAGGATCATTC-CCACTGTGGAACACTGTGGCCTTTTTGTCCTGCCCTTTCTGGAGACTAATCCAGTCGTG--CAGAGAGGCAGGGCTCCC-----------AGACTCCCTCTGCTGGATGCACACTACACCTCTTAGTTCCTCCTCTGTGACATGATGCAGTCATGGAACTTGCTTCAGGTGTACTGTGGGATTTCAGTCAATTACCCTGCAAA--GTGCTTTAAGGACACGCATGTCCTTTGTGAATACTCTGTGGGAGTCAGTTCATGCTCTCTCACGGA--------------------------------TCCTAACCAGGCTGGTCTTAGAGTGGGGATGGAACAGGGAAGGAACACCAGGCTCCCACATGAACACAGGCTCCAGTGGCCCCTCAAGGGAATGGGACAGCTCAAGAGAAG-AGTTGGGGGCATCAG---ATCACAAACTCAAGGGGTCATAGAGGAAG-------AGGACTC---------------TTCATAGA--GGATGAAGCAGCTTGGT--CCTGGTCATAGGAGGTCCTCCCCATGAGACTGTGGGGCGGAGGCCAGGGGACC--AGTGCTTCC---------------ATATAGCTTGGTACAAAAAGACAAACGTATGTCCTCATAGTGGTGCTGTAAGGGTAATGTAGAGTCTGGGTAGCTATGGTCACCCTGATAAACATGTCACAGTGGGG---------CCCTAGGGGGCATCCAGTGTTAATTCTAGTGCTGCCCAGGCTC-CAGATCTTAGCTCTGACACCACCTGCTGTGACCCAGGCC-AGGCAACTGCCTTTGATGGTGTTT-ATTTGCCCCCAAGTGGATCAGGGACAGGCCTCATCTGGGGAGGTTGGTGGCTCTTGTAGCTCTCATTATCTTCCCAGAGCAGGATTTGGGTCTCTGGTTGGGTAAAACCAGGGCTCATGAGGTGGCAGGCAGGAGCAGCTGT-GACCTTTGCTG-----G-----GTCCCCTTGCTATCGCCAGTGGAAAATGGATAAGGT---TGGATCTGAAGTATATACCAGTATAGAGGGGTCTCATCGGCAGCCACAGATAGTCAGGAGGTCTAGAACTTCAGGTTCTGGCATCTTCAGTCAGGGAGTGTTAGAGGTATCCCTGACAGGAATGGGGTGAGGAGTAGAGGAATAGAGCCACTCCGATGGCTGCCTGCCTTCTCTGCACACTAGCCCTGAGATCCTCCTGGGGACATGTCCTGAGTCCTGTCCCTGTCTATAAAACGGGACGAGTGTGGATGACACTGGAGGTAGGTGAGAGCCCAGCATAGAATCGGACACACAGATGCTTCGCCCCAGTACCCCTCACTGTCTGCTCTCCAGATAACAGAGAAAACGGGGTGAAGATGGAGAGACACCACTGCCCTTCTCTGTAACCAAGAGAAGAGGGTGATGGCATTGATCCTTATGGAAGAAGAGCAGGCGTTTGTCCTGCACGTGCTGTGCCAGACACCGGACGGGTTACCTCGTTTAATCCTTTCCACATCCCATAAGGTCATGTCACCCCCATCCACCAGAGATCCTGAAGCACAGAGGGAGGTGTGGTGCGCTGACCCAGGCTGCACCGCTGGGAGGAGACGCAGCTGCAGTCTAGTGGCCTCCATACCTCTCACTCCATCCCTTCTT-TCCTGTAGTCTATTTAAGGCTCACCCCGAGACC--------CTGGAAAAGTTCGACAAGTTCAAGAACCTGAAATCCGAGGAAGAGATGAAGAGTTCAGAGGACCTGAAGAAGCACGGCTGCA-CCGTGCTCACAGCCCTGGGTACCATCCTGAAGAAGAAGGGACAACATGCTGCTGAGATCCAGCCTCTGGCCCAGTCCCACGCCACCAAGCACAAGATCCCGGTCAAGTACCTGGAGGTAGGGGGCCACTGCAGGAGGCCAAGTCTCCATTTCGATAAGGTCATGTCACGCCCATGGGCAGAGATAGAAATCCCAGCTCAGCTATTCAGTGTGAATGACCC---GCTTTCTCCCCACTAAGC-------CCAGGTCTCCTCAACAACCTGCCCCAGCCCAGCTCCCCTT-GAGGGCACTGAGGAGCCA----GCTTTTGATTTAGTGTCTTGCTCCAAATGCTGGGCCAGCACCTCAGTGTGAAATGTGACCCGATACACGACTTGCCTTCTCCTCGTGAGCAGGAGCA--------------------GGACAGATGAGGAAACTGAGGCACAGG------------GTCACTGAGGCAGGACTCCCACTCAGG--------GTTCTGTCAGAGAATGGTGGCAGCTGAGGGCTCCATTTGTTTCCCCTCAGCTGCCCCACTCCATGACGGTTCAGACACCAGCTGGACAGTGGCCTGAGGGGAGCCCGGCAGCCCTCAG-GGGGTCCTGTCAGTCT-------GTAACTGGAGTCACAGGGCAGGGCAGGGCAGGGC----CAGA-------------CACATAACTCTCCTCACTGCCTGAAGGTGGGGCTGGTCCTGCAGA-------------CCCAGGCTCCACAGACTTTGC--TGTATCTGTGAATTGTGAGGCAGATTTCCCAGAGGTCTGTTTGCAAAGATGTCCTGTGCCTCTGGAGGTACCTGTGTCCCTCGGTGTCCCCTTCTGCTGTGTCCCGCTCTCATCTCTGTCGTGTCACTTGCAAAAGAGG--------CTGAAACAGCAGCTGCACCGTCACTCCTGAGCTAGCA-----GGGGTGTGTCCACTGTCCCCAGGGTCTAACCTCAGCTAATTATCTACCTGCTTCATGACCTCAACCTGGTCACTTGGTAG-TCTCTTCGGTAAGATGCCACCCCTTGGTACAGCAACCGAAGC--------------------------------------------------------TGTATGGGACCAGAGTCTAAGTCAACA-----------------------------------------------------------------------------------------------------------------ATGATAAATTAGCAATATGAGAGTTTCATCAGGAGCCTTTAGGTTCTGGGTGGCCTCAGTCGACATCTGGGTAATGGGGATACCTGATCTCACCTGGCCCCTCTTCCCTTGT--------------------AGTACCTGTTCCTCAGTCCCAGGTACAG-------TCCTGATGGGCGAGCGGGTACTGAGCAAGGTATCTTG---------TCTGCTGTGCCAAGCTTTCATGGGCT----------------GGCTGGATGCTGGGCCTGTGTC-CTGCAGTAACGTGGGCACACGAGAGGTGAGAAAGGGGCTCCAGAGAGGCCCCCTGGGTT-CGAGTGACCCTCCCTGACTACGTGACTTCA-GCACACAGGCTCTACCTGTTTCTGCCTCAGTT-TCCCCATTTACAAAGCGGGGATGGTGTTGGTGCCTGCCTCCAGGTA-----GTCAGGTTTAAAGGAGTTGGTTCAGAGTCA-TTGCACAGAATGGCACCTGGGACATCGTAAGTGTGCAGGAAACGGTCCTATAATAAAGAGAAGGGCAG---TGGGTACACACCCTC--AGCTCATGGCTTGCTCAGTCC-GGTGACCTACAACCTCTT--GTCCCCTTCTTGCAGTTTATCTCAGAAGTCATCATCCAAGTCCTGAAGAAGAGATATTCCGGGGACTTTGGAGCAGATGCTCAGGGCGCCATGAGCAAGGCCCTGGAGCTGTTCCGGAATGACATTGCTGCCAAGTACAAGGAGCTGGGCTTCCAGGGCTGAGCCATGG------------GCTCCCACTGTCCAGCCCAGC-----------AAGCTGGGACCCAATGTTGTGTAGA-AGGTAGAGTGTGCT-------GTGCCCTAGGTTAGCAGAGAACAGAAGAGGGGAGCATAGTGTGGCATCTACCCACACCCCTGGGG-------------------ACAGAGCTCTGGGCAG-------CATTGTCCTGGAACCCAGAGGTGCAAAG----TGGCCTCTGCTTCCTCAGCTCTGCT-GGGTCATGCTCAGGTCTCCTGTCACCTAAGTCCCAACCCACTTTCCTCCTGGTTTT------------------------GGGAAAAATCTCTTTTCCA--CTGTCACATTTGACCC--CAAATGCAAGTCACCAGCTAGCAGACACTGACCTTTGAAGGAGACAGAGGGTTAC--TTAGAGGGAGTGGAGGGTGGAAAG---------------------------------------TGGGCAGGTGAGCATCGGAAGTCTC----CAGCTGCTTAGCT-TCCCCCTGACCCACCTGGTTTCAATAAAACATCCTGCAACTCCTCA
Physeter catodon
------------------------------TGAGCCCAATT--TTTCTGCCCTGTTTAAAGCGAACCTGG--CTCTCTCTCTGGCTGGCAGGTGCTGAGAGTGGAGGGTGAGGT---------GGGGGTCAAGCCTTA-GAACTTGGAGGCTGAAAAATCAGCTAGACCA----ACCCTTTTGGTTTACAGATA---GTTACGCCA-----------AGAC-C---CAGAGAAACCGAG---TGACTTCTTCAATGTCACACAGCACATCG---------------------GAGGCAGAGCTGGGGC-CAGAACTCAGCTTCCATAATTCTCAGCGCC---GAAGTCTTCCTCCCAGACATCCAGG---GAACTTCCAAAATGTTTGAGATCCTTGGCCGCCTTGGGTCCCAGGTCCTTGGCCCTCTTCTGCCCCAGACAGTCTTCCTCCAGTGTCCCGACCTCAGGCC--TTC--GCCTGTGGGTCCTGGGTCTTGCATCTGTCAGGCAGGAGT-TCA---AGGATCATTGTCACCAGATAA-------TAATAGTCAGGAGCTCAGGGACCCTTGCGTGAAAGGTGCCAGGGATGA-GCTGTCAGTGATGTACAAGGTTTGGCCGGGGCCCCAG------ATGAGGTGGTGACCTT------GGGCATGGCTGTGGATAACCCTGG-GTGTTCTGGGGGAGTGGGAGGGGTA---CGCCCCCTGGGAAAGCGAGTGCTTCCCGTCCGT--GCTACGTC----GTGCA-TGCATGTGCGTGTGTGTGCACGC-GCCTGTGACTATGGGTGTGTGCCATGAGATACTTGTGC--ACACTGGAGTGT---GTTCACA-----------GTTAGGTT-TAAGCGTGTCGCAAAGCCAGGAAGCTACAAAAGAG----GAACCTTTAAAAGAACTGAAGCCATTAGACAGAAGAAGGGG--------GAGTCAGAGCAGACAGGGTCACCCCTGC-AGAGGTTTCAGG---GTGCCACACAAGGGAGAGAGCGTGAGTTATGGAACCATTCA--TGTGGGTTTGTGTCTGGGCTCTGCCACTGACCAGACATGTGACCCTGGGCAAGTCACCTACCTCTGTGGTCTTCCGTT-TGCCCACGAGTGAAATGGGGAAAATCACTCCCGCC--CAGGTGGCAGGTATGACTCCCTTTAACTAACTTCCACCTTTCCGCAGCAGGA--AAGACTGAGGTCC--AAGGCGACAGGTGGCTGCTCAGCTCA---CTGTGGCCAGCTCACTGGGACCCTGGCAGGAGCG--GGGACCTGGTCTTTAGCGG-TGGCTCCCACCAGGGACCCTCATCCTCGGTACTGTGAGGGGCCAGGGAGGTGCAGGGAGTCAGAG--------CAAAATAAGCAAGAGCTTTCTGAGGT--TTCTGA--GCTC-AGACTGGAAGGGCTC-------------TCACAGGTAGCAGTGAGTCTCCTGGAGCCAGATGTATTGAAGAAGCCAAAGGGCCATCCCCAGGC--------------------AGGAAACTGCAGAGACGGTCCCCTGC-ATGGGCTACTTTTTGAGTCACTCAGACCTGATTTCAGTCTGGCTCTTCTGCACCCCGGCCGTGTGATATTGAGCTAGGACTTCACATTTCT-TAGCCTCTTTTCTCTCATCTGTAAAGTGGGAATAATATGTCCCTAAAGGGAAGCACAAG----TGCAGAGCCTAGCACGGGCCTGGCACACAGCGGGTGCTCAAGAAATCCAGG-TCCTTTGGCCCCAGT-GCTCCCCCTTTTCCCTGCTCCACTGTGACGCCATTTAGTCACCTA-----TGCTCCCAACAACTTAGAGGAAATGGGG----GTGA--AGCCAGG-TGCAACGCAGAGATGTCTTTCTT--TCCTGCCTTTCACTGTAACAA-TGAA-----GAAGAAGATGACGATAAAAATATT--TAATAATAACAAGCATT-TATA---CACTTCTTGTGT------CAGCATATACTG-----TCTCATT-----TAATCCTCACG-CTTCTATAGGGAGGGACTTGTTATCTCCATTCCTTGGATGAGGAGACT----GAGGCACAGAGAGGTGAAGTGA---GTTGCCCAAGGTCACATAGCTGGG-A--ACTGGGA---GAGCCTGGGCTTGAAC--------TCGGGTCTGGCCACCTCCAGAC----CCCTGCCTCT--------GACTCCAGGCCTCCTTGTGCCCA-CAGGCTCTTTAAGAGTCATCCCGAGACCC-----TGGAGAAATTTGACAGGTTCAAGCACCTGAAGACAGAGGCTGAGATGAAGGCCTCAGAGGACCTGAAGAAGCATGGCGTCACCGTGCTCACTGCCCTGGGGGCCATCCTCAAGAAGAAGGGGCATCATGAGGCGGAGCTGAAGCCCCTGGCCCAGTCGCATGCTACCAAGCACAAGATCCCCATCAAGTACCTGGAGGTGGGTGGCAGGGACGGGGGCGGCAGGGACGGGGGCGGCTGGGAGGATGGAAGGAGAGGCCT---CAAGGGCATGAGATTTGGGTTCAAGTCCCAGCTCAGCCACTAACTTGCTGGGTGACCTATGTCGTTCCTCTCTGTTCCCCCATTTCCTCATCTGTAAAGTGGGCTGCCACCCACTTTGCCTTCCTCCTGGG-TCATTGAGAGTCAACA-------TATTTC--------TCCCCCAAATTAGTATCTTGTTACCAAGGAGGAAG--------TGGATATGGG-ATGCACAACTTGCATTCTCTCCACAGGTAGGTACCATTATCACACCCACTCGACAGATGAGGAAACTGAGGCACAGAAAGGTTAAGTAGCT-TGTCCCTGGTCACAG-AA----GCAGCAAATGGCAGAACCAGGATTCAGACT-AGGTCTCTATTCCTCCAGAACCCC--GGCTCCTAATCCCTGTGACACTGGGAGTCGGAGATATAC--AG--GTCTCGT-TGACAGGAGTTCA-GCAACTGGGGCTCAATTCCAGGTGGGAA-------------------T--TATGAGTTCCTACATCTTTACAGGTGAAGAGAGAAAGCCTCAGGTGTCAGAGGCATAAGCAATTGGTTTGAACCCCCTGTGATGGGCTGAGAAGTTG-GGTGGGCTGGGG---------AGAGCCTGAGCTTCGAGTCCTATGTCTA-------------------------------------------------------------------------------------------------CAGGTTGGTGTAGCAGTGAAGGTAAGGTTACTTATGATAGCTACCAGTTAGGATGGGCCATGATAAGCTATTTGGCTGGGTCACT-GCATGGAATTCTCACGATT-TCCCCATGAACTTAGGAATTGCATTCAGTGT--TTTCTTTCCCTCTCACATAAAAGAAGTTCAGAGGTAGGGAGCCCAGAGCTGGTATGGAAGCCCCATAGTCACCAGGGATGCAGGTTCTTTCTGTGTTGCCAT--CTTTGGCATG----TGGTGTCCACCCTCAAAGTTGACTCGTGGTCCAAGATAGCTGCTGGAGCTCCAGCCATCTGGTCCATATTCCAAACAGA--G--CAAGAGGAAGAGAGGAAGTCT-CCT---------TTTAAGGAGCCCTGGAAATCTCATCCAACAATGTTTTTTACATTTCATTGGCCAGAAGTTAGTTACAAGACCACATCTATCTGCAT--GGAAAGGTGGGAAATGTAGCCTATTA---CCACTCCAAATACAG-CTAGTGTTCTCTTAC-AAGGAAGAAGGAGAAAATGGACATTGGGCAGGCAG---TTAGCATTCTCTT--CCCCAGGTAGGTACTCTTCTCATCCCCATTTTA---CTGATTTGGAAACTGAAGCCTCAGAGAGGTTAAGTAGCTTGCCCCAGGTCACCCAGCTTATAAGCCAGGAGGTACAATCTGATCGATGGTACTCTGGAACCCAA---GCTCTTAATCGCTTGTGGCACAGCATTTCCCATGAACAACCACCACCACCA------------------------------------CCTCC---------------------------------------------------ACCCTCT------------------TCAGCCTC-------------------------------------------ACCAC--------------------------------CCTGCAC--------------------------------------CCCTACCTGC-TC-CCCAAGTCACACACAAACAATCCTGCCTGCCCATCTCAGCACCTCAGACATGAGAAGTGGGTCCTGCCTCTGAGA-----TTCTGCACGAGCGGTGGTCCCTGCCCTCCCCTTCTTCTCTTGGCAAATCTC-TGTTCATCCTCCAAGGCCCAGCTCAGAAGCCACCGCCTCTGTTCTGG---CCCA-------TGAGGCAGTCAGCTGCTCTATCTGCCAGCCCTCCGCT--GCGA----AGGCA-TTTATCCTCACCGCT-GCCATGGCGCCTGCCCGGCTGTATTGTAATTTGTCTCTTTACCCAGCTGTCTC-CTGGCTGGGACACGAGCTCCTCAAGGGCAGACCCTGGCTC-CTCCTCTGTCTGAATCCCAGTGGCATGTGGAGGTGTTCACA-ACACAACTTGAATGAT-GGAT-----------------------CCCCTCTCTAGGGATGAGAAGAC--CGTC--------------------------------T----CAGAGAGGGAAAATGATATAGCCAAGGTAGCA--------------------CTACCAGCTAGCAG--ACTAGAACCCAGGTCT--------------------------ACTTAAAACAGGTTTGA--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------TTTACTGCATAATCCCTGCCAAAGAGTTAAGACAGAGCCCC-ATGCCTGCTCTTTC--CCTCACCATGGAACCTTCGTCTGTGTTCTTCCTAAGAAATGTCTCCCTCTCACCTGTGAGTG--------------------AGGGGGCAC-AGAGGGCGCTGGGTGCTAAAAGCAACTATCTTTACTCAATTTTCCAGCTGCCCCAGCCCAGCAATTTTCTAAGTATTTGTGGCTCTGCCTCTGACTTAGGAAAA-CTGAGTGGGGGGACCTAATGGGGTCAC--GGGCCCCTATTTTACAGGTGAGGTCACACATGGAGGTCAGGG---AGCTGGGCCTGGAA----CTTGGCTCTCT--AAGTTCTCCATGGGCAGGGCTTGCTG-ACGTCAGTCCCTCCCCCAGCTTCGTTTTGTTGTGTTTATGGCATTAAAAGGGAA-TTAAACACATGTCCAAATCTATGCCCAGCAGCAGGTGTTTGTGGAGCTGTCCCTGTTTCTGAGCCAAGCAGGGTTTACTAGATGTTGACATTTGTGCT-TCTGAGGCTGCCCTCAGTACATGCTCCCCCGACTCACCCAACCCCACCCTGCTTTTGGCCACTTGCACCCCGGGCGCCG-------------------GAGAAAATGGGCAAACCCAACCGCGGT-----------GCCCTCTCTGTCTCTGTCCCTGGCTCTGGCTGTC--ATTTGTCCACGATCCCCTGGGT------------------------CCACTTGCTACGTGACCT--CAGTTTGGG-----------------TCTCTGTAAAA--TGACACC-CTTTGACCA--CAGGCCAGGGTA------GTGATTGGCACCAAATCTTTCTGAACCGTTGAAAAAGCAC-AGTAGCTTCTTACAGACACAAGATCTTTCAGTCTCT-AAATCAGCCCCG-TGCTGTAGA----------GGTGCCTGCGTGCATTTTCCTGGTAATGACATTGTAGACGGTAAAGCAAACAGTGAAGGTGAGGCTGGTGTAAGATACAGTTGGTCCTGGGGCTGAGTTGTAGGCCTGATCCCTTTCCTCTTCCCTTCTCTCCTTCCCTCCCACCTATACCTCTGAGTACCAGCT--CTGTGCTTGGCACAGAAAATATGATGTTCGGACCCCCTGCCC------AAACCCCAAAAAGGCTATTTAAAGCAATCACCATCCTAAGCCATT---AAATGCAAAAATTAAATACACACAAATGGAAA-TAGTTTCT-------------------TTCTAGATT--------------------------------------------------------------TTCGGAGTGCCAAATCCTGGATAAAA----------------------------------------------------------------------------------------------------------------------------------------------CCATGGAGT---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------AGGAGTGTTTCCC--------------------------------------------------TTCCTT--TCCGAG------------------GCTGGGT--------------CCAGCCCACCTAGCCACTTGACCGTGGTT------CCACGGCCTGACCTTTCCTCATCTGTACCAAGATGGGCCTCCGTTTCCCCATCTGTACCTGTCTCACGGGG---TTGTGGGGATTAAAGGAGTTAATCCATGGAGGGCCGCTCAGCAGCGTGCCTGATATATAGTAAGTGCTCAATAAACGACCCTGGAATAAAGAGAAGATGGG-TTCGGGCTGACTCCCATCTGGCCCCTGGCTTGTCCTGCCC--GTGACCAGTACCCCTCT-GCCCCTGTCTTTGCAGTTCATCTCGGAAGCCATCATCCACGTTCTGCACAGCAGGCACCCTGGAGACTTTGGTGCCGACGCCCAGGGAGCCATGAACAAGGCCCTGGAACTGTTCCGGAAGGACATCGCTGCCAAGTACAAGGAGCTGGGCTACCAGGGCTAAGCCCCACGTCCCACGCCCCACGCCCCTCTCCCCACCCACCTGGG-CAGGGAGGGAGGGGCCTTAATCTTGTGTAGC--TGTAGGGTTTGCTTCTGAGTGCCTGCTTTGTTTGTGAGAGGCCGGTGGGAGATGCCGGAGGGGCTGGGGGCAGGGAAGGGATGAGGGTTTGTTCAAGTGGTTTCACGTGCCTTCCTGTGGGGAGGGTCGTCAGCTTGGAAGCCAGGAGAAGCTTGGCTCTC--CCCACTGTGGCCCGCAGAGTTTGGGTCACACTCAACTCCCCTTTCACCTAAATTCCAACCCAATTCCTTCCCACCTC-------------------------ACCTTAAATCCCCAATCCAAACCGTCAAGCTAAACTC--CAGCCATGATTCTCTGATCC-ATCACCTCACCCCGTGAAGAAAGCAGAATGTCCC-CTTGCACCGAGAAGG---TCTGGGCTGGGTTGGACAG-----CCACACCCAGCCCCTGGGGCGCGTGG--------TGTCTGGAGCATAAGTGTGCCTTCTCAGGTGATGGAGCGACTCACCTGGCTT-AATAAAAGATT-CGCA-TGCCAT-
Equus caballus
------------------------------TTAGCCAGGTT--TTCCTGTACTGTTTAAAGTAAGTCTGGG--CCCTTCTCTGGCTGACTGTTG-TGAGAATGGCAGATGAGGC---------GGAGGTCAAGCCTTA-GAATCTCGAGGCTGAGAAATCAGCTGGACCA----ACCCTTTTGGTTTAGAGATG---GGTGAACCAGAGAGGCTG--AGCC-C---CAGAGAGGCTGAG---CAACTTCTTCAATGTCACACAGCACATTG---------------------GAGACAGAGCTGGGGC-CAGAACTCAGCTTCTGTAATTCTCAGT-CC---CATGTTCTTTCCCCAGACATCTGAG---GTGCTCCCAAAATGTCTGAGATCCCTGGCTCCCTGGGGTCCCAGGTCCTTGGCTCTCTACTGCCTAAGATAGTCT-CCTTCAGCGTCCCAACCTCAGGCC--TTT--GCCTGTGGGTGCTAGGTCTCAGAGCTGTCAGGCAGGAGT-TTG---AGGATCATTGTCACCAGATAA-------TAATAGTCAGGAGCTCAGGGACCCTTGGGTGGA-GGTGCCGGTGAGGA-GCTGTCAGTGACACACGAGGCTTGGCTGGGGCCCCGG------GCAAGGTGGTGACATT------GGACGTGGCTGTGGACAGGCCTGG-GTGTGCTGGGAA-GCGGGATGGACG---TGCTCCTGGGGAA--GGAGTGCTCCCTGTCCGG--GCTGCTTGGCGCGTGCT-TGTGTATGAGTCTGAGTGCACAT-GCACACAAGTGTGGCTGTGCGCTGCGAGACACTTACGC--AAACTGGAGCGC---GTCCCCA-----------GTCCG-TT-TAAGGGTGTGGCAAAGGCCAGGAGCTGCAGGAGAG----GGACCTC--------CCGAGGCC-CTAGACAGAAGGAGAGA--------AACCTCGGGTAGACGGGGTCAGTCCTGC-GAACATTTCAGG---GTGCGACCCAAGGGAGAGAGCGTGAGTTATGGACCCATCCAATCTTGAGTTTGGGTCTGGGCTCTGCCACCAACCGGCCGTGTGACCCTGGGCTGGTCACCTCCCTCTGTGGTCCTCTGTT-TGCCCACGAGTGAGATGGGG---ATCCCTCCCTCC-------------CAGGACCCCCTGTAACCAACACCCAGCAGGAGAGATGAGGTCCAAGGCTGGAGGTG--GCCGCTCAGGCTTGCGGGCAGAGTCCG-TCTTCGGCCAGCTCACTGGGTCCCTGGCAGGAGCG--GGGGCCTGATCTTTGGCGGGCGGCTCCCACCAGGGCCCCTCACCCTCTGTACCGAGTGGGGCT-GGGAGGTGCAGGGAGTCAGAGTGTGTGCTCAAAATAAGCAAG-ACTTTCTGAGCC--AGAGAGCTGCTCTGGAGCGAAATGGCTC-------------CCACAGGGAGTAGTGAGCCTCCTGTCGCTGGGAGCATGTAAGCAGCCAAAGGATGATTCCTGGGC--------------------AGGAAGCTGCAGACATGGTCCCTGGC-ATGG---------------------ACCTGATCTCAGTCTGGCTCTTCTGCACATCAGCCCTGTAATACAGAGCTAGGACATCATACCTAC-GAGCCTCTGTTCCCTCGTCTGTGAAATGGG---AATATGACCCTACAGGGAAGTGCAGG----AGCAGAGCCT-----------GGCACACAGTAGGTGCTCAACAAATCCAGGGTCCTTCAGCTCCTAT-GCTCTCCCCT-CCCCTGCTCCATCCTCAAGGCCACT-GTCCCCTA-----CTGTCCAAATAATGTACAGAAAGTGGGG----GTGA--GGCCAGGGCACAATGCAGAGACGTCTTTCCT--TCCTGCCTCTCACAGTAAGAG-TGAAAAGAGGAAGGTGATGATGCTGATGACAAGAATAATACTAGCCAGCGTTCTACCACCTGCCGGTGGGCTGGCACTCAGCATACACCA-----CCTCACT-----TAATCCTCACAGCCTCCAAAGGCAGGGACTGGTCATCCCCACTCCTTGGACGAGGAAGCG----GAGGCACAGAGAGGGGAAGTGA---ATTGCCCAAGGTCACACAGCTGGG-A--ATGGGGGATTGAACTCGAGTCCGG----------TCGCCTCCAGACCCCACCGCACGT--CCCTGCATCT--------GACTCCAGCCCTCCTTGTGCCCA-CAGGCTCTTTACCGGCCACCCTGAGACCC-----TGGAGAAGTTTGACAAGTTCAAGCATCTGAAGACAGAGGCCGAGATGAAGGCCTCCGAGGACCTGAAGAAGCACGGCACTGTGGTGCTCACCGCCCTGGGCGGCATCCTCAAGAAGAAGGGCCACCATGAGGCGGAGCTGAAGCCGCTGGCCCAGTCACACGCCACCAAGCACAAGATCCCCATCAAGTACCTGGAGGTGGGAGGC----------------AGGGCCGGGGCGGGCGGGGGCCTGGAAGGACAGGCCT---CGGAGGGAGGAGATTTGGGTTCGAGTCCCAGCTTAGCCACCAACTTGCTGGGTGACCTAGGTCGCTCCTCTCTGTGACCCAGTTCCCTT-CCTGTAAAGTGGGCTGTCAGCAGCTTTGCCTTCTTCTTGGGGTCACTGAGAGGAAACA-------TATCTC--------TCCCCCAAATTGGTGTCTCCTTACCAAGGAAGAAGGGGGAGAGTGGATATGG--GTGCACAACTTGCATCCTCTCCCTAGGTAGGTATCATTATCACACCCATTTGATGGACGGGGAAACTGAGGCACAGAGCGGTTAAGTAGTT-TGTCCGTGGTCACAG-AG----AGAGCAAATGGCAGAGCCAGGATTCGAACT-AGGTCTCAACTGCTCCAGC-CCCT--AGATCCTAACCTCCGTGCAAC-GGGAGTCTGAGACGGGC--AGC-AGTTCAGGTGGCAGGAAATCA-GCTGCCGGG-CTCAGTTCCGGGCGGAGG-------------------G--TGGCCAGCAGAATGGTTCTATTGCTTTCAGTA--TCTTTACAGATGA-AGAGACGGAAAC-------TCAGGTGTCCTA--GTGGAG-GAGGGGATG-TG-GGGACGGGG---------AGCACTAAGGGTCTGAGGGACACAGCAGG------------------------------------------------------------------------------------------------CACGCTGGTGCAGCAATGAGGATAAGGACAATGGTGACAGC----------------------------ATTTGCCTGAGTCACC-ACATGGAATTCTCACAACT--ACCCATGAGGTTAGGAATTGTGTTCAGTGT--CTTCTTTTCCTCTCACA-----CGAGTTCAGAGGTAGGGAGTCCCGAGCTGGTGTGGGACCCCCAGGGACACAAGCAAT--------TTTCTGTTCTGCCATGTCCTTGACATG----TGGTGTCCATCCTCAAAGTTGACGCATGGTCCAAGATGGCTGCTAAAGCTCCAGCCATCTGGTCCATCTTCCAAACAGG--CAGC-GGAGGAAGAGGGGAAAGCTCTCTCTTAAGGAGTTTAAGGAGTCCCAGAAATCTCATCCAACAATTTTGTTTACATCTCATTGGCCAGAAGTTAGTTACAAGACCACCCCTATCTGCAC--AGGAAACTGGGAAATGCGGTCTATTG---CCACTCCAAATACAA-CTAGTGTTCGGTTACCGAGAAAGAAGGAGAAAATAGACATTGGGCAGACAG---TTAGCGGTCTCTG--CCCCAGGCAGGTACCCTTCTCACCCCCATTTTA---CAGATTTGGAAACTGAGGGCTCAGAGAGGTTAAATAACTTGCCCAAGGTCACCCATTTAATAAGCCAGAAGGTACGGTCCCATTGACGGCACTCCGGAACCCAA---ATTCTTAATCCCT-GTAGCACAATATCTCCCAT---CAACCACAACCCCCC--------------------------------------------------------------------------------------------AGCCCCT------------------TCTGTCTT----------------------------------------------ACTC------------------------------CCTGCAC--------------------------------------CCGCGCCCCCATCACACACA-CACAGTCTTGCACGTCCCCCCGTCCCACTCAGCATCACAGATGCGAGGAGCCAGTACTGCCTCAGAAA-----TTCTGCATGCAC--CGGTCCCTGCCCTCCTAGTCCTCTCTTGGTAAATTCC-CCTTCATCCCTCAAGGCCCAGCTCAGAAGGCACCTCCTCTGTTCCAG---CCCAG--GCCCCGGGGCCACCAGCTGCTCC--CCGCCAGCCCTCCCCCC-ACCG----AGGCACTTTGTCCTCGTTCCC-GTCGTGGCGCCCGCC-GGCTGT-CTGTAATTTATCTCTGTGCCCAGCTGTCTT-CCGGCTGGGATGC-AGCTCCTCAAGGGCAGACGCTGGCTCACCCCCATTTCTGAACCCCAGGGGCATGCGACTG-GCTCACC-ACACAATCTGAGTGAC-CGGT-----------------------CCCCTCTCTAGGGATGGGGAATCTGAGGC--------------------------------T----CAGCAAGAGAGAACGATGTTGCTGAGGTGGCA--------------------CTACCAGCTAGTGG--ACGAGCACCCAGCTTT--------------------------AAC-AGAACAAGTGTAA--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------TTTACTGTCAAATCC-TTGCAAAACCTTAAGACCGAGCC---ATGCCTGCT-CGCC--CCTCGCTGTGGAACCGCCTCCCGTGCCCTTTATAAAAAATTGCTCCTTCTCCCCTGGGGGTG--------------------AGGGGCCCC----GGGGGCTGGG-GCTAAGGGCGA----CTCTTCTCGGTCACCCAGCTGCCTCTTCCCA-CAGTTTTCAAGGGACTAGTGGCTCTGCCGTAGGCTCAGGACAG-CTGAGCGGGGGGCCTGATGTGGTCCTC--GAAGCCCCGTCTGACAGGTGGAGTCCCACACTGGGGTC-----------GGGCCTC-------CCAGGTTCCCC--AAG-----GGCAGG-----------------CACCCCGCGCCCCCACCTGGTTTTGTAGCATTTGCGAAATTATAGGTCATCTTAAGCACATGTCCACATCGGTGCCCGAAGGTGGGTGTTCGTGGAGCTGTGCCTGTCTCTGAGCCCAGTGGTGTTTCCTGGGTGTCGACACTTGTGCTTCCTGATGGCTCCCTCAGCACAGGCCCCC----------------CAACCTGCCTTCAGTCCCTTGAACCCCAGGCGCCCT------------------GAGAAAATGGG-AAACCCAGCCGCGGT-----------GCAGTCACAACCTCTGTCCCTGGCTCTGGCCGAC--ATTTGCCCACAGTCCCCTGGGTGCAC-----TGCCAGCTCTGCCGTCCACTTGTAACATGACCT--CAGTCTGGCTGCTTTCCC-ATCTGGGCCTCTGCAAAA--TGTCACCACCTGGCCCA--CTGGCCAGGCAT------GTGGTTGGCATCGAATCCTGATGCACCGTTGAAAAAACCCCAGCAGCTTGTTACATCCACGAGCTCTTTCGGTCTCT-AAGGCAGCCCCA-TGTCGTGGC----------TATTCCTGTCTCCGTTTTCCTGGGAGTGAAATTGGGCATGTAAGGGCAAACTGCGCTGGTGGGGCCGAGGACAGACGTTATT-GTCCCGGGGGCAAGCCCGAGGCCTCGTCCCTTTCCTCTTCCCCTCTCACCTCCCCTCCCACTGATATTTCTGAGCGTCCACT--CTGTGCTTGGCACAGAAGCTATGATGCTCTGCTCCCCT----------AACTCCTAATTAGGCTGTTTAAAGCACGAACCATACTAAACCATT---AAATGCAAAAATCAAGTACACACACATGGAAA-TAGTTTCT-------------------TTCTAGATT--------------------------------------------------------------TCCTGGTCGAAAAATCCTGGATAAAA----------------------------------------------------------------------------------------------------------------------------------------------CCGCGGAATGGGACATTCTCT-----------------------------------------------------------------------------------------------------------------------------------------------------------------TCCTTTCCGCTGCCCGTGGCGGCTGTGCC-------------------------CTGAGCTCGTGCCGGGCTGTACTAGGTCCTCGTTTAGAGCAGCAG--------CTGAGGAGGGCAGTGGG----TGAGCCAGCCCACCTGTCTGCGTGACCTCGGGCA------CACGGCTTAACCTT-CCTCATCTGTACCAAGATGGGCCTCAGTTTCCCCATCTGCACCTGTCTCACGAGG---TCGTGAGGATTAAAGGAGTTAACCCATGGTG--------------------------------------------------------------------------------------ATCAAGCCCC-----------------ATGACCGGTGCCCCTCTTGTCCTTGTCCCCACAGTTCATCTCAGATGCCATCATCCACGTTCTGCACAGCAAGCATCCCGGGGACTTTGGCGCTGATGCCCAGGGCGCCATGACAAAGGCCCTGGAGCTGTTCCGGAACGACATCGCGGCCAAGTACAAGGAGCTGGGCTTCCAGGGCTAA---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Bos taurus
------------------------------GAAGCCAGGTT--TTTCTGCAGTGTT-AAAGTAAATCCGGGGTTCCCTCTCTGGCTGACAAGTGCTAAGAGTGTAGGGTGAGGTCCGGGTTGGAGGGGTCAAGCCTTA-GAACTTGGAAGCTGAGAAATCAACT--------------TTTTGGTTTA-AGATA---GTTGATCCA-----------AGAC-C---CAGAGGAGCTGAG---TGACTTTTTCAACAT-------CACAGCA---------------------GAGGCAGAACTGGGGC-CAGAACTCAGCTTTCATGATTCTCAGTGCT---TA-GTCTCTTCCCCAGACATCCGGG---GGTCTCCCAAGATGTCTCAGACCCCTGGTGGCCTTGGATCCCAGGTCCTCGGTCCTCTTCTTCCCCAGAGAGTCTCCCTCCAGTGTCCCGACCT---------TC--ACCTGTGGGTTCTGGGTCTTGCATCTGTCAGGCAGGAGT-TCA---AGGATCATTGTCGCCAGATAA-------TAATAGTCAGGATCTCAGGGACCCTGAGGTGAAAAGTGCTGGAGATGA-GCTGTCAGTGATGTAGAAGGGTTGGCCGGGGCCCCAG------ATGAGGTAGTGACATT------GAGCATGGCTGTGGGCAGCACTGG-GTGTTCTGGGAAGGCAGGAGGGACA---TGCCCTTGGGGGAAGGGGGCGCTCCCTGTCTGT--GCTATGCCC-GTGTGTG-TGTGTGTGTGTGTGCATGAGCTT-ACGTGTGATTATGGGTATGTGCCGTGAGACACTTGTGC--ACACTGGAGTGT---GTCTGCA-----------GTTAGGTT-TAAGGGTGTTCTAGAGCCAGGAAATCACAAAAGAG----AAAGCTTTCAAGGAACTGAAGCCATCAGACAAGAGGAGGGA--------AAGTCC-AGCAGACAGGGTCACTCTGGC-AGAGGTTTCCGG---GTGCAACACAAAAGAGAGAGCATGAGTTATGGAACCATTTG--TGTGGGTTCGTGTCTGGGCTCTGCTACTGACCAGACATGTGACCCTGGAAAAGTCACCTACCTCTGTGGTCTTCCCTT-TGCCCATGAGTGAAATGGGAAAAATCTCTCCCTCCTCCCCGTGGCAGGTATGACTCCCTTTAAC-GATCCCCACCTTTCTGCAGAATGA--AAGACTGAAGTCT--GAGGCAACAGGTGACTCCTTGGATCA---CTGCGGCCAGCTCACTGGGTCCCTGGCAAGAGTG--GGGGCCTGGTCTTTGGCAGATGGCTCCCATTAGAGATTCTCACCGTCAGTACTG-----------GGAAAAACAGGGAGGCAGAG--------CAAAATAAGCAAGCGCTTTCTGAGCC--TGAGAGCTGTTC-AGACTAGAAAAGCTG-------------TCACATGGAGTAGTGAGCCTCCTGAAGCTGGAAGTATTTAAGCGGCCAAAGGATGATCCTCGGGC--------------------AGGAAGCTGCAGAGATGGTCCCCTGC-ATGGACTACATTTTGAGTCACTCAGACCTGATTTCAGTCTGGCTCATCTGCAGCCCAGCTATGTGATATTGAGCTAGGACCTCATACTCCT-TAGCCTCTGTTCCCTCATATATAAAATGGGGATAATACATCACCAAAGGGAAGTGCAAG----TGGAGAGCCTAGCGTGGGCCTGGCGCACAGCACATACTCCACAAATCC-------TTTAGCCCCCGT-GCTCC------TCCCTGCTCCACGGTGACAC-GTCTAGTCATCTT-----TGCTCCCAACAGCTTAGAGCAAATGGGG----GCGA--GGCCAGG-TACCATGCAG--ATGTTTTTCCT--TCCAGCCTTTCACAGTAACAA-TGAA-----GCAGATGATGATGCTAAAAATA------GTAATAGCAAGCATT-TATAGGACACTTCCTGTGTGACCCTCAGCCTATACCA-----TCTCATTCAATTTAATCCTCACA-CTTCTATAGGGAGGGACTCGTTATCCCTATTCCTTGGATGGGGAGACT----GAAGCACAGAGAGGTGAAGCGA---ATTGCC-ACAGTCACACAGCTAGC-A--ACTGGGA---GGGTGTGGATTTGAAC--------TTGAGTCTGGCCACCTCCTGAC----CCCTGTGTCT--------AACTCAAGCCCTCCTTGTACCCCTCAGGCTCTTCACAGGTCATCCCGAGACCC-----TGGAGAAATTTGACAAGTTCAAGCACCTGAAGACAGAGGCTGAGATGAAGGCCTCCGAGGACCTGAAGAAGCATGGCAACACGGTGCTCACGGCCCTGGGGGGTATCCTGAAGAAAAAGGGTCACCATGAGGCAGAGGTGAAGCACCTGGCCGAGTCACATGCCAACAAGCACAAGATCCCTGTCAAGTACCTGGAGGTGAGTGGC----------------GGGTCCCGGGTGGCTGGGAGGATGGAAGGGGAGGCCT---CGAAGACCTGAGATTTGGGTTCAAGTCCCAGCTCAGTTA-TAACTTGCTGAGTGACCTATGCTGTTCCTCTTTATCCCCCAGTTTTCTCATCTGTAAAGTGGGCTGCTACCCACTTGGCCTTCCTCCTGGG-TCATTGAGAGTAAACA-------TATTTT--------TCCCCCAAATGAGTATCTTATTACCAAGAGAGAAG--------TGACTGTGAA-GTACACAGCTTGCCTTCTGTCTACGGGTAGGCACCACTATCCCACCCTCTCGACAGATGAGGAAACTGAGGCACAGAGAGGTTAAGCAGCT-TGCCCCTGGTCACAG-AG----GCACCAAGTAGCAGAACCAGGATTCAGACT-AGGTCTCTATTCCTCCAGAAACCC--AGCTCCTAGTCTCTGTGCCACTGCAAGTTGGACATACAC--AG--GACTCATGTGGCAGGAGTTCA-GTAGCTGGGGCTTAATTCTGGGTGGGAA-------------------T--TATTAGTTCCAGCATCTTTACGGATGAAGAGAGGGCGCCTCAGGTGTCCAAGGCATAAGCAATTGGCTTGAATCTCCTGTGATGGACTGAGAAGTTG-GGTGGGCTGGGG---------AGAGTCTGAGTTTCAAGTCCTGTGTCTAGACCCAGTGGGTGGAGGGATATGGCAAGAAGGA--GAACTAGGAAACTGGGGGATAAAAGGCCCTGAGGCTATCACCCCCGAGGATGGGGCCACAGGCAGTTTGGTGCAGCAGTGAAGGTAAGGTTAGTCATGAAAACTTCTGGTTAGGATGGGCTATGAGAAGCAGTTTGCATGGGTCACT-GCATGGAACTTTCACGATTACCCCCATGAGCTTAGGAACTGCATTCAGTGT--TGTCTTTTTCTCTTACA--AAAGAAGTTCAGAGGTCAGGAGCCCAGAGCTGGTGTGGGAGCCCCATAGGTACCGGGGATGCAGGCTCTTTCTGTGTTGGCAT--CCTTGGCATAAATATGGTGTCTACCCTCAAAGTTGACTCGTGGTCCAAGATAGCTGCTGGAGCTCCAGCCTTCTGGTCCATATTCCAAACAGA--GAGCAGGAGGAAGAGACAAAGGCTGTCT---------TTTGAGAAACCCCAGGCATCTCACCTAACAGTGTTATTTACATCTCACTGTCCAAAAGTTAGTCATGAGATCATGTCTATCTGTGT--GGAAAGCTGGGAAATGTAGTCTATTG---CCACTCCAAATACAAACTAGTGTTCTCTTAC-AAGGAAGAAGGTGAAAATGGCCATTGGGCAGGTAA---ACAGCATTCTCTT--CCCCAGGCAGGGACTCTTCTCATCCCCACTTTA---CTGATTTGGAAACTGAAGGCTCAGAGAGGTTGAGTAGCTTGCTCCCAGTCACACAGCATATAAGCCAGGAGGGACAGTCTGATCTATGTTATTCTGGAACTTGC---GCTCTTAATTCCTCATGGTGCAACGTCTCCTGTGAACAACCACCATCACCATTGTCTTCAGCCCCACCATCCAGCTCCCCTGCTTGCCCTCCCAAAGGTTACTGTGAAACTGTTAGCTACTCAGTTGTGTCTGACTCTTTGTGACCCTCTGGACTGTAGCCTGTCAGTCCAGCCTCCTCTGTCTATGGAATTCTCCAGGCAAGAATACTGGAGTGGGTTGCCATGCCCTCCTCCAGGGGATCTCGAACCCAGGTCTCCTGCATTGTAAGCAGATTCTTTACCCTCTGAACTACCGGAGAAGCCCCACCTGC-TCTCCCGCATCACACACAAACAATTCTGCCTGCCCATCTCAGCATCACAGACATGAGGAACAGTTCCTGCCCCTGGGA-----TTCTGCACAAATGGTGCTTTC--GCCTCCCCATCTTCTCTCGGCAAATCCC-TGTTCATTCTCCAAGGCCCAGCGCAGAAGCCACCTCTTCAGTTCTGG---CCCA-------CGAGGCAGTCAGCTGCTCCACCTGCCAGCCCTCCCCA--GCAA----AGGCA-TTTATT-TCAGCCCT-GCCGTGGCGCCTGCC-GGCTGTATTGTAATTTATCTCTTTACCCAGCTGTCTC-CTGGCTGGGACAGGAGCTCCTCAAGGTCAGACCCTGGCTT-CTTCCATATTTGAACCCCAGTGACAGGCAGGGG-GCTCACA-AACCAGCTTGAATGAT-GGAT-----------------------CCCATCTCCAGGGAGGAGCAGAC--AGGC--------------------------------T----TAGAGAGAAACCAGTGCGGTCTAGAAATAGGA--------------------CTGCCAGCTAGCAG--ACTAGAACC-GGGTCT--------------------------ACTTAAAACAAGTTTGAAAAGAAAGTGGAAGTGAAGTCACTCAGTCGTGTCCAATTCTTTGCGACCCCATGGACTGTAGCCCATCAGGCTTCTCCATCTATGGAATTTTCCAGGCAAGAGTACTGGAGTGGTTGCCATTTCCTTCTCCAGGGGATCTTCCTGACCCAGGGATCAAA------CCCAGGTCTCCCGCATTGCAGGTAGACGCTTTACCATCTGAGCTACTGCAAAAAATTAAGACAGCGCCCC-GCGTCTGCTGTTCC--CTTCACTGTGGAACCTTTGTCTATGTTCTTCATGAGAAATTGCTCCTTCTCATCCATTAGCA--------------------AGGGGGCCCCAGAGGGTGCTGGGTGCTGAAG-------TCTTTTCTCAGTCTTCTAGCTGCCCCAGCCCA-CAATTTTCCTAGTATTAGTGACTCAGAAGTTGACTCAGGAAAGGCTGAATTGAGGGACCCAAAGGAGTCA---GGATCTCTGTCTTACATGTGAGGTCATCCGTGGAGGTCAGGGTGGAACTGGGCCTGGAA----CCTGGGTCTCC--AAGTTCTCGCTAGG----------------TCAGCCCATCCCCTCACTTGGCTTTGTTACGTTTATGACATTAAAAGGGAA-TTAAACACATGTCCCCCCCGATGCCCAGCAGTGGGTGTCTGTGGCGCTGTCCTGGCTTCTG-GCCAAGCGGGGTTTACTAGGTGCTGACATTTGTGCT-CCTGAGGCTGCCCT--GCACGTGCTCCC----CTCACCCCCACCCATCA-GCTTTTGGCCACTTGTACCCTGGATGCCG-------------------GAGAATATGAACAAACCCACCAGTGGT-----------GGCCTCT--GTCTC----CCTGGCTCTGGCTGTT--GTTTGTCCACAGTCCCCTGGGTGCAGA----TTCCAGCTCTCCCATCCACTTGCAACATGACCT--CAGTCTGG------------------CCTCTGTAAAA--TGACACC-CCTTGTCCA--CAGGCCACGGCT------GAAATTGGCATCAAATCTTTCTGAACTGTTGAAAAAGCA--AGTAGCTTCTTACT-ATATAAGGTTGCTCAGTCCCTTAAAGCAGCCCTG-TGCTGTGGA----------GATGTTTGCACCCATCTTTCTGGGCATGACAATGTGGATGGTAGAGCCAACAGTGATGGTGAGGCTGCTGTAACATATGATCAGTCTTGGGCCCAAGC-ATAGGGCTGGTCCCTTTCCTCTTCCTCTCTCTCCTTCTCTTCCACCAATGCTTCTGAGGACCAGCT--CCGTGCTTGGCGCAGAAAATAAGATGTTCTGATCACCCTCCCGCCACTATCCCTCCAAAAGGCTACTGAAAGAGATCACCATCCTAAGCCATT---AAATGCAAAAATTAAATCCACACACATGGAAG-TATTTTCTAGACTTTCTGATTGCAAAATCCTGGATTAGAACCATGTAATAGGTTGGGTCCTCATCCTGAGCTGAACCTCCAATACCTTGGCTATCTGATGTGGAGAGCCAACTCATGGGAAAAGATCTTGATGCTGGGAAAGATTGAGGGCAGGAGAAGGGGCAACAGAGGATGAAATGGTTAGATGGCATCATTGACTCAATGGACAGGAGTTTGAGCACATACAGGGAGATAGTGAAGGACAGGGAAGCCTTGCGTGCTGCCGTCCATGGGGTTGCAAAGAGTCTGACAGGACTGAGTGACTGAACAATAAAACATCCTGAACATAGTAGAGGGGGGTAGCGTGTATGCATGCTAAGTCGCTTCAGTCATGTCCAATTCTGTGACCCTAGGGACTGTAGCCCATCTCTGTCCATGGGATTCTCCAAGCAAGAATCCTGGAGTGGGTTGCCATGCTCTCCTCCAGGGGATTTTCCCAACCCAGCGACTGAACCCATGTCTTCTGCATCTCCAGCACTGCAGGTGGATTCTTTACTCTGAGCCACCTGGGAAGCCCCTGGTTGGGGGTGGGG----GTAGCCAACCCACCAAGCCATGTGACTGTGACCACAAGCCCAAGGCCTGACCTTTCCTCCTCTGTATTAAGATGGGCCTCTATTATCCCATCTGTACTTATCTCATGGG----TTGTGGGGATTAGAGGAGTTAATCCATGGAGG-CCGCTCAGAAGTGTGCCTGATATATAATAAATACTCAGTAAATGGCCCTGCAATAAAGCAAAGCTGGGGCTGGGGTTGATTCCCATCTGGCCCCTGGCTTGTCCTGCCA--GTGACCAGCACCCCTCT-GCTCCTGTCCCTGCAGTTCATCTCGGACGCCATCATCCATGTTCTACATGCCAAGCATCCTTCAGACTTCGGTGCTGATGCCCAGGCTGCCATGAGCAAGGCCCTGGAACTGTTCCGGAATGACATGGCTGCCCAGTACAAGGTGCTGGGCTTCCATGGCTAAGCCCCAC--------CCCTGTGCCCCTCACCCCACCCACCTGGG-CAGGGTGGGCGGGGACTGAATCCCAAGTAGT--TATAGGGTTTGCTTCTGAGTGTGTGCTTTGTTTAGGAGAGGTGGGTGGAAGAGGT-GGATGGGTTAGGGGTGGAGGGAGCCTTGGGAGAGGCCTGGGG------ACCAGGCTTTCAGTGG--AGGGTCATCAACTTGGGAACCATGAGAAGCTTG-----------ACTGTGGCTGGCTGAGTCTGGGTCAAACTCAACTTTCCTTTCACCTCAATGCCAACCCAATTCCTACCAACCTCTAAACTGACCTGCACCTTTACCCTCACCTTAAATCCCCAATCCGAGCTGTCAACATAAACTC--CAGCC-TAATTCTCTGACCCCATCACCCAGCCCCTTGAAGACAGCAGAGTGTCTTGCTTGCCCTGAGAAGGAAGTGTGGGCCGGGTGGGACGG-----CCACACCCAGCCCTAGGGAGGCATGGAGGCATGGTGTCTGCAACATAAATGTCCCTTCTCAGGTAGGGGAGTGA--CACCTGGTTT-AATAAAGGATTTCTCA-CATCACA
Carlito syrichta
--------------------------GTAATAATACCTACC--TACCCGCACTGCAGTGAGTCGATACATT-TAAAACCCGTAAAACCCTCACATCCTTAGTGGGTGCTCTGCAAGTGTGAGCTCGTGTCAGTCCA-G-CGTGCTCACTGCTAAACCCCCCGGGGTCCTA----ACCGTG-GTGGTCAGCCAGA-ATGGGGAACTGAAAACAGGG--AG-------GGGAAATGCAGGTTC-TGCCCGTGGGCACAAGTCATTAACTGCCC---------------------AATGATAGGACGAGAA-GACATGGCAGTTCACATGG--AAAAGAGCT---AGAGGCACTGTGGGAGGCCGAGGCGAGTGGATTGCCTGAGCCCGCGGGTTT---GAGACCCGCCTGGGCAACTTGGCGAGACCGCATCTCGACAATAA--ATC--AAAAAATGAGCCGGGCGTGGTAG--TGCATGCCTGTAGTTCCGGCTACTTGGAAGGCTGAGGCGGAAGGATCGCCGGAGTCCAGCAGGTCCACGCTG----GGATGGCCGGGAGCGTGCCA--CAGCACTCCAGCCTGGGCGACAGAGTGAG-ACTCCAACTCAAAAAAAAAAAGTGCTGGGGACTTTAGTGGACCGTAAGCTAAACAGATCATTGAGGGCAGTGGCTCTTAAAAACGCAAACTCATCCTCAGCTTGTATGAAGAACAGCTTGGCATCCCGGCTATGGGACAGTCTCC--CCAT--AAAACCCTA-GGAAGGT-CGAGACTGGGCAT-CAGGTGCTT-CCCTGTGACGTCTTCTGGGCATGGTGGCTCAGG-GCTC--CAACTGGCATGC---ATCCACG-----------GTGGCGTTACAAGGACACGAGGGAGCCTGGGAACCATAAAAGAG----GAACATTGCAAAGAACTCAAGACATTAGACTCGGGGGAGGGGACGGGACAGCCCTGGGCAGCCGGGGTCACCCCTGCCACAAGCCCCAGGCA-GTGTGGCATGAGGGAAGGGGAGTAGATTATGGGACTGTCTGATTTGGGGTTTGGGTCTG-----TGAAACGGAGAAACCCATT---CCC-----ACCCGGGTGCATCTGCCTCCCTTT---------AC----------CCAGCACCCGCCTGCTCA--CAGCAGGGAGGCCCGAGGCCCGA-GGCTGGCAGC-AGCTGCACACGG-GTCA--TGGGCAGCAGTCTTTGTGGGGTCTGGCTGTGTGCCTGGTGGG-AGTGGGCGGTGGTCCGTGCATGTCTGCTCAGTGAG--GGGACTGGGACAT--GCAGGGCGT---CGTCAGAGCTT-------------------------------GGGCTGGAAGCCAGTG------------AGAGGCT-------TCTGAGCC--CGAGAAGTGTCCTGGAGAGCAGTGGCCC-------------TCACAGGGAGCAGAG-GCCTCCTGACAT------------AGC-CTAAG--GGCCGTCTCTAGGC--------------------AAGAAGCTGTGGAGAC--TCCCCGGC-ATGG-------------------------------ACCCAGACTCCTTTTTGAGTCACTC-----AGCTGAGCCCCCAACTTCACACCTCT-AGCTCTGTTTCCTCATCCTTTCCTCATGGC------------CCGCAAAGGAACGCACG----TGTGGAGCCT-----------GGCATGGGACCAGT----CCAGGGTCC-------TTCAGCCTCCGT-GCTCCC-----GCCCCACCCC----TGATACCACCCGGCCACCTA-----CTCCCCAGGCCACTCGGGGAAAACCGGA----GTGAGAGGCCAGGCAGCCACGCAGAGAAGTCTTTTTCCCTTCTGCCCCTCCCAGGAACAG-TGGA--GAGGAGGAGGATGTTGGTGATAATAAAAACAGCAATAGCA-GGTGTTTAGACAACACTCACCGTGCACCAGACAGCCAGCAGGT-----CCCTAGCACACACTATCTCATTTAATCCTTTAAGTAGGGACAAGCCATCCATGTTCCTCGCATGACGACACT----GAGGTGCAGAGAGGTGGAGTGA---CTTGTCCAAAGTCACACAGCGGGG-A--AGAGGGT---GAGC-TGAACTTGACC--------TGGAGCCTGGCGGCCTTGAG--------CCGTGCCT--------GGCTCCAGCCCCCCTTGTGCCCA-CAGGCTCTTCAAGGGTCACCCTGAGACCC-----TGGAGAAGTTCGACAAATTCAAGAACCTGAAGTCGGAGGATGAGATGAAGGCGTCGGAAGACCTGAAGAAGCACGGCGTCACCGTGCTCACAGCCCTGGGGGGCATCCTCAAGAAGAAGGGGCAGCATGAGGCCGAGATCAAGCCCCTGGCGCAGTCGCACGCCACCAAGCACAAGATCCCCGTCAAGTACCTGGAGGTGGGAGAT-----------------GGG----GGGAGGGGAGGGGAGGAAAGGAGGA-----GGGAGGGGAGGGCCTCCAGGACAGGCCTCCTGGCCCGACTGCCAGTGTGGGCGGTGACCCACGTCACTCTCCTCTGG--CCCGGTTCTCTCATTTGTAAAGAGGACCGACACC-ACGCGGCCTTCCTCCTGGAATGGCCGAGAACGGACA-------CATTTCAAACCTTGTACCCCCCGA--GGGCCTCGCGACCCAGGAGGAG--GGGAGAGTGGATGCCAGTAGGCACAGCCCGCACACCCCCCAGAGAGGGCTGCCACTGTCACATCCGCTCTATAGCTGGGGAAACTGAGGCACGGGGAGGTTAAGGAGCT-TGTCCGTGGTCACAG-AG----GCAGCGGATGGCAGAGCCAGGGTCCAAACTCAGGTCTCTGCCATCCCGGAGCCCCCCAGCCCCTCACCGCGGTGCCACCGGGAATCTGTAACACGC--AG--GCCTGGCACGACCCCAACTCA-GCCTCTGGCGTTCCGTTCGGGGTGCGGG-------------------CTCTGCCGGATCCAGCGTCG----------------AGAACCTTGGGTGTTGCAGCACGAAAC------------CCTCCTGGGATGGGCTGGAAAGGCA-GGTGGGCCGGGGCAGGGT---GGGACCTGAGTCTCAGGTTCTGTCTCTGTAACTGCCGGGATGAGGGAGTGTGGAG-------------------------------------------------------------------AGAGAAGCTTCTGGAGAGCTGGGGATGAGAGCAG-GAGAGGTGCATCCTAGGAGGGCGGGA-ACAGCAGGAAGGCTGCGCC--CCATGAGAACGAGGGTGTGGCAAAGCGCTACCAGTTGGGATGGGC-ACGGTGGGCGC--TCGCCTGTGCCACCCCAT----GGCCTTCTCAGGATTGTCCCAGGAGGCTAGGGACTGCAGCCAGCACCGTCTTCTCTCACGTAAAAGAACCTCAGGGGCAT--CCTTAGCATG----TGGTGTCCAAGCTCAGAGTTGCCTCATGG-CCAGGAAGGCTGCAAGAGCACCAGTTGCCTCGTCCATACTCGAAACAGG--AAGCAGGGGGAGGTGAGGAAGGGTCTCTCCTGAGAAGTTTAAGGAGTTCCAGAAATCTCACGCAACAGTTTTACCTGCACCTCATTGGCCAGAA---AGCTACAAGGCCAC-CCCGTCTGCA---GGGAAGCTGAGAAGTGCGGTCTATTA---TAC-TCCAAATATAC-TGAGTGTTCTTTTGCCAAGGAATAA-G-GGGGGTGGATATGGTGTAGGGAG---TCAGCAATCTCTG--CCCCAGGCAAGTGCCCTTCTCATTCCCATTTTC---CTGATTTGGAAACTGGGGTCTCGAAGAGGCTTGCCAAAGT-CACGCAGCTACTAGACCAGGAGGTAAGGAGGCACAGTCAGATCTCTG-TGCTCTGGACTCCAA---GCTCTTAATTCGTTGTGGCATAATATCGTCCATCAACAACTGCCTCGCCCC-----------------------------------------------------------------------------------------------CTACA----------------------CTC----------------------------------------------------------------------------------TCCAC--------------------------------------CC--CCCCA---------AGTCACGCACAGTCCTGCCGACCCAACGGGCTCAGAACCCCCGACG-GGGCGGCTGGCCCTGCACGTGGG----------CCATGTCC--CACCCCTCTGACCCCTCATC-TCTCTTGGCAAACCCT-TACTCATCCTCCAAGGCCCAGCGCAGAAGCCACCTCTTCGGTTTGGG-CCCCTGG--CCCACGAGGCCGCCGGCTGGCCCAGTCGCCAGCCCTCCCCCCCACCGCCGGTGGCACTTGATTCTTACTTC--ATCATGGCACC-GCCAGGTGGGGTTGGAATCCACCTCTTTGCCCAGCGATCTC-CCAGCTGGAACCG-GGCTCC-TAGCTGC--------------CACCATCTCTG-GCCCCAGGTGCACGTGGTCA-GTGCCCA-ACACAACCCGAATGACAGGGT-----------------------CTGCTGTCTAGGTGTGAGGAAGCCGAGGCTGTGGTCCACACTCATCTCCATGGTTTGCTCACGAGTGGAATGGGGAAACCCACGCCCACCCGGGGGCATGTCTCCCTTTACCTGGCACCCGCCTGCTCACAGCAGAAAGGACCAGGGCATGAGGCCGGCGGCGGCTACATGGGGTCACTGGCAGCGGTCCTCCAAGGCAGCTCGCTGCATGACTGACTGGAGCAGGCAGTGGTCCATGCAGGGCCACGTCAGGAGGGTCCCCAGGGCCCTCCACCATCTGCTCAGTGAGGGGACTGGGAGGTGTGGGGAGTCAGAAAACAATGTCCCCGAGGTGGCACAACCGTTAGTGACT------GGAGCCAGGTCTACCTAGAGCAAGTTTAACTTAATGCCTCATCT-CTGCAGTGAA----GACAGAGCCAGGACATCAGCTCTTCTG-CTGTACAG-GGACC----CTGTGTGCTCTTCCTGGGACATC-CTCCATCTCCCCTGTGGACGGATGAGCTC-CCAGGGACCCAGGGGGCCGCAGAGGGTGGTG-----CCAGGGTGC---CTGTTTCTCAGTGGTGCAAATGCCCCATTCCA---GTTTTCTAGATATGAGAGGCTCTGCCGTAAACCCTGGGAAGGCTGAGCTGGGGGCCCCACTGGGGTCATCTAAGCCCCCATCTTGCTGGCAGAGTCACA-ATGGGGG-CAGGGCAGAGGTGACCCCAGAA----GCTGAGTCTCCCCAAATTCCCTAAGTGCAGGGCTTGCTGCACAGCGCTCTGTATCCCCACTTCACCTGGTGGCACTTGTGAAATTGAAGGTGGCTTGGCACCTGCATCTAAGTCGACACCCAG--GTGGTCACATGTGGAGCTGGCTGTGTTCCCAAGCAGAAAGGTGTCTACCGGGGGTGGACGCT-GTGCT--TCTGAGCCCCTCTGCGGGTCTGTCTCC--------CCCTCCCCATCCTCACTTTGTGTCTCCTGCACCCGGGGCCCCC-------------------GAGAAAATGAG-AAACCCATCTGCGGTTT---------GGCCTCACGGCCTCTGTTCCCGGTTTTCG-AGAG--GCTCGTCCA--GCCTCCTGGGCTCAA-----AGTCAGCCCTGCCACCCACTCACTGTGTGACCT--CGGTCTTGCCACTTG--------TCTCCTCTGTAAAG--GGACACCCCTCGGTGCA--CTGGCCAGGGCC------GTGGTTGGGAT-GAAGC-TGAGAAAC--TTGAGAGAGCAC-AGTGGCATTGTCCACCTACGGG-CCTCTCGATCCCT-CAAGCAGTCCCT-TGCTGCAGA----------CGCTCCCGTCTCAGGTTCCCCGGTGATGAGACTGTGGATATTCGAGCAAGGGG------------TGACTGCAG---TTGTCTAGGTCCAGGGCCAGCGGGGAGCCTCATCCCTTTCCTCTTCCTCTCTCTCCTTCCCTCTCACCCGTGTTTCTG---------------GGTTTCACACTGAATGCATGAGGCTGTGCCC------------------CCCAAAACAGGTCTTTAAAGCAGCAACCATGCCAAGTCACT---GCATG-AAAAACGGAATGCACACACATGGAACCTAGCTTCT-------------------CTCCAGACC--------------------------------------------------------------TGCTGATTGCAAAAGCCTGGATAAAA----------------------------------------------------------------------------------------------------------------------------------------------CGCTGGACTGCTAGATTCTGAG-----------------------------------------------------------------------------------------------------------------------------------------------------CCTTTCTCCTCTCTTCTGCTGCCGCCACGTGGGGGATGC---------------------AAGTCCTCACGTGCTGCAGCTGGGGGTTCACATCCCCAGCAGAGTGGAGGA--GGGCTGTGAGG------------------------CCCCCTGG--GTTCAAATGCCTCTTCCGGGCTTCACCAGCCCTGTGCCTCAGTT-TCCCCATCTGTAAACAGAGGGTGGTAGTAGTAACCACCTCACAGTGGAGTTGTGGGGACAAGGAGAGTCCAGACACGGAGG-TCAAGCAGAACCTCTCCTGGTGCACAGCAGGTGTTCAATGAACAGCCCTGGAACAAAGAGCTGGTGGGACGCCATCCGCCGCCCACCCGGCTCCTGGCTCGTCGGGCAC-GGTGACCAGCCTCCCTCT--CC-CGCTCCCTGCAGTTCATCTCGGAAGCCATCATCCAGGTTCTGCAGAGCAAGCACTCCGGGGACTTCGGCGCCGACGCCCAGGGAGCCATGAAGAAGGCCCTGGAGCTGTTCCGGAACGACATGGCCGCCAAGTACAAAGAGCTGGGCTTCCAGGGCTGAGCCCCGGCCGC----CCG---CCCCGTCCGCCTGCCCGCCGGG--------AGGGCGGGGCCTCATCTCGCGTAGC--TGTAGAGTTTGCTTTT--GCGTCCGCTTTGTGGAGGGGGGGTGGGCGGGAGAGGCCGAGCGGCT--GGGGCTGGGG--CTGGGGCCGG-----GAGGCT---CCACACGGCTGTCCACGG--CCTGCTGTCACCCTGGGAGCCCGGAGTCGCGCGGCCCTTTGCGCACCGTGGCCTGCAGGGTTGGGGTCCTACTTCGTTCTCCT-----CTAAGTCCCAACCCAATTCTTCCCAGCGGCCAGAAG-------------------GACCTTAGCCCCACATCCAAGCTATCAACTACGTCCCGGCCATCATCATTTTCTGATCC-ACCACCCACCTTTGAAAAGACAGCAGAAGGTCCC-TTTGCGATGAGGAGGAGGTCTGGGCAGGGTTGAGGAGGGGGTCTGGGCAGGGCAGGACAGCCAGGAAGCCTTCGGGGATCTGGAACATG-GTGTGTCTTCTCGGGTATTGGAATGACTAGCATGGTTTTAATAAAACACCTTGCAACATCTCA
Red Jungle Fowl
----------GAGGCAACCGCCATAGGCAGCACTTGAGACCTATCTATCTGTCCCAGCCACCGGTGAACCATGGGGCTCAGCGACCAGGAGTGGCAACAAGTCCTCACCATCTGGGGAAAAGTGGAGGCCGACATTGCTGGCCATGGACACGAGGTTCTGATGAGGTAGGCAACGAGCATCGGTCCTGAGCTGAGAGCTTTGGGTGAGAGACCACGGAATGCTGAGTACTGGGGCTCAGCCGCGTGCGTTTCAACAGGCTGTTTTGTTTCGGT------------------GTAGGGTGGTGCTTTTC-CATTGCTGTGGTTTCTGGGTTAGAAGTGGA---GAGGCTGTGCTTCTGCCACTCCATGTGTCAGATCTCAGGGCTTTGGAGTGGAA-AATGGCACAGTGTCTTTGCCTGTC------CTTACGGACCAGCAGGGCTGGAGACAGGGACACCCTCTCGAGAG------TGCCCTCGGGCT---GCACTCAGCACTCAGTGCCAGGAGGGTAC----GGACACCCAGGACAGCAGTGGG-TATCTGGGGGTTCCTATCTCAAGCACAGCTAGA-GCAAGAAGCCGACAGCACTGGGCAAAGCCACGGCCCATTCAAAGCTTCTTCCAC-----------ATCATTTCTCATTCCTTCAGGGGGTTTCTGAG--------GGAGAGAAGAAGAGGGAGTGGAGCAGATTTCTCTCGGTTTAGAAATGTGCCTGCTTGTGCCTCGGTAAAGACTTTTTAGGTAGGTAACATATCAGGGTATACGTTTAGAAAATATTGATCCAACTGGACGTTTTCCAATGGAAACAACAACAACAAAAAAATACTCCTTGAAAGAATTTTCAAGAGTCTTGGGAATATCTCATCATGG-AGGACACGAAGAGAGACCTTTGAAAGCAGATGCCAGCAAAGCCTGGTCACTCGTTTGCAGAATG------TTTTTGCTGCAAGACAGCTTGTGTTTTCCATGCAGCTGAGGT--ATTTGTTAATGCAGCCTGACCTACAATTAC---ATCTTATAATGGAGAGTCTGGCCCTCATCCTAATGGCCACCTATATGGGAGGAATGTTTGATTTGAAATGGATGTAAATACTGCCATTACTCCTGCTGGTGCTGCAGCGACTTGGGTTATGCCCTTCAAAGGGCA--TAATACCCGCAAATATACTTGAGCAACGGAGCTGACGTCTCTCAACACCCAATTATC--AGACCGCTCCTAATCACCAGGACAGGCAATGCTCCCTTCTCACCCCGGCTCTGGACGAGGAAAAGAGTGAAATTTCCTTACAGTCTGGTTGCTAGTGGTGTCAGCATCCTGTCTCCTGGGAAGT-----GCAGTGGAGGGGAAGCACGAGACAATACCAGCCTTCCCAATTTCATTTCCACTTTGGAATCAGAGCATCTGTATATAGCACTGAATGGCAC----TGGTCAATGCGGAGATGGGCTGG--GAAGAAGGTCTTAGTGAGAGCCTGGGGTCTGCAGTATCTGACAGGGTGAC------AGGCTGGGGGAAGTAGAGGATGGATCCTTAGTGAGGCAGTGTAGATAGG-------TGGTTTTGTTTGGTTCTGTTTTTCCTTCAGGTTAATGTAAAAATTCACCTCCCAGTGCCACCACCAGCCTTGGCTCCCATCTCTGTCCTGATACAGTTGC--CGTAACTCCGGGTACTGATGCCAAAGCGTTAGCAGAGCTATTCACA-AATAAGGATGGCTGCTCTGTTGCTCCGGCTGGACATGGTTATGGCAGTGT--CTCTGTCTCAGCCGGAGTGAATGTTAGGACATGTAAGAGCATAGGATTGAAGTAGATACAGTACTCAAAGGGAGGTACATCATGAGCTTCTGCACTCCAAAGACAAGAGGTATTTTATCAGTTGCTAATGATAAAAAGCAGCCTGGATAAAACAGATGGTGAAGTAAACTTCTGAACACAGAGTGTCATCAGGTTGTGGAACAGCCTCCCAGAGGAAGGATGAAAAACACAGATTCAGAGTAGGGTCAGATGCTTTTTTGGCTTATATACGTGCTAAATGCACTGATTGTGAGTCTGAAGGATCAGAGTTCTCAAGTGAAAGCAGGCTTTACTAAGAGATTTCTTTGGGGAGCGATGTAAGTTATCACATATTTTCCTGTCCGTATGCATCCAGATCAGCAGGACCTTGGCTCTGCTCTGGCTGACAGCTCCCTAATGTCTCACACATCC--CTCTGCTTCTGTCCA-CAGACTTTTCCATGACCACCCTGAGACTT-----TGGATCGCTTTGATAAGTTCAAAGGCCTGAAGACCCCTGATCAGATGAAGGGCTCTGAAGATCTGAAGAAACATGGAGCTACTGTCCTCACCCAGCTTGGCAAAATCCTGAAGCAGAAGGGTAATCATGAGTCAGAGCTGAAGCCCCTGGCTCAAACCCATGCCACGAAGCACAAAATCCCAGTCAAATATCTGGAGGTATGGAAAAGGACAGGGAAACTT--GGTGTCTGATATGTGATAG-ATGTGTGCAAGACAGACACATTGAGAATTGCACTTCTATTTCAGGCTGGGTATTAGGAAAAATTTCTTCACCGA--AAGAGCGGTGCTG-CAGTGGCACAGGCGGCCCGG--GGAGGTGGTGGGGTCACAGTCCCTGGAGGTGTTCCAGAGCCGTGTGGATGTGGCACTGA--GGGACATGGTCAGTGGGCATGGTGGGGATGGGCTGACGTTTGGACCAGGTGATCTTAGAGGTCTTTCCAACATTAAAAGTTCTCTGATTCTATTTAGTGATGTCTAACTGGACTTCAGTTAGCACTTTCTCAAGCCAAAGGTCTCTGTGCACAGGCAAGA------GGAGGCACAGAAG-----GATTCAGCTCCTGTG----AATCTCCATT-AGATTTCCATGCC----AAACACT--ATACGCTGTCTGAGACACTCCTAACAATAATAAACCAGCCCATGCAGTCCCT---GCAT--GCCTG-GGAAATGACATTACATAAGAATTGGCAGTG---------------------GCCATACACAAGGGATGTACAATTTTAGAGT--AAGCCCTGGAGGA-AGAGCTGGAGAC--------CAAAACCCATGAAACTAAGCGCTTTACAAGTGCAGGACATGG-----------ATTTGCAGCCTTAGTGGATGGGCAGACAAAGAGTGGGAAGGACCACGGTCTACTCA----------------------------------------------------------------AGGTCACCGAGTATATCAGTGA-GCTAGGAACAGAGCCCAGCTCCTCTGCCCAGC-----------CAAGGCTCCTTGCATGCTCGACCACAGTGTTTCTCTGGAGACTGCCAGAAGGGCTACTGAATTAGCCTGGAGATATTTCCAGGCTCTGCAACCT------ATTTTTCAGTCTGGATATTCTGAAAATAAACCCTGAGAGTCCCTTCTGTTGTTTTTTTTTTTTTTTTTTTTTCTTTCTTTCTCTCTCCTTC-CTCACAGTTCATTTCTGAAGTCATTATCAAGGTCATTGCTGAAAAACATGCCGCAGACTTTGGGGCCGATTCCCAGGCTGC-----CATGAAGAAGGCTCTGGAGTTGTTC-----------CGAAATGACATGGCCAGCAAGTACAAGGAGTTTG-----GTTTCCAGGGTTAGCATGTACGCAGAAGGACAC---------CACGACGGAGGCCTTGCCATGCATCTCAGAGTAGCTTCCCCAGCACTATTCTGGACATCTTCCAACTGACCATCCACGACGTGCGGGAATGCTTCAGTGAGAGGCCAAGAGTC------------------ACCCTAGGCAGCGCAG---------AGGCACTCAGAGTGATGTCCGTGATAAATTGTTGTCCTTTCCTGGCTTCATGT--------ATGCAAAAGAAATAATCTGTTTTCTTAGAATAAATAATGTAAGGGGTTATCTGTCACCTTCTGCTTA-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
记得model(2)的树文件要有分支标记:
model=0
8 1
(((((Physeter catodon,Bos taurus),Equus caballus),Homo sapiens),Carlito syrichta),(Mus musculus,Rattus norvegicus),Red Jungle Fowl);
model=2
8 1
(((((Physeter catodon,Bos taurus)$1,Equus caballus),Homo sapiens),Carlito syrichta),(Mus musculus,Rattus norvegicus),Red Jungle Fowl);
准备好了.nuc,.trees以及codeml.ctl文件之后就可以运行程序了
1.
3. 运行程序
打开cmd,到paml的bin文件下,输入codeml,回车,最后就得到了文件:
我们看看model0和model2的Inl值:
model0: lnL(ntime: 13 np: 15): -11421.714463(q值) +0.000000 (model只有1个omega)omega (dN/dS) = 0.94008
model2: lnL(ntime: 13 np: 16): -11419.006785(q值) +0.000000 (model有2个omega)w (dN/dS) for branches: 0.98309 0.75139
可以看到model2的两个omega都是小于1的,说明受到负选择。
用R算p值:pchisq(q,df)
> pchisq(-11419.006785+11421.714463,16-15,lower.tail=FALSE)
[1] 0.09986626
p值可信。
最后用到的文件再这里:
https://pan.baidu.com/s/1lzmNbHZrLZwb8KHu2DNDrw
提取码:lc6k