Gene BAS1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1502 
Symbol 
ID2852170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1523014 
End bp1538067 
Gene Length15054 bp 
Protein Length5017 aa 
Translation table11 
GC content38% 
IMG OID637504756 
Producthypothetical protein 
Protein accessionYP_027769 
Protein GI49184517 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTA CGAATCGATT TTCTACCACC ACTAACGGCG CACTTGCGAT TACAGGAAAC 
ACACTCGGTT TAAGTAAAAT CAGTAATCAA AATCGTGCTG GTACAATCGG GGCAATTGGC
GCATTTATAA CTACGAATAC CGCTTTACAA GTTCCCACTT TTCCTGCCGG CACAACTTTA
AACTATACAC AAAATAGTTC TACCGCTATT TTAAATATTC CTGCTGGTAG TACGATTCTT
TACGCAGAAC TCATTTGGGG CGGCAACTAC TTATCTCGTG ATCAAAACAT TACAAGTGTT
TTAGGAAACC CTGTTTCTTT TACAACACCT GTTTCAACAT ACTCGATTAC TCCTTCTGCT
GTTACAGCTT CCAATCAAAC ATTCGTTTCT GGATCTATCA CATTTGGATT CTATACACGT
TCTGCAGATG TAACATCCCT CATTCAAGCG GGAGGATCTG GCTCTTATAC AACCGGCTCT
GTCCCTGGAC TTGTAGATCC TATAGATGCT TCTAATGGAA CAATTAATTC GGCTGGATGG
ACACTTATCG TCGCTTACCA AAATGGAACA TTACCTGCAA GAAACTTAAC CATTTATGTA
GCAGGTAACC GGGTTTCTGC AGAAACTGGT AGTGCCGATG TATCTGTTTC AGGATTTTTA
ACACCTTCAG GAGGGCCTGT AAGCGGTAGA TTATTTTTAA GTTCTACCGA AGGAGATGCT
GATTTAATTG GGGATCAGGC TCTATTCGGG CCAAATTTCA GTTCATTAAA TGCCTTATCT
GGACCTAACA ATGCTGTAAA TAATTTCTTC GGTTCTCAAA TTAATAATGC CGCTGGAAAC
TTAGATACAA CCGGGACATT TGGAACGCGA AATCAAAGTG CTTCTACAGG TACAAACATC
TCCGCTGGAA GACAGGGCTG GGACATTACT TCCATTGATA TTTCTCCTTA TTTAACAAAT
TCTCAAGTGT CCGCCGCAAT CCGTTTAACA ACTAACGGAG ACGCATATAT GTTGAATACA
GTCGGTTTAC AAATCAACAT AAATTCACCT AACATACAAG CAACAAAAAG CGTAAATAAA
AGTGTTGCAG CAATTGGAGA CGTTCTCACT TATACAGTTA CTATCCCTAA TACGGGGCTT
CTTCCCGCCA ATAACGTTAT TTTTACAGAC ATTCTCCCTA ACGGTACTTC CTTTATACCT
GGAACTGTAA CAGTAGATAA TGTCCCGCAA ACGAATGCAA ATCCGGCCGC TGGTATATCT
CTTGGAACCA TTAATAACAG CGCTTCTCGT ACAGTAACTT TCCAAGCTAC TGTCGTTTCT
TTTCCAAGTC AAAATCCTAT CTCCAACACT GCTAATATTA CATTTCAATA TACACCAATC
GCCGGAGGAA CGACTTTTAA CGGTCTTGCA ACAAGCAACT CTGCTGGAAC ACAAGTTAAC
CTCGCAGATA TTAATGGCAC AAAATCAGTT AACAAACTTT TTACCGATAT TGGTGAAACG
TTAACTTACA GTATCGCCTT AGCTAATATA GGGAATATTG CTGCAACTAA CGTAATATAT
ACGGATCCGA TTCCTAGCGG GACTACTTTC GTTCCGGGGA GTGTAACTGT TAACGGAGTT
ACTCAGGCTG GAGCAAATCC CGCTACTGGT ATATCAATTG GATCTATTGC TGCTAATTCT
ACGACTACTG TTGCATTTCA AGTATTTGTT CCTTCTATTC CCCAAACAAA TCCAATATTA
AATAGTGGTA CAACAACATA CCAATACATT CCTGTTCCAA ATCAACCGGC AGTAAGTGGG
ACTGATACGA CGAATATTGT CTCCACTCAA GTGAATAACG CTACTGTAAC TATGGCAAAA
GCAGTAGATA AAAATTTTGC TGATATTGGT GATACACTAA CGTACACCGT TTCCTTTACA
AGTACAGGTA ATACAAATGC GAACAACGTT ATTTTTACAG ATGTTATTCC TACTGGAACA
ACTTTTGTTC TAAACAGTTT AACAATAGAT GGCACGACAC AAGGTGGAGC AAATCCCGCT
AACGGTGTGA ACATTGGATC AATCCCAACT GGCACAACAA AAAATGTTTC ATTTCAAGTA
GTTGTAAATA CAATACCCGC GCTAAATGTC GTATCTAATG GATCAAGCGC TTCTTATCAG
TACACTGTCA ACCCAAGCCA ATCACCCGTT ACAAAAAACA TTTCTTCTAA TCTCGTTTCC
ACTCAAATTA ACAATGCGAA TTTAGCATTA ACAAAATCAA CAAATAAACA ATTTGCCACA
ATTGGTGAAA CGATAAGTTA TACAATTCTT ATTACAAACA GCGGAAATAC AGCTGCAACT
AACGTGCAAC TAACAGACCC ACTTCCAAAC GGAACGATAT TGACCCCTGG TTCTGTAACA
CTCAACGGCG TTTTGCAAAA TGTAGATTCC CTCGTTGCTT TACCTATCGG CACAATTCCT
GGCGGGGCGA CTTTTACACT TTCTTTCCAA GTAACAGTCA TCAATATTAC CACCCAAAAT
CCTATCATTA ATAATGCTTT CGCCTCTTAT CTATATACTG TAAATCCAAG TCTGCCACCA
ACTTCAAAAA CAGCAAATTC TAATTCTGTT ACATCAACAA TTAGACTAGC AAACCTTCAA
GCTACTAAAT CTGTAGATAA AACGTTTGCG GAAGTTGGGG ATGTATTAAC TTATACCTTT
TCTCTTACAA ACGATGGAAA TGTTGCGGCG AACAATATAG TACTATCCGA TTCAATCGCG
AATGGTACTG CCTTTGTACC AAACAGTGTT ACGATTAACA ATGTTACTCA ACCAGGCGTT
ACACCAGCTA GCATCAATAT CGGCAGTACC ACTGCTGGTA CTACAATTAC AGCTTCATTC
AAAGTATTAA TAACTAGTAT TCCAAACCCA AATCCTATTT CAAATAGCGC TTCTATTTCC
TATAACTTTA TCGTTGATCC AAACGCTTTC CCTATAAGTA AGAACACAAC TTCAACCACT
ACATTTACTC AAGTAAATGA CGCAAATATT ATTTCAGCAA AAACAGTGGA TCGAGCGTCC
GCTACTGTTG GGGATGTATT AACTTATACC GTTGTTTTAA CGAATGCAGG AAGTGTTTCT
GCTGATAGTC CTACTTTCGT AGATACGAAT CCAGACGGTA CTACCTTTAT CCCAAACACT
TTCCTTATTA ATGGTGTACT CCAAAATAAC GCAGATCCAA ATGTCGGTGT TCCCTTACCT
TCCATTCCTG CGAACGGTTC ACTTACCGTC TCTTATCAAG TAACTGTCAC CTCTTTACCA
ACACAAAATC CAACAATAAA TTCATCTAGT ACACAGTATA GTTTTATTTT AAATCCGGGC
GATCCACCAA CTATAGAAAC ATCTTTAAGT AATACTGTAA GTACACAAAT TAATTTAGCA
AATGTAGTTA TTGTCAAACA GGTAGATTTA ACTATTGCTG ACGTTGGGCA ACCAATCACA
TATACAATTG CTTTAGCTAA CCCTGGGAAT ACTCCCGCAA ATAATGTAGT TGTTACCGAT
ATACTCCCTC CTGGTACAAC TCTCGTACCA AATAGTATTT TTATAGGCGG GGCTTTACAA
CTTGGTGCGG ATCCAAGTGC TGGTCTTCAA GTTGGTACGA TTCCAGCTGG TGGTTTTACA
ACAATTGTCT TCCAAATTGG TGCAAATAGT TTACCTTCAC CAAATCCAGT TCAAAACAGT
GCTGTACTTC AATATAACTT TATCGCAGAT CCAAATTCAC CTCCCGTTGT AAGAAACTCT
GCTAGTAATA TAGTAACTAC ACAAATTAAC ACTGCTAATA TTGTTGCTAC GAAACTAACA
AGCACAAATT TTGCTGATGT TGGCGATGTC ATAACTTATG CAACGATTTT AACGAATAAC
GGCAATATCC CTGCCTCGAA TGTAACGTTT ACAGATATCA TTCCAGCTGG TACCATCTTC
CTCCCTAATA CTGTAACGAT TAACGGTGTC CCTATAGCTA ATGCAAACCC TGCTAACGGC
ATTTTAATTG GTACGATAGG AGCGAATTCA TCACGTACTG TTGCATTCCA AGTTTTTGTA
CCAACCATTC CTAGTGCAAA TCCGATTGCG AATCAATCGA GCACTACATT CCAATACACG
TACGACCCAT CCAAACCAGC TGTCATGCAG ATGGTTGCTT CTAACACTGT ACAGACAACT
ATTAATAATG CTACGATTAC CTCTGTAAAA TCTGCAGATA AACAGTTTGC GAACGTAAAT
GATATCATTA CGTACACGAC TACTTTAACG AATAATGGAA ACACACTTGC ATCAAATATA
GTATTTACAG ATGCTATTCC AAGCGGGACA TCTTTTATTC CAAATAGTGT AACAGTAAAC
GGCACTACAC TCTCTAATGC AAATCCAGCA AATGGCATAG CAATTGATCC GATAAATCCG
AATGCAAATA CGATAATTTC ATTCCAAGTG CAAGTAAATT CTATTCCAAA TCCGAATCCT
ATCCCGAACC AAAGTAACAC AACGTACCAA TATGTCGTAA ATCCTAACTT ACCTCCAGCG
TCCTCTAATA CGCTAAGTAA CGTAATAACA ACTCAAATTA ATAACGCTAC AATCATCGCT
ACGAAGTCAG TAAATACACC GAATGCTGCA ATTGGAGATA TCGTTACTTA TACGATTGCA
GTTACGAACA CAGGGAATAT TCCTGCTAGT GCTACAGTTT TAACAGATGG ACTTGGACCA
GGTGCATCCT TCATCCCAAA TTCCGTTACG ATAAATAACG TATCTCAACC TGGATTAGAT
CCTTCACTAG GTATTCATTT AGACGATATT TCACCAGAAG GAACTACATT TATTACATTC
CAAGTGAAAA TCCTTGCTAT TCCACCTAGT GGCACTTTAA CGAATAATGC TCTTGTAAAC
TACGAATATA CAGTGAATCC AACTGAAACG CCAGCTGTTG GAAGTACCGT TACAAACACG
ACAGTTACAC CTATCGTTGA CGCTACCTTA GTAATAAATA AAAATGCTAG TACAACTTTC
GCTACAATTG GAGATACGAT TACATTCACC TCAGTTGTTA CAAACACAGG AAATACTACC
GCGAATAACA TTGTTTTTAC AGATTCAATT CCAAATGGTA CTACCTTTGT CCCAAATAGT
TTTAAAATAA ACGGTGTAAC GGTCCCGAAT ACAAATCCAC AAAACAGTAT CAATATTGGT
AACTTGAATG CAAATGCATC GGTTACACTT AGTTTTCAAG TAAACATTAC AACTCTTCCA
AATCCTAATC CAATTCCGAA CCAATCATCG CTTCAATATA GATTTATTGT TGATATAAAT
GAACCGCCTG TTTCACGAAC CGTTCAATCC AATAAAACTT TTACACAAGT AAACTCTGCT
TCCGTTATCG CCACGAAAAC TGCTAGCAGT GCATTTGCCG CTGTTGGAGA TACAATTACG
TATACAACGA CTCTCACTAA TAGTGGAAAT ACTACTGCAA ACACACCTGT TTTTATCGAT
ATATTACCAG CTGAACTGTC ATTCGTTCCT GATAGCGTAC AAATTAATAC CATCCCACAA
CTTGGATTTA GGCCTGATAC TGGTGTTCCT TTAGACTCAA TTCCAGTTGG AGGAACGATA
ACAATTAGCT TTCAAGCTAT CGTTGGTTCA ATACCAGCTA TAAATCCAAC ATTGAACCAA
TCTAGCACAA CATACTCTAT CATCGTTGAC CCTACCCAGC CACCGGTGAC AGAAATAGCT
ACAAGCAATC CAACTTTAAT TCAAATTAAC GAAGCGATTA TTCAAGCAAC GAAAAGTGTG
GATCGACTAT TCTCTGACGT CGCACCTGGA AATTCATTTT TAACGTACAC TGTTTTATTA
GAAAATATAG GGAATACAAC TGCTACGAAT ATCATTTTTA CAGATCCGAT TCCAAATCAT
ACAGTATTTA TAGAAGATAG CGTTCGAGTA GGCGGGATTT TATTACCTGG AGTAAATCCA
GCAAACGGAA TACCAATTGG GGATATTATT GCAGGAGATT TTATAAACAT TACCTTCCGC
GTACAAGTAG TTAGCATTCC AAATCCAATT TTCACAATTG GACCTGGGGG GCCAAATTCA
CCGGTTGTAA ATGGCGCTTC TATTAATTAC CAATTTATGA CAGGACCTAA TTTACCACTC
GCTTCAAGAA GTACGACATC CAATCCTGTT TCAACACAAA TAAATTCTGG GGAAATCGCA
CTTGTTAAAT CTGTAGATAA AACTTTCGTA ACGATCGGGG ATACACTTTC TTATTCAATT
TCATTAAGTA ACCCTGGAAA TGTCACTTCA CAAAATATAA TTTTCACGGA TGTTTTACCT
GAAGGAATAA CTTTTATTTC TGGAACACTT ACAAACGATT CTGGTACACA GCAAATTGGA
AATCCAGCTA CCGGGATTCA AATTGGAAAT ATAAATCCTG GTAGTACGGC TACTATTGTG
ATAAACGCAC TTGTTACAAA TATTCCAAGT ATAAATCCAA TTTCGAACTT TAGTTCTGTA
CAATTTGCAC ATGTGGTCGA TCCAAGCCAA CCTTCCGTAT CACAAACGAA TCTATCTAAT
ACTGTTTCGA CAACTATTAA GAGTGCTATA TTAACGACTA CAAAAAGTGC TGATAAATCC
GTTATTTCTG TCGGTGATAC AATTACGTAT ACAACTACTA TTACAAATAC AGGAAATACG
GCAGCAGCGA ATATAAAGTT CACGAGTGCA ATTCCAGCTA ACACTACCTT TATACCAAAC
TCAGTCACAA TAAATGGGGT TCAGCAATCT GGTGTGCAAC CAGCACTTGG AGTAAACATA
CCAAATATTG CTCCTGGTGA AACAGTAACT GTTACTTTCC AAGTAAATGT TCTTTCCGTT
CCCTCTTCAA GTTCAATTAT GGGGAATGAT ACCATTTTAT ATTCGTATAC TGTCGATCCA
AACGGAACTC CTGTTACAAC TTCTACTTCA ACGAATATCG TTACAAACCC TGTATTAGAT
GCTATCATTA CGATGGTAAA ATCCGTCGAT CAAACACTTG TAACACTAGG TGATACCATT
ACCTATACGA TACTTTTGAC AAATACCGGT AATACAAATG CTACTAATAT CACTTTCACT
GATCTTATAC CAAATGGAAC AACGTTTATT ACTGATAGCG TTACAATAGA TGGCATCACG
CAAATCGGGC TCAATCCTAA TACAGGTATA ACAATTGGAG CAATTGCTCC TAACAGCTCA
ATATCTATAG CATTTCAAGT TACCGCTACT TCTACACCTG TTCAAAATCC TATTGCCAAT
TCCGCTACTG CTTCTTACAC ATTTATCGCT GATCCTAATG CCCCTATTGT TTCAAGGACC
GTTACTTCAA ACACAGTGTT CACTACGATT AATACAGCTA CCATTCTTTC ATTAAAACAA
GTCGATAAAT CCTTTAGTCG TATTGGAGAC ACACTCACTT ATACTGTCGC TTTAACAAAC
AATGGAAACT CATCTGCGCA AAATGTTATA TTCACAGATA CCGTACCGAG CGGAACAGCA
TTTATTGCAG ACACATTTTC TATTAATGGA ATTCCTCAAA GTGGCGCAAA TCCAGTGAAC
GGTGTAAATA TCGGATCTAT AACAGCTGGG ACTACAGTAA CAGTTTCTTT CCAAGTTACT
GTAACGTCAT TACCCACGGA AAACCCCATT GTAAATTTCT CATCAACATC GTACCAATTA
GTCTCACCGC CTGATGCAGA AACTTCAATT AGCAATCCTG TTTCAACGCA AATTAAAGAA
GCCATATTAT CCATGGCGAA AAATGAAAGT TTATCCTTTG CAAATATCGG GCAAACTGCT
TTTTACACTA CTTCTATTTC GAATATAGGA AATACAGATG CAACTAATAT TGTATTCACA
GATGTATTAC CAAATGGAGT CACATTTGTT CCTAACACAT TAACTGTCGA TGGTGTTTTA
CAACCTGACG CGAATCCAAA TACAGGTGTA TTACTTGCAA CACTTCCGCC TAATGAAATA
TATAGTATCG TCTTTCAAGT TACAGTGAAC AGCATTCCCC CTATTAATCC AGCACCAAAT
ACAGCATCAA CAACATATGA GTTTACTGTT GATCCTGTTA ATCCCCCAGT ATTAAGTGCG
GCTACTTCCA ACACTACGCT TCTTCAAATA AACAACGCAA ATATTATAAG TACAAAAACA
ACAGATCTTA CTTTTGCGGA TGTTGGTAAT ACAATAACAT TTACACTTAA CCTCCCGAAT
ACAGGGAATG TGACTGCAAC TGATGTTACA GTTATCGATA CGCTTGATAG TAATTTAACT
TTTGTTCCAA ACAGTTTCAC AGTTAATGGG CAAACCATTC TAAATGCTGA TTTATCTACT
GGTGTAAATA TCGGTTCAAT TAACGGTGGT ACTGCGGCAA TTGTCACATT CCAAGCTACC
GTTACAACCC TACCAATCAA CAATCCTATT TCTAATTCAG CTCTTACAAC TTATCGTTAC
ATTGTTGATC CAGACCAGTC ACCTATTACA ACTTCCAATC AATCTAATAC AACGACAACA
CAAATTAATA GTGCTATTCT TACTGCACAA AAAAGCACAA ATGTATTTAC GGTAGATATC
GGGCAAGATA TTGTCTACTC CGTTACAATT ACAAATAGCG GAAATGTTAA TGCAACGAAT
GTTATTTTTA CCGATGTTAT TCCAGACGGA ACTTCCTTTG AACCAAATAG TTTTACACTT
AATGGAACTA TTATCGAAAA TGCAAACATC ATTACAGGCG TCCCAATTGG TGATATCGCG
CCAAACGAAT CTGCCATTGT AGAATTTCAT ATTACTTCAA ATGAAATCCC GGCTATTAAT
CCAATTACTA ATCAAGCTAG CGTTAGCTTT CAGCATATCG TTAATCCAGC TAATCCTCCT
GTTTCAAAAA ACATTACTTC AAATAGTGTT ACAACAACAA TTGAAAGTGC TATTTTAACT
ACTACTAAAA TCGGTGATAA AGCTTTTGCA ACGATTGGTG ATACAATTAC GTATACAACT
ACGATTACGA ATATTGGAAA TATCCCTGCC AATAACGTTA TTTTCTCAGA CCCGATACCA
TCGTGGACAC AATTTGTTGC AGGATCCGTT ATTGTTGATG GCACTCCATT ACCATCCGCT
TCTATCACCA GCGGGATTGG CATAAATACG ATCATTCCAA ATCAAACTGT AACAATCATA
TTCCAAGTTC AAATCGTAAG CAATCCACCA ACATTCACAC CTGAACTCCA AAACTTAGCA
TTTGTTAACT TCCAATATAA CGTAGGCAAT GCATTACAGG CTCAGCCTGG CAATGTGGAA
ACGAACGTCT TCGTTACTGC TATTCATTCA GCAATACTTT CAGCTGTAAA AACTGCTAGT
ACAGCCTTTG CGAATATTGG AGACACAATC ACTTATACAG TTTTAATACA AAATAGCGGC
AATACAAACG CTACGAATGT AAATTTCTCA GACCTCATTC CAGGAGGAAC GACCTTTGTT
GAAAATAGTT TTGCTGTAAA TGGAAATACC ATTCCAGGTG CAAATCCAAA TAGCGGCGTT
AATATCGGGA CCGTTAGCGC GGGTAGTTCC TTAACCGTTA CTTTCCAAGT CATAGTTACA
TCTACTCCTC CTTCCAACCC AATTACAAAC GTTGCATCTA TTCAATTCGC ATTCATCGTT
GATCCGGCCG CTCCTCCTGT TACAGGCACA GTAACTTCTA ATAGTGCTTC TACACAAATT
AATAACGCTA CTGTTACAAC GCTTTTAGAA GCAGATCGAA CAATCGTATC TATTGGAGAT
ATAATTACGT ACACTGCAAC ATTAACAAAC ACTGGAAACT TCCCTGCAAA CTCGGTATTA
CTCATTAACG GTGTTCCTGA AGGGGCATTA TTTGTTCCAA ATAGTGTCAC GCTTAACGGG
ATTTCACTTC CAGATGCAAG TCCAACTCTC GGTATTCCAG TTGGTATTAT CGCACCAGGT
AATTCTGCTA CAATTACGTT CCAATTTCTT GCAAACTCTA TTCCGCCGCA AGGAGCAATT
ATAAATCAAG CACTTACAAG TTACACATAT ATTGTCGATC CAAGTCAACC TCCAGTTACA
GCAACATCTT CATCTAATAC AGTTACTACA GCTGTCGTTG ATGCATCGCT ATCTGTAATT
AAAAATACAG ATTCCATCGT ACAATCTACT GACGGTACAA TCACTTACAC TGTCGTCATT
CAAAACAACG GGAATACAAC TGCAAATACA GTTACTTTAA CAGATTTGGT CCCAGAAGGA
ACTGCATTGA TTCCAAATAG CGTGACCATT AATAGCATCT CAATTCCAGG TGCCGATCCA
AACGTAGGAA TACCATTAAA CTCCATTGCG CCGTCAGAAA TCGTCACCGT CACATTCCAA
GTTATCGTTC AATCTATTCC AAGCGTGAAT CCAATTTCTA ATATAGCCCG TATTGACTAT
ACTTTTATTG CGGATCCAAC TGCTCCTATC GTCTCTCGAA CAATTACTTC GAATCCAGCT
TTCACACAAA TTTCAGATGC GAATGTTCTT TCTTTAAAAG CCGTCAATGC ACAACAAGCA
ACAACTGGCG ACATTTTAAC CTACACGATA ACACTAGAAA ATACCGGAAA TATTCCAGCT
ACAAATCTCA TATTTTCAGA TACGATTCCA GAAGGGACTA CATTCGTAGA AAATAGTTTT
ACACTTAACG GAACAGCTAT ACTGGGTGCA AATCCAAATG TAGGTGTTAC TTTGCCTAAC
CTAGCAGCAA ACGCTACTCA CCTTATTGCT TTCCAAGTTC TTATTAACGA TCCATTCTCG
CAACAATCGA TTACAAATCA ATCTAATACA ACATATACAA TTCAACCAGA TCCAGGGCAA
CCGCCTATTA CTGAAACATC TACAAGTAAT ATTGTCATTA CAAATTTCGT GCAAGCACAA
TTAACAATTA CAAAAACGTC CAATCCAATA ACTGTAGATA TTGGCGGAAC TATACTTTAT
ATTTCTGAAG TGAAAAATAG CGGCAATGTT GACGCAATAA ATATTATTTT TACAGATTCG
ATTCCAGTTG GGACTACATT CGTTCTCGAC AGTGTCACAA TTAACGGTGT ACTTCAGCCT
GATGCAAATC CCGAAAACGG AATACCAATT GGAACGATTC CACCAAACAG TTCCAAAACA
ATACTATTTC AAGTACAAAC AAATAATCCA CCTACTGAAA CCGAAATTGT AAATCAATCT
TCAGTAACTT ACCAATATGT AAGTATTCCT ACAGCTCCAC CAGTGAATCG CTCTGCAAAT
TCTAACATTG TTACAACATC ACTTCAAAAT GCGAATATTA TTTCCGTTAA AAGCGCAGAT
GTAACTTTCG TCTCCATCGG GCAATTTATT ACCTACACAA ATACACTACA AAATATCGGA
ACGGTTCCAG CTAACAATAC GGTGTTCATT GACAACATTC CAGAAGGGAC AATATTCATT
GAAGATAGCT TATCAATAAA TAATGTCATT CAGCCTGGTA CGAATCCTGA AAATGGAGTA
ACTCTCGGCA CGATACAACC AGATGAAACA GTCACTATTT CATTCCAAGT ACAACTTACA
AATATACCTG AAGACAATAC AGTCATTAAC ATCTCAGACA CTTCATATGA ATACCAAATT
GACCCTAGTT CTCCAATTAT TCAGCGTAGA TCATTATCAA ATACAGTAAA CACGGAAGTC
CGTACAGCAA ATGTTAGTGC AATTAAATCT GCTAACAGAT CCATTACACG CATCGGTCAA
ATTATCACAT ATACCATCGC AGTTACAAAT GCTGGTACAG TACCTATTAC AAATACTCTC
CTAATTGACG CAATCGCTGC TGGCACCACA TTCGTTCCAG ATAGCATTCT TGTAGATGGC
ATACCAAGAC CTAACGAAAA TCCAAGTACC GGAATCTCCC TTAATATTAT CCTTCCAAAC
AATACAATTA TCGTTACATT CCAAGTAAAT GTAGACTCGA TACCTTCTCA AAATAACATG
AATAATATCG CCGTCATCCA CTATGAATAT CAGCCAGACC AAAGCTTACC ACCAATTTCA
GAAACGACAG CTTCCAACAG TACAAATATA CAATTTATTG AAGCTATTCT TTTCGCTACA
AAATCCGCTA ATACAGTATT AGCTAATATT GATGAAACTA TTGAATATAC AGTACTCATT
CAAAATAATG GATCCACTAC AACTAACTCC ATCTTTTTTA CAGATACGAT AGAGGATGGA
TCAATATTTA TCCCGGGAAG TGTAATAGTT AACAATACCG TACTTCCTGC AGCAGATCCG
AATATCGGCT TTTCCATCCC GAATATCGCA TCAGGTCAAG TGGCTACAAT AACATTCCAA
GTTTCGGTTA CGAATTTACC TGTTGCAAAC CCAACACCTA ATACCGCAAA CATCGTCTAC
GACTTTATTT TCAACCCTGA CTTTGCACCA ATTCAAAAAT CGACTACTTC CAATACTACT
TTCGTTCAAA TTAATGATGC TGATATCGTT TCACTTAAAA CTGTTGATTT GACTTCTGTA
ACAATTGGTG ATGTTTTAAC TTATACAACA ACTTTAACAA ATACAGGGAA TACGGATGCC
ACTGCCGTTG TATTTACAGA CAATATTCCT GGTGGAACAA CCTTTATAGA CGGTAGCGTT
TTAGTAAATA ACATTCCGCA GCTTAATGCC AATCCAAGTA CCGGTATATT GGTAGGAACG
ATTGCTCCTA ACATTTCTAT CCCAGTCACA TTTTCTGTTA CTGTCGTAGC GCTTCCAACT
AGCGGCCATG TTCAAAATCA AGCAACTTCT CGTTATACAA TAAATGGAGA AGAACAAATA
TCGACTAGTA ACATTACTTT CACTGAAGTT ATTTCTGCTA ATATAATCGC AGTAAAAACA
ACACCTATCC AATATGCTGA CCTACAAACC ATTATCCCTT ACACAATTTC CATCACAAAC
AATGGGAATA TACAAGTGGA AAACATTATC GTTACAGATA TCATCCCAGC AAATACAAAC
TTTATAGAGA ATAGTGTTAT TGTGAATGGA AACACTCGTC CAAATGACAA TCCACTTAGT
GGGATACCAA TTGATAACAT TCTGCCTAAT ACGACAGCAA CTGTTCTATT CCAAGTACGG
GTTACTTCGA TACCTCAAAC AAATCCAATC TCTAACACAA GTACAATTGA ATATGAATAC
ACGGTAGGAG ATCAACCACC TATTACCAAA ACTATTATTT CATCAGCTGC TTTAACAGAA
ATTAATCATG CGAATTTGAA TAGTAATAAA GCTGTTGACC TTGCATTTGC AATGGTCGGT
GATACGTTAA CGTATACGAT TACACTCAAT CAAACTGGTA ATGTTGCAGC AAATGATGTA
ATCATTCAAG ATATGATTCC TCAAGGTACT ACATTTATAG AAAATAGCGT TATTGTAAAC
GGAGAAACTC TTCCGGGAGT AAATCCAGCA AATGGCATAC CAATTGGTAC TATAATTGTA
GATGGAGACG CTATCGCTTC ATTCCAAGTA ACTGTGACTT CTATTCCAAT ACGAAACGAA
CTCACCAACC AAGCAATCTC TACTTTTAAC TATATAGTCA ATCCAAATAA CGTACCTGTT
ACAAATACGA CGACAACAAA TACAGTCACA ACAACCGTCC AAAATGATAA TGTCATTGCG
ATAAAAGCTG TTGATTTCAC GAGTGCCTTA CCTGGTCAAA CTTTAACGTA TACCATTACG
ATTACTAATA ATGGTAATAT CACTATTGAA GATCTTCTTC TAGTAGACAT GGCACCTGTA
GATACAACAT TCGTTATTGG TAGTGTTACG ATTAACGGAA TCAATCAGCC TAATGCGAAT
CCTGAAAATG GTATTACGTT AGGAACTCTT GCTCCTAATG ACTCTGTTAT TATTACATTC
CAAGTGACAA TATCTTCTTC TACTCTTCAA TCTACAATCA ATAACGATGC TACTATTTTC
TATACGCCTA TTGTCGGTCT AATCGAACCA CCTATTACAA TTACAAGGCA AATAGATATC
GTCACAAAGC AAACAAATAC TGTTACAACG ACAATCATTG ATCCAATGGT TCATATTGAA
AAAACGGCTG ACAAATCTAT TGTCGTTTTA GGAGATATTC TTACTTTCAC GTTAGAGATA
TTTAATGATT CTCCAATCCC AACAGTAAGT ACTGCCGTTA TAGATACCAT TCCAGCTGGT
ACAACATTTA TAGAAAACAG TGTTACGCTT AACGGTACTC CGGTTCCAAA TGTCCGTCCA
GACACAAGTA TGAATATCGG ATCTTTACCT GCAGATGCAG TAGCAATACT AACATTTAAA
GTGCTCGTAA CTTCTATTCC TTCAAACAGT ACAATTATTA ATTCTGCTAC CGTCACAGCC
ACCTTCCAAT TGACACCCCA GGACCCAATT ATTACTTTTA TTGTTAATTC AAATATCGTT
CGTATACCAG TTCAATTTGT AACTGCGACA GTAGTGAAGA ACGCTTCCGT CACCTCAGCT
TATTTAAATC AATATTTTGA TTATACGGTG CGTATTACGA ATACTTCCGA GATTTCACTC
TCAAATATTT CTTTACAGGA TACTATTCCA GTAGGTTTAC AATTTATAAA CGGCACTGTC
TTCATTAACG AAGAACGCTC TCCACTAGCG AATCCGAATA TCGGTTTCCT AGTCGCTACT
AATTTGGAAC CAAACGAAAC AATTATCGTC TTATTCACAG TACAAGTAAT AAGTCCACCT
GTTAATAATG AATTTAAAAA TACGGCCAAT ATTTCATTAC AACTTCAAGT CTCGCCTACC
GACCCACCAA TTACAGAAAC TGTTACAAGT AACGAAAACA TCGTCATCTT TGTTCCAGAA
AACCAAGATG AAATAGTTCC AAATTTAAAT TGCTTCTTTG ACGGGGAACG TTTTGTACGC
ATTACTCCAC GGAATGTAGG AAATTACCTT TGGACTTGGA TTTGGTGGCG TTAA
 
Protein sequence
MPITNRFSTT TNGALAITGN TLGLSKISNQ NRAGTIGAIG AFITTNTALQ VPTFPAGTTL 
NYTQNSSTAI LNIPAGSTIL YAELIWGGNY LSRDQNITSV LGNPVSFTTP VSTYSITPSA
VTASNQTFVS GSITFGFYTR SADVTSLIQA GGSGSYTTGS VPGLVDPIDA SNGTINSAGW
TLIVAYQNGT LPARNLTIYV AGNRVSAETG SADVSVSGFL TPSGGPVSGR LFLSSTEGDA
DLIGDQALFG PNFSSLNALS GPNNAVNNFF GSQINNAAGN LDTTGTFGTR NQSASTGTNI
SAGRQGWDIT SIDISPYLTN SQVSAAIRLT TNGDAYMLNT VGLQININSP NIQATKSVNK
SVAAIGDVLT YTVTIPNTGL LPANNVIFTD ILPNGTSFIP GTVTVDNVPQ TNANPAAGIS
LGTINNSASR TVTFQATVVS FPSQNPISNT ANITFQYTPI AGGTTFNGLA TSNSAGTQVN
LADINGTKSV NKLFTDIGET LTYSIALANI GNIAATNVIY TDPIPSGTTF VPGSVTVNGV
TQAGANPATG ISIGSIAANS TTTVAFQVFV PSIPQTNPIL NSGTTTYQYI PVPNQPAVSG
TDTTNIVSTQ VNNATVTMAK AVDKNFADIG DTLTYTVSFT STGNTNANNV IFTDVIPTGT
TFVLNSLTID GTTQGGANPA NGVNIGSIPT GTTKNVSFQV VVNTIPALNV VSNGSSASYQ
YTVNPSQSPV TKNISSNLVS TQINNANLAL TKSTNKQFAT IGETISYTIL ITNSGNTAAT
NVQLTDPLPN GTILTPGSVT LNGVLQNVDS LVALPIGTIP GGATFTLSFQ VTVINITTQN
PIINNAFASY LYTVNPSLPP TSKTANSNSV TSTIRLANLQ ATKSVDKTFA EVGDVLTYTF
SLTNDGNVAA NNIVLSDSIA NGTAFVPNSV TINNVTQPGV TPASINIGST TAGTTITASF
KVLITSIPNP NPISNSASIS YNFIVDPNAF PISKNTTSTT TFTQVNDANI ISAKTVDRAS
ATVGDVLTYT VVLTNAGSVS ADSPTFVDTN PDGTTFIPNT FLINGVLQNN ADPNVGVPLP
SIPANGSLTV SYQVTVTSLP TQNPTINSSS TQYSFILNPG DPPTIETSLS NTVSTQINLA
NVVIVKQVDL TIADVGQPIT YTIALANPGN TPANNVVVTD ILPPGTTLVP NSIFIGGALQ
LGADPSAGLQ VGTIPAGGFT TIVFQIGANS LPSPNPVQNS AVLQYNFIAD PNSPPVVRNS
ASNIVTTQIN TANIVATKLT STNFADVGDV ITYATILTNN GNIPASNVTF TDIIPAGTIF
LPNTVTINGV PIANANPANG ILIGTIGANS SRTVAFQVFV PTIPSANPIA NQSSTTFQYT
YDPSKPAVMQ MVASNTVQTT INNATITSVK SADKQFANVN DIITYTTTLT NNGNTLASNI
VFTDAIPSGT SFIPNSVTVN GTTLSNANPA NGIAIDPINP NANTIISFQV QVNSIPNPNP
IPNQSNTTYQ YVVNPNLPPA SSNTLSNVIT TQINNATIIA TKSVNTPNAA IGDIVTYTIA
VTNTGNIPAS ATVLTDGLGP GASFIPNSVT INNVSQPGLD PSLGIHLDDI SPEGTTFITF
QVKILAIPPS GTLTNNALVN YEYTVNPTET PAVGSTVTNT TVTPIVDATL VINKNASTTF
ATIGDTITFT SVVTNTGNTT ANNIVFTDSI PNGTTFVPNS FKINGVTVPN TNPQNSINIG
NLNANASVTL SFQVNITTLP NPNPIPNQSS LQYRFIVDIN EPPVSRTVQS NKTFTQVNSA
SVIATKTASS AFAAVGDTIT YTTTLTNSGN TTANTPVFID ILPAELSFVP DSVQINTIPQ
LGFRPDTGVP LDSIPVGGTI TISFQAIVGS IPAINPTLNQ SSTTYSIIVD PTQPPVTEIA
TSNPTLIQIN EAIIQATKSV DRLFSDVAPG NSFLTYTVLL ENIGNTTATN IIFTDPIPNH
TVFIEDSVRV GGILLPGVNP ANGIPIGDII AGDFINITFR VQVVSIPNPI FTIGPGGPNS
PVVNGASINY QFMTGPNLPL ASRSTTSNPV STQINSGEIA LVKSVDKTFV TIGDTLSYSI
SLSNPGNVTS QNIIFTDVLP EGITFISGTL TNDSGTQQIG NPATGIQIGN INPGSTATIV
INALVTNIPS INPISNFSSV QFAHVVDPSQ PSVSQTNLSN TVSTTIKSAI LTTTKSADKS
VISVGDTITY TTTITNTGNT AAANIKFTSA IPANTTFIPN SVTINGVQQS GVQPALGVNI
PNIAPGETVT VTFQVNVLSV PSSSSIMGND TILYSYTVDP NGTPVTTSTS TNIVTNPVLD
AIITMVKSVD QTLVTLGDTI TYTILLTNTG NTNATNITFT DLIPNGTTFI TDSVTIDGIT
QIGLNPNTGI TIGAIAPNSS ISIAFQVTAT STPVQNPIAN SATASYTFIA DPNAPIVSRT
VTSNTVFTTI NTATILSLKQ VDKSFSRIGD TLTYTVALTN NGNSSAQNVI FTDTVPSGTA
FIADTFSING IPQSGANPVN GVNIGSITAG TTVTVSFQVT VTSLPTENPI VNFSSTSYQL
VSPPDAETSI SNPVSTQIKE AILSMAKNES LSFANIGQTA FYTTSISNIG NTDATNIVFT
DVLPNGVTFV PNTLTVDGVL QPDANPNTGV LLATLPPNEI YSIVFQVTVN SIPPINPAPN
TASTTYEFTV DPVNPPVLSA ATSNTTLLQI NNANIISTKT TDLTFADVGN TITFTLNLPN
TGNVTATDVT VIDTLDSNLT FVPNSFTVNG QTILNADLST GVNIGSINGG TAAIVTFQAT
VTTLPINNPI SNSALTTYRY IVDPDQSPIT TSNQSNTTTT QINSAILTAQ KSTNVFTVDI
GQDIVYSVTI TNSGNVNATN VIFTDVIPDG TSFEPNSFTL NGTIIENANI ITGVPIGDIA
PNESAIVEFH ITSNEIPAIN PITNQASVSF QHIVNPANPP VSKNITSNSV TTTIESAILT
TTKIGDKAFA TIGDTITYTT TITNIGNIPA NNVIFSDPIP SWTQFVAGSV IVDGTPLPSA
SITSGIGINT IIPNQTVTII FQVQIVSNPP TFTPELQNLA FVNFQYNVGN ALQAQPGNVE
TNVFVTAIHS AILSAVKTAS TAFANIGDTI TYTVLIQNSG NTNATNVNFS DLIPGGTTFV
ENSFAVNGNT IPGANPNSGV NIGTVSAGSS LTVTFQVIVT STPPSNPITN VASIQFAFIV
DPAAPPVTGT VTSNSASTQI NNATVTTLLE ADRTIVSIGD IITYTATLTN TGNFPANSVL
LINGVPEGAL FVPNSVTLNG ISLPDASPTL GIPVGIIAPG NSATITFQFL ANSIPPQGAI
INQALTSYTY IVDPSQPPVT ATSSSNTVTT AVVDASLSVI KNTDSIVQST DGTITYTVVI
QNNGNTTANT VTLTDLVPEG TALIPNSVTI NSISIPGADP NVGIPLNSIA PSEIVTVTFQ
VIVQSIPSVN PISNIARIDY TFIADPTAPI VSRTITSNPA FTQISDANVL SLKAVNAQQA
TTGDILTYTI TLENTGNIPA TNLIFSDTIP EGTTFVENSF TLNGTAILGA NPNVGVTLPN
LAANATHLIA FQVLINDPFS QQSITNQSNT TYTIQPDPGQ PPITETSTSN IVITNFVQAQ
LTITKTSNPI TVDIGGTILY ISEVKNSGNV DAINIIFTDS IPVGTTFVLD SVTINGVLQP
DANPENGIPI GTIPPNSSKT ILFQVQTNNP PTETEIVNQS SVTYQYVSIP TAPPVNRSAN
SNIVTTSLQN ANIISVKSAD VTFVSIGQFI TYTNTLQNIG TVPANNTVFI DNIPEGTIFI
EDSLSINNVI QPGTNPENGV TLGTIQPDET VTISFQVQLT NIPEDNTVIN ISDTSYEYQI
DPSSPIIQRR SLSNTVNTEV RTANVSAIKS ANRSITRIGQ IITYTIAVTN AGTVPITNTL
LIDAIAAGTT FVPDSILVDG IPRPNENPST GISLNIILPN NTIIVTFQVN VDSIPSQNNM
NNIAVIHYEY QPDQSLPPIS ETTASNSTNI QFIEAILFAT KSANTVLANI DETIEYTVLI
QNNGSTTTNS IFFTDTIEDG SIFIPGSVIV NNTVLPAADP NIGFSIPNIA SGQVATITFQ
VSVTNLPVAN PTPNTANIVY DFIFNPDFAP IQKSTTSNTT FVQINDADIV SLKTVDLTSV
TIGDVLTYTT TLTNTGNTDA TAVVFTDNIP GGTTFIDGSV LVNNIPQLNA NPSTGILVGT
IAPNISIPVT FSVTVVALPT SGHVQNQATS RYTINGEEQI STSNITFTEV ISANIIAVKT
TPIQYADLQT IIPYTISITN NGNIQVENII VTDIIPANTN FIENSVIVNG NTRPNDNPLS
GIPIDNILPN TTATVLFQVR VTSIPQTNPI SNTSTIEYEY TVGDQPPITK TIISSAALTE
INHANLNSNK AVDLAFAMVG DTLTYTITLN QTGNVAANDV IIQDMIPQGT TFIENSVIVN
GETLPGVNPA NGIPIGTIIV DGDAIASFQV TVTSIPIRNE LTNQAISTFN YIVNPNNVPV
TNTTTTNTVT TTVQNDNVIA IKAVDFTSAL PGQTLTYTIT ITNNGNITIE DLLLVDMAPV
DTTFVIGSVT INGINQPNAN PENGITLGTL APNDSVIITF QVTISSSTLQ STINNDATIF
YTPIVGLIEP PITITRQIDI VTKQTNTVTT TIIDPMVHIE KTADKSIVVL GDILTFTLEI
FNDSPIPTVS TAVIDTIPAG TTFIENSVTL NGTPVPNVRP DTSMNIGSLP ADAVAILTFK
VLVTSIPSNS TIINSATVTA TFQLTPQDPI ITFIVNSNIV RIPVQFVTAT VVKNASVTSA
YLNQYFDYTV RITNTSEISL SNISLQDTIP VGLQFINGTV FINEERSPLA NPNIGFLVAT
NLEPNETIIV LFTVQVISPP VNNEFKNTAN ISLQLQVSPT DPPITETVTS NENIVIFVPE
NQDEIVPNLN CFFDGERFVR ITPRNVGNYL WTWIWWR