Gene BCZK1464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK1464 
Symbol 
ID3024800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp1547819 
End bp1562872 
Gene Length15054 bp 
Protein Length5017 aa 
Translation table11 
GC content38% 
IMG OID637545693 
Productcell surface protein 
Protein accessionYP_083059 
Protein GI52143769 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTA CGAATCGATT TTCTACCACC ACTAACGGCG CACTTGCGAT TACAGGAAAC 
ACACTCGGTT TAAGTAAAAT CAGTAATCAA AACCGTGCTG GTACAATCGG GGCAATTGGC
GCATTTATAA CTACGAATAC CGCTTTACAA GTTCCCACTT TTCCTGCCGG CACAACTTTA
AACTATACAC AAAATAGTTC TACCGCTCTT TTAAATATTC CTGCTGGTAG TACGATTCTT
TACGCAGAAC TCATCTGGGG CGGCAACTAC TTATCTCGTG ATCAAAACAT TACAAGTGTT
TTAGGAAACC CCGTTTCTTT TACAACACCT GTTTCAACAT ACTCGATTAC TCCTTCAGCT
GTTACAGCTT CCAATCAAAC ATTCGTTTCT GGATCTATCA CATTTGGATT CTATACACGT
TCTGCAGATG TAACACCCCT CATTCAAGCG GGAGGATCTG GCTCTTATAC AATCGGCTCT
GTCCCTGGAC TTGTAGATCC TATAGATGCT TCTAACGGAA CAATTAATTC AGCTGGGTGG
ACGCTTATCG TCGCTTACCA AAATGGAACA TTACCTGCAA GAAACTTAAC CATTTATGTA
GCAGGCAACC GGGTTTCTGC AGAAACTGGT AGTGCCGATG TATCTGTTTC AGGATTTTTA
ACACCTTCAG GAGGGCCTGT AAGCGGTAGA TTATTTTTAA GTTCTACCGA AGGAGATGCT
GATTTAATTG GGGATCAGGC TCTATTCGGG CCAAATTTCA GTTCATTAAA TGCCTTATCT
GGACCTAACA ATGCTGCAAA TAATTTCTTC GGTTCTCAAA TTAATAATGC TGCTGGAAAC
TTAGATACAA CCGGGACATT TGGAACGCGA AATCAAAGTG CTTCCACAGG TACAAACATC
TCCGCTGGAA GACAGGGCTG GGACATTACT TCCATTGATA TTTCTCCTTA TTTAACAAAT
TCTCAAGTGT CCGCCGCAAT CCGTTTAACA ACTAACGGAG ACGCATATAT GTTGAATACA
GTCGGTTTAC AAATCAACAT AAATTCACCT AACATACAAG CAACAAAAAG CGTGAATAAA
AGTGTTGCAG CAATTGGAGA CGTTCTCACT TATACAGTTA CTATCCCTAA TACGGGGCTT
CTTCCCGCCA ATAACGTTAT TTTTACAGAC ATTCTCCCTA ACGGTACTTC CTTTATACCT
GGAACTGTAA CAGTAGATAA TGTCCCGCAA ACGAATGCAA ATCCGGCCGC TGGTATATCT
CTTGGAACCG TTAATAACAG CGCTTCTCGT ACAGTAACTT TCCAAGCTAC TGTCGTTTCT
TTTCCAAGTC AAAATCCTAT CTCCAACACT GCTAATATTA CATTTCAATA CACACCAATC
GCCGGAGGAA CGACTTTTAA CGGTCTTGCA ACAAGCAACT CTGCTGGAAC ACAAGTTAAC
CTCGCAGATA TTAATGGCAC AAAATCAGTT AACAAACTTT TTACCGATAT TGGGGAAACG
TTAACTTACA GTATCGCCTT AGCTAATATA GGGAATATTG CTGCAACTAA CGTAATATAT
ACGGATCCGA TTCCTAGCGG GACTACTTTC GTTCCGGGAA GTGTAACTGT TAACGGAGTT
ACTCAGACTG GAGCAAATCC CGCTACTGGT ATATCAATTG GCGCTATTGC TGCTAATTCT
ACGACTACTG TTTCATTTCA AGTACTCGTT CCTTCTATTC CCCAAACAAA TCCAGTTTTA
AATAGCGGAA CAACAACATA TCAATACATT CCTGTGCCAA ATCAACCGGC AATAAGTGGG
ACTGATACGA CCAATATCGT ATCTACTCAA GTGAATAACG CTACTGTAAC TATGGCAAAA
TCAGTAGATA AAAATTTTGC AGATATTGGT GATACACTAA CGTACACCGT TTCCTTTACA
AGTACAGGTA ATACAAATGC GAACAACGTT ATTTTTACAG ATGTCATTCC TACTGGAACA
ACTTTTGTTC TAAACAGTTT AACAATAGAT GGCACGACAC AAGGTGGAGC AAATCCCGCT
AACGGTGTGA ACATTGGATC AATCTCAACT GGCACAACAA AAAATGTTTC ATTTCAAGTA
GTTGTAAATA CAATACCCGC GTTAAATGTC GTATCTAATG GATCAAGCGC TTCTTATCAG
TACACTGTCA ATCCAAGCCA ATCACCCGTT ACAAAAAACC TTTCTTCTAA TCTCGTTTCC
ACTCAAATTA ACAATGCGAA TGTAGCATTA ACAAAATCAA CAAATAAACA ATTTGCCACA
ATTGGGGAAA CGATAAGTTA TACAATTCTT ATTACAAACA GCGGAAATAC AGCTGCAACT
AATGTACAAC TAACAGACCC ACTTCCAAAC GGAACAATAT TGACCCCTGG TTCTGTAACA
CTCAACGGCA TTTTGCAAAA TGTAGATTCT CTCGTCGCTT TACCTATCGG CACAATTCCT
GGCGGAGCTA CTTTTACACT TTCTTTCCAA GTAACAGTCA TCAATATTAC CACCCAAAAT
CCTATCATTA ATAATGCTTT CGCCTCTTAT CTATATACTG TAAATCCAAA TCTGCCACCA
ACTTCAAAAA CAGTAAATTC TAATTCTGTT ACATCAACAA TTAGACTAGC AAACCTTCAA
GCTACTAAAT CTGTAGATAA AACGTTTGCG GAAGTTGGGG ATGTATTAAC TTATACCTTT
TCTCTTACAA ACGATGGAAA TGTTGCGGCA AACAATGTAG TACTATCCGA TTCAATTGCG
AATGGTACTG CCTTTGTACC AAACAGTGTT ACGATTAACA ATGTTACTCA ACCAGGCGTT
ACACCAGCTA GCATCAATAT CGGTAGTATC ACTGCTGGTA CTACAATTAC AGCTTCATTC
AAATTTTTAA TAACTAGTAT TCCAAACCCA AATCCTATTT CAAATAGCGC TTCTATTTCC
TATAACTTTA TCGTTGATCC AAACGCTTCC CCTATAAGTA AGAACACAAC TTCAACCACT
ACATTTACTC AAGTAAATGA CGCAAATGTC ATTTCAGCAA AAACAGTGGA TCGAGCGTTC
GCTACTGTTG GGGATGTATT AACTTATACC GTCGTTTTAA CGAATGCAGG AAGTGTTTCT
GCTGATAGTC CTACTTTCGT AGATACGAAT CCAGACGGTA CTACCTTTAT CCCAAACACT
TTCCTTATTA ATGGTGTACT CCAAAATAAC GCAGATCCAA ATGTCGGTGT TCCCTTATCT
TCCATTCCTG CGAACGGTTC ACTTACCGTC TCTTATCAAG TAACTGTCAC CTCTTTACCA
ACACAAAATC CAACAATAAA TTCATCTAGT ACACAGTATA GTTTTATTTT AAATCCGGGC
GATCCACCAA CTATAGAAAC ATCTTTAAGT AATACTGTAA GTACACAAAT TAATTTAGCA
AATGTAGTTA TTGTCAAACA GGTAGATTTA ACTATTGCTG ACGTTGGGCA ACCAATCACA
TATACAATTG CTTTAGCTAA CCCTGGGAAT ACTCCCGCAA ATAATGTAGT TGTTACCGAT
ATACTCCCTC CTGGTACGAC TCTCGTACCA AATAGTATTT TTATAGGCGG GGCTTTACAA
CTTGGTGCGG ATCCAAGTGC TGGTCTTCAA GTTGGTACGA TTCCAGCTGG TGGTTTTACA
ACAATTGTCT TCCAAATTGG TGCAAATAGT TTACCTTCAC CAAACCCAGT TCAAAACAGT
GCTGTACTTC AATATAACTT TATCGCAGAT CCAAATTCAC CTCCCGTTGT AAGAAACTCT
GCTAGTAATA TAGTAACTAC ACAAATTAAC ACAGCTAATA TTGTTGCTAC GAAACTAACA
AGCACAAACT TTGCTGATGT TGGCGACGTC ATAACTTATG CAACGATTTT AACGAATAAC
GGCAATATCC CTGCCTCTAA TGTAACGTTT ACAGATATTA TTCCAGCTGG TACTATCTTC
CTGCCTAACA CTGTAACGAT TAACGGTGTC CCTATCGCTA ATGCAAACCC TACTAACGGC
ATTTTAATTG GTACGATAGG AGCGAATTCA TCACGTACTG TTTCATTCCA AGTTTCTGTA
CCAACTATTC CTATTCCAAA TCCGATTACG AATCAATCGA GCACTACATT CCAATACACG
TACGACGCAT CCAAACCAGT TGTAACGCAG ATGGTAGCCT CTAACACTGT ACAGACAACT
ATTAATAATG CTACGATTGC CGCTGTAAAA TCGGCAGATA AACAGTTTGC TAACGTAAAC
GATATTATTA CGTACACAAC TACTTTAACG AATAACGGGA ACACACTTGC ATCAAATGTA
ATTTTTACAG ATGTAATTCC AAACGGAACA TCATTTATCC CTAATAGTGT AACAGTAAAT
GGAAATACAC TTCCTAATAC AAATCCAGCA AGTGGAATTG CAATTGATCC AATAAACCCT
AATACAAGTG CAACAATCTC ATTTCAAGTA ATAGTAAATT CCATTCCTAG TCCAAATCCA
ATCCCGAATC AAAGTAATAC AACATACCAA TATATCATAA ATCCTAACTT ATCTCCTGCT
TCCGCCAATG CTCTAAGCAA TCTAGTAACA ACTCAAATTA ATAACGCTAC AATCACTGCT
ACGAAGTCAG TGAATACACC GACTGCTGCA ATTGGAGATA TCGTTACTTA TACGATTGCC
GTTACGAACA CAGGAAATAT CCCTGCTAGT GCTACAGTTT TAACAGATGG ACTTGGACCA
GGTGCCTCCT TCATACCGAA TTCCGTTACG ATAAATAACG TTTCCCAACC TGGATTAGAT
CCTTCATTAG GTATTCATTT AGACGATATT TCACCCGGCG GCATTACTTT CATTACATTC
CAAGTGAAAA TCCTCGCCAT TCCACCGAGT GGAACTTTAA CGAATAATGC TCTTGTAAAC
TACGAATACG CGGTGAATCC AACTGAAACA CAAGCTGTTG GCAGTACCGT TACAAATACG
ACAGTTACAC CAATTGTTGA CGCTACTTTA GTAATAAATA AAAATGCTAG TACAACTTTC
GCTACAATTG GAGATACGAT TACATTCACC TCAGTTATTA CAAACATAGG AAATACTACT
GCGAACAACA TTGTTTTTAC AGATTCGATT CCAACTGGTA CTACCTTTGT CCCCAATACC
TTAAAAATAA ACGGTGTAAC GGTTCCGAAT ACAAATCCAC AAAACGGTAT TAATATTGGT
AACTTGAATG CAAATGCATC AGTTACACTT AGTTTTCAAG TAAACATTAC AACTCTTCCA
AATCCTAATC CAATTCCGAA CCAATCATCA CTTCAATACA GCTTTATTGT TGATATAAAT
GAACCGCCTG TTTCACGAAC CGTTCAATCC AATAAAACTT TTACACAAGT AAACTCTGCT
TCCGTTATCG CAACAAAAAC TGCAAGCAGC GCATTTGCTG CTGTTGGAGA TACAATTACG
TATACAACTA CCCTCACTAA TAGCGGTAAT ACTATTGCAA ATACACCTGT TTTTATCGAT
ATATTGCCAC CTGAACTGTC ATTCGTTCCT GATAGCGTAC AAATTAATAC CATCCCACAA
CTTGGATTTA GGCCTGATAC TGGTGTTCCT TTAGACTCGA TTCCAGTTGG AGGAACGATA
ACAATTAGCT TTCAAGCTAT CGTTGGTTCG ATACCAGCTA TAAATCCAAC ATTGAACCAA
TCTAGCACAA CATACTCTAT CATTGTTGAC CCTACCCAGC CGCCGGTGAC AGAGACAGCT
ATAAGTAATC CAACTTTAGT TCAAATTAAC GAAGCCATTA TTCAAGCAAC GAAAAGTGTG
GATCGAATAT TTTCTGACAT CGCACCTGGA AATTCATTTT TAACGTATAC TGTTTTATTA
GAAAATATAG GGAACACGAC TGCTACGAAT ATCATTTTTA CAGATCCGAT TCCACATAAT
ACAGTATTTA TAGAGGATAG CGTTCGAGTA GGCGGGATTT TATTACCTGG AGTAAATCCA
GCAAACGGAA TACCAATTGG GGATATTATT GCAGGGGATT TTATAAACGT CACCTTCCGC
GTGCAAGTAG TTAGTATTCC AAATCCAATT TTCACAATTG GACCTGGGGG GCCAAATTCA
CCGGTTGTAA ATGGAGCTTC CATTGATTAT CAATTTATAA CAGGACCTAA TTTACCACTC
GCTTCAAGAA GTACGACATC CAATCCTGTT TCAACACAAA TAAATTCTGG GGAAATCGCA
CTTGTTAAAT CTGTAGATAA AACTTTCGCA ACGATCGGGG ATACACTTTC TTATACAATT
TCATTAAGTA ACCCTGGAAA TGTCACTTCA CAAAATATAA TTTTCACGGA TGTTTTACCT
GAAGGAACAA CTTTTATTTC CGGAACACTT ACAAACGATT CTGGTACACA GCAAATTGGA
AATCCAGCTA CCGGGATTCA AATTGGAAAT ATAAATCCTG GTAGTACGGC TACTATTGTG
ATAAACACAC TTGTTACAAA TATTCCAAGT ATAAATCCAA TTTCGAACTT TAGTTCTGTA
CAATTTGCAC ATGTGGTCGA TCCAAGCCAA CCTTCCGTAT CACAAACGAA TCTATCTAAT
ACTGTTTCGA CAACTATTAA GAGTGCTATA TTAACGACTA CAAAAAGTGC TGATAAATCC
GTTATTTCTG TCGGTGATAC AATTACGTAT ACAACTACTA TTACAAATAC AGGAAATACG
GCAGCAACGA ATATAAAGTT CACGAGTGCA ATTCCAGCTA ACACTACCTT TATACCAAAC
TCAGTCACAA TAAATGGGGT TCAGCAATCT GGTGTGCAAC CAGCACTTGG AGTAAACATA
CCAAATATTG CTCCTGGTGA AACAGTAACT GTTACTTTCC AAGTAAATGT TCTTTCCGTT
CCCTCTTCAA GTTCAATTAT GGGGAATGAT ACCATTTTAT ATTCGTATAC TGTCGATCCA
AACGGAACTC CTGTTACAAC TTCTACTTCA ACGAATATCG TTACAAACCC TGTATTAGAT
GCTATCATTA CGATGGTAAA ATCCGTCGAT CAAACACTTG TAACACTAGG TGATACCATT
ACCTATACGA TACTTTTGAC AAATACCGGT AATACAAATG CTACTAATAT CACTTTCACT
GATTTTATAC CAAATGGTAC TACGTTTATT ACTGATAGCG TTACAATAGA TGGCATCACG
CAAATCGGGC TCAATCCTAC TACAGGTATA ACGATTGGAG CAATTGCTCC TAACAGCTCA
ATATCTATAG CATTTCAAGT TACCGCTACT TCTACACCTG TTCAAAATCC TATTGCCAAT
TCCGCTACTG CTTCTTACAC ATTTATCGCT GATCCTAATG CCCCTATTGT TTCAAGGACC
GTTACTTCAA ACACAGTGTT CACTACGATT AATACAGCTA CCATTCTTTC ATTAAAACAA
GTCGATAAAT CCTTTAGTCG TATTGGAGAC ACACTCACTT ATACTGTCGC TTTAACAAAC
AATGGAAACT CATCCACACA AAATGTTATA TTTACAGATA CTATACCGAG CGGAACAGCA
TTTATTGCAG ACACATTTTC TATTAATGGA ATTCCTCAAA GTGGCGCAAA TCCAGTGAAC
GGTGTAAATA TCGGATCTAT AACAGCTGGA ACTACAGTAA CAGTTTCTTT CCAAGTTACT
GTAACGTCAT TACCCACGGA AAACCCCATT GTAAATTTCT CATCAACATC GTACCAATTA
GTCTCACCGC CTGATGCAGA AACTTCAATT AGCAATCCTG TTTCAACGCA AATTAAAGAA
GCACTATTAT CCATGACGAA AAATGAAAGT GTATCCGTTG CAGATATCGG GCAAACTGCT
TTTTACACTA CTTCTATTTC GAATATAGGA AATACAGATG CTACTAATAT TGTATTCACA
GATGTATTAC CAAATGGAGT CACATTTGTT CCTAACACAT TAACTGTCGA TGGTGTTTTA
CAACCGAATG CGAATCCAAA TACAGGTGTA TTACTTGCAA CACTTCCGCC TAATGAGATA
TATAGTATCG CCTTTCAAGC TGCAGTGAAC AGCATTCCCT CTATTAATCC AGCACCGAAT
ACAGCATCAA CAACATATGA ATTTACTGTT GATCCTGCTA ATCCTCCAGT ATTAAGTGCC
GCTACTTCCA ACACTACTCT TCTTCAAATA AATAACGCAA ATATTATAAG TACAAAAATA
ACAGACCTTA CTTTTGCGGA TGTTGGTAAT ACAATAACAT TTACACTTAA CCTCCCGAAT
ACAGGGAATG TGACTGCAAC TGATGTTACC GTTATCGATA CGCTTGATAG CAATTTAACT
TTTGTTCCAA ACAGTTTCAC AGTTAATGGG CAAACTATCC CAAACGCTGA TTTATCTACT
GGTGTAAATA TCGGTTCCAT TAATGGTGGA AGCACATCAA TTGTCACATT CCAAGCTATC
GTTTCAACCC TCCCAATCAA TAATCCTATT TCTAATTCGG CTCTTACAAC TTATCGTTAC
ATTGTTGATC CAGATCAGCC GTTCATTTCA ACTTCTAATC AATCTAATAC AACGATGACA
CAAATTAATA GCGCTATCCT TACTGCACAA AAAAATTCAA ATGTGTCAAC AGTAGATATT
GGGCAAGATA TTATCTACTC CGTTACGATT ACAAATAGCG GAAATGTTAG TGCAACGAAT
GTTATTTTTA CCGACGTTAT TCCAGACGGA ACTTCCTTTG AACCAAATAG TTTTACACTT
AATGGAACTA TTATCGAAAA TGCAAACATC ATTACAGGCG TCCCAATTGG TGATATCGCG
CCAAACGAAT CTGCCATTGT AGAATTTCAT ATTACTTCAA ATGAAATCCC GGCTATTAAT
CCAATTACTA ATCAAGCTAG CGTTAGCTAT CAACATATCG TCAATCCAGC TAATCCTCCT
GTTTCAAAAA ACATTACTTC AAATAGTGTT ACAACAACAA TTGAAAGTGC TATTTTAACT
ACTACTAAAA TCGGTGATAA AGCTTTTGCA ACGATTGGTG ATACAATTAC GTATACAACT
ACGATTACGA ATACTGGAAA TATTTCTGCC AATAACATTA TTTTCTCAGA CCCAATACCA
TCGTGGACAC AATTTGTTGC AGGATCCGTT ATTGTTGATG GCACTCCATT ACCATCCGCT
TCTATCATTG ACGGTGTTGG CATAAATACA GTCAATCCAA ATCAAACTGT AACAATCATA
TTCCAAGTTC AAATCGTAAG TAGCCCAACA ACGTTCACAC CTGAACTCCA AAACTTAGCA
TTTGTTAACT TCCAATATAA CGTAGGCAAT GCATTACAGG CTCAGCCTGG CAATGTGGAA
ACAAACATCT TCGTTACTTC TATTCATTCA GCAATACTTT CTGCTGTAAA AACTGCTAGT
ACAGCCTTTG CGAATATTGG AGACACGATC ACTTATACCG TTTTAATTCA AAATAGCGGC
AATACAAACG CTACGAATGT AAATTTCTCA GACCTCATTC CAGCAGGAAC GACCTTTATT
GAAAATAGTT TTGCTGTAAA TGGAAGTACC ATTCCAGGTG CAACTCCAAA TAACGGAGTT
AATATTGGAA CCGTTAGTAC GAACAGTTCC TTAACAGTTA CTTTCCAAGT CATAGTTACA
TCTACTCCGC CTTCAAACCC AATTACAAAC GTTGCATCTA TTCAATACGA ATTTATTGTT
GATCCGACTT CTCCTCCTGT TACAGGCACA ATTACTTCTA ATAGTGCTTC CACACAAATT
AATAACGCTA CTGTTACAAC GGTTTTAGAA GCAAATCGAA CAATTGTATC TATCGGAGAT
ATAATTACGT ATACAGCAAC ATTAACAAAC ACTGGAAACT TCCCTGCAAA CTCGGTATTA
CTCATTAACG GTGTCCCTGA AGGGGCATTA TTTGCTCCCA ATAGTGTTAC GTTTAACGGA
ATTTCACTTC CAGATGCAAG TCCAACTCTC GGTATTCCAG TTGGCATTAT CGCACCAGGT
GATTCTGCTA CAATTACGTT CCAATTTCTT GCAAACTCTA TTCCGCCACA AGGAGCAATT
ATAAATCAAG CACTTACAAG TTACACATAT ATTGTCGATC CAAATCAACC TCCAGTTACA
GCAACATCTT CATCTAATAC AGTTACTACA GCTGTCGTTG ATGCATCGCT ATCTGTAATT
AAAAATACAG ATTCCCTCGT ACAATCTACT GACGGTACAA TCACTTACAC TGTCGTCATT
CAAAACAACG GGAATACAAC TGCAAATACA GTTACTTTAA CAGATTTGGT CCCAGAAGGA
ACTGCATTGA TTCCAAATAG CGTGACCATT AATAGCATCT CAATTCCAGG TGCCGATCCA
AACGTAGGAA TACCAATAAA CTCCATGGCG CCGTCAGAAA TCGTCACCGT CACATTCCAA
GTTATCGTTC AATCTATCCC AAGCGTGAAT CCAATTTCTA ATATAGCCCG TATTGACTAT
ACTTTTATCG CGGATCCAAC TGCTCCTATC GTCTCTCGAA CAATTACTTC GAATCCAGCT
TTCACACAAA TTTCAGATGC GAATGTGCTT TCTTTAAAAG CCGTCAATGC ACAACAAGCA
ACAACTGGCG ACATTTTAAC GTACACGATA ACATTAGAAA ATACCGGAAA TATTCCAGCT
ACAAATCTCA TATTTTCAGA TTCGATTCCA GCTGGGACTA CATTCGTAGA AAATAGTTTT
ACACTTAACG GAGCAGCTAT ACTGGGTGCA AATCCAAATA TAGGTGTTAC TTTGCCTAAC
CTAGCAGCAA ACGCTACTCA CCTTATTGCG TTCCAAATTC TTATTAACGA TCCATTCTCG
CAACAATCGA TTACAAATCA ATCTAATACA ACATATACAA TTCAACCAGA CCCAGGGCAA
CCGCCTATTA CTGAAACATC TACAAGTAAT ATTGTCATTA CAAATTTCGT GCAAGCACAA
TTGACAATTA CAAAAACGTC CAATCCAATA ACTGTAGATA TTGGCGGAAC TATACTTTAT
ATTTCTGAAT TGAAAAATAG CGGCAATGTT GACGCAATAA ATATTATTTT TACAGATTCG
ATTCCAGCTG GGACTACATT CGTTCTCGAC AGTGTCACAA TTAACGGTGT ACTTCAGCTT
GATGTAAATC CTGAAAACGG AATACCAACT GGAACGATTC CACCAAACAG TTCCAAAACA
ATACTATTTC AAGTACAAAC AAATAATCCA CCTACTGAAA CCGAAATTGT AAATCAATCT
TCAGCAACTT ACCAATATGT AAGTATCCCT ACAGCTCCAC CAGTGAATCG CTCTGCAAAT
TCTAACATTG TTACAACATC ACTTCAAAAT GCGAATATTA TTTCTGTTAA AAGCGCAGAT
GTAACTTCCG TATCCATCGG GCAATTTATT ACCTACACAA ATACACTACA AAATACCGGA
ACGGTTCCAG CTAACAATAC GGTGTTCATT GACAACATTC CAGAAGGGAC TATATTCATT
GAAGATAGCT TATCAATAAA TAATGTCATT CAGCCTGGTA CGAATCCTGA AAATGGAGTA
ACTCTCGGCA CGATACAACC AGATGAAACA GTCACTATTT CATTCCAGGT ACAACTTACA
AATATACCTG AGGACAATAC AGTCATTAAC ATCTCAGACA CTTCGTATGA ATACCAAATT
GACTCTAGTT CTCCAATTAT TCAGCGTAGA TCATTATCAA ATGCAGTAAA CACGGAAGTC
CGTACAGCAA ATGTTAGTGC AATTAAATCT GCTAACAGAT CCATTACACG CATCGGTCAA
ATTATCACAT ATACCATCGC AGTTACAAAT GCTGGTACAG TACCTATTAC AAATACTCTC
CTACTTGACG CAATTGCTGC TGGCACCACA TTCGTTCCAA ATAGCATTCT TGTAGATGGC
ATACCAAGAC CTAACGAAAA TCCAAGTACC GGAATCTCCC TTAATATTAT TCTTCCAAAC
AATACGATTA TTGTTACGTT CCAAGTAAAT GTAGACTCGA TACCTTCTCA AAATAACATG
AATAATATCG CCGTCATCCA CTATGAATAT CAGCCAGACC AAAGCTTACC ACCAATTTCA
GAAACGACAT CTTCCAACAG TACAAATATA CAATTTATTG ATGCTATTCT TATCGCTACA
AAATCCGCTA ATACAGTATT AGCTAATATT GATGAAACTA TTGAATATAC AGTACTCATT
CAAAATAACG GATCCACTAC AACTAACTCC ATCTTTTTTA CAGATACGAT AGAGGATGGA
TCTGTATTTA TCCCGGGAAG TGTAATAGTT AACAATACCG TACTTCCTGC AGCGGATCCG
AATATCGGCT TTTTCATCCC GAATATCGCA TCAGGTCAAG TGGCTACAAT AACATTCCAA
GTTTCGGTTA CGAATTTACC TGTTGCAAAC CCAACACCTA ATACCGCAAA CATCGTCTAC
GACTTTATTT TCAACCCTGA CTTTGCACCA ATTCAAAAAT CGACTACTTC CAACACTACT
TTCGTTCAAA TTAATGATGC TGATATCGTT TCACTTAAAA CTGTTGATTT GACTTCTGTA
ACAATTGGTG ATGTTTTAAC TTATATAACA ACTTTAACAA ATACAGGGAA TACGGATGTT
ACTGCTCTTG TATTTACAGA CAATATTCCT GGTGGAACAA CCTTTATAGA CGGTAGCGTT
TTAGTAAATA ACATTCCGCA GCTTAATGCC AATCCAAGTA CCGGTATATT GGTAGGAACG
ATTGCTCCTA ACATTTCTAT CCCAGTCACA TTTTCTGTAA CTGTCGTAGC GCTTCCAACT
AGCGGCCATG TTCAAAATCA AGCAACTTCT CGTTATACGA TAAACGGAGA AGAACAAATA
TCGACTAGTA ACTTTACTTT CACTGAAGTT ATTTCTGCTA ATATAATCGC AGTAAAAACA
ACTCCTATCC AATATGCTGA CCTACAAACC ATTATCCCTT ACACAATTTC CATCACAAAC
AATGGGAATA TACAAGTGGA AAACATTATC GTTACAGATA TCATCCCAGC AAATACAAAC
TTTATAGAGA ATAGTGTTAT TGTGAATGGA AACACTCGTC CAAATGACAA TCCACTTAGC
GGGATACCAA TTGATAACAT TCTGCCTAAT ACGACAGCAA CTGTTCTATT CCAAGTACGA
GTTACTTCGA TACCTCAAAC CAATCCAATC TCTAACACAA GTACAATTGA ATATGAATAC
ACGGTACAAG ATCAACCACC TATTACCAAA ACTATTATTT CATCAGCTGC TTTAACAGAA
ATTAATCATG CGAATTTGAA TAGTAATAAA GCTGTTGACC TTGCATTTGC AATGGTCGAT
GATACGTTAA CGTATACGAT TACACTCAAT CAAACTGGTA ATGTTGCAGC AAATGATGTA
ATCATTCAAG ATATGATTCC TCAAGGGACT ACATTTATAG AAAATAGCGT TATTGTAAAC
GGAGAGGCTC TTCCGGGAGT GGATCCAGCA AGCGGCATAC CAATTGGTAC TATAATTGTA
GATGGGGACG CTATCGCTTC ATTCCAAGTA ACTGTGACTT CTATTCCAAT ACGAAACGAG
CTCAACAACC AAGCAATCAC TACTTTTAAC TATATAGTCA ACCCAAATAA CGTGCCTGTT
ACAAATAAGA CGACAACAAA TACAGTCACA ACAACCGTTC AAAATGATAA TATCATTGCG
ATAAAAGCTG TTGATTTCAC GAGTGCCTTA CCTGGTCAAA CTTTAACGTA TACCATTACG
ATTACTAATA ATGGTAATAT CACTATTGAA GATCTACTTC TAGTAGACAC GGCACCTGTA
GATACGACAT TTGTTATTGG TAGTGTTACG ATTAACGGAA TCAATCAGCC TAATACGAAT
CCTGAAAATG GTATTACGTT AGGAACTCTT GCTCCTAATG ACTCTGTTAT TATTACATTC
CAAGTGACAA TATCTTCTTC TACTCTTCAA TCTACAATCA ATAACGATGC TACTATTTTC
TATACGCCTA TTGTCGGTCT AATCGAACCA CCTATTACAA TTACAAGGCA AATAGATATC
GTCACAAAGC AAACAAATAC TGTTACAACA ATAGTAGTTG ATCCAATGGT TAGTATTGAA
AAAACAGCCG ATAAATCTAT CGTGGCGACA GGGGACATCA TAACTTTTAC ATTAAAAGTG
ATAAATCATT CACCAATTCC TACAATCAGT ACTTCTGTGC TAGATATAAT TCCAGCAGGT
ACAATATTTA TAGAAGACAG TGTAACAATT AACGGTACTC TAGTTTCAAA TATTCGTCCA
GACACTGGTA TGAATATTGG CTCTTTATCT GCAGATTCCA TAGCAACTAT AACATTTAAA
GTTCTCGTCA CTTCTATCCC CTCAAACAGT ACAATTATTA ATTCTGCTAC CGTCACAGCC
GCCTTCCAAT TGACACCCCA GGACCCAATT ATTACTTTTA TTGTTAATTC AAATATCGTT
CGTATACCAG CTCAATTTAT AACTGCGACA GTAGTGAAGA ACGCTTCCGT CACCTCAGCT
TATTTAAATC AATATTTTGA TTACACGGTG CGTATTACGA ATACTTCCGA GATTTCACTC
TCAAATATTT CTTTACAGGA TACTATTCCA GTAGGTTTAC AATTTATAAA CGGCACTGTC
TTCATTAACG AAGAACGCTC TCCACTAGCG AATCCGAATA TTGGTTTTCT AGTCTCTACT
AATTTGGAAC CAAACGAAAC AATTATCGTG TTATTCACAG TACAAGTAAT AAGTCCACCT
GTTAATAATG AATTTAAAAA CAGCGCAAAT ATTTCTTTAC AACTTCAAGT CTCACCTACC
GATTCACCAA TTACAGTAAC AGTTACAAGT AACGAAAACA TCGTCATCTT TGTTCCAGAA
AATCCAGATG AAACACTTCC AAATTTTAAT TGCTTCTTTG ACGGTGAACG TTTTATACGC
ATTCCTCCTC GAAATATACG AAATTACTTT TGGGCTTGGA TTTGGTGGCA GTAA
 
Protein sequence
MPITNRFSTT TNGALAITGN TLGLSKISNQ NRAGTIGAIG AFITTNTALQ VPTFPAGTTL 
NYTQNSSTAL LNIPAGSTIL YAELIWGGNY LSRDQNITSV LGNPVSFTTP VSTYSITPSA
VTASNQTFVS GSITFGFYTR SADVTPLIQA GGSGSYTIGS VPGLVDPIDA SNGTINSAGW
TLIVAYQNGT LPARNLTIYV AGNRVSAETG SADVSVSGFL TPSGGPVSGR LFLSSTEGDA
DLIGDQALFG PNFSSLNALS GPNNAANNFF GSQINNAAGN LDTTGTFGTR NQSASTGTNI
SAGRQGWDIT SIDISPYLTN SQVSAAIRLT TNGDAYMLNT VGLQININSP NIQATKSVNK
SVAAIGDVLT YTVTIPNTGL LPANNVIFTD ILPNGTSFIP GTVTVDNVPQ TNANPAAGIS
LGTVNNSASR TVTFQATVVS FPSQNPISNT ANITFQYTPI AGGTTFNGLA TSNSAGTQVN
LADINGTKSV NKLFTDIGET LTYSIALANI GNIAATNVIY TDPIPSGTTF VPGSVTVNGV
TQTGANPATG ISIGAIAANS TTTVSFQVLV PSIPQTNPVL NSGTTTYQYI PVPNQPAISG
TDTTNIVSTQ VNNATVTMAK SVDKNFADIG DTLTYTVSFT STGNTNANNV IFTDVIPTGT
TFVLNSLTID GTTQGGANPA NGVNIGSIST GTTKNVSFQV VVNTIPALNV VSNGSSASYQ
YTVNPSQSPV TKNLSSNLVS TQINNANVAL TKSTNKQFAT IGETISYTIL ITNSGNTAAT
NVQLTDPLPN GTILTPGSVT LNGILQNVDS LVALPIGTIP GGATFTLSFQ VTVINITTQN
PIINNAFASY LYTVNPNLPP TSKTVNSNSV TSTIRLANLQ ATKSVDKTFA EVGDVLTYTF
SLTNDGNVAA NNVVLSDSIA NGTAFVPNSV TINNVTQPGV TPASINIGSI TAGTTITASF
KFLITSIPNP NPISNSASIS YNFIVDPNAS PISKNTTSTT TFTQVNDANV ISAKTVDRAF
ATVGDVLTYT VVLTNAGSVS ADSPTFVDTN PDGTTFIPNT FLINGVLQNN ADPNVGVPLS
SIPANGSLTV SYQVTVTSLP TQNPTINSSS TQYSFILNPG DPPTIETSLS NTVSTQINLA
NVVIVKQVDL TIADVGQPIT YTIALANPGN TPANNVVVTD ILPPGTTLVP NSIFIGGALQ
LGADPSAGLQ VGTIPAGGFT TIVFQIGANS LPSPNPVQNS AVLQYNFIAD PNSPPVVRNS
ASNIVTTQIN TANIVATKLT STNFADVGDV ITYATILTNN GNIPASNVTF TDIIPAGTIF
LPNTVTINGV PIANANPTNG ILIGTIGANS SRTVSFQVSV PTIPIPNPIT NQSSTTFQYT
YDASKPVVTQ MVASNTVQTT INNATIAAVK SADKQFANVN DIITYTTTLT NNGNTLASNV
IFTDVIPNGT SFIPNSVTVN GNTLPNTNPA SGIAIDPINP NTSATISFQV IVNSIPSPNP
IPNQSNTTYQ YIINPNLSPA SANALSNLVT TQINNATITA TKSVNTPTAA IGDIVTYTIA
VTNTGNIPAS ATVLTDGLGP GASFIPNSVT INNVSQPGLD PSLGIHLDDI SPGGITFITF
QVKILAIPPS GTLTNNALVN YEYAVNPTET QAVGSTVTNT TVTPIVDATL VINKNASTTF
ATIGDTITFT SVITNIGNTT ANNIVFTDSI PTGTTFVPNT LKINGVTVPN TNPQNGINIG
NLNANASVTL SFQVNITTLP NPNPIPNQSS LQYSFIVDIN EPPVSRTVQS NKTFTQVNSA
SVIATKTASS AFAAVGDTIT YTTTLTNSGN TIANTPVFID ILPPELSFVP DSVQINTIPQ
LGFRPDTGVP LDSIPVGGTI TISFQAIVGS IPAINPTLNQ SSTTYSIIVD PTQPPVTETA
ISNPTLVQIN EAIIQATKSV DRIFSDIAPG NSFLTYTVLL ENIGNTTATN IIFTDPIPHN
TVFIEDSVRV GGILLPGVNP ANGIPIGDII AGDFINVTFR VQVVSIPNPI FTIGPGGPNS
PVVNGASIDY QFITGPNLPL ASRSTTSNPV STQINSGEIA LVKSVDKTFA TIGDTLSYTI
SLSNPGNVTS QNIIFTDVLP EGTTFISGTL TNDSGTQQIG NPATGIQIGN INPGSTATIV
INTLVTNIPS INPISNFSSV QFAHVVDPSQ PSVSQTNLSN TVSTTIKSAI LTTTKSADKS
VISVGDTITY TTTITNTGNT AATNIKFTSA IPANTTFIPN SVTINGVQQS GVQPALGVNI
PNIAPGETVT VTFQVNVLSV PSSSSIMGND TILYSYTVDP NGTPVTTSTS TNIVTNPVLD
AIITMVKSVD QTLVTLGDTI TYTILLTNTG NTNATNITFT DFIPNGTTFI TDSVTIDGIT
QIGLNPTTGI TIGAIAPNSS ISIAFQVTAT STPVQNPIAN SATASYTFIA DPNAPIVSRT
VTSNTVFTTI NTATILSLKQ VDKSFSRIGD TLTYTVALTN NGNSSTQNVI FTDTIPSGTA
FIADTFSING IPQSGANPVN GVNIGSITAG TTVTVSFQVT VTSLPTENPI VNFSSTSYQL
VSPPDAETSI SNPVSTQIKE ALLSMTKNES VSVADIGQTA FYTTSISNIG NTDATNIVFT
DVLPNGVTFV PNTLTVDGVL QPNANPNTGV LLATLPPNEI YSIAFQAAVN SIPSINPAPN
TASTTYEFTV DPANPPVLSA ATSNTTLLQI NNANIISTKI TDLTFADVGN TITFTLNLPN
TGNVTATDVT VIDTLDSNLT FVPNSFTVNG QTIPNADLST GVNIGSINGG STSIVTFQAI
VSTLPINNPI SNSALTTYRY IVDPDQPFIS TSNQSNTTMT QINSAILTAQ KNSNVSTVDI
GQDIIYSVTI TNSGNVSATN VIFTDVIPDG TSFEPNSFTL NGTIIENANI ITGVPIGDIA
PNESAIVEFH ITSNEIPAIN PITNQASVSY QHIVNPANPP VSKNITSNSV TTTIESAILT
TTKIGDKAFA TIGDTITYTT TITNTGNISA NNIIFSDPIP SWTQFVAGSV IVDGTPLPSA
SIIDGVGINT VNPNQTVTII FQVQIVSSPT TFTPELQNLA FVNFQYNVGN ALQAQPGNVE
TNIFVTSIHS AILSAVKTAS TAFANIGDTI TYTVLIQNSG NTNATNVNFS DLIPAGTTFI
ENSFAVNGST IPGATPNNGV NIGTVSTNSS LTVTFQVIVT STPPSNPITN VASIQYEFIV
DPTSPPVTGT ITSNSASTQI NNATVTTVLE ANRTIVSIGD IITYTATLTN TGNFPANSVL
LINGVPEGAL FAPNSVTFNG ISLPDASPTL GIPVGIIAPG DSATITFQFL ANSIPPQGAI
INQALTSYTY IVDPNQPPVT ATSSSNTVTT AVVDASLSVI KNTDSLVQST DGTITYTVVI
QNNGNTTANT VTLTDLVPEG TALIPNSVTI NSISIPGADP NVGIPINSMA PSEIVTVTFQ
VIVQSIPSVN PISNIARIDY TFIADPTAPI VSRTITSNPA FTQISDANVL SLKAVNAQQA
TTGDILTYTI TLENTGNIPA TNLIFSDSIP AGTTFVENSF TLNGAAILGA NPNIGVTLPN
LAANATHLIA FQILINDPFS QQSITNQSNT TYTIQPDPGQ PPITETSTSN IVITNFVQAQ
LTITKTSNPI TVDIGGTILY ISELKNSGNV DAINIIFTDS IPAGTTFVLD SVTINGVLQL
DVNPENGIPT GTIPPNSSKT ILFQVQTNNP PTETEIVNQS SATYQYVSIP TAPPVNRSAN
SNIVTTSLQN ANIISVKSAD VTSVSIGQFI TYTNTLQNTG TVPANNTVFI DNIPEGTIFI
EDSLSINNVI QPGTNPENGV TLGTIQPDET VTISFQVQLT NIPEDNTVIN ISDTSYEYQI
DSSSPIIQRR SLSNAVNTEV RTANVSAIKS ANRSITRIGQ IITYTIAVTN AGTVPITNTL
LLDAIAAGTT FVPNSILVDG IPRPNENPST GISLNIILPN NTIIVTFQVN VDSIPSQNNM
NNIAVIHYEY QPDQSLPPIS ETTSSNSTNI QFIDAILIAT KSANTVLANI DETIEYTVLI
QNNGSTTTNS IFFTDTIEDG SVFIPGSVIV NNTVLPAADP NIGFFIPNIA SGQVATITFQ
VSVTNLPVAN PTPNTANIVY DFIFNPDFAP IQKSTTSNTT FVQINDADIV SLKTVDLTSV
TIGDVLTYIT TLTNTGNTDV TALVFTDNIP GGTTFIDGSV LVNNIPQLNA NPSTGILVGT
IAPNISIPVT FSVTVVALPT SGHVQNQATS RYTINGEEQI STSNFTFTEV ISANIIAVKT
TPIQYADLQT IIPYTISITN NGNIQVENII VTDIIPANTN FIENSVIVNG NTRPNDNPLS
GIPIDNILPN TTATVLFQVR VTSIPQTNPI SNTSTIEYEY TVQDQPPITK TIISSAALTE
INHANLNSNK AVDLAFAMVD DTLTYTITLN QTGNVAANDV IIQDMIPQGT TFIENSVIVN
GEALPGVDPA SGIPIGTIIV DGDAIASFQV TVTSIPIRNE LNNQAITTFN YIVNPNNVPV
TNKTTTNTVT TTVQNDNIIA IKAVDFTSAL PGQTLTYTIT ITNNGNITIE DLLLVDTAPV
DTTFVIGSVT INGINQPNTN PENGITLGTL APNDSVIITF QVTISSSTLQ STINNDATIF
YTPIVGLIEP PITITRQIDI VTKQTNTVTT IVVDPMVSIE KTADKSIVAT GDIITFTLKV
INHSPIPTIS TSVLDIIPAG TIFIEDSVTI NGTLVSNIRP DTGMNIGSLS ADSIATITFK
VLVTSIPSNS TIINSATVTA AFQLTPQDPI ITFIVNSNIV RIPAQFITAT VVKNASVTSA
YLNQYFDYTV RITNTSEISL SNISLQDTIP VGLQFINGTV FINEERSPLA NPNIGFLVST
NLEPNETIIV LFTVQVISPP VNNEFKNSAN ISLQLQVSPT DSPITVTVTS NENIVIFVPE
NPDETLPNFN CFFDGERFIR IPPRNIRNYF WAWIWWQ