Gene BCAH187_A1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A1759 
Symbol 
ID7074172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp1620121 
End bp1635153 
Gene Length15033 bp 
Protein Length5010 aa 
Translation table11 
GC content38% 
IMG OID643450225 
Productconserved repeat domain protein 
Protein accessionYP_002337716 
Protein GI217959168 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTA CAAATCGATT TTCTACCACC ACTAACGGCG CACTTGCGAT TACAGGAAAC 
ACACTCGGTT TAAGTAAAAT CAGTAATCAA AACCGTGCTG GTACAATCGG GGCAATTGGC
GCATTTATAA CTACCAATAC CGCTTTACAA GTAACTTCTT TTCCTGCTGG TACAACATTA
AACTATACAC AAAATAGTTC TACCGCTCTT TTAAATATAC CTGCTGGTAG TACGATTCTT
TACGCAGAAC TCGTTTGGGG CGGAAACTAT TTATCTCGTG ATCAAAACAT TACGAACGTA
TTAGGCAATC CTGTTTCTTT TACAACACCT GTTTCAACAT ACTCGATTAC TCCATCGGCT
GTTACAGCTT CCAATCAAAC ATTCGTTTCT GGATCTATCA CATTTGGATT CTATACACGT
TCTGCAGATG TAACCTCCCT CATTCAAGCA GGAGGATCTG GCTCTTATAC AACCGGCTCT
GTCCCTGGAC TTGTAGATCC TATAGATGCT TCTAACGGAA CAATTAATTC AGCTGGGTGG
ACGCTTATCG TCGCTTACCA AAATGGAACA TTACCTGCAA GAAATTTAAC CATTTATGTA
GCAGGCAACC GGGTTTCTGC AGAAACTGGT AGTGCCGATG TATCTGTTTC AGGATTTTTA
ACACCTTCAG GAGGGCCTGT AAGCGGGAGA TTGTTTTTAA GTTCTACCGA AGGAGATGCT
GATTTAATTG GGGATCAGGC TCTATTCGGG CCAAATTTCA GTTCATTAAA TGCCTTATCT
GGACCTAACA ATGCTGTAAA TAATTTCTTC GGTTCTCAAA TTAATAATGC CGCTGGAAAC
TTAGATACAA CCGGGACATT TGGAACGCGA AATCAAAGTG CTTCTACAGG TACAAACATC
TCCGCTGGAA GACAGGGCTG GGACATTACT TCCATTGATA TTTCTCCTTA TTTAACAAAT
TCTCAAGTGT CCGCCGCAAT CCGTTTAACA ACTAACGGAG ACGCATATAT GTTGAATACA
GTCGGTTTAC AAATCAACAT AAATTCACCT AACATACAAG CAACAAAAAG CGTGAATAAA
AGTGTTGCAG CAATTGGAGA CGTTCTCACT TATACAGTTA CTATCCCTAA TACAGGGCTT
CTCCCCGCCA ATAACGTTAT TTTTACAGAC ATTCTTCCTA ACGGTACTTC CTTTATACCT
GGAACTGTAA CAGTAGATAA TGTCCCGCAA ACGAATGCAA ATCCGGCCGC TGGTATATCT
CTTGGAACCA TTAATAACAG CGCTTCTCGT ACAGTAACTT TCCAAGCTAC TGTCGTTTCT
TTTCCAAATC AAAATCCTAT CTCCAACACT GCTAATATTA CATTTCAATA TACACCAATC
GCTGGAGGAA CGACTTTTAA CGGTCTTGCA ACAAGCAACT CTGCTGGAAC ACAAGTTAAC
CTCGCAGATA TTAATGGCAC AAAATCAGTT AACAAACTTT TTACCGATAT TGGTGAAACG
TTAACTTACA GTATCGCCTT AGCTAATATA GGGAATATTG CTGCAACGAA CGTTATATAT
ACGGATCCGA TTCCTAGCGG GACTACTTTC GTTCCAGGAA GTGTAACTGT TAACGGAATT
ACGCAAGCTG GAGCGAATCC CGCAAATGGT ATATCAATTG GGTCTATTGC CGCTAATTCT
ACGACTACTA TTTCATTCCA AGTATCCGTT CCATCTATCC CCCAAACAAA CCCCATATTA
AATAGCGGGA CTACAACTTA TCAATACATT CCTGTACCAA ATCAACCGGC AGTAAGCGGG
ACTGATACGA CGAATATTGT CTCCACTCAA GTGAATAATG CTACTGTAAC AATGACAAAA
GCGGTAGATA AAAATTTTGC CGATATTGGT GATACACTAA CGTACACCGT TTCCTTTACA
GGTACAGGTA ATACAAGTGC AAACAATGTT ATGTTTACAG ATGCCATTCC TACTGGAACC
ACTTTTGTCC TTAACAGTTT AACAATAGAT GGCACAACGC AAGTGGGAGC AAATCCCGCT
AACGGTGTGA ACATTGGAGC GATCCCAACT GGTACAACAA AAAATGTTTC ATTTCAAGTA
GTTGTAAATA CGATACCCGC GCTAAATGTC GTATCTAATG GATCAAGCGC TTCTTATCAG
TACACTGTCA ATCCAAGCCA ATCACCCGTT ACAAAAAACA TTTCTTCTAA TCTCGTTTCC
ACTCAAATTA ACAATGCGAA TTTAGCATTA ACAAAATCAA CAAATAAACA GTTTGCCACA
ATTGGGGAAA CGATAAGTTA TACAATTCTT ATTACAAACA ACGGAAATAC AGCTGCAACT
AATGTACAAC TAACAGACCC ACTTCCAAAC GGAACAATAT TGACCCCTGG TTCTGTAACA
CTCAACGGCA TTTTGCAAAA TGTAGATTCT CTCGTTGCTT TACCTATCGG CACAATTCCT
GGCGGAGCTA CTTTTACACT TTCTTTCCAA GTAACAGTCA TCAATATTAC CGCTCAAAAT
CCTATCATTA ATAATGCTTT CGCTTCTTAT TTATACACTG TAAATCCTAG TCTGCCACCA
AATTCAAAAA CAGCAAATTC TAATTCTGTT ACATCAACAA TTAGACTAGC AAACCTTCAA
GCTACTAAAT CTGTAGATAA AACGTTTGCA GAAGTTGGAG ATATATTAAC CTATACCTTC
GCTCTTACAA ACAACGGAAA TGTTACGGCA AACAATGTAT TACTATCCGA TTCAATAGCG
AACGGGACTT CCTTTGTACC GAACAGTGTT ATCGTTAACG GCGTTAATCA GCCGGGCGCA
ACACCAGCTA GCATCAATAT CGGTAGCATC AATGCTAATA CTACGATTAC AGCTTCATTC
CAAGTATTAA TCACTAGCAT TCCAAACCCA AACCCTATTT CGAATAGTGC CTCCATTTCT
TATAACTTTA TTGTTGATCC AAACGCTTCG CCTGTAAGTA AGAACACTAC TTCAACTACT
ACATTTACGC AAGTAAACGA TGCAAATGTC ATTTCAGCTA AAACAGTAGA TAAAGGGTTC
GCTACTGTAG GGGATGTATT AACTTATACC GTCGTTTTAA CGAATGCAGG AAGTGTTTCT
GCTGATAGTC CTACTTTCGT AGATACAAAT CCAGACGGCA CTACCTTTAT CCAAAACACT
TTCCTGATTA ATGGTGTACT CCAAAATAAC GCAGATCCAA ATGTCGGTGT TCCGTTACCT
TCCATTCCTG CGAACGGTTC ACTTACCGTC TCTTATCAAG TAACTGTCAC CTCTTTACCA
ACACAAAATC CAACAATAAA TTCATCTAGT ACACAGTATA GTTTTATTTT AAATCCGGGC
GATCCACCAA CTATAGAAAC ATCTTTAAGT AATACTGTAA GTACACAAAT TAATGTAGCA
AATGTAGTTA TTGTTAAACA GGTAGATTTA ACTATTGCTG ATGTTGGGCA ACCAATCACA
TATACCATTG CCCTAGCTAA CCCAGGAAAT ACTCCAGCGA ATAATGTAGT TGTTACCGAT
ATACTCCCTT CCGGCACAAC TCTCGTACCA AATAGTACTT TTATAGGCGG TGCTTTACAG
CTAGGTGCAG ACCCAAGTAC CGGTCTTCAA GTCGGGACAA TTCCTGCTGG CGGATTTACA
ACGATTGTTT TTCAAATAAG CGCAAGTAGT TTGCCTACAC AAAATCCAAT TCAAAACAGT
GCTGCACTGC AATATAACTT TATTGCTGAC CCAAATTTAC CTGCTGTTGT GAGAAATGCT
ACTAGTAATA TAGTAACGAC ACAAATTAAT ACTGCTAATA TTGTCGCTAC GAAACTAACA
AGTACAAATT TCGCTGATGT TGGTGATGTC ATAACTTATG CAACGATTTT AACGAATAAC
GGCAATATCC CTGCCTCTAA TGTAACGTTT ACAGATATCA TTCCAGCTGG TACCATCTTC
CTTCCTAACA CTGTAACGAT TAACGGCGTC CCTATAGCTA ATGCAAACCC GGCTAATGGC
ATTTTAATTG GTACGATAGG AGCGAATTCA TCACGTACTG TTTCATTTCA AGTTTCTGTA
CCAACTATTC CTAGTCCAAA TCCGATTACG AATCAATCGA GCACTACATT CCAATACACG
TACGATGCAT CCAAACCAGT TGTAACGCAG ATGTTGGCCT CTAACACCGT ACAGACAACT
ATTAATAATG CTACAATTGC CGCTGTGAAA TCGGCAGATA AACAGTTTGC TAACGTAAAT
GATATGATTA CGTACACGAC TACTTTAACG AATAACGGGA ACACACTTGC ATCAAATGTA
ATTTTTACAG ATGTAATTCC AAACGGAACA TCATTTATCC CTAATAGTGT AACAGTAAAT
GGAAATACAC TCCCTAATAC AAATCCAGCA AGTGGAATTG CAATTGATCC AATAAACCCT
AATACAAGTG CAACAATCTC ATTTCAAATA ATAGTAAATT CCATTCCTAG TCCAAATCCA
ATCCCGAACC AAAGCAATAC AGCGTACCAA TATGTCATAG ATCCTAACTT ACCTCCAGCA
TCCGCTAATG CGTTAAGTAA CGTAATAACA ACTCAAATTA ATAACGCTAC AATCGTCGCT
ACGAAGTCAG TAAATACACC GACAGCTGCA ATTGGGGATA TCGTTACTTA TACGATTGCA
GTTACGAACA CAGGAAATAT TCCTGCTAGT GCTACAGTTT TAACAGATGG ACTCGGCGCA
GGCGCTTCAT TTGTCCAAAA CTCTGTAACT ATCAATAATA TTCCGCAGCC GGGGTTAGAT
CCATCACTTG GTATTCACTT AGCAGATATT CCCCCAGGAG ATACTGTATT CATTACTTTT
CAAGCACAAA TTTTAGCGAT ACCACCTAGT GGCACATTAA CAAATAATGC TCTTGTAAAT
TATGAATATA CCGTGAATCC AAACCAATCA CCTGCTGTTA ATAGTACAAT TACAAATACA
ACGGTTACTC CAATTATTGA TGCTACATTA GTTTTAAATA AAAATGCGAG TACAACTTTC
GCTACAATTG GTGATACGAT TACATTCACT TCATCTGTTA CAAACACAGG AAATACTACT
GCGAACAACA TTGTGTTTAC AGATTCCATT CCAAATGGTA CTGCCTTTGT CCCAAATAGC
TTTAAGATAA ATGGTGTAAC CGTCCCGAAT GCAAATCCAC AAAACGGTAT CAATATTGGT
AACTTGAATG CAAATGCATC GATTACACTC AGTTTTCAAG TAAACATTAC AACACTACCA
AACCCGAATC CAATTCCTAA CAAATCATCG CTTCAATATA GCTTTATTGT TGATATAAAT
GAGCCGCCTG TTTCACGAAC AGTTCAATCC AATAAAACCT TTACACAAGT AAATACAGCT
TCCGTTATCG CAACAAAAAC TGCAAGCAGC GCATTTGCTG CTGTTGGAGA TACAATTACG
TATACAACTA CCCTCACTAA TAGCGGAAAT ACTACCGCAA ACACACCTGT TTTTATCGAT
ATATTACCAC CCGAACTTTC ATTCGTTCCT GATAGTGTAC AAATTAACTC CATCCCACAA
CTTGGATTTA GACCAGATAG CGGAATCTCT TTAGACTCAA TTCCAGTTGG AGGAACGATA
ACAATTAGCT TTCAAGCTAT TGTTAGCTCA ATACCAGCTA CAAATCCAAC ATTGAACCAA
TCTAGCACAA CATACTCTAT CATTGTTGAT CCTACCCAGC CGCCGGTGAC AGAAACAGCT
ACAAGTAATC CAACTTTAGT TCAAATTAAC GAAGCGATTA TTCAAGCAAC GAAAAGTGTG
GATCGACTAT TTTCTGACAT CGCACCTGGA AATTCATTTT TAACGTATAC TGTCTTATTA
GAAAATATAG GGAACACAAC TGCTACGAAT ATCATTTTTA CAGATCCGAC TCCAAATAAT
ACAATATTTA TAGAAGATAG CGTTCGAGTA GGTGGGGTTT TATTACCTGG AGTAAATCCA
GCGAACGGGA TACCAATTGG AGATATTATT GCAGGAGATT TTATAAATGT TACCTTCCGT
GTGCAAGTAG TTAGCATCCC AAATCCTATT TTTACAATCG GACCTGGAGG ACCAAACTCA
CCAGTTGTAA ACGGTGCTTC TATTGATTAT CAATTTATGA CAGGACCTAA TTTACCACTT
GTTTCACGAA ATACGACATC TAATTCTGTT TCGACACAAA TAAATTCTGG GGAAATTGTG
GCAATCAAAT CTGTAGATAA AACTTTCGCA ACGATTGGCG ATACAATTTC TTATACAATT
ACATTAAGCA ACCCTGGAAA TGTCACTTCA CAAAATATAA TTTTCACGGA TACTTTACCT
GATGGAACGG CATTCATATC TGGCACTCTT ACAAACGATT CTGGTACACA GCAAATCGGA
AATCCATCTA ACGGGATTCA GATTGGAAAT ATAAACCCAA ATGGAACGGC CGTGATTACT
CTTAATGTAC TTGTTACAAA CATTCCAAGT ATAAATCCAA TTTCTAATTT TAGTTCAGTA
CAATTTGCAC ATGTGGTTGA TCCTAGCCAA CCTGCCGTAA CACAAACGAA TGTATCTAAT
TCTGTTTCGA CAACAATTAA TAGTGCAATA TTAACGACCC AAAAAAGCGT TGATAAATCT
ATCATTTCTG TTGGAGATAC GATTACGTAT ACAACAACTA TTACAAATAC AGGAAATACG
ACTGCAACGA ATATAACGTT CACGAGCGCA ATTCCAGCTA ACACTACCTT TATACCAAAC
TCAGTCAAAA TAAATGGGGT TCAGCAATCT GGTGCGCAAC CAGCACTTGG AGTAACTATA
CCAAATATCG CTCCTGGTGA AACAGTAACC GTTACTTTCC AAGCAACTGT TCTTTCTGTT
CCCCCTTCAA GTTCAATTAT GGGTAGTGAC ACAATTTTAT ATTCTTATAC TGTCGATCCA
AACGGAACTC CTATTACAAC TTCTACTTCA ACGAATATCG TTACAAACCC GGTACTAGAT
GCTATGATAA CGATGGTGAA ATCTGTAGAT CAAACAATTG TAACACTAGG TGATACCATT
ACCTATACGA CACTTTTGAC AAATATCGGT AATACAAACG CTACGAATAT TACTTTCACT
GATCTTATAC CAAATGGTAC AACGTTTATT ACTGATAGCG TTACAATAGG TGGCATTACA
CAAATCGGGC TCAATCCTAA TACCGGCATA ACAATTGGAT CAATTGCTCC TAACAGCTCA
ATATCTATAG CATTTCAAGT TACCGCTACT TCTACACCAG CCCAAAATCC TATTGCCAAT
TCCGCTACTG CTTCTTACAC ATTTATCGCT GATCCAAATG CACCGATTGT TTCAAGAAAT
GTTACTTCAA ACACAACATT TACTACAATT AATACAGCTA CTATTCTTTC ATCTAAACAA
GTTGATAAAG CATTTAGCCA TATCGGAGAT ACACTCACTT ATACTGTCAC TTTAACAAAC
AATGGAAATT CATCTGCACA AAATGTTATT TTCACAGATA CTATGCCGAG CGGAACTACA
TTTATTGCAA ACACATTTTC CATTAATGGA GTTCCTCAAA GTGGTGCGAA TCCCGTAAAC
GGTGTAAATA TTGGACCGAT AACAGCGGGG GCTACAGTAA ATGTTTCCTT CCAAGTAAAC
GTGACCTCAT TACCGACCGA AAACCCAATT GTAAATTTCT CATCAACATC TTATCAATTA
GTCTCACCGC CTGATACAGA AACTTCCATT AGCAATCCTG TTTCAACGCA AATTAAAGAA
GCCATATTAT CTATGACGAA AAATGAAAGT GCATCCTTTG CAGATATCGG GCAAACTGCC
TTTTATACTA CTTCTATTTC CAATATAGGA AATACGGATG CAACTAATAT AGTATTCACA
GATGTATTAC CAAGTGGACT CACATTTGTT CCTAACACAT TAACCGTCGA TGGCGTTTTA
CAACCTAACG CGAATCCAAA TACAGGTGTA TTACTTGCAG CACTTCCACC AAATGAAATA
TATAGTATCG TCTTTCAAGT TACAGTTAGC AGCATTCCAC CTATTAATCC AGCACCGAAT
ATAGCATCAA CGACATATGA GTTTACTGTT GATCCTGTTA ACCCTCCAGT GTCGAGTTCA
GCTACTTCCA ACACTACGCT TCTTCAAATA AATAACGCAA ATATTATAAG TACAAAAACG
GCAGACCTTA CCTTTGCGGA TGTTAGTAAT ACAATAACAT TTACACTTAA CCTCCCTAAT
ACCGGGAATG TGGCTGCAAC TGATGTTACC GTCATCGATA TACTTGATAG CAATTTAAGT
TTCGTTCCAA ATAGTTTCAC AGTTAATGGG CAAACCATTC CAAATGCTGA TTTATCTACT
GGTGTAAATA TCGGTTCCAT TAATGGTGGT AATACGGCAA TTGTCACATT CCAAGCAACT
GTTACTACAC TTCCTACACT TAATCCTATT TCTAATTCTG CTTCTATCAC ATATCATTAT
GTCGTTGACC CTAGCCAGCC ATCTATTACA ACTTCTAATC AATCTAATAC AACAACCACA
CAAATTAATA GCGCTATCCT TACTGCACAA AAAAATTCAA ATGTATCTAC GGTTGACATT
GGGCAAGATA TTGTCTACTC CGTTACAATT ACAAATAGCG GAAATGTTAG TGCAACAAAT
GTTATTTTTA CCGATCTTAT TCCAGATGGA ACTTCATTTG AACCGAATAG TTTTACACTT
AACGGAACTA GCATCCCAAC TGCAGATATC GTTACAGGGG TTCCAATTGG TGATATCGCG
CCAAACCAAT CTGTCATTGT GGCATTTAAT ATTATCGCAC ATGAAATCCC ACCTATAAAT
CCAATTACGA ATCAAGCTAG CGTTAACTTT CAACATATCG TTAATCCAAA AAATCCTCCA
GTTTCAAAAA ATATTACTTC TAATAACGTT ACAACAAAAA TTGAAAGTGC AATTTTAAAC
ACGATTAAAA TCGGAGATAA AGCTTTTGCA ACGATTGGTG ATACGATTAC GTATACAACT
ACGATTACGA ATACAGGAAA TATTCCAGCT AACAACGTTG TTTTCTCAGA CCCTTTACCT
TCATGGGCAC AACTTGTTGC AGGATCAGTT GTTGTTGACG GTACTTCATT ACCATCCGCT
TCTATCACCA GCGGTATTGG CATAAATACA GTCAATCCAG ATCAAACTGT AACAATCATA
TTCCAAGTTC AAATTGTAAG CAATCCAACA ACATTCACTC CTGAACTCCA AAACTTAGCA
TTTGTGAACT TCCAATATAA CGTAGGCAGT GCATTACAGG CTCAGCCTGG CAACGTGGAA
ACGAACGTCT TCGTTACTTC TATTCATTCA GCAATACTTT CAGCTGTAAA AACTGCTAGT
ACAGCCTTTG CAAATATTGG CGACACGATC ACTTATACCG TTTTAATTCA AAATAGCGGC
AATACAAATG CTACGAATGT AAATTTCTCA GACCTCATTC CAGCAGGAAC GACCTTTGTT
GAAAATAGTT TTACTGTAAA TGGAAATACC ATTCCAGGTG CAAATCCAAA TAGTGGAGTT
ACTATCGGGA CCGTTAGCGC GGGTAGTTCC TTAACTGTTA CTTTCCAAGT CATAGTTACA
TCTACTCCAC CTTCAAATCC AATTACAAAC GTTGCATCTA TTCAATACGA ATTCATCGTT
GATCCAGCAT CTCCTCCTGT TACCGGCACA ATAACTTCTA ATAGCGCTTC TACACAAATT
AATAACGCTA CTGTTACAAC GCTTTTAGAA GCAAATCGAA CAATCGTATC TATTGGAGAT
ATAATTACGT ACACTGCAAC ATTAACAAAC ACTGGAAACT TCCCTGCAAA CTCTGTATTA
CTCATTAACG GTGTTCCTGA AGGGGCATTA TTTGTTCCAA ATAGTGTCAC GCTTAACGGG
ATTTCACTTC CAGATACAAG TCCAACTCTC GGTATTCCAG TTGGTATTAT CGCACCAGGT
GATTCTGCTA CGATTACGTT CCAATTTCTT GCAAACTCTA TTCCACCGCA AGGAGCAATT
ATAAATCAAG CACTTACAAG TTACACGTAT ATTGTCGATC CGAGTCAACC TCCAGTTACA
TCAACCTCCT CATCTAATAC GGTTAGTACA GCTGTCGTTG ATGCATCGCT GTCTGTAATT
AAAAATACAG ATTCTCTCGT ACAATCTACT AACGGTACAA TCACTTACAC TGTAGTCGTT
CAAAATAACG GGAATACAAC TGCAAATACA GTTACTTTAA CAGATTTGGT CCCAGAAGGA
ACTGCATTTA TTCCGGATAG CGTAACCATT AATGGCGTTT CAGCTCCAGG TGCCGATCCA
AACGTAGGGA TACCATTAAA CTCCATAGCG CCTTCAGACA TTATCACCAT AACATTCCAA
GTGATCGTTC AATCCATTCC AAGCGTGAAT CCAATTTCTA ATATAGCCCG TATTGCCTAT
ACTTTTATCG CCGATCCAAC CGCTCCTATC ATCTCTCGAA CAATAACGTC TAATCCAGCA
GTCACACAAA TTTCGGATGC GAATGTTCTT TCTTTAAAAG CAGTCAATGC ACAACAAGCA
ACAACAGGTG ATATTTTGAC CTACACGATA GCGCTAGAAA ATACCGGAAA TATTCCAGCT
ACGAATCTCA TATTTTCAGA TACTATTTCA GAAGAGACTA CATTCATAGA AAATAGTTTT
ACACTTAACG GAACAGCTAT ACTTGGTGCA AATCCAAACG CAGGGGTTAC TTTACCTAAC
CTAGCAGCAA ACGCTACTCA CCTTATTTCG TTCCAAATTC TTATTAACGA TCCATTCTCG
CAACAATCGA TTACAAATCA ATCTAATACA ACATATACAA TTCAACCGGA TCCAGGGAAA
CCGCCTATTA CGGAAACATC GACTAGTAAT ATCGTCATTA CAAATTTCGT GCAAGCACAA
TTAACAATTA CAAAAACATC CAATCCAACA ACTGTAGATA TTGGTGGAAC GATACTTTAT
ATTTCTGAAG TGAAAAATAT CGGCAATGTT GACGCAATAA ATATTATTTT TACAGATTCG
ATTCCAGCTG GGACTACATT CGTTCCCGAC AGTGTCACAA TTAACGGTGT ACTTCAGCCT
GATACAAATC CAGAAAACGG AATATCAATT GGAACGATCC CATCAAATAG TTCCAAAACA
ATACTATTTC AAGTACAAAC AAACAATCCA CCTACTGAAA CCGAAATTGT AAATCAATCT
TCAGCAACGT ACCAATATGT AAGCATTCCT ACAGCTCCCC CAGTGAATCG CTCTGCAACT
TCTAATATCG TTACAACATC ACTTCAAAAT GCAAATATTA TTTCTGTTAA GCAGGCAGAT
GTTACTTTCG TATCCATCGG GCAAAATATT ACCTACACAA ATACACTACA AAATATAGGA
ACTGTTCCGG CTAACAATAC ATTGTTCATT GACAATATTC CAGAAGGTAC TATATTCATT
GAAGATAGCT TATCAATAAA TAATGTCATT CAGCCTGGTG CGAACCCTGA AAATGGAATA
ACTCTTAGCA CGATACAACC AAATGAAACA GTCACTATTT CATTCCAAGT ACAACTTACA
AATATACCCG AGGGCAACAC AGTCACTAAC ATTTCAGACA CTTCGTATGA ATACCAAATT
GACTCTAGTT CTCCAATTAT TCAGCGTAGA TCGTTATCAA ATGCAGTAAC TACTGAAGTC
CGTACAGCAA ATGTTAGTGC ACTTAAATCT GCTAATAGAT CTATTACACG CATTGGTCAA
ATCATCACAT ATACAGTCGC AGTTACAAAC GCTGGTACAG TACCTATTAC AAATACTCTC
CTAATTGACG CAATTTCTGC TGGCACCACA TTCATTCCAA ATAGCATTCT TGTAGATGGT
ATACCAAGAT CTAACGAGAA TCCAAGTACC GGAATCACCC TTAATATTAT CCTTCCAAAC
AATACAATTA TCGTTACATT CCAAGTAAAT GTAGTCTCTA TACCTCCTCA AAATAACATT
AATAATATCG CCGTCATCCA CTATGAGTAT CAGCCGGACC CAAGCTTACC ACCAATTTCA
GAAACGACAT CTTCCAATAC TACAAATATA CAATTTATTG ATGCTATTCT TATCGCTACA
AAATCTGCTA ATACAATATT AGCTAATATT GATGAAACTA TTGAATATAC AGTACTCATT
CAAAATAACG GATCCACTAC AACTAACTCC ATCTTTTTTA CAGATACGAT AGAAGATGGA
ACAGTCTTTA TTCCAGGAAG TGTAATAGTT AACAACACGG TACTTCCTGC AGCAGATCCA
AATATCGGTT TTTCTATTCC TAATATTGCG TCAAGTCAAG CGACTACAAT AACGTTCCAA
GTTTCCGTTA CGAATTTACC TGCTGTAAAC CCAACACCTA ATACTGCAAA CCTCGTCTAC
GACTTTATTT TCAACCCTGA TTTTGCACCA ATTCAAAAAT CGACTACTTC CAACACTACT
TTCGTTCAAA TTAATGATGC TGATATCGTT TCACTTAAAA CTGTTGATTT GACTTCTGTA
ACAATTGGTG ACATTTTAAC TTATACAACA ACTTTAACGA ATACAGGGAA TACAGATGCC
ACTGCTGTTG TATTTACAGA CAATATTCCT GGTGGAACTA CCTTTATTGA CGGTAGCGTT
TTAGTAAATA ACATTCCGCA GCTTAACGCC AATCCAAGTA CCGGTATACT TGTAGGAACG
ATTGCTCCTA ACATTTCTAT CCCAGTCACA TTTTCTGTTA CCGTCATAGC GCTTCCAGCT
AGCGGCCATG TTCAAAATCA ATCAACTTCT CGTTATACGA TAAATGGAGC AGAACAAATA
TCGACTAGTA ATATTACCTT CACTGAAGTT ATTATTGCTA CTATAGTCGC AACAAAAACA
ACGCCTATCC AATACGCTGA CCTACAAACG ATTATCCCTT ACACAATTTC CATCACAAAC
AATAGCAATA TACAAGTAGA AAACATTAAC GTTACAGATA TCATCCCAGC AAATACGAGC
TTTATAGAGA ACAGTGTTAT TGTGAATGGA AACGCTCGTC CAAATGACAA TCCACTTAGC
GGGATACAAA TTGATAACAT TCCGCCTAAT ACAACAGCAA CTATTCTATT CCAAGTACGG
GTTACTTCGA TTCCGCAAAC AAATCCAATC TCTAACACGA GTACAATTGA ATACCAATAC
ACGTTACCAG ATCGGCCACC TATTACCGAA ACTATTATTT CATCAGCTGC CGTAACAGAA
ATTAATCACG CGACTTTAAA TAGTAATAAA GCTGTTGACC TTGCATTTGC AACTGTCGGT
GATACGTTAA CGTATACGAT TATACTCAAT CAAACTGGTA ATGTTGCAGC AAATGATGTA
GTCATTCAAG ATATGATTCC TCAAGGTACC ACATTTATAG AAAATAGCGT GATTGTAAAC
GGAGAAACTC TTCCGGGAGT TAATCCAGTA AGTGGCATAC CAATTGGTAC TATAATTGTT
GGTGGAGACG CTATTACTTC ATTCCAAGTA ACTGTGACTT CTATTCCAAT ACAAAATGAA
CTCACCAACC AAGCAATCAC TACTTTTAAC TATATAGTCA ACCCAAATAA CATACCTGTT
ACAAATACGA CTACAACAAA TACTGTTACA ACAACCGTCC AAAATGATAA TGTCATTGCG
ATAAAAGCTG TTGATTTCAC AAGTGCCTTA CCTGGTCAAA CTTTAACGTA CACCATTACG
ATTACTAATA GTGGTAATAT CACTATTGAA GATCTTCTTC TAGTAGACAC GGCACCAGTA
GATACGACAT TCGTTATTGG TAGTGTTACG ATTAACGGAA TCAATCAACC TAATGAAAAT
CCTGAAAATG GTATTACGTT AGGAACTCTT GCTCCTAGTG AATCTGTTAT TATTACGTTC
CAAATAACAA TATCTTCTTC TACTCTTCAA CCTACAATTA ATAATGATGC TTCTGTTTCC
TATACCGTCA TCATTGATCC AACAAAACCA CCTATTACAA TTACAAAACA AACAAATATC
GTTACAACGA CAGTCATTGA TCCGATGGTT CGTATTGAAA AAACAGCTGA CAAATCTATT
GTCGTTATTG GAGATATAAT TACATTCACA TTAGCAGTAT TTAATCACTC CCCAATTCCG
ACAATCAATA CTTCTGTTAT AGACACAATT CCGGCTGGTA CAACATTTAT AGAAAATAGC
GTAACAATTA ACGGTATTCT GGTACCAACT GTCCGTCCAG ACACTGGTAT GAATATCGGC
GCTTTACCTG CTGGCTCAGT AGCAACGATA ACATTTCAAG TGCTCGTAAC TTCTATTCCT
TCAAAAAATA CAATTATAAA TTCTGCAACA GTTACAGCCG CTTTCCAATT GACACCACAG
GATCCGATTA TTACTTTCGT TGTTAATTCG AATATTGTTC GTATACCAGT TCAATTTGTA
ACTGCGACAG TCACGAAAAA CGCTTCCGTC AGCTCAGCTT ATTTAAATCA ATATTTTGAT
TACACGGTGC GTATTACGAA TACTTCCGAG ATTTCACTCT TAAATATTTC TTTACAGGAT
ACTATTCCAG TAGGTTTACA ATTTATGAGC GGCACCGTCT CCATTAACGG AGAACGCTCT
CCACTAGCGA ATCCGAATAT CGGTTTCCTA GTTGCTACTA ATTTAGAACC AAGCGAAACA
ATTATCGTGT TATTCACCGT ACAAGTGATA AGTCCACCTG TTACTAATGA GTTTAAAAAT
ACGGCTAATA TTTCGTTACA ACTTCAAGCC TCACCTACCG ATCCACCAAT TACAGTAACC
GTTACAAGTA ACGAAAACAT CGTCACCTTT GTTCCAGAAA ATCCGGATAA AACACTTCCA
AATTTCAATT GCTTCTTTGA CGGTGAACGC TTCATACGGA TTACTCCTCG AAATGTAGGC
AATTACCTTT GGACTTGGAT TTGGTGGAAT TAA
 
Protein sequence
MPITNRFSTT TNGALAITGN TLGLSKISNQ NRAGTIGAIG AFITTNTALQ VTSFPAGTTL 
NYTQNSSTAL LNIPAGSTIL YAELVWGGNY LSRDQNITNV LGNPVSFTTP VSTYSITPSA
VTASNQTFVS GSITFGFYTR SADVTSLIQA GGSGSYTTGS VPGLVDPIDA SNGTINSAGW
TLIVAYQNGT LPARNLTIYV AGNRVSAETG SADVSVSGFL TPSGGPVSGR LFLSSTEGDA
DLIGDQALFG PNFSSLNALS GPNNAVNNFF GSQINNAAGN LDTTGTFGTR NQSASTGTNI
SAGRQGWDIT SIDISPYLTN SQVSAAIRLT TNGDAYMLNT VGLQININSP NIQATKSVNK
SVAAIGDVLT YTVTIPNTGL LPANNVIFTD ILPNGTSFIP GTVTVDNVPQ TNANPAAGIS
LGTINNSASR TVTFQATVVS FPNQNPISNT ANITFQYTPI AGGTTFNGLA TSNSAGTQVN
LADINGTKSV NKLFTDIGET LTYSIALANI GNIAATNVIY TDPIPSGTTF VPGSVTVNGI
TQAGANPANG ISIGSIAANS TTTISFQVSV PSIPQTNPIL NSGTTTYQYI PVPNQPAVSG
TDTTNIVSTQ VNNATVTMTK AVDKNFADIG DTLTYTVSFT GTGNTSANNV MFTDAIPTGT
TFVLNSLTID GTTQVGANPA NGVNIGAIPT GTTKNVSFQV VVNTIPALNV VSNGSSASYQ
YTVNPSQSPV TKNISSNLVS TQINNANLAL TKSTNKQFAT IGETISYTIL ITNNGNTAAT
NVQLTDPLPN GTILTPGSVT LNGILQNVDS LVALPIGTIP GGATFTLSFQ VTVINITAQN
PIINNAFASY LYTVNPSLPP NSKTANSNSV TSTIRLANLQ ATKSVDKTFA EVGDILTYTF
ALTNNGNVTA NNVLLSDSIA NGTSFVPNSV IVNGVNQPGA TPASINIGSI NANTTITASF
QVLITSIPNP NPISNSASIS YNFIVDPNAS PVSKNTTSTT TFTQVNDANV ISAKTVDKGF
ATVGDVLTYT VVLTNAGSVS ADSPTFVDTN PDGTTFIQNT FLINGVLQNN ADPNVGVPLP
SIPANGSLTV SYQVTVTSLP TQNPTINSSS TQYSFILNPG DPPTIETSLS NTVSTQINVA
NVVIVKQVDL TIADVGQPIT YTIALANPGN TPANNVVVTD ILPSGTTLVP NSTFIGGALQ
LGADPSTGLQ VGTIPAGGFT TIVFQISASS LPTQNPIQNS AALQYNFIAD PNLPAVVRNA
TSNIVTTQIN TANIVATKLT STNFADVGDV ITYATILTNN GNIPASNVTF TDIIPAGTIF
LPNTVTINGV PIANANPANG ILIGTIGANS SRTVSFQVSV PTIPSPNPIT NQSSTTFQYT
YDASKPVVTQ MLASNTVQTT INNATIAAVK SADKQFANVN DMITYTTTLT NNGNTLASNV
IFTDVIPNGT SFIPNSVTVN GNTLPNTNPA SGIAIDPINP NTSATISFQI IVNSIPSPNP
IPNQSNTAYQ YVIDPNLPPA SANALSNVIT TQINNATIVA TKSVNTPTAA IGDIVTYTIA
VTNTGNIPAS ATVLTDGLGA GASFVQNSVT INNIPQPGLD PSLGIHLADI PPGDTVFITF
QAQILAIPPS GTLTNNALVN YEYTVNPNQS PAVNSTITNT TVTPIIDATL VLNKNASTTF
ATIGDTITFT SSVTNTGNTT ANNIVFTDSI PNGTAFVPNS FKINGVTVPN ANPQNGINIG
NLNANASITL SFQVNITTLP NPNPIPNKSS LQYSFIVDIN EPPVSRTVQS NKTFTQVNTA
SVIATKTASS AFAAVGDTIT YTTTLTNSGN TTANTPVFID ILPPELSFVP DSVQINSIPQ
LGFRPDSGIS LDSIPVGGTI TISFQAIVSS IPATNPTLNQ SSTTYSIIVD PTQPPVTETA
TSNPTLVQIN EAIIQATKSV DRLFSDIAPG NSFLTYTVLL ENIGNTTATN IIFTDPTPNN
TIFIEDSVRV GGVLLPGVNP ANGIPIGDII AGDFINVTFR VQVVSIPNPI FTIGPGGPNS
PVVNGASIDY QFMTGPNLPL VSRNTTSNSV STQINSGEIV AIKSVDKTFA TIGDTISYTI
TLSNPGNVTS QNIIFTDTLP DGTAFISGTL TNDSGTQQIG NPSNGIQIGN INPNGTAVIT
LNVLVTNIPS INPISNFSSV QFAHVVDPSQ PAVTQTNVSN SVSTTINSAI LTTQKSVDKS
IISVGDTITY TTTITNTGNT TATNITFTSA IPANTTFIPN SVKINGVQQS GAQPALGVTI
PNIAPGETVT VTFQATVLSV PPSSSIMGSD TILYSYTVDP NGTPITTSTS TNIVTNPVLD
AMITMVKSVD QTIVTLGDTI TYTTLLTNIG NTNATNITFT DLIPNGTTFI TDSVTIGGIT
QIGLNPNTGI TIGSIAPNSS ISIAFQVTAT STPAQNPIAN SATASYTFIA DPNAPIVSRN
VTSNTTFTTI NTATILSSKQ VDKAFSHIGD TLTYTVTLTN NGNSSAQNVI FTDTMPSGTT
FIANTFSING VPQSGANPVN GVNIGPITAG ATVNVSFQVN VTSLPTENPI VNFSSTSYQL
VSPPDTETSI SNPVSTQIKE AILSMTKNES ASFADIGQTA FYTTSISNIG NTDATNIVFT
DVLPSGLTFV PNTLTVDGVL QPNANPNTGV LLAALPPNEI YSIVFQVTVS SIPPINPAPN
IASTTYEFTV DPVNPPVSSS ATSNTTLLQI NNANIISTKT ADLTFADVSN TITFTLNLPN
TGNVAATDVT VIDILDSNLS FVPNSFTVNG QTIPNADLST GVNIGSINGG NTAIVTFQAT
VTTLPTLNPI SNSASITYHY VVDPSQPSIT TSNQSNTTTT QINSAILTAQ KNSNVSTVDI
GQDIVYSVTI TNSGNVSATN VIFTDLIPDG TSFEPNSFTL NGTSIPTADI VTGVPIGDIA
PNQSVIVAFN IIAHEIPPIN PITNQASVNF QHIVNPKNPP VSKNITSNNV TTKIESAILN
TIKIGDKAFA TIGDTITYTT TITNTGNIPA NNVVFSDPLP SWAQLVAGSV VVDGTSLPSA
SITSGIGINT VNPDQTVTII FQVQIVSNPT TFTPELQNLA FVNFQYNVGS ALQAQPGNVE
TNVFVTSIHS AILSAVKTAS TAFANIGDTI TYTVLIQNSG NTNATNVNFS DLIPAGTTFV
ENSFTVNGNT IPGANPNSGV TIGTVSAGSS LTVTFQVIVT STPPSNPITN VASIQYEFIV
DPASPPVTGT ITSNSASTQI NNATVTTLLE ANRTIVSIGD IITYTATLTN TGNFPANSVL
LINGVPEGAL FVPNSVTLNG ISLPDTSPTL GIPVGIIAPG DSATITFQFL ANSIPPQGAI
INQALTSYTY IVDPSQPPVT STSSSNTVST AVVDASLSVI KNTDSLVQST NGTITYTVVV
QNNGNTTANT VTLTDLVPEG TAFIPDSVTI NGVSAPGADP NVGIPLNSIA PSDIITITFQ
VIVQSIPSVN PISNIARIAY TFIADPTAPI ISRTITSNPA VTQISDANVL SLKAVNAQQA
TTGDILTYTI ALENTGNIPA TNLIFSDTIS EETTFIENSF TLNGTAILGA NPNAGVTLPN
LAANATHLIS FQILINDPFS QQSITNQSNT TYTIQPDPGK PPITETSTSN IVITNFVQAQ
LTITKTSNPT TVDIGGTILY ISEVKNIGNV DAINIIFTDS IPAGTTFVPD SVTINGVLQP
DTNPENGISI GTIPSNSSKT ILFQVQTNNP PTETEIVNQS SATYQYVSIP TAPPVNRSAT
SNIVTTSLQN ANIISVKQAD VTFVSIGQNI TYTNTLQNIG TVPANNTLFI DNIPEGTIFI
EDSLSINNVI QPGANPENGI TLSTIQPNET VTISFQVQLT NIPEGNTVTN ISDTSYEYQI
DSSSPIIQRR SLSNAVTTEV RTANVSALKS ANRSITRIGQ IITYTVAVTN AGTVPITNTL
LIDAISAGTT FIPNSILVDG IPRSNENPST GITLNIILPN NTIIVTFQVN VVSIPPQNNI
NNIAVIHYEY QPDPSLPPIS ETTSSNTTNI QFIDAILIAT KSANTILANI DETIEYTVLI
QNNGSTTTNS IFFTDTIEDG TVFIPGSVIV NNTVLPAADP NIGFSIPNIA SSQATTITFQ
VSVTNLPAVN PTPNTANLVY DFIFNPDFAP IQKSTTSNTT FVQINDADIV SLKTVDLTSV
TIGDILTYTT TLTNTGNTDA TAVVFTDNIP GGTTFIDGSV LVNNIPQLNA NPSTGILVGT
IAPNISIPVT FSVTVIALPA SGHVQNQSTS RYTINGAEQI STSNITFTEV IIATIVATKT
TPIQYADLQT IIPYTISITN NSNIQVENIN VTDIIPANTS FIENSVIVNG NARPNDNPLS
GIQIDNIPPN TTATILFQVR VTSIPQTNPI SNTSTIEYQY TLPDRPPITE TIISSAAVTE
INHATLNSNK AVDLAFATVG DTLTYTIILN QTGNVAANDV VIQDMIPQGT TFIENSVIVN
GETLPGVNPV SGIPIGTIIV GGDAITSFQV TVTSIPIQNE LTNQAITTFN YIVNPNNIPV
TNTTTTNTVT TTVQNDNVIA IKAVDFTSAL PGQTLTYTIT ITNSGNITIE DLLLVDTAPV
DTTFVIGSVT INGINQPNEN PENGITLGTL APSESVIITF QITISSSTLQ PTINNDASVS
YTVIIDPTKP PITITKQTNI VTTTVIDPMV RIEKTADKSI VVIGDIITFT LAVFNHSPIP
TINTSVIDTI PAGTTFIENS VTINGILVPT VRPDTGMNIG ALPAGSVATI TFQVLVTSIP
SKNTIINSAT VTAAFQLTPQ DPIITFVVNS NIVRIPVQFV TATVTKNASV SSAYLNQYFD
YTVRITNTSE ISLLNISLQD TIPVGLQFMS GTVSINGERS PLANPNIGFL VATNLEPSET
IIVLFTVQVI SPPVTNEFKN TANISLQLQA SPTDPPITVT VTSNENIVTF VPENPDKTLP
NFNCFFDGER FIRITPRNVG NYLWTWIWWN