Gene BCAH820_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_1686 
Symbol 
ID7188439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp1590560 
End bp1605613 
Gene Length15054 bp 
Protein Length5017 aa 
Translation table11 
GC content38% 
IMG OID643555098 
Productconserved repeat domain protein 
Protein accessionYP_002450637 
Protein GI218902803 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones161 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTA CGAATCGATT TTCTACCACC ACTAACGGCG CACTTGCGAT TACAGGAAAC 
ACACTCGGTT TAAGTAAAAT CAGTAATCAA AACCGTGCTG GTACAATCGG GGCAATTGGT
GCATTCGCAA CTACGAATAC CGCTTTACAA GTAACTTCTT TTCCTGCTGG TACAACATTA
AACTATACAC AGAATAGTTC TACCGCTCTT TTAAATATCC CTGCTGGTAG TACGATTCTT
TACGCAGAAC TCATCTGGGG CGGCAACTAC TTATCTCGTG ATCAAAACAT TACAAGTGTT
TTAGGAAACC CTGTTTCTTT TACAACACCT GTTTCAACAT ACTCGATTAC TCCTTCTGCT
GTTACAGCTT CCAATCAAAC ATTCGTTTCT GGATCTATCA CATTTGGATT CTATACACGT
TCTGCAGATG TAACATCCCT CATTCAAGCG GGAGGATCTG GCTCTTATAC AACCGGCTCT
GTCCCTGGAC TTGTAGATCC TATAGATGCT TCTAATGGAA CAATTAATTC AGCTGGATGG
ACGCTTATCG TCGCTTACCA AAATGGAACA TTACCTGCAA GAAACTTAAC CATTTATGTA
GCAGGTAACC GGGTTTCTGC AGAAACTGGT AGTGCCGATG TATCTGTTTC AGGATTTTTA
ACACCTTCAG GAGGGCCTGT AAGCGGTAGA TTATTTTTAA GTTCTACCGA AGGAGATGCT
GATTTAATTG GGGATCAGGC TCTATTCGGG CCAAATTTCA GTTCATTAAA TGCCTTATCT
GGACCTAACA ATGCTGTAAA TAATTTCTTC GGTTCTCAAA TTAATAATGC CGCTGGAAAC
TTAGATACAA CCGGGACATT TGGAACGCGA AATCAAAGTG CTTCTACAGG TACAAACATC
TCCGCTGGAA GACAGGGCTG GGATATTACT TCCATTGATA TTTCTCCTTA TTTAATAAAT
TCTCAAGTGT CCGCCGCAAT CCGTTTAACA ACTAACGGAG ACGCATATAT GTTGAATACA
GTCGGTTTAC AAATCAACAT AAATTCACCT AACATACAAG CAACAAAAAG CGTGAATAAA
AGTGTTGCAG CAATTGGGGA CGTTCTCACT TATACAGTTA CTATCCCTAA TACAGGGCTT
CTTCCCGCCA ATAACGTTAT TTTTACAGAC ATTCTCCCTA ACGGTACTTC CTTTATACCT
GGAACTGTAA CAGTAGATAA TGTCCCGCAA ACGAATGCAA ATCCGGCCGC TGGTATATCT
CTTGGAACCA TTAATAACAG CGCTTCTCGT ACAGTAACTT TCCAAGCTAC TGTCGTTTCT
TTTCCAAGTC AAAATCCTAT CTCCAACACT GCTAATATTA CATTTCAATA TACACCAATT
GCCGGAGGAA CGACTTTTAA CGGTCTTGCA ACAAGCAACT CTGCTGGAAC ACAAGTTAAC
CTCGCAGATA TTAATGGCAC AAAATCAGTT AACAAACTTT TTACCGATAT TGGTGAAACG
TTAACTTACA GTATCGCCTT AGCTAATATA GAGAATATTG CTGCAACTAA CGTAATATAT
ACGGATCCGA TTCCTAGCGG GACTACTTTC GTTCCGGGGA GTGTAACTGT TAACGGAGTT
ACTCAGGCTG GAGCAAATCC CGCTACTGGT ATATCAATTG GCGCTATTGC TGCTAATTCT
ACGACTATTG TTTCATTTCA AGTACTCGTT CCTTCTATTC CCCAAACAAA TCCAGTTTTA
AATAGCGGAA CAACAACATA TCAATACATT CCTGTGCCAA ATCAACCGGC AATAAGTGGG
ACTGATACGA CCAATATCGT ATCTACTCAA GTGAATAACG CTACTGTAAC TATGGCAAAA
TCAGTAGATA AAAATTTTGC AGATATTGGT GATACACTAA CGTACACCGT TTCCTTTACA
AGTACAGGTA ATACAAATGC GAACAACGTT ATTTTTACAG ATGTTATTCC TACTGGAACA
ACTTTTGTTC TAAACAGTTT AACAATAGAT GGCACGACAC AAGGTGGAGC AAATCCCGCT
AACGGTGTGA ACATTGGATC AATCTCAACT GGTACAACAA AAAATGTTTC ATTTCAAGTA
GTTGTAAATA CAATACCCGC GCTAAATGTC GTATCTAATG GATCAAGCGC TTCTTATCAG
TACACTGTCA ATCCAAGCCA ATCACCCGTT ACAAAAAACA TTTCTTCTAA TCTCGTTTCC
ACTCAAATTA ACAATGCGAA TTTAGCATTA ACAAAATCAA CAAATAAACA ATTTGCCACA
ATTGGGGAAA CGATAAGTTA TACAATTCTT ATTACAAACA GCGGAAATAC AGCTGCAACT
AATGTACAAC TAACAGACCC ACTTCCAAAC GGAACAATAT TGACCCCTGG TTCTGTAACA
CTCAACGGCG TTTTGCAAAA TGTAGATTCT CTCGTCGCTT TACCTATCGG CACAATTCCT
GGCGGAGCTA CTTTTACACT TTCTTTCCAA GTAACAGTCA TCAATATTAC CATCCAAAAT
CCTATCATTA ATAATGCTTT CGCCTCTTAT CTATATACTG TAAATCCAAA TCTGCCACCA
ACTTCAAAAA CAGTAAATTC TAATTCTGTT ACATCAACAA TTAGACTAGC AAACCTTCAA
GCTACTAAAT CTGTAGATAA AACGTTTGCG GAAGTTGGGG ATGTATTAAC TTATACCTTT
TCTCTTACAA ACGATGGAAA TGTTGCGGCG AACAATGTAG TACTATCCGA TTCAATCGCG
AATGGTACTG CCTTTGTACC AAACAGTGTT ACGATTAACA ATGTTACTCA ACCAGGCGTT
ACACCAGCTA GCATCAATAT CGGTAGTATC ACTGCTGGTA CTACAATTAC AGCTTCATTC
AAAGTATTAA TAACTAGTAT TCCAAACCCA AATCCTATTT CAAATAGCGC TTCTATTTCC
TATAACTTTA TCGTTGATCC AAACGCTTTC CCTATAAGTA AGAACACAAC TTCAACCACT
ACATTTACTC AAGTAAATGA CGCAAATATT ATTTCAGCAA AAACAGTGGA TCGAGCGTTC
GCTACTGTTG GGGATGTATT AACTTATACC GTCGTTTTAA CGAATGCAGG AAGTGTTTCT
GCTGATAGTC CTACTTTCGT AGATACGAAT CCAGACGGTA CTACCTTTAT CCCAAACACT
TTCCTTATTA ATGGTGTACT CCAAAATAAC GCAGATCCAA ATGTCGGTGT TCCCTTACCT
TCCATTCCTG CGAACGGTTC ACTTACCGTC TCTTATCAAG TAACTGTCAC CTCTTTACCA
ACACAAAATC CAACAATAAA TTCATCTAGT ACACAGTATA GTTTTATTTT AAATCCGGGC
GATCCACCAA CTATAGAAAC ATCTTTAAGT AATACTGTAA GTACACAAAT TAATTTAGCA
AATGTAGTTA TTGTCAAACA GGTAGATTTA ACTATTGCTG ACGTTGGGCA ACCAATCACA
TATACAATTG CTTTAGCTAA CCCTGGGAAT ACTCCCGCAA ATAATGTAGT TGTTACCGAT
ATACTCCCTC CTGGTACAAC TCTCGTACCA AATAGTATTT TTATAGGCGG GGCTTTACAA
CTTGGTGCGG ATCCAAGTGC TGGTCTTCAA GTTGGTACGA TTCCAGCTGG TGGTTTTACA
ACAATTGTCT TCCAAATTGG TGCAAATAGT TTACCTTCAC CAAATCCAGT TCAAAACAGT
GCTGTACTTC AATATAACTT TATCGCAGAT CCAAATTCAC CTCCCGTTGT AAGAAACTCT
GCTAGTAATA TAGTAACTAC ACAAATTAAC ACTGCTAATA TTGTTGCTAC GAAACTAACA
AGCACAAATT TTGCTGATGT TGGCGATGTC ATAACTTATG CAACGATTTT AACGAATAAC
GGCAATATCC CTGCCTCGAA TGTAACGTTT ACAGATATCA TTCCAGCTGG TACCATCTTC
CTTCCTAATA CTGTAACGAT TAACGGTGTC CCTATAGCTA ATGCAAACCC TGCTAATGGC
ATTTTAATTG GTACGATAGG AGCGAATTCA TCACGTACTG TTGCATTCCA AGTTTTTGTA
CCAACCATTC CTAGTGCAAA TCCGATTGCG AATCAATCGA GCACTACATT CCAATACACG
TACGACCCAT CCAAACCAGC TGTCATGCAG ATGGTTGCTT CTAACACTGT ACAGACAACT
ATTAATAATG CTACGATTAC CGCTGTAAAA TCTGCAGATA AACAGTTTGC GAACGTAAAT
GATATCATTA CGTACACGAC TACTTTAACG AATAATGGAA ACACACTTGC ATCAAATATA
GTATTTACAG ATGCTATTCC AAGCGGGACA TCTTTTATTC CAAATAGTGT AACAGTAAAC
GGCACTACAC TCTCTAATGC AAATCCAGCA AATGGCATAG CAATTGATCC GATAAATCCG
AATGCAAATA CGATAATCTC GTTCCAAGTG CAAGTAAATT CTATTCCAAA TCCGAATCCT
ATCCCGAACC AAAGTAACAC AACGTACCAA TATATCGTAA ATCCTAACTT ACCTCCAGCG
TCCTCTAATA CGCTAAGTAA CGTAATAACA ACTCAAATTA ATAACGCTAC AATCATCGCT
ACGAAGTCAG TAAATACACC GAATGCTGCA ATTGGAGATA TCGTTACTTA TACGATTGCA
GTTACGAACA CAGGGAATAT TCCTGCTAGT GCTACAGTTT TAACAGATGG ACTTGGACCA
GGTGCATCCT TCATCCCAAA TTCCGTTACG ATAAATAACG TATCTCAACC TGGATTAGAT
CCTTCACTAG GTATTCATTT AGACGATATT TCACCAGGAG GAACTACGTT TATTACATTC
CAAGTGAAAA TCCTTGCTAT TCCACCTAGT GGCACTTTAA CGAATAATGC TCTTGTAAAC
TACGAATATA CAGTGAATCC AACTGAAACG CCAGCTGTTG GAAGTACCGT TACAAACACG
ACAGTTACAC CTATCGTTGA CGCTACCTTA GTAATAAATA AAAATGCTAG TACAACTTTC
GCTACAATTG GAGATATGAT TACATTCACC TCAGTTGTTA CAAACACAGG AAATACTACC
GCGAATAACA TTGTTTTTAC AGATTCAATT CCAAATGGTA CTACCTTTGT CCCAAATAGT
TTTAAAATAA ACGGTGTAAC GGTCCCGAAT ACAAATCCAC AAAACAGTAT CAATATTGGT
AACTTGAATG CAAATGCATC GGTTACACTT AGTTTTCAAG TAAACATTAC AACTCTTCCA
AATCCTAATC CAATTCCGAA CCAATCATCG CTTCAATATA GATTTATTGT TGATATAAAT
GAACCGCCTG TTTCACGAAC CGTTCAATCC AATAAAACTT TTACACAAGT AAACTCTGCT
TCCGTTATCG CCACGAAAAC TGCTAGCAGT GCATTTGCCG CTGTTGGAGA TACAATTACG
TATACAACGA CTCTCACTAA TAGCGGAAAT ACTACTGCAA ACACACCTGT TTTTATTGAT
ATATTACCAG CTGAACTGTC ATTCGTTCCT GATAGCGTAC AAATTAATAC CATCCCACAA
CTTGGATTTA GGCCTGATAC TGGTGTTCCT TTAGACTCAA TTCCAGTTGG AGGAACGATA
ACAATTAGCT TTCAAGCTAT CGTTGGTTCA ATACCAGCTA TAAATCCAAC ATTGAACCAA
TCTAGCACAA CATACTCTAT CATCGTTGAC CCTACCCAGC CACCGGTGAC AGAAATAGCT
ACAAGCAATC CAACTTTAAT TCAAATTAAC GAAGCGATTA TTCAAGCAAC GAAAAGTGTG
GATCGACTAT TCTCTGACGT CGCACCTGGA AATTCATTTT TAACGTACAC TGTTTTATTA
GAAAATATAG GGAATACAAC TGCTACGAAT ATCATTTTTA CAGATCCGAT TCCAAATAAT
ACAGTATTTA TAGAAGATAG CGTTCGAGTA GGCGGGATTT TATTACCTGG AGTAAATCCA
GCAAACGGAA TACCAATTGG GGATATTATT GCAGGAGATT TTATAAATAT TACCTTCCGC
GTACAAGTAG TTAGCATTCC AAATCCAATT TTCACAATTG GACCTGGGGG GCCAAATTCA
CCGGTTGTAA ATGGCGCTTC TATTAATTAT CAATTTATGA CAGGACCTAA TTTACCACTC
GCTTCAAGAA GTACGACATC CAATCCTGTT TCAACACAAA TAAATTCTGG GGAAATCGCA
CTTGTTAAAT CTGTAGATAA AACTTTCGCA ACGATCGGGG ATACACTTTC TTATTCAATT
TCATTAAGTA ACCCTGGAAA TGTCACTTCA CAAAATATAA TTTTCACGGA TGTTTTACCT
GAAGGAACAA CTTTTATTTC TGGAACACTT ACAAACGATT CTGGTACACA GCAAATTGGA
AATCCAGCTA CCGGGATTCA AATTGGAAAT ATAAATCCCG GTAGTACGGC TAACATTACG
ATAAATGTGC TTGTAACAAA CATTCCAAGT ATAAATCCAA TTTCTAATTT TAGTTCCGCA
CAATTTGAAC ATGTAGTTGA TCCTAGCCAA CCTTCTGCAT TACAAACAAC TATATCAAAT
ACAGTTTCAA CAACCATTAA TAGTGCAATA TTAACTACAG CAAAAAGTGT TGATAAATCT
ATTATTTCCG TCGGGGATAC CATTACGTAT ACAACGACTA TTACGAATAC AGGAAATACA
CCCGCTACAA ATGTAACTTT TACAAGCACC ATTCCAGCTA GTACTACATT TATACCAAAT
TCAGTCACAA TAAATGGAAT TCAGCAGCTT GGTGTGCAAC CAGCACTTGG AGTAAACATA
CCAAATATCG CGCCTGGTGA AACAGTAACC GTTACTTTCC AAGTAAATGT TCTTTCTGTT
CCCTCTTCAA GTTCAATTAT GGGAAATGAT ACCATTTTAT ATTCTTATAC TGTCGATCCA
AACGGAACTC CTGTTACAAC TTCTACTTCA ACGAATACCG TTACAAATCC TGTATTAGAT
GCTATCATTA CGATGGTAAA ATCCGTCGAT CAAACACTTG TAACACTAGG TGATACCATT
ACCTATACGA CCATTTTAAC GAATAGTGGT AATACAAACG CTACAGATAT TGCTTTCATT
GACCTTGTAC CAGATGGAAC AACGTTTATT ACTGATAGCG TTACAATAGA TGGCATCACG
CAAATTGGAC TCAATCCTAA TACAGGTATA ACGATTGGAT CAATTGCTCC TAACAGCTTA
ATATCTATAG CATTTCAAGT TACCGCTACT TCTACACCAG CCCAAAATCC TATTGCCAAT
TCCGCTACTA CTTCTTACAC ATTTATCGCT GATCCTAATG CCCCTATTGT TTCAAGGACC
GTTACTTCAA ACACAGTGTT CACTACGATT AATACAGCTA CCATTCTTTC ATTAAAACAA
GTCGATAAAT CCTTTAGTCG TATTGGAGAC ACACTCACTT ATACTGTCGC TTTAACAAAC
AATGGAAACT CATCTGCGCA AAATGTTATA TTCACAGATA CCGTACCGAG CGGAACAGCA
TTTATTGCAG ACACATTTTC TATTAATGGA ATTCTTCAAA GTGGTGCAAA TCCAGTGAAC
GGTGTAAATA TCGGAACTAT AACAGCTGGG ACTACAGTAA CAATTTCGTT CCAAGTTACT
GTAACGTCAT TACCAACTGA AAACCCCATT GTAAATTTCT CATCAACATC GTACCAATTA
GTCTCACCGC CTGATGCAGA AACTTCAATT AGCAATCCTG TTTCAACGCA AATTAAAGAA
GCACTATTAT CCATGACGAA AAATGAAAGT GTATCCTTTG CAGATATCGG GCAAACTGCT
TTTTACACTA CTTCTATTTC GAATATAGGA AATACAGATG CAACTAATAT TGTATTCACA
GATGTATTAC CAAATGGAGT CACATTTGTT CCTAACACAT TAACTGTCGA TGGTGTTTTA
CAACCTGACG CGAATCCAAA TACAGGTGTA TTACTTGCAA CACTTCCGCC TAATGAAATA
TATAGTATCG TCTTTCAAGT TACAGTGAAC AGCATTCCCC CTGTTAATCC AGCACCAAAT
ACAGCATCAA CAACATATGA GTTTACTGTT GATCCTGTTA ATCCTCCAGT ATTAAGTGCG
GCTACTTCCA ACACTACGCT TCTTCAAATA AACAACGCAA ATATTATAAG TACAAAAACA
ACAGACCTTA CTTTTGCGGA TGTTGGTAAT ACAATAACAT TTACACTTAA CCTCCCGAAT
ACAGGGAATG TGACTGCAAC TGATGTTACA GTTATCGATA CGCTTGATAG TAATTTAACT
TTTGTTCCAA ATAGTTTCAC AGTTAATGGG CAAACCATTC TAAATGCTGA TTTATCTACT
GGTGTAAATA TCGGTTCAAT TAACGGTGGT ACTGCGGCAA TTGTCACATT CCAAGCTACC
GTTACAACCC TACCAATCAA CAATCCTATT TCTAATTCAG CTCTTACAAC TTATCGTTAC
ATTGTTGATC CAGACCAGTC ACCTATTACA ACTTCCAATC AATCTAATAC AACGACAACA
CAAATTAATA GTGCTATTCT TACTGCACAA AAAAGCACAA ATGTATTTAC GGTAGATATC
GGGCAAGATA TTGTCTACTC CGTTACAATT ACAAATAGCG GAAATGTTAA TGCAACGAAT
GTTATTTTTA CCGATGTTAT TCCAGACGGA ACTTCCTTTG AACCAAATAG TTTTACACTT
AATGGAACTA TTATCGAAAA TGCAAACATC ATTACAGGCG TCCCGATTGG TGATATCGCG
CCAAACGAAT CTGCCATTGT AGAATTTCAT ATTACTTCAA ATGAAATCCC GGCTATTAAT
CCAATTACTA ATCAAGCTAG CGTTAGCTTT CAGCATATCG TTAATCCAGC TAATCCTCCT
GTTTCAAAAA ACATTACTTC AAATAGTGTT ACAACAACAA TTGAAAGTGC TATTTTAACT
ACTACTAAAA TCGGTGATAA AGCTTTTGCA ACGATTGGTG ATACAATTAC GTATACAACT
ACGATTACGA ATACTGGAAA TATCCCTGCC AATAACGTTA TTTTCTCAGA CCCGATACCA
TCGTGGACAC AATTTGTTGC AGGATCCGTT ATTGTTGATG GCACTCCATT ACCATCCGCT
TCTATCACCA GCGGTATTGG CATAAATACG ATCATTCCAA ATCAAACTGT AACAATCATA
TTCCAAGTTC AAATCGTAAG CAATCCACCA ACATTCACAC CTGAACTCCA AAACTTAGCA
TTTGTTAACT TCCAATATAA CGTAGGCAAT GCATTACAGG CTCAGCCTGG CAATGTGGAA
ACGAACGTCT TCGTTACCGC TATTCATTCA GCAATACTTT CAGCTGTAAA AACTGCTAGT
ACAGCCTTTG CGAATATTGG AGACACAATC ACTTATACAG TTTTAATACA AAATAGCGGC
AATACAAACG CTACGAATGT AAATTTCTCA GACCTCATTC CAGGAGGAAC GACCTTTGTT
GAAAATAGTT TTGCTGTAAA TGGAAATACC ATTCCAGGTG CAAATCCAAA TAGCGGCGTT
AATATCGGGA CCGTTAGCGC GGGTAGTTCC TTAACCGTTA CTTTCCAAGT CATAGTTACA
TCTACTCCTC CTTCCAACCC AATTACAAAC GTTGCATCTA TTCAATTCGC ATTCATCGTT
GATCCGGCCG CTCCTCCTGT TACAGGCACA GTAACTTCTA ATAGTGCTTC TACACAAATT
AATAACGCTA CTGTTACAAC GCTTTTAGAA GCAGATCGAA CAATCGTATC TATTGGAGAT
ATAATTACGT ACACTGCAAC ATTAACAAAC ACTGGAAACT TCCCTGCAAA CTCGGTATTA
CTCATTAACG GTGTTCCTGA AGGGGCATTA TTTGTTCCAA ATAGTGTCAC GCTTAACGGG
ATTTCACTTC CAGATGCAAG TCCAACTCTC GGTATTCCAG TTGGTATTAT CGCACCAGGT
GATTCTGCTA CAATTACGTT CCAATTTCTT GCAAACTCTA TTCCGCCGCA AGGAGCAATT
ATAAATCAAG CACTTACAAG TTACACATAT ATTGTCGATC CAAGTCAACC TCCAGTTACA
GCAACATCTT CATCTAATAC AGTTACTACA GCTGTCGTTG ATGCATCGCT ATCTGTAATT
AAAAATACAG ATTCCATCGT ACAATCTACT GACGGTACAA TCACTTACAC TGTCGTCATT
CAAAACAACG GGAATACAAC TGCAAATACA GTTACTTTAA CAGATTTGGT CCCAGAAGGA
ACTGCATTGA TTCCAAATAG CGTGACCATT AATAGCATCT CAATTCCAGG TGCCGATCCA
AACGTAGGAA TACCATTAAA CTCCATTGCG CCGTCAGAAA TCGTCACCGT CACATTCCAA
GTTATCGTTC AATCTATTCC AAGCGTGAAT CCAATTTCTA ATATAGCCCG TATTGACTAT
ACTTTTATTG CGGATCCAAC TGCTCCTATC GTCTCTCGAA CAATTACTTC GAATCCAGCT
TTCACACAAA TTTCAGATGC GAATGTTCTT TCTTTAAAAG CCGTCAATGC ACAACAAGCA
ACAACTGGCG ACATTTTAAC CTACACGATA ACACTAGAAA ATACCGGAAA TATTCCAGCT
ACAAATCTCA TATTTTCAGA TACGATTCCA GTTGGGACTA CATTCGTAGA AAATAGTTTT
ACACTTAACG GAACAGCTAT ACTGGGTGCA AATCCAAATG TAGGTGTTAC TTTGCCTAAC
CTAGCAGCAA ACGCTACTCA CCTTATTGCT TTCCAAGTTC TTATTAACGA TCCATTCTCG
CAACAATCGA TTACAAATCA ATCTAATACA ACATATACAA TTCAACCAGA TCCAGGGCAA
CCGCCTATTA CTGAAACATC TACAAGTAAT ATTGTCATTA CAAATTTCGT GCAAGCACAA
TTAACAATTA CAAAAACGTC CAATCCAATA ACTGTAGATA TTGGTGGAAC TATACTTTAT
ATTTCTGAAG TGAAAAATAG CGGCAATGTT GACGCAATAA ATATTATTTT TACAGATTCG
ATTCCAGTTG GGACTACATT CGTTCTCGAC AGTGTCACAA TTAACGGTGT ACTTCAGCCT
GATGCAAATC CCGAAAACGG AATACCAATT GGAACGATTC CACCAAACAG TTCCAAAACA
ATACTATTTC AAGTACAAAC AAATAATCCA CCTACTGAAA CCGAAATTGT AAATCAATCT
TCAGTAACTT ACCAATATGT AAGTATTCCT ACAGCTCCAC CAGTGAATCG CTCTGCAAAT
TCTAACATTG TTACAACATC ACTTCAAAAT GCGAATATTA TTTCCGTTAA AAGCGCAGAT
GTAACTTTCG TCTCCATCGG GCAATTTATT ACCTACACAA ATACACTACA AAATATCGGA
ACGGTTCCAG CTAACAATAC GGTGTTCATT GACAACATTC CAGAAGGGAC AATATTCATT
GAAGATAGCT TATCAATAAA TAATGTCATT CAGCCTGGTA CGAATCCTGA AAATGGAGTA
ACTCTCGGCA CGATACAACC AGATGAAACA GTCACTATTT CATTCCAAGT ACAACTTACA
AATATACCAG AGGGCAACAC AGTCATTAAC ATCTCAGACA CTTCATATGA ATACCAAATT
GACCCTAGTT CTCCAATTAT TCAGCGTAGA TCATTATCAA ATACAGTAAA CACGGAAGTC
CGTACAGCAA ATGTTAGTGC AATTAAATCT GCTAACAGAT CCATTACACG CATCGGTCAA
ATTATCACAT ATACCATCGC AGTTACAAAT GCTGGTACAG TACCTATTAC AAATACTCTC
CTAATTGACG CAATCGCTGC TGGCACCACA TTCGTTCCAG ATAGCATTCT TGTAGATGGC
ATACCAAGAC CTAACGAAAA TCCAAGTACC GGAATCTCCC TTAATATTAT CCTTCCAAAC
AATACAATTA TCGTTACATT CCAAGTAAAT GTAGACTCGA TACCTTCTCA AAATAACATG
AATAATATCG CCGTCATCCA CTATGAGTAT CAGCCAGACC AAAGCTTACC ACCAATTTCA
GAAACGACAT CTTCCAATAC TACAAATATA CAATTTATTG ATGCCATTCT TATCGCTACA
AAATCCGCTA ATACAGTATT AGCTAATATT GATGAAAACA TTGAATATAC AGTACTCATT
CAAAATAACG GATCCACTAC AACTAACTCC ATCTTTTTTA CTGATATTAT AGAAGATGGA
ACAGTATTTA TTTCGGGGAG TGTAACAGTT AACAACACTG TACTTCCTGC AGCAGATCCG
AATATCGGCT TTTCCATCCC GAATATCACA TCAGGTCAAG TAGCTACAAT AACATTCCAA
GTTTCGGTTA CGAATTTACC TGCTGCAAAC CCAACACCTA ATACTGCAAA CATCGTCTAC
GACTTTATTT TCAACCCTGA CTTTGCACCA ATTCAAAAAT CGACTACTTC CAACACTACT
TTCGTTCAAA TTAATGATGC TGATATCGTT TCACTTAAAA CTGTTGATTT GACTTCTGTA
ACAATTGGTG ATGTTTTAAC TTATACAACA ACTTTAACAA ATACAGGGAA TACGGATGCC
ACTGCCGTTG TATTTACAGA CAATATTCCT GGTGGAACAA CCTTTATAGA CGGTAGCGTT
TTAGTAAATA ACATTCCGCA GCTTAATGCC AATCCAAGTA CCGGTATATT GGTAGGAACG
ATTGCTCCTA ACATTTCTAT CCCAGTCACA TTTTCTGTTA CTGTCGTAGC GCTTCCAACT
AGCGGCCATG TTCAAAATCA AGCAACTTCT CGTTATACAA TAAATGGAGA AGAACAAATA
TCGACTAGTA ACATTACTTT CACTGAAGTT ATTTCTGCTA ATATAATCGC AGTAAAAACA
ACACCTATCC AATATGCTGA CCTACAAACC ATTATCCCTT ACACAATTTC CATCACAAAC
AATGGGAATA TACAAGTGGA AAACATTATC GTTACAGATA TCATCCCAGC AAATACAAAC
TTTATAGAGA ATAGTGTTAT TGTGAATGGA AACACTCGTC CAAATGACAA TCCACTTAGT
GGGATACCAA TTGATAACAT TCTGCCTAAT ACGACAGCAA CTGTTCTATT CCAAGTACGG
GTTACTTCGA TACCTCAAAC AAATCCAATC TCTAACACAA GTACAATTGA ATATGAATAC
ACGGTAGGAG ATCAACCACC TATTACCAAA ACTATTATTT CATCAGCTGC TTTAACAGAA
ATTAATCATG CGAATTTGAA TAGTAATAAA GCTGTTGACC TTGCATATGC AATGGTCGGT
GATACGTTAA CGTATACGAT TACACTCAAT CAAACTGGTA ATGTTGCAGC AAATGATGTA
ATCATTCAAG ATATGATTCC TCAAGGTACT ACATTTATAG AAAATAGCGT TATTGTAAAC
GGAGAGGCTC TTCCGGGAGT GGATCCAGCA AGCGGCATAC CAATTGGTAC TATAATTGTA
GATGGGGACG CTATCGCTTC ATTCCAAGTA ACTGTGACTT CTATTCCAAT ACGAAACGAG
CTCAACAACC AAGCAATCTC TACTTTTAAC TATATAGTCA ACCCAAATAA CGTACCTGTT
ACAAATACGA CGACAACAAA TACAGTCACA ACAACCGTTC AAAATGATAA TATCATTGCG
ATAAAAGCTG TTGATTTCAC GAGTGCCTTA CCTGGTCAAA CTTTAACGTA TACCATTACG
ATTACTAATA ATGGTAATAT CACTATTGAA GATCTTCTTC TAGTAGACAC GGCACCTGTA
GATACGACAT TTGTTATTGG CAGTGTTACG ATTAACGGAA TCAATCAGCC TAATGCGAAT
CCTGAAAATG GTATTCTGTT AGGAACTCTT GCTCCTAATG ACTCTGTTAT TATTACATTC
CAAGTGACAA TATCTTCTTC TACTCTTCAA TCTACAATCA ATAACGATGC TACTATTTTC
TATACACCTA TTGTCGGTCT AATCGAACCA CCTATTACAA TTACAAGGCA AATAGATATC
GTCACAAAGC AAACAAATAC TGTTACAACG ACAATCATTG ATCCAATGGT TCATATTGAA
AAAACGGCTG ACAAATCTAT TGTCGTTTTA GGAGATATTC TTACTTTCAC GTTAGAGATA
TTTAATGATT CTCCAATCCC AACAGTAAGT ACTTCCGTTA TAGATACCAT TCCAGCTGGT
ACAACATTTA TAGAAAACAG TGTTACGCTT AACGGTACTC CGGTTCCAAA TGTCCGTCCA
GACACAAGTA TGAATATCGG ATCTTTACCT GCAGATGCAG TAGCAATACT AACATTTAAA
GTGCTCGTAA CTTCTATTCC TTCAAACAGT TCAATTATTA ATTCTGCAAC AGTTACAGCT
GCTTTCCAAT TGACACCTCA GGAGCCGATT ATTACTTTTA TCGTTAATTC GAATATTGTT
CGTATACCAG TTCAATTCGT AACTGCGACA GCCACGAAAA GTGCTTCCGT CACTTCAGCT
TATTTAAATC AATATTTTGA TTACACGGTG CGTATTACGA ATACTTCCGA GATTTCACTC
TCAAATATTT CTTTACAGGA TGCCATTCCA GCAGGTTTAC AATTTATAAA CGGCACTGTC
TTCATTAACG AAGAACGCTC TCCACTAGCG AATCCGAATA TCGGTTTCCT AGTCGCTACT
AATTTGGAAC CAAACGAAAC AATTATCGTC TTATTCACAG TGCAAGTGAT AAATCCGCCT
GTTAATAATG AATTTAAAAA TACGGCCAAT ATTTCATTAC AACTTCAAGT CTCGCCTACC
GATCCACCAA TTACAGAAAC TGTTACAAGT AACGAAAACA TCGTCATCTT TGTTCCAGAA
AACCAAGATG AAATAGTTCC AAATTTAAGT TGTTTCTTTG ACGGGGAACG TTTTATACGC
ATTACTCCTC AGAATATACG AAACTACCTT TGGACTTGGA TTTGGTGGCG TTAA
 
Protein sequence
MPITNRFSTT TNGALAITGN TLGLSKISNQ NRAGTIGAIG AFATTNTALQ VTSFPAGTTL 
NYTQNSSTAL LNIPAGSTIL YAELIWGGNY LSRDQNITSV LGNPVSFTTP VSTYSITPSA
VTASNQTFVS GSITFGFYTR SADVTSLIQA GGSGSYTTGS VPGLVDPIDA SNGTINSAGW
TLIVAYQNGT LPARNLTIYV AGNRVSAETG SADVSVSGFL TPSGGPVSGR LFLSSTEGDA
DLIGDQALFG PNFSSLNALS GPNNAVNNFF GSQINNAAGN LDTTGTFGTR NQSASTGTNI
SAGRQGWDIT SIDISPYLIN SQVSAAIRLT TNGDAYMLNT VGLQININSP NIQATKSVNK
SVAAIGDVLT YTVTIPNTGL LPANNVIFTD ILPNGTSFIP GTVTVDNVPQ TNANPAAGIS
LGTINNSASR TVTFQATVVS FPSQNPISNT ANITFQYTPI AGGTTFNGLA TSNSAGTQVN
LADINGTKSV NKLFTDIGET LTYSIALANI ENIAATNVIY TDPIPSGTTF VPGSVTVNGV
TQAGANPATG ISIGAIAANS TTIVSFQVLV PSIPQTNPVL NSGTTTYQYI PVPNQPAISG
TDTTNIVSTQ VNNATVTMAK SVDKNFADIG DTLTYTVSFT STGNTNANNV IFTDVIPTGT
TFVLNSLTID GTTQGGANPA NGVNIGSIST GTTKNVSFQV VVNTIPALNV VSNGSSASYQ
YTVNPSQSPV TKNISSNLVS TQINNANLAL TKSTNKQFAT IGETISYTIL ITNSGNTAAT
NVQLTDPLPN GTILTPGSVT LNGVLQNVDS LVALPIGTIP GGATFTLSFQ VTVINITIQN
PIINNAFASY LYTVNPNLPP TSKTVNSNSV TSTIRLANLQ ATKSVDKTFA EVGDVLTYTF
SLTNDGNVAA NNVVLSDSIA NGTAFVPNSV TINNVTQPGV TPASINIGSI TAGTTITASF
KVLITSIPNP NPISNSASIS YNFIVDPNAF PISKNTTSTT TFTQVNDANI ISAKTVDRAF
ATVGDVLTYT VVLTNAGSVS ADSPTFVDTN PDGTTFIPNT FLINGVLQNN ADPNVGVPLP
SIPANGSLTV SYQVTVTSLP TQNPTINSSS TQYSFILNPG DPPTIETSLS NTVSTQINLA
NVVIVKQVDL TIADVGQPIT YTIALANPGN TPANNVVVTD ILPPGTTLVP NSIFIGGALQ
LGADPSAGLQ VGTIPAGGFT TIVFQIGANS LPSPNPVQNS AVLQYNFIAD PNSPPVVRNS
ASNIVTTQIN TANIVATKLT STNFADVGDV ITYATILTNN GNIPASNVTF TDIIPAGTIF
LPNTVTINGV PIANANPANG ILIGTIGANS SRTVAFQVFV PTIPSANPIA NQSSTTFQYT
YDPSKPAVMQ MVASNTVQTT INNATITAVK SADKQFANVN DIITYTTTLT NNGNTLASNI
VFTDAIPSGT SFIPNSVTVN GTTLSNANPA NGIAIDPINP NANTIISFQV QVNSIPNPNP
IPNQSNTTYQ YIVNPNLPPA SSNTLSNVIT TQINNATIIA TKSVNTPNAA IGDIVTYTIA
VTNTGNIPAS ATVLTDGLGP GASFIPNSVT INNVSQPGLD PSLGIHLDDI SPGGTTFITF
QVKILAIPPS GTLTNNALVN YEYTVNPTET PAVGSTVTNT TVTPIVDATL VINKNASTTF
ATIGDMITFT SVVTNTGNTT ANNIVFTDSI PNGTTFVPNS FKINGVTVPN TNPQNSINIG
NLNANASVTL SFQVNITTLP NPNPIPNQSS LQYRFIVDIN EPPVSRTVQS NKTFTQVNSA
SVIATKTASS AFAAVGDTIT YTTTLTNSGN TTANTPVFID ILPAELSFVP DSVQINTIPQ
LGFRPDTGVP LDSIPVGGTI TISFQAIVGS IPAINPTLNQ SSTTYSIIVD PTQPPVTEIA
TSNPTLIQIN EAIIQATKSV DRLFSDVAPG NSFLTYTVLL ENIGNTTATN IIFTDPIPNN
TVFIEDSVRV GGILLPGVNP ANGIPIGDII AGDFINITFR VQVVSIPNPI FTIGPGGPNS
PVVNGASINY QFMTGPNLPL ASRSTTSNPV STQINSGEIA LVKSVDKTFA TIGDTLSYSI
SLSNPGNVTS QNIIFTDVLP EGTTFISGTL TNDSGTQQIG NPATGIQIGN INPGSTANIT
INVLVTNIPS INPISNFSSA QFEHVVDPSQ PSALQTTISN TVSTTINSAI LTTAKSVDKS
IISVGDTITY TTTITNTGNT PATNVTFTST IPASTTFIPN SVTINGIQQL GVQPALGVNI
PNIAPGETVT VTFQVNVLSV PSSSSIMGND TILYSYTVDP NGTPVTTSTS TNTVTNPVLD
AIITMVKSVD QTLVTLGDTI TYTTILTNSG NTNATDIAFI DLVPDGTTFI TDSVTIDGIT
QIGLNPNTGI TIGSIAPNSL ISIAFQVTAT STPAQNPIAN SATTSYTFIA DPNAPIVSRT
VTSNTVFTTI NTATILSLKQ VDKSFSRIGD TLTYTVALTN NGNSSAQNVI FTDTVPSGTA
FIADTFSING ILQSGANPVN GVNIGTITAG TTVTISFQVT VTSLPTENPI VNFSSTSYQL
VSPPDAETSI SNPVSTQIKE ALLSMTKNES VSFADIGQTA FYTTSISNIG NTDATNIVFT
DVLPNGVTFV PNTLTVDGVL QPDANPNTGV LLATLPPNEI YSIVFQVTVN SIPPVNPAPN
TASTTYEFTV DPVNPPVLSA ATSNTTLLQI NNANIISTKT TDLTFADVGN TITFTLNLPN
TGNVTATDVT VIDTLDSNLT FVPNSFTVNG QTILNADLST GVNIGSINGG TAAIVTFQAT
VTTLPINNPI SNSALTTYRY IVDPDQSPIT TSNQSNTTTT QINSAILTAQ KSTNVFTVDI
GQDIVYSVTI TNSGNVNATN VIFTDVIPDG TSFEPNSFTL NGTIIENANI ITGVPIGDIA
PNESAIVEFH ITSNEIPAIN PITNQASVSF QHIVNPANPP VSKNITSNSV TTTIESAILT
TTKIGDKAFA TIGDTITYTT TITNTGNIPA NNVIFSDPIP SWTQFVAGSV IVDGTPLPSA
SITSGIGINT IIPNQTVTII FQVQIVSNPP TFTPELQNLA FVNFQYNVGN ALQAQPGNVE
TNVFVTAIHS AILSAVKTAS TAFANIGDTI TYTVLIQNSG NTNATNVNFS DLIPGGTTFV
ENSFAVNGNT IPGANPNSGV NIGTVSAGSS LTVTFQVIVT STPPSNPITN VASIQFAFIV
DPAAPPVTGT VTSNSASTQI NNATVTTLLE ADRTIVSIGD IITYTATLTN TGNFPANSVL
LINGVPEGAL FVPNSVTLNG ISLPDASPTL GIPVGIIAPG DSATITFQFL ANSIPPQGAI
INQALTSYTY IVDPSQPPVT ATSSSNTVTT AVVDASLSVI KNTDSIVQST DGTITYTVVI
QNNGNTTANT VTLTDLVPEG TALIPNSVTI NSISIPGADP NVGIPLNSIA PSEIVTVTFQ
VIVQSIPSVN PISNIARIDY TFIADPTAPI VSRTITSNPA FTQISDANVL SLKAVNAQQA
TTGDILTYTI TLENTGNIPA TNLIFSDTIP VGTTFVENSF TLNGTAILGA NPNVGVTLPN
LAANATHLIA FQVLINDPFS QQSITNQSNT TYTIQPDPGQ PPITETSTSN IVITNFVQAQ
LTITKTSNPI TVDIGGTILY ISEVKNSGNV DAINIIFTDS IPVGTTFVLD SVTINGVLQP
DANPENGIPI GTIPPNSSKT ILFQVQTNNP PTETEIVNQS SVTYQYVSIP TAPPVNRSAN
SNIVTTSLQN ANIISVKSAD VTFVSIGQFI TYTNTLQNIG TVPANNTVFI DNIPEGTIFI
EDSLSINNVI QPGTNPENGV TLGTIQPDET VTISFQVQLT NIPEGNTVIN ISDTSYEYQI
DPSSPIIQRR SLSNTVNTEV RTANVSAIKS ANRSITRIGQ IITYTIAVTN AGTVPITNTL
LIDAIAAGTT FVPDSILVDG IPRPNENPST GISLNIILPN NTIIVTFQVN VDSIPSQNNM
NNIAVIHYEY QPDQSLPPIS ETTSSNTTNI QFIDAILIAT KSANTVLANI DENIEYTVLI
QNNGSTTTNS IFFTDIIEDG TVFISGSVTV NNTVLPAADP NIGFSIPNIT SGQVATITFQ
VSVTNLPAAN PTPNTANIVY DFIFNPDFAP IQKSTTSNTT FVQINDADIV SLKTVDLTSV
TIGDVLTYTT TLTNTGNTDA TAVVFTDNIP GGTTFIDGSV LVNNIPQLNA NPSTGILVGT
IAPNISIPVT FSVTVVALPT SGHVQNQATS RYTINGEEQI STSNITFTEV ISANIIAVKT
TPIQYADLQT IIPYTISITN NGNIQVENII VTDIIPANTN FIENSVIVNG NTRPNDNPLS
GIPIDNILPN TTATVLFQVR VTSIPQTNPI SNTSTIEYEY TVGDQPPITK TIISSAALTE
INHANLNSNK AVDLAYAMVG DTLTYTITLN QTGNVAANDV IIQDMIPQGT TFIENSVIVN
GEALPGVDPA SGIPIGTIIV DGDAIASFQV TVTSIPIRNE LNNQAISTFN YIVNPNNVPV
TNTTTTNTVT TTVQNDNIIA IKAVDFTSAL PGQTLTYTIT ITNNGNITIE DLLLVDTAPV
DTTFVIGSVT INGINQPNAN PENGILLGTL APNDSVIITF QVTISSSTLQ STINNDATIF
YTPIVGLIEP PITITRQIDI VTKQTNTVTT TIIDPMVHIE KTADKSIVVL GDILTFTLEI
FNDSPIPTVS TSVIDTIPAG TTFIENSVTL NGTPVPNVRP DTSMNIGSLP ADAVAILTFK
VLVTSIPSNS SIINSATVTA AFQLTPQEPI ITFIVNSNIV RIPVQFVTAT ATKSASVTSA
YLNQYFDYTV RITNTSEISL SNISLQDAIP AGLQFINGTV FINEERSPLA NPNIGFLVAT
NLEPNETIIV LFTVQVINPP VNNEFKNTAN ISLQLQVSPT DPPITETVTS NENIVIFVPE
NQDEIVPNLS CFFDGERFIR ITPQNIRNYL WTWIWWR