Gene BCG9842_B3691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3691 
Symbol 
ID7183425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1515635 
End bp1530667 
Gene Length15033 bp 
Protein Length5010 aa 
Translation table11 
GC content38% 
IMG OID643549363 
Producthypothetical protein 
Protein accessionYP_002445033 
Protein GI218896622 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTA CGAATCGATT TTCTACCACC ACTAACGGCG CACTCGCTAT TACAGGTAAT 
ACTTTAGGCT TAAGTAAAAT TAGTAATCAA AACCGTGCTG GTACAATCGG GGCCATTGGT
GCGTTCGTAA CTACGAATAC AGCTTTACAA GTTCCTACTT TTCCTGCTGG TACAACGTTA
AACTATACAC AAAATAGTTC TACCGCTATT TTAAATATCC CTGCTGGCAG TACTATTCTT
TACGCAGAAC TCATATGGGG AGGAAATTAT TTAACTCGCG ATCAAAATAT TACAAGTGTT
CTAGGAAATC CCATTTCATT TACGACACCC GTTTCAACAT ACTCAATTAC TCCTTCAGCT
GTTACAGCTT CAAATCAAAC ATTCGTTTCA AACTCTATTA CATTTGGCTT TTATACTCGT
TCAGCAGACG TAACAACTCT TATTCAAGCG GGAGGTTCTG GTTCCTATAC TACAGGAGCT
GTCCCTGGAC TTGTTGATCC TTTAGATGCT TCAAATGGAT CTATTAACTC GGCTGGATGG
ACACTCATTG TCGCTTATCA AAACGGGACA TTGCCTGCAA GAAACTTAAC CATTTATGTA
GCGGGTAACC GTGTTTCTGC TGATACCGGT AGTGCAGATG TAGCTGTTTC TGGATTTTTA
ACACCTTCAG GAGGACCTGT AAGCGGGAGA TTATTTTTAA GTTCTATTGA AGGAGATGCA
GATTTAATAG GGGATCAAGC ATTATTTGGA CCAAATTTCA GTTCATTAAA TGCTTTATCT
GGGCCAAATA ATGCTGTAAA TAATTTCTTC GGTTCTCAAG TTAATAATGC TGCTGGAAAC
TTAGATACAA CTGGGACATT TGGAACGCGA AATCAAAGTG CTTCTACAGG CACAAACATC
TCCGCTGGGA GACAGGGCTG GGATATTACT TCTATTGATA TTTCTCCTTA TTTAACAAAC
TCACAAGTGT CCGCCGCAAT CCGTTTAACA ACTAACGGAG ATGCGTATAT GTTAAATACA
GTCGGTTTAC AAATCAACAT AAATTCACCT ACTATACAAG CAACAAAAAG CGTGAATAAA
AGTGTTGCTG CAATTGGAGA TATTCTCACT TATACAGTTA CGATCCCTAA CACCGGACTT
CTTCCGGCGA ATAATGTTAC CTTTACAGAC ATTCTCCCTA ACGGGACTTC CTTTATACCT
GGCACTGTAA CAGTAGATAA TGTCCCGCAA ACAAACGCAA ATCCCGCCGC TGGTATATCT
CTTGGAACAA TCAATAATGG TGCTTCTCGT ACAGTAACTT TCCAAGCTAC TGTCGTTTCT
CTTCCAAGTC AAAACCCTAT CTCTAATACC GCTAATATTA CATTTCAATA TACACCTATC
GCCGGAGGAA CAACCTTTAA CGGTCTTGCA ACAAGCAATT CTGCTGGGAC ACAAATTAAC
CTGGCAGATA TTAATGGAAC AAAATCAGTC AACAAAATTT TTACCGATAT TGGTGAAACA
TTAACTTACA GTATTGCCTT AGCTAATATA GGAAATATCG CTGCAACGAA CGTTATATAT
ACGGATCCAA TTCCTAGCGG AACTACTTTC ATTCCAGGAA GTGTAACTGT TAACGGAGTT
ACTCAAGCTG GAGCAAATCC CGCTAATGGT ATATCAATTG GTTCGATTGC CGCAAATTCC
ACTACTACTA TTTCATTCCA AGTGCTCGTT CCTTCTATCC CTCAAACAAA TCCAATATTA
AATAGTGGAA CAACAACATA TCAATATATT CCTATACCAA ATCAACCTGC AGTAAGTGGG
ACTGATACGA CAAATATCGT ATCTACTCAA GTGAATAACG CTACTGTAAC AATGGTAAAA
GCAGTAGATA AAAATTTTGC AGATATTGGT GATACACTAA CGTACACCGT TTCTTTTACA
GGTACAGGTA ATACAAATGC GAACAATATT ATTTTTACAG ATGTCATTCC TACTGGAACA
ACTTTTGTTT TAAACAGTTT AACAATAGAT GGCACTACAC AAGTGGGCGC AAATCCCGCT
AACGGCGTGA ACATTGGATC CATCCCAAGC GGCACAACAA AAAATGTTAC ATTTCAAGTA
GTAGTAAACA CAATACCCGC GTCAAATGTC GTATCTAATG GATCAAGTGC TTCTTATCAA
TATACCGTCA ATCCTATCCA ATCACCCGTT ACAAAAAACA TTTCTTCTAA TCTCGTTTCC
ACTCAAATTA ACAATGCGAA TGTAACATTA ACAAAATCAA CTAACAAACA ATTTGCCACA
ATTGGTGAAA CGATAGGTTA TACAATCCTT ATTACAAACA GTGGAAATAC AGCCGCTAAT
AACGTACAAC TAACAGATCC ACTTCCAAAT GGAACGATAT TAACATTAGG TTCGGTAACG
CTCAACGGCG TTTTGCAAAA TGTAGATTCT CTCGTTGCTT TACCTATTGG CACAATTCCT
GCTGGAGCTA CTTTTACCCT CTCTTTCCAA GTAACAGTTA TTAATATTAC TTCACAAAAT
CCTATCCTTA ATAACGCTTT TGCCTCTTAC ATTTACACTG TAAATCCAAG TCTGCCACCA
ACTTCAAAAA CAGCAAATTC TAATTCTGTT ACATCTACAA TTAGACTAGC AAATCTCCAT
GCTAATAAAT CTGTAGATAA GACGTTTGCG GAAGTTGGCG ATGTATTAAC TTACACCTTT
GCTCTTACTA ACGATGGAAA TGTTGCAGCG AACAACGTAT TACTATCCGA TTCCATCGCG
AATGGCACTT CCTTTGTACC GAACAGTGTT ATAGTTAACG GTGTTACTCA GCCGGGCATA
ACACCAGCTA GCATCAATAT TGGTAGCATC AATGCTAATA CTACAATTAC AGCTTCATTC
CAAGTAGTAA TAACTAGCAT TCCAAATCCA AACCCTATTT CAAATAGCGC TTTCGTCTCC
TACAACTTTA TCGTTGATCC AAACGCTTCA CCTGTAAATA AAAACACTAC TTCAACCACT
ACATTTACTC AAGTAAACGA TGCAAATGTC ATTTCAGCAA AAACAGTGGA TCAAGCGTTC
GCTACCGTAG GGGATATATT AACTTATACT GTTACTTTGA CGAATGCAGG AAGTGTTTCT
GCTGATAGTC CTACTTTCGT AGATACAAAT CCAGACGGTA CTACCTTTAT TCCAAACACT
TTTCTTATTA ATGGTGTACT CCAAAATAAC GCAGATCCAA ATATAGGTGT TCCACTATCT
TCCATTCCTG CTAACGGCTC ACTTACCGTC TCTTATCAAG TAACCGTCAC CTCTTTACCA
ACACAAAACC CAACGATAAA TTCATCTAGT ACACAGTATA GTTTTATTCT AAATCCGGGA
GATCCACCAA CTATAGAAAT ATCTGTAAGT AACACTGTAA GTACACAAAT TAATTTAGCT
AATGTAGTTA TTGTCAAAGA AGTAGATTTA ACTATCGCCG ATGTCGGGCA ACCAATCACA
TATACAATTT CTTTAGCTAA TCTTGGCAAT ACTACAGCAA ATAATGTTAT TGTTACTGAT
ATAATCCCTA ATGGCACCAC TATCGTACCA AATAGTATTT TTATAGGTGG TGCTTTACAA
CTAGGTGCAG ATCCCAGTAC CGGTCTTCAA GTCGGCTCCA TTCCTTCGGG TGGTTTTACA
ACGATTGTTT TTCAAATAAG TGCAAATGGG CTACCTTCAC CAAACCCAAT TCAAAACAGC
GCTTCACTTC AATATAGCTT TATCGCTGAT CCAAATTTAC CTGCTCTTGT TAGAAACGCT
GCTAGCAATA TAGTAACTAC ACAAATTAAT ACTGCTAATA TCATTGCTAC AAAGCTAACA
AGCACAAACT TCGCTAATGT CGGTGATATC ATATTATATG CGACTATTTT AACAAATAAC
GGGAATATCC CTGCTGCTAA TGTAACGTTT ACAGATATCA TTCCAGCGGG TACCCTCTTC
ATTCCTAATA CTGTAACGAT TAATAATGTC CCTATAGCTA ATGCAAATCC TGCTAACGGC
ATTTCGATTG GTATGATAGG AGCAAATTCA TCACGCACCG TTGCATTCCA AGTTTTTGTA
CCAACTATCC CTGCTGTAAA TCCGATTACA AATCAATCAG GCACAACATT CCAATATACG
TATGATCCAT CCAAACCCGC TGTGATGCAG ATGGTTGCTT CTAACACTGT ACAGACAACT
ATTAATAACG CCTCAATTGC CGCTACTAAA TCGGCAGATA AACAGTTTGC TAACGTAAAT
GATATTATTA CGTACACGAC TACTTTAACG AATAACGGGA ATACACTTGC ATCAAATGTA
ATATTTACAG ATGTAATTCC AAACGGGACA TCGTTCATTC CTAATAGTGT TTCTGTAAAC
GGGAATACAC TGCCTAATGT CAATCCAGCA AGCGGAATTG CAATCGACCC AATAAATCCG
AATGCAAATA CACTCATCTC GTTCCAAGTA CAAGTAAATT CTATCCCGAA CCCGAATCCA
ATCCCGAACC AAAGTAATAC AACGTATCAG TATGTCATAG ATCCTAACTT ACCTCCAGCA
TCCGCTAATG CGCTGAGTAA CGTAATAACA ACTCAAATCA ATAATGCCAC GATTATCGCT
ACAAAATCAG TGAACACACC GAATGCTGCA ATTGGAGATA TCGTTACTTA TACAATTGCA
GTTACGAACA CAGGAAATAT CCCTGCTAGC GCTACAGTTT TAACAGATGG GCTTGGACCA
GGTGCATCCT TCATCCCAAA TTCCGTTACA ATAAATAACG TTTCCCAACC TGGATTAGAT
CCTTCATTAG GGATTCATTT AGACGATATT TCACCAGGAA ACACTACTTT CATTACATTC
CAAGTGAAAA TTCTGGCTAT TCCACCTAGT GGAACTTTAA CGAATAATGC TCTTGTAAAC
TACGAATATG CAGTGAATCC AGCTGAAACG CCAGCTATTG GTAGTACCGT TACAAACACA
ACAGTTACAC CGATTATTGA CGCTACTTTA GTAATAAATA AAAGTGCTAG TACAACTTTC
GCTACGATTG GTGATACAAT TACATTCACT TCATCTGTTA CAAACACAGG AAATACTACT
GCGAACAACA TTGTTTTTAC AGATACAATT CCAAACGGTA CTACCCTTGT CCCAAATAGC
TTTAAGATAA ATGGTGTAAC CGTCCCGAAT GCAAATCCAC AAAACGGTAT CAATATTGGT
AACTTAAATT CAAATGCATC GGTTACACTC AGTTTCCAAG TAAACATCAC AACGCTTCCA
AACCCTAATC CAATTCCGAA CAAATCATCG CTTCAATATA GCTTTATTGT TGATATAAAT
GAACCGCCTG TTTCACGAAC AGTTCAATCC AATACAACTT TTACGCAAGT AAATTCAGCT
TCCGTTATCG CAACGAAAAC TGCGAGCAGT GCATTTGCCG CTGTTGGCGA TACAATTACG
TATACAACTA CTCTCACTAA TAGCGGAAAT ACTACCGCAA ACACACCTGT TTTTATCGAT
ATATTACCAC CTGAACTTTC ATTCGTTCCT GATAGCGTAC AAATTAATAC CATCCCACAA
CTTGGATTTA GACCAGATAG CGGGATCTCT TTAGACTCAA TTCCAGTTGG AGGAACGACA
ACAATTAGCT TTCAAGCTAT CGTTGGCTCA ATACCAGCTA CAAATCCAAC TTTGAACCAA
TCCAGTACGA CATACTTGAT CATTGTTGAC CCTACCCAGC CACCGATGAC AGAAACAGCT
ACAAGTAATC CAACTTTAGT TCAAATTAAT GAAGCAATTA TTCAAGCAAC GAAAAGTGTG
GATCGACTAT TTTCTGACGT AGCACCTGGA AATTCATTTT TAACGTATAC AGTTTTATTA
GAAAATGTAG GGAACACAAC TGCTACGAAT ATCATTTTTA CAGATCCTAT TCCAAATAAC
ACAATATTTA TAGAAGATAG CGTTCGAGTC GGCGGAGTTT TATTGCCTGG GGTAAATCCA
GCGAATGGAA TACCAATTGG GGATATTATT GCAGGAGATT TTACAAACGT CACCTTCCGC
GTTCAAGTAG TTAGTATTCC AAATCCAATT TTCACAATTG GACCTGGTGG ACCAAACTCA
CCGGTTGTAA ATAGTGCCTC CATTGACTAT CAATTTATTA CAGGACCTAA TTTACCACTC
GTTTCAAGAA GTACGACATC CAATCCCGTT GCGACACAAA TAAATTCCGG AGAAATTGTG
GCAATCAAAT CTGTAGATAA AACTTTCGCA ACGATTGGCG ATACAATTTC TTATACAATT
ACATTAAGTA ACCCTGGAAA TGTCACTTCA CAAAATATAA TTTTCACAGA TATTTTACCT
GACGGAACGA CATTCATATC TGGCACTCTT ACAAACGATT CTGGTACACA GCAAATCGGA
AATCCAGCTA GCGGGATACA GATTGGAAAT ATAAACCCAA ATGGAACGGC CGTTATTTCT
CTCAACGTAC TTGTTACAAA CATTCCAAGT ATAAATCCTA TCTCTAATTT TAGTTCCATA
CAATTTGAAC ATGTGATTGA TCCTAGCCAA CCTTCTGCAT TACAAACAGC TGTATCTAAT
ACAGTTTCAA CGACCATTAA TAGCGCAGTA TTAACTACAA CAAAAAGTGT TGATAAATCT
ATTATTTCCG TCGGGGATAC ACTTACGTAT ACAACGACTA TTACGAATAC AGGAAATACA
CCAGCTACAA ATATAACTTT TACAAGTGCG ATTCCTCCTA GTACTACTTT TGTACCAGAT
TCGGTCACTA TAAATGGCAT CCAGCAGCTT GGTGCACAGC CAGCACTCGG AGTAAATATA
CCAAATATCG CTCCTGGTGA AACAGTAACT GTTACTTTCC AAGTAAATGT GATTTCTGTT
CCCCCTTCAA GCTCAATTAT GGGTAACGAT ACAATTTTAT ATTCTTATAC TGTTGATCCA
AACGGAACTC CTGCTACTAC TTCTACTTCA ACAAACATTG TTACAAACCC TGTATTAGAT
GCTATGATAA TGATGATTAA ATCAGTTGAT CAAACAATTG TAACGCTAGG TGATACCATT
ACCTATACGA CAATTTTAAC GAATAATGGT AATACAAATG CAACGAATAT TACTTTCACC
GACCTTATAC CAGATGGTAC AACGTTTATT ACTGATAGCG TTACAATAGA TGGCATCACG
CAAATTGGTC TTAATCCTAA TACAGGTATA ACGATTGGAG CAATTGCTCC TAACAGCTCA
ATATCTATAG CATTTCAAGT TACCGCTACT TCTACACCTG TTCAAAATCC TATTGCCAAT
TCCGCTAGTG CTTCTTACAC ATTTATCGCT GATCCAAATG CCCCTATTGT TTCAAGAAAT
GTTACTTCAA ACACAGTGTT CACTACGATT AATACAGCTA CCATTCTTTC ATTAAAACAA
GTCGATAAAT CCTTTAGTCG TATTGGAGAC ACACTCACTT ATACTGTCGC TTTAACAAAC
AATGGAAACT CATCCGCACA AAATGTTATA TTTACAGATA CAATGCCGAG CGGAACTACT
TTTATTGCAA ACACATTCTC TATTAATGGG GTTCCTCAAA GTGATGCGGA TCCATCGAAT
GGTGTGAATA TTGGGACTAT AACAGCCGGG ACTACAGTAA CCGTTTCGTT CCAAGTTACT
GTAACGTCAT TACCAACGGA AAACCCCATT GTAAATTTCT CATCAACATC GTACCAATTA
GTCTCACCAC CTGATGCAGA AACTTCAATT AGCAATCCTG TTTCAACGCA AATTAAAGAA
GCCATATTAT CCATGACGAA AAATGAAAGT GTATCCTTTG CAGATATCGG GCAAACTGCT
TTTTATACTA CTTCTATTAC CAATGTAGGA AATACCGATG CAACTAACAT TGTTTTCACA
GATGTATTAC CAAGTGGACT CACATTTGTT CCTAACACAT TAACTGTCAA TGGCGTTTTA
CAACCTAACG CGAATCCAAA TACAGGTGTA TTACTTGCAA CACTTCCACC AAATGAAATA
TATAGTATCG TCTTTCAAGT GACAGTGAAC AGTATTCCTC CTAGTAACCC AGCACCAAAT
ACAGCATCAA CGACGTATGA GTTTACTGTT GATCCAGGTA ATCCGCCAGT ATCGAGTACA
GCTACTTCCA ACACTACACT CCTTCAAATA AACAACGCAA CTATTATTAG CACAAAAACA
GCAGATCTTA CTTTCGCGGA TGTTGGTAAT ACAATAACAT TTACACTTAA CCTCCCTAAT
ACCGGGAATG TGACTGCAAC TGATGTAACT ATTATTGACA TTCTTGATAG TAATTTAAGT
TTCGTTCCAA ATAGTTTCAC AGTTAATGGG CAAACCATTC CAAACGCTGA TTTATCTACT
GGTGTAAATA TTGGTTCCAT TAATGGTGGT AATACGGCAA TCGTTACATT CCAAGCAACT
GTTATTACAC TTCCAACACT TAATCCCATT TCTAACTCTG CTTCTATCAC ATATCATTAT
GTCGTTGATC CTAGCCAGCC ACCTATTACA ACTTCCAATC AATCTAATAC AACGACAACA
CAAATTAATA GCGCTACCCT TACTGCACAA AAAAATTCAA ATGTATCTAC GGTAGATATC
GGGCAGGATA TTACCTACAC CGTTACAATT ACAAATAGCG GAAATGTTAG TGCGACGAAT
GTTATTTTTA CCGACCTTAT TCCAGACGGA ACTTCCTTTG AACCGAATAG TTTTACACTG
AACGGAACTA GCATCCCAAA TGCAAATATC ATTACAGGCG TTCCAATTGG TGATATTGCG
CCAAACGAAT CTGTCATCGT AGCATTCCAT ATTAATGCCA ATGAAATTCC GCCTATAAAT
CCAATTACTA ATCAAGCTAG CGTTAGCTTT CAATATATCG TTAATCCAGC TAATCCTCCA
GTTTCAAAAA ACATTACTTC TAATAGCGTT TCAACACAAA TTGAAAGTGC TATTTTAAAT
ACGATTAAAA TAGGAGATAA AGCATTTGCA ACGATTGGTG ATACGATTAC GTATACAACA
ACTATTACGA ATACAGGAAA TATCCCAGCT AACAATGTTG TATTCTCAGA CCCTATACCA
TCGTGGACAC AATTTGTTGC AGGATCAGTT GTTGTTGATG GAACTCCATT AACATCCGCT
TCTATCATTG ACGGCGTTGG CATAAATACA ATAACTCCAA ATCAAATCGT AACAATCGTA
TTCCAAGTTC AAATTGTAAG CAACCCAACA ACGCTCACAC CTGAACTCCA AAACTTAGGA
TTTGTTAACT TCCAATATAA CGTAGGCAAT GCATTACAGG CTCAACCTGG CAATGTGGAA
ACGAACGTCT TCGTTACCTC TATTAATTCA GCAATACTTT CAGCTGTAAA AACTACTAAT
ACAGCCTTTG CAAATATTGG AGATACAATC ACTTATACAG TTTTGATTCA AAATAATGGC
AATACAAACG CTACGAATGT AAATTTCTCA GACATGGTTC CAGCAGGAAC AACCTTTGTT
GAAAATAGTT TTACTGTAAA TGGAAGTAGC CTTCCAGGCG CAAATCCAAA TAACGGAGTT
AATATCGGAA CAGTTAACGC GGGTAGTTCC TTAACCGTTA CTTTCCAAGT CATTGTTACA
TCAACCCCAC CTTCAAATCC GATTACAAAC GTAGCATCTA TTCAATACGC ATTCATCGTT
GATCCAGCCT CTCCTCCTGT TACAAGTACG ATAAATTCTA ATAGCGCTTC AACACAAATT
AATAACGCGA CTGTTACAAC TGTTTTACAA GCAAATCGAA CAATCGTATC TATCGGAGAT
GTAATTACGT ATACAGCAAC ATTAACGAAT ACTGGAAACT TCCCTGCAAA CTCTGTATTA
CTCATTAACG GAGTTCCTGA AGGAGCATTA TTTGTTCCAA ATAGTGTTAC GCTCAACGGG
ATTTCACTTC CAGATGCAAG TCCAACTCTT GGTATTCCAG TTGGTATTAT CGCACCAGGT
GATTCCGCTA CTATTACGTT CCAATTTCTT GCAAGTTCTA TTCCACCGCA AGGAGCGATT
ATAAATCAAG CACTTACAAG TTACACGTAT ATTGTCGATC CAAGTCAACC TCCTGTTACA
GCAACATCCT CGTCTAATAC GGTTAATACA GCTGTTGTTG ATGCATCGTT ATCCGCAATT
AAAAGTACAG ATTCTCTCGT ACAATCTACT GACGGTACAA TCACTTACAC AGTAGTCGTT
CAAAACAACG GAAATACTAC TGCAAATACA GTTACTTTAA CAGATTTGGT CCCAGAAGGA
ACTGCATTTA TTCCGAATAG CGTAACTATT AATAGTGTTT CCGTTCCAGG TGCCGACCCT
AACGTAGGAA TACCATTAAA CCCCATCGCA CCGTCAGAAA TCGTCACTGT AACATTCCAG
GTTATCGTTC AATCTATTCC AAGCGTGAAC CCAATTTCTA ATACAGCCCG TATTGACTAT
ACTTTTATCG CCGATCCAAC TGCTCCTATC ATCTCTCGAA CAATTACTTC GAATCCAGCT
TTCACACAAA TTTCGGATGC GACTATCCTT TCTTTAAAAG CAGTCAATGC ACAACAAGCA
ACAACAGGTG ATATTTTGAC TTATACAATA ACATTAGAAA ATACAGGAAA TATCCCAACT
ACGAATCTCA CATTTTCAGA CACTATCCCT CAAGGGACTA CCTTTGTAGA AAATAGTTTT
ACACTTAACG GAACAGCTAT ACTTGATGCC AATCCTAATG TAGGTGTTAC TTTGCCTAAC
CTAGCTGCAA ACGCTACGCA CCTTATTTCG TTCCAAATTC TTATTAACGA TTCATTCTCG
CAAGAATCAA TTACAAATCA ATCCAACACA ACTTATACAA TTCAGCCAGA CCCAGGGCAA
CCGCCTATTA CTGAAACATC TACTAGTAAT ATCGTCATTA CAAATTTCGT GCAAGCACAA
TTGACAATTA CAAAAACATC CAATCCAACA ACTGTTGATA TTGGCGGAAC TATACTTTAT
ATTTCTGAAG TAAAAAATAT CGGCAATGTT GACGCAATAA ATATTAATTT CACAGATTCT
ATTCCAGCTG GGACTACATT CGTTCCCGAT AGTGTCACAA TTAACGGTAT CCTTCAGCCA
GGTGTAAATC CAGAAAACGG AATACCGATT GGAACAATTC CAGCAAACAG TTCCAAAACG
ATACTATTTC AAGTGCAAAC AAATAATCCA CCTACTGAAA CCGAAATTGT AAATCAATCT
TCAGCAACTT ACCAATATGT AAGTATTCCT GCAGCCCCAC CAGTAAATCG TTCTGCAAAT
TCTAATATCG TTACAACATC ACTTCAAAAT GCAAATATTA TTTCTGTTAA AAGTGCGGAT
GTAACTTTCG TATCAATCGG GCAAATTATT ACCTACACAA ATACACTTCA AAATATAGGA
AGCGTTCCAG CTAACAATAC CGTTTTCATT GATAACATTC CAGAAGGTAC TATATTCATT
GAAGATAGCT TAGCAATAAA TAATGTAATT CAGCCTGGCG CTAATCCTGA AAACGGAGTA
ACTCTCGGCA CGATACAACC AAATGAAACA GTCACTATTT CATTCCAAGT ACAACTTACA
AATATACCAG AGGGCAATAC AGTCATTAAT ATTTCAGACA CTTCGTACGA ATACCAAATT
GACCCTAGTT CTCCAATTAT TCAGCGTAGA TCGTTATCAA ATGCAGTAAA TACTGAAGTG
CGGACGGCAA ATGTTAGTGC CAGTAAATCC GCCAATAGAT CCATTACACG CATTGGTCAA
ATCATCACAT ATACAGTCGC AGTTACAAAC GCCGGTACAG TACCTATTAC AAATACTCTC
CTAATTGACG CAATTGCAGC TGGGACCACA TTCGTTCTAA ATAGCATTCT TGTAGATGGC
ATACCGAGAC CTAATGAAAA TCCAATTACC GGTATCAACC TTGATATTAT CCTTCCAAAT
AATACAATTA TTGTTACTTT TCAAGTAAGC GTAGTCTCTA TACCGCCACA AAATAACATT
AATAATATCG CAGTCATTCA CTATGAGTAT GAACCAGACC CAAGCGCACC ACCAATTTCA
GAAACGACAT CTTCCAATAG TACAAATATA CAATTTATTG ATGCTATTCT TATTGCTACA
AAATCCGCTA ATACTGTATT AGCTAACATT GATGAAACCA TTGAATATAC AGTATTCATT
CAAAATAACG GATCGACTAC AACTAACTCC ATCTTTTTTA CAGATACTAT AGCGGATGGA
ACAGTATTTA TTCCAGGAAG TGTAATAGTT AACAATACTG TACTTCCTGC AGCAGATCCG
AATATCGGCT TTTCCATTCC TAATGTCGCA GCAGGTCAAA TAGCTACAAT AACATTCCAA
GTTTCCGTTA CGAATTTACC TGTTGTAAAT CCAGCACCTA ATACTGCAAA CATCGTCTAC
GACTTTATTT TCAATCCTGA CTTTGCACCA ATTCAAAAAT CTACTACTTC CAATACTACT
TTCGTTCAAA TTAATGATGC TGATATCGTT TCACTTAAAA CTGTTGATTT GACCTCTGTA
ACAATTGGTG ACATTTTAAC TTATACAACA ACTTTAACGA ATACAGGGAA TACGGATGCC
ACTGCTGTTG TATTTACAGA CAATATACCT GATGGAACAA CTTTTATTGA CGGTAGCGTT
TTAGTAAATA ACATTCCTCA GCTTAACGCC AATCCAAGTG CAGGTATACT TGTAGGAACG
ATTACTCCTA ACACTTCTAT CCCAGTCACA TTTTCTGTTA CCGTCGTAGC TCTTCCTGCT
AGCGGCCATG TTCAAAATCA ATCAACTTCT CGTTATACAA TAAACGCGGA AGAACAAATA
TCGACTAGCA ATATTACCTT CACTGAATTT ATTTCTGCTA ATGTAATTGC GACAAAAACA
ACGCCTATCC AATATGCTGA CTTACAAACT ATCATCCCTT ATACAATTTC CATCATAAAT
AACGGAAATA TACAAGTCGA AAACATTATT GTTACAGATA TCATCCCAGC AAATACGAGC
TTTATAGAGA ATAGTGTTAT CGTGAACGGC AATGCTCGGC CAAATGACAA CCCTCTTAAC
GGTATACAAA TTGATAACAT TCCGCCTAAT ACGACAGCAA CTATTCTATT CCAAGTACGG
GTTACTTCGA TTCCACAAAC AAATCCAATC TCTAACACGA GTACAATTGA ATATGAATAC
ACGGTACCAG ATCGACCACC TATTACCGAA ACTATTATTT CATCAGCTGC CGCAACAGAA
ATTAATCACG CGAATTTGAA TAGTAATAAG GCTGTTGACC TTGCATTTGC AACGGTTGGT
GATACGTTAA CGTATACGAT TACACTAAAT CAAACCGGTA ATGTTGCAAC AAATGATGTA
AACATTCAAG ATATTATTCC TCAAGGTACT ACGTTTATAG AAAATAGCGT CATTGTAAAC
GGAGAAGCTC TTCCAGGAGT GAATCCAGTA AGCGGCATAC CAATTGGCAC TATAATTGTT
GGTGGAGATG CTATCGTTTC ATTCCAAGTA ACTGTGACTT CTATTCCAAC ACCAAATGAA
CTAAACAACA ACGCAATTAC TACTTTTAAC TATATAGTCA ACCCAAATAA CGTACCCGTT
ACAAATACGA CTACAACAAA TACCGTCACA ACTACTGTCC AAAATGATAA TGTCATTGCC
ATAAAATCTG TTGATGTAAC GAATGCCTTA CCTGGTCAAA CTTTAACGTA TACAATTACG
ATTACGAATA GCGGTAATGT AACGATTGAA GACCTTCTTG CCATAGACAC CGTACCAATA
GATACGACTT TTGTTGCTGA TAGTGTTACG ATTAACGGAA TCAATCAGCC TAATGAAAAT
CCTGAAAATG GTATTACGTT AGGAAATCTT GCTCCTAATG AATCTGTTAT TATTACGTTC
CAAGTGACAA TATCTTCTTC TACTCTTCAA TCTACAATTA ATAATGATGC TTCTGTTTCC
TATACCGTTA TCATTGATCC AACAAAACCA CCTATTACAA TTACAAAACA AACAAATACC
GTTACAACGA CAGTCATTGA TCCGATGGTT CGCATTGAAA AAACAACTGA CAAATCTATT
GTCGTTATAG GAGATATCAT TACATTCACA TTAGCGGTAT TTAATCACTC CCCCATTCCG
ACAATCAGTA CTTCTGTTAT AGACAGCATT CCAGCTGGTA CAACATTTAT AGAAAATAGC
GTAACAATTA ACGGTACTTC GGTTCAAAAT GTTCGTCCAG ACACTGGTAT TAATATTGGT
TCTTTATCTG CAGATACAGT AGCAACTATA ACATTTCAAG TTCTCGTAAC TTCTATTCCT
TCAAACAGTA CAATTATAAA TTCTGCAACA GTTACCGCTG CTTTTCAATT GACACCACAG
GATCCAATTA TTACTTTCAT TGTTAATTCA AATATTGTTC GTATACCAGT TCAATTTGTA
ACTGCAACAG TCACGAAAAA CGCTTCCGTC AGCTCCGCTT ATTTAAACCA ATACTTTGAT
TACACGGTGC GTATTACAAA TACTTCCGAA ATTTCACTCT CAAATATTTC TTTACAAGAT
ACCATTCCAG CAGGTTTACA ATTTATAAAC GGCACTGTCT TTATTAACGG TGAACGCTCT
CCACTAGCGA ACCCAAATAT CGGTTTCCTA GTTGCCACTA ATTTAGAGCC AACTGAAACA
ATTATCGTGT TATTCACCGT ACAAGTAATA AGTCCACCTA TTAATAATCA ATTTAAAAAT
ACGGCCAATA TTTCGTTACA ACTTCAAGTC TCGCCTACCG ATCCACCAAT TACAGTAACC
GTTACAAGCA ACGAAAACAT CGTCACCTTT GTTCCAGAAA ATCCAGACGA AACACTTCCA
AATTTAAATT GCTTCTTTGA CGGTGAACGC TTCATACGGA TTACTCCCCA AAATGCAAGA
AATTACCTTT GGACTTGGAT TTGGTGGAAT TAA
 
Protein sequence
MPITNRFSTT TNGALAITGN TLGLSKISNQ NRAGTIGAIG AFVTTNTALQ VPTFPAGTTL 
NYTQNSSTAI LNIPAGSTIL YAELIWGGNY LTRDQNITSV LGNPISFTTP VSTYSITPSA
VTASNQTFVS NSITFGFYTR SADVTTLIQA GGSGSYTTGA VPGLVDPLDA SNGSINSAGW
TLIVAYQNGT LPARNLTIYV AGNRVSADTG SADVAVSGFL TPSGGPVSGR LFLSSIEGDA
DLIGDQALFG PNFSSLNALS GPNNAVNNFF GSQVNNAAGN LDTTGTFGTR NQSASTGTNI
SAGRQGWDIT SIDISPYLTN SQVSAAIRLT TNGDAYMLNT VGLQININSP TIQATKSVNK
SVAAIGDILT YTVTIPNTGL LPANNVTFTD ILPNGTSFIP GTVTVDNVPQ TNANPAAGIS
LGTINNGASR TVTFQATVVS LPSQNPISNT ANITFQYTPI AGGTTFNGLA TSNSAGTQIN
LADINGTKSV NKIFTDIGET LTYSIALANI GNIAATNVIY TDPIPSGTTF IPGSVTVNGV
TQAGANPANG ISIGSIAANS TTTISFQVLV PSIPQTNPIL NSGTTTYQYI PIPNQPAVSG
TDTTNIVSTQ VNNATVTMVK AVDKNFADIG DTLTYTVSFT GTGNTNANNI IFTDVIPTGT
TFVLNSLTID GTTQVGANPA NGVNIGSIPS GTTKNVTFQV VVNTIPASNV VSNGSSASYQ
YTVNPIQSPV TKNISSNLVS TQINNANVTL TKSTNKQFAT IGETIGYTIL ITNSGNTAAN
NVQLTDPLPN GTILTLGSVT LNGVLQNVDS LVALPIGTIP AGATFTLSFQ VTVINITSQN
PILNNAFASY IYTVNPSLPP TSKTANSNSV TSTIRLANLH ANKSVDKTFA EVGDVLTYTF
ALTNDGNVAA NNVLLSDSIA NGTSFVPNSV IVNGVTQPGI TPASINIGSI NANTTITASF
QVVITSIPNP NPISNSAFVS YNFIVDPNAS PVNKNTTSTT TFTQVNDANV ISAKTVDQAF
ATVGDILTYT VTLTNAGSVS ADSPTFVDTN PDGTTFIPNT FLINGVLQNN ADPNIGVPLS
SIPANGSLTV SYQVTVTSLP TQNPTINSSS TQYSFILNPG DPPTIEISVS NTVSTQINLA
NVVIVKEVDL TIADVGQPIT YTISLANLGN TTANNVIVTD IIPNGTTIVP NSIFIGGALQ
LGADPSTGLQ VGSIPSGGFT TIVFQISANG LPSPNPIQNS ASLQYSFIAD PNLPALVRNA
ASNIVTTQIN TANIIATKLT STNFANVGDI ILYATILTNN GNIPAANVTF TDIIPAGTLF
IPNTVTINNV PIANANPANG ISIGMIGANS SRTVAFQVFV PTIPAVNPIT NQSGTTFQYT
YDPSKPAVMQ MVASNTVQTT INNASIAATK SADKQFANVN DIITYTTTLT NNGNTLASNV
IFTDVIPNGT SFIPNSVSVN GNTLPNVNPA SGIAIDPINP NANTLISFQV QVNSIPNPNP
IPNQSNTTYQ YVIDPNLPPA SANALSNVIT TQINNATIIA TKSVNTPNAA IGDIVTYTIA
VTNTGNIPAS ATVLTDGLGP GASFIPNSVT INNVSQPGLD PSLGIHLDDI SPGNTTFITF
QVKILAIPPS GTLTNNALVN YEYAVNPAET PAIGSTVTNT TVTPIIDATL VINKSASTTF
ATIGDTITFT SSVTNTGNTT ANNIVFTDTI PNGTTLVPNS FKINGVTVPN ANPQNGINIG
NLNSNASVTL SFQVNITTLP NPNPIPNKSS LQYSFIVDIN EPPVSRTVQS NTTFTQVNSA
SVIATKTASS AFAAVGDTIT YTTTLTNSGN TTANTPVFID ILPPELSFVP DSVQINTIPQ
LGFRPDSGIS LDSIPVGGTT TISFQAIVGS IPATNPTLNQ SSTTYLIIVD PTQPPMTETA
TSNPTLVQIN EAIIQATKSV DRLFSDVAPG NSFLTYTVLL ENVGNTTATN IIFTDPIPNN
TIFIEDSVRV GGVLLPGVNP ANGIPIGDII AGDFTNVTFR VQVVSIPNPI FTIGPGGPNS
PVVNSASIDY QFITGPNLPL VSRSTTSNPV ATQINSGEIV AIKSVDKTFA TIGDTISYTI
TLSNPGNVTS QNIIFTDILP DGTTFISGTL TNDSGTQQIG NPASGIQIGN INPNGTAVIS
LNVLVTNIPS INPISNFSSI QFEHVIDPSQ PSALQTAVSN TVSTTINSAV LTTTKSVDKS
IISVGDTLTY TTTITNTGNT PATNITFTSA IPPSTTFVPD SVTINGIQQL GAQPALGVNI
PNIAPGETVT VTFQVNVISV PPSSSIMGND TILYSYTVDP NGTPATTSTS TNIVTNPVLD
AMIMMIKSVD QTIVTLGDTI TYTTILTNNG NTNATNITFT DLIPDGTTFI TDSVTIDGIT
QIGLNPNTGI TIGAIAPNSS ISIAFQVTAT STPVQNPIAN SASASYTFIA DPNAPIVSRN
VTSNTVFTTI NTATILSLKQ VDKSFSRIGD TLTYTVALTN NGNSSAQNVI FTDTMPSGTT
FIANTFSING VPQSDADPSN GVNIGTITAG TTVTVSFQVT VTSLPTENPI VNFSSTSYQL
VSPPDAETSI SNPVSTQIKE AILSMTKNES VSFADIGQTA FYTTSITNVG NTDATNIVFT
DVLPSGLTFV PNTLTVNGVL QPNANPNTGV LLATLPPNEI YSIVFQVTVN SIPPSNPAPN
TASTTYEFTV DPGNPPVSST ATSNTTLLQI NNATIISTKT ADLTFADVGN TITFTLNLPN
TGNVTATDVT IIDILDSNLS FVPNSFTVNG QTIPNADLST GVNIGSINGG NTAIVTFQAT
VITLPTLNPI SNSASITYHY VVDPSQPPIT TSNQSNTTTT QINSATLTAQ KNSNVSTVDI
GQDITYTVTI TNSGNVSATN VIFTDLIPDG TSFEPNSFTL NGTSIPNANI ITGVPIGDIA
PNESVIVAFH INANEIPPIN PITNQASVSF QYIVNPANPP VSKNITSNSV STQIESAILN
TIKIGDKAFA TIGDTITYTT TITNTGNIPA NNVVFSDPIP SWTQFVAGSV VVDGTPLTSA
SIIDGVGINT ITPNQIVTIV FQVQIVSNPT TLTPELQNLG FVNFQYNVGN ALQAQPGNVE
TNVFVTSINS AILSAVKTTN TAFANIGDTI TYTVLIQNNG NTNATNVNFS DMVPAGTTFV
ENSFTVNGSS LPGANPNNGV NIGTVNAGSS LTVTFQVIVT STPPSNPITN VASIQYAFIV
DPASPPVTST INSNSASTQI NNATVTTVLQ ANRTIVSIGD VITYTATLTN TGNFPANSVL
LINGVPEGAL FVPNSVTLNG ISLPDASPTL GIPVGIIAPG DSATITFQFL ASSIPPQGAI
INQALTSYTY IVDPSQPPVT ATSSSNTVNT AVVDASLSAI KSTDSLVQST DGTITYTVVV
QNNGNTTANT VTLTDLVPEG TAFIPNSVTI NSVSVPGADP NVGIPLNPIA PSEIVTVTFQ
VIVQSIPSVN PISNTARIDY TFIADPTAPI ISRTITSNPA FTQISDATIL SLKAVNAQQA
TTGDILTYTI TLENTGNIPT TNLTFSDTIP QGTTFVENSF TLNGTAILDA NPNVGVTLPN
LAANATHLIS FQILINDSFS QESITNQSNT TYTIQPDPGQ PPITETSTSN IVITNFVQAQ
LTITKTSNPT TVDIGGTILY ISEVKNIGNV DAININFTDS IPAGTTFVPD SVTINGILQP
GVNPENGIPI GTIPANSSKT ILFQVQTNNP PTETEIVNQS SATYQYVSIP AAPPVNRSAN
SNIVTTSLQN ANIISVKSAD VTFVSIGQII TYTNTLQNIG SVPANNTVFI DNIPEGTIFI
EDSLAINNVI QPGANPENGV TLGTIQPNET VTISFQVQLT NIPEGNTVIN ISDTSYEYQI
DPSSPIIQRR SLSNAVNTEV RTANVSASKS ANRSITRIGQ IITYTVAVTN AGTVPITNTL
LIDAIAAGTT FVLNSILVDG IPRPNENPIT GINLDIILPN NTIIVTFQVS VVSIPPQNNI
NNIAVIHYEY EPDPSAPPIS ETTSSNSTNI QFIDAILIAT KSANTVLANI DETIEYTVFI
QNNGSTTTNS IFFTDTIADG TVFIPGSVIV NNTVLPAADP NIGFSIPNVA AGQIATITFQ
VSVTNLPVVN PAPNTANIVY DFIFNPDFAP IQKSTTSNTT FVQINDADIV SLKTVDLTSV
TIGDILTYTT TLTNTGNTDA TAVVFTDNIP DGTTFIDGSV LVNNIPQLNA NPSAGILVGT
ITPNTSIPVT FSVTVVALPA SGHVQNQSTS RYTINAEEQI STSNITFTEF ISANVIATKT
TPIQYADLQT IIPYTISIIN NGNIQVENII VTDIIPANTS FIENSVIVNG NARPNDNPLN
GIQIDNIPPN TTATILFQVR VTSIPQTNPI SNTSTIEYEY TVPDRPPITE TIISSAAATE
INHANLNSNK AVDLAFATVG DTLTYTITLN QTGNVATNDV NIQDIIPQGT TFIENSVIVN
GEALPGVNPV SGIPIGTIIV GGDAIVSFQV TVTSIPTPNE LNNNAITTFN YIVNPNNVPV
TNTTTTNTVT TTVQNDNVIA IKSVDVTNAL PGQTLTYTIT ITNSGNVTIE DLLAIDTVPI
DTTFVADSVT INGINQPNEN PENGITLGNL APNESVIITF QVTISSSTLQ STINNDASVS
YTVIIDPTKP PITITKQTNT VTTTVIDPMV RIEKTTDKSI VVIGDIITFT LAVFNHSPIP
TISTSVIDSI PAGTTFIENS VTINGTSVQN VRPDTGINIG SLSADTVATI TFQVLVTSIP
SNSTIINSAT VTAAFQLTPQ DPIITFIVNS NIVRIPVQFV TATVTKNASV SSAYLNQYFD
YTVRITNTSE ISLSNISLQD TIPAGLQFIN GTVFINGERS PLANPNIGFL VATNLEPTET
IIVLFTVQVI SPPINNQFKN TANISLQLQV SPTDPPITVT VTSNENIVTF VPENPDETLP
NLNCFFDGER FIRITPQNAR NYLWTWIWWN