Gene OSTLU_43542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43542 
Symbol 
ID5006508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp738 
End bp13925 
Gene Length13188 bp 
Protein Length4395 aa 
Translation table 
GC content51% 
IMG OID640421929 
Productpredicted protein 
Protein accessionXP_001422630 
Protein GI145356835 
COG category[Z] Cytoskeleton 
COG ID[COG5245] Dynein, heavy chain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAA GGGCAATCAA AAGTATAGAA ACGAGTTCAC CGTTGACCGA TCTCGCGGCT 
CATGCGCACA CGTTCATGTC CAAATTGCTC GATGCGTCGA ATTCCCATCG AACAGTGCTG
TACGTTCCAG ATGATCAGAG TACTGGTAAC TTGGAGAGCG TCGTCGCAAA CTGGTCCCGT
CAACTTCGCC AGGTGCTCGG GCGCGGCGGG GACGTCGGCG AAGACGACGA GAGCAAGCGC
GACGCACAGA GACTGGACGA ACGAGTACAG TCCGATGGAC CTTTGATGCG CATCATGAAG
TTTTGGCCCG AACGCGCGAC GGACTTGCGC GGTGTTCAGG AACAGCTGCA ATCGGAAAGC
GTTCGGTTGA TTGTGAAGAA ACTCGCGTCG CAGAAAAACG CAAACTACCT GTGCGATGAC
ATCGCGGCGT TTGAACAGTT GATCGTCGAC AGAGTTCACT TGGCTGAACG CATCAGTGAC
ATGTTCAGCC GCGTCAAGCC CGCATGCGAA GCTTTGAGCG TCGCACGACT GGCAGACGTC
CCGCAAATCT TGCCGCGCCT CGTTGACGCA ATCTTCCTGG CGTGGCTGAA TACGGATTCA
AACATCAAAG ACGCGGTCAG TGATCAGTAT GTGGACCTGC TTAAACGTAT CGCCGATCAA
ATTATGATTA TTTGCAAAGG TGAGTTGAAT CTGAACCACA TCATTCAGAA CGATGGTTCG
AACGAAGAAG CGTACAACGG CGCAGTTTGT ACTCTTCGCG AGTGCGCATT CGTGGCTGAT
GTGTGGAAGA ATATATACTT CGAGCGCGCG GCGATCGAGA GCAGTGGAGC ACTGTTGATT
GATGCGACGA CCGTGTTTTC GCATTTGGAT GCATTCGCGC ATCGTTGCAG AGATCTGTTG
GAATTGTGCG AAGCGCAAGA ACAATTTGCG CCGCCGGCGT CTCGCTCATT CCCAAGCGCA
GCGAACGTGG TGCGTTACAG GCTCGACGAT CGACTTGAAG TAATCCGTCG CGTTTTCCTT
GATGTCGCGA TAAATGACTT GACCAAGATT GAGTATGATA TTTTGGATGT GTCCTCGATG
CGATGGCACG ATGATTTCGC CGCGTTCAAG TCTCGCGTGA AAAAGTTGGA GATGAAGATG
TGTGCCATAT TCGGCGAAGC GTGCGAACGG GCGGAGACGT TACAAGCGCG TGTTTCGCTC
CTTCGCTTGT TATTCACGAA AATTGCACGT CGAGATGGTG TTGTTCGCGC AGTCGAATCT
TGCGCCGCTC GCGCGTACTC TATGTTTGCC GAGGATTTGC AGTGTGCCAA ACGACAGTTT
GACGCAAATC GGATGAAGCC TCCGATAGAA TCGAGCTTGC CTCGAACGTT CGGTGCCGTG
CACTGGGCCA ATACATTGCT GCATCGTCTC GAGAACGACT TCGGTGAGTT ACGAAGGGCA
CACGCGGAGG GTGCGCTACC GCAACTGCCC GAATCCGTGG ATTTGCTCGC CGTGTACGAT
TTAACCATTC CACAAATAGA GCGCTTTATC AACGAAAAAT ACGACGAATG GGTCGTTTAT
GTGAACGCAA TGAACGTCGA AGAGCGCCTT GATCAGCCTT TGCTACGCGC CGACGATGAT
GGGTTATTGT TCATCACGTA CGCCGTCGAC GTAGGCGTTG CGTGCGAGGA TGCTCGAGGA
TTCGCATCCA TGTCTCGAGA AGTACCTGCC AATATCAAAG AAGTCGCGTT TGGTGGTATG
CGCTGTGCCC ATGATGCGTT GTGCGAACGA TTGAGTTCGC TCGTCGAAAG GAACAACACG
TTATTGCGGC AAATCGGTCG CGACAAGCAC GTCAATCGTC TTTTCCGCGC AAATGTCGAT
ATTTTGAAAA ATCTTTTCGC CGCGGCGTCA AAAGAGCTGA CATGGCGCTC CGAAATCTCC
ACCATCGACG CTTTTTGTGC AGAGTCGGAC AAACTCTGCT CGTTTATGAG TGAAAGTTTG
GAAAAGTTTA GAGGATCGGC TGCGCGCATC ACTGAAATTC TCGAAGAACT CTCGGCGGTT
TCATTACGGC GCAGGGAAGG TGACCACGTC GACCCGCTGG ACGAATTTAA GAACATCTAC
GAACAGGAAA CCGTTCATAT TCGCTCGAAG TTCGGAGAAA CAAGTGCGAC TATCTTACAT
CTCACTTCAC ATCAGCGACA GAGCATCGGT GACGTTGACA CCGCCAGTGA AGCGTGGATC
GCATATATCA AGGACATTGA AGACGCCGTC GTTCGTGCAT TGCGCAAGGT CTTGTTGCGC
ACGCTATCTC ATTTGCATCT CACGATGGGC CGAGTGGAGT CTGACTGCGC TCCATTGAAA
CCAGTCGTGC TGATGAGCAT GAACCTTTCA TCCTGCGATG TTGGCACCAT GCGAGTCGAA
ACTTCACCGC AAACCACGCA ATTGTGCGAA ACCATCAACA CCGTTTTCGA CGCCGCGCTC
GATGCGTACG GCTCCGTGCC TCAAATATCT CCGCTGAACG AGTCCATTCA AGATAGATTG
CTGCGTGATA ACCGGGATAT GATTTCGCGC GTAAGATCGA GTGTCACAAA CTCGGTGAAA
GACATCGCCG AAGATTTGAT GAGATTTGCC GCACGTTGGG AGGAGCGCTA TTTCCACTTG
TGGCGAGAAA ACGCCGTCGA CGTCAGCGAC AACAGCGCGA TTGAAGCCCG CATTCATCGC
GTGCAGGAGT CGATGGAGGA CGTCAGAAGT GAAGAGCACA CGCATGTGAT CGGGTTCGTC
AGAATCGAAT GTGGATATCT CAAGCAAAGT CTTTTGGCTT TAGCTCAGAA TAACGTGCAA
GGCATGATTT CAGAATTACG CGCCAAGGCG GCGACTGAAA TCGACGACGT TATGAGATAC
TTTGAAATGA GTGTCGCCCG TTTCGAACAG CCACCGAGTG ACATAAATGA GCTCGTCAAG
CACGTGAGCG AACTCAAGCA ACTACAAACA AACGTTGACT CTGTTCCTGC ATCATTCGTC
AAGCCGCGCG ATAGTTTTCG TATTTTGAAC GAATTCGTAG ACGTCACGGA GACGGAATCT
CAGCGCTTGG CGTCCATCGA TGACAAGTTT GAAGAGTTCA AGTTTCAGCT AGAGAAGATC
GAGTCAAGCC TCGATGCGAC AAAAGATGTG TTTCAGAATG AACTGCGAAC ATCAATCGTG
CAACTCGAAA AAGATGCGAG AGAGTATAAA GTGAAATTCG ACCGTTTGGC GCCGATGCAG
ACGCACCCGG CGTGCGCGAA ACGCGCGTGC GACAAGTCGT GGGAGTTTAT CGCGGGCGTT
GATGAGGTCA TCGAACAGTT TGATCATCGA CTGCGTGTGA TACTGAACGG TTTGGTTGTA
TTTGAAGATC TCGCTCCACC CACACTGGTC GATATGGAAG AATTAAAAGC CGACTTGGAA
CAACTCAAGA TGGTTTGGCG AGTGGTGCGA GAATGGAACG ACTTGTACGA AGGTTGGAAG
GATGGTAAAT TTAATGACCT AGATGTCGAG TCCATGGAAA ACGGCGCTAC TCTTTTGTAC
AAACAAATCA CCAAGCTCGG GCGAGGTAAG GTCAAGGAGT GGGGAACTTG GATCAGTTTG
AAAGATACGG TTGATTCGTT CAAGCGCATC ATGCCGCTCA TCGTTGACAT GCGTAATCCG
GCGGTTCGTC GACGACACTG GGAACTCGTC ATGGAGGCAT GTGGCAAGCA ATTCGATCCC
ACAAGCGAAG GTTTCACTCT GGATAAAGTC GTTGAACTTG GCTTGGACCA TTACGCCGAA
GCCATTTCCG AAATTAGCAC GGATGCTACG AAGGAATTAT CCGTGGAGAA TACGCTGCGC
GGCATCGCCG ACGTCTGGAC AAACGTTGTT CTCGACACCG GTCCGTTCAA GGAAGGGCGC
GACGACGTGA TGAAGTTGAG AAGTGCAGAT GATATATTTA CCGCATTGGA AGACAATACC
GTAACTCTGA GTACTTTAAA AGCGAGCAAA TTTTTCTCCG TCTTTGAGCG CACAATTACG
TCGTGGGAAA AGACGCTCGG TGTAGTAAAT GACGTCGTCG AGATGGTGCT CAAAGTTCAA
CTAGCGTGGA TGTACTTGGA GAATATCTTT ATTGGCAGTG AAGACATCGC GCGTCAGCTT
CCTAGTGAAA CGGAAATGTT TGGCACGATT AATACGCGTT TTATCAAGCT CATGCAAGAA
ATGCACAAGA CGAGCAACGT CGTTTTGGCG TGCACGGCGA TGCGCGCGCC CGACATTGGC
GACACCCCAG ACGTGTCTCT TCTCAATGAG CTGAGCGCAA TGGATTCGAA TCTCGAGCGC
ATACAAAAGT CACTCGATGA TTACCTTGAG TCGAAGCGCC AGATGTTTCC GCGCTTTTAT
TTCCTCAGCA ACGACGATTT ACTGCAAATT TTGGGACAAG CGAAGGAACC GCAGAACATC
CAGCCACATC TGAAGGGGAT GTTTGAGGGC ATTAAAAAGT TAGAGATGTA CGCTCCCGAT
CCCTTGACCG GTCGAAGACA TTGTGAATCT GTCGCAATGA CTTCTCCTGA CGGCGAAACG
ATCCCGTTCG ATAATCCTAT CCGCACCGAG GGCCGACCGG AGGAATGGCT GAACACAGTC
GAGGCGGCGA TGTACTCCGC AACAAGGACA CATTTGGCGA GCACGTTCGA GCAGTGCCGC
GCGAAGGGAA TCAAAAAAGA TAAATGGGTG AAAGACAATC CTGGACAGAT GTTGATTACC
GCTGGTTGCA TTTCGTGGAC GATGGAGTGC GAGAGGGCGC TTCGAGATCC GGAGAACGTG
AAGGAGGCGC TGAAAAAGTT ACGGCGTAAA TGGATTCAAT ACTTGAATAA ACTTGTTGAG
CTTACGCGCA CACCACTCGA CAAAGTGACT CGCAAGAAAG TGACGGCATT GATCACTATC
GAAGTACACG CGCGCGATGC GATAGAAAAA CTCATCAAGA CCGGATGCTC TTCTCCCAAC
GATTTTGAGT GGGTGTCACA GTTACGATTT TATTGGGATC GCGAGACGAG GCATTGCACT
GTGAAGCAAG TGCTCAGTGT TTTCGACTAT GGATATGAAT ACCAAGGTAA CAATGGCCGC
CTGGTTGTGA CACCACTTAC CGATCGCTGC TACATGACGC TCGGTGCCGC TATGTTTACG
CGTCGAGGCG GAAACCCACT CGGACCGGCG GGAACTGGTA AAACAGAAAC CGTGAAAGAT
TTTGGCAAGG CTTTAGCGCG GTACGTGATT GTATTCAATT GTTCGGATGG CGTAGACTAC
AAGATGACGG GTAAGATGTT TAGTGGGCTC GCGCAAACTG GCGCATGGGC GTGTCTGGAC
GAATTTAACC GTATCACGGT GGAGGTACTG TCAGTGGTGG CGACACAAAT TAGCGTCGTC
ATGGCCGCCG TGAAGCAGAA CTTGAAGATG TTCGATTTTG AAGGTCAGCG TATTCGCTTG
ATACCGTCAT GCGGTGTCTT CGTCACCATG AATCCAGGCT ACGCCGGTCG CGCCGAATTG
CCCGACAATC TCAAAGCAAT CGTGAGGCCA GTGTCAATGA TGGTTCCAGA TTTTTGTTTG
ATTGCCGAGA TCATGATGTT CAGCGAAGGC TTCACAAACG CGAAACCGCT GGCCAAGAAA
ATGGTTGCAA TCATGGAGCT CAGTCAGCAA CAGCTCAGTA AGCAAGATCA CTACGATTAC
ACCTTGCGTT CTTTCATCAT TCCAATATCT CGAGCCGCTG GCGCGAAGAA GCGTCAAGCT
CCGCAAGCCG ACGAACAATT GATCTTATTC AACGCGATGA GAGACCTGAT TATTCCAAAG
CTCGTGTACA TTGATATTCC ACTATTCAAG GCTCTTCTCA ACGACTTGTT CCCTGAAGTC
AACGCGCCGC ATGAGGACTC GGCTACTCTG CGAGAGGCAC TCGTCCAGGA GTGCCGACTG
AACAATTTGC AACCAGTGGA CGCATGGATT TCCAAGATTG TGCAAATTTT CGATTGCAAA
TCGGCCCGAC ACGGTAACAT GATTGTCGGT AAAACTGGCT CGGGCAAAAC TCGGGCGCGA
GAAATTCTGA TCAAGGCCAT GTCGCGCTTG AAACAAAGCG GCGTTCAAGG CGATTTCCAA
AATGTCGAAG TGTATCCAAT CAACCCACTC GCGTTGTCGA ACGACGAACT GTACGGGTCA
TTCGATGAAG CAACACACGA ATGGTCTGAC GGAGTTTTGG CCAAAATTAT GCGAAACGTG
TGCAAAGACG AATCGGCGAA TCAGAAATGG ATTCTCATGG ACGGGCCCGT CGATACGCTG
TGGATTGAAA GTATGAATAC GCTCTTGGAC GACAATAAAC TTTTAACGTT GCTTTCGGGC
GAACGAATCA TGATGAGCCC GCAAGTTTCA ATTTTGTTTG AAGTCGAAGA TCTGTCGCAG
GCTTCGCCAG CGACGGTCTC GAGAGCTGGA ATGATTTATT TCAACGTCGA AGATCTTGGT
TGGCAGCCGT ACGTGGCGAG CTGGCGATGT GAACGAAAAA ATAGAGATCG CAGTGATGTA
GACATAGAAG ACGCGCTGTC TAACTGCATG GACAAGTACA TGGACGAAGT TTTGGGGTTC
AAGCGTTCAA AATGTCGAGA GCCCGTGGCG ACGGATGAGT TAGCGAGCAT TCATCAGTTT
ACGACGTTGT TCGATGCGCA CCACATCCAG AGCGTCGGGG ACGTAGAGCC TATTTTTGTG
TTTTGTATAG TCTGGTCGAT AGGAGGGTCA ATTGATCACG CAAGCCGCTT ACGTTTCGAC
GCAATGCTTC ACCGTATAAT GCCTTTGAAG CTGTTCCCGC ACACGCCGGC GTCGGCGCCG
ACGGACACAA CTGTCTTCGA CTTTTACTAC GACGCGGAGC GCCGTGCGTT CGTGCCTTGG
GTGGAGAAGA TACCTACCTA TCATTTACCT CACGAAAACA CGCCGTTCTT CAAGATCATG
ATTCCCACCG TCGATTCCGT CCGTACAAAG CATCTAGCGA CGCTACTTCT CAAAGCCGGC
ATGAACACTC TCATCGTCGG TAACGTCGGC GCGGGTAAAT CAATGGTGGT AGACTCTTGT
CTTTCCGAGC TACCAGAAGG ATACATTGGA AGTAGAATAA CATTCAGTGC ACAAACGAGC
TCAAATTCGC TTCAGGAAAC AATCGAAGGT AAGCTTGAGA AGCGCTCAAA AGGAAGCCTC
GCGCCTCCCG GCGGTCGAAA GCTCATCCTG GCAATTGACG ATTTGAACAT GCCGAAGAAG
AGCGAGTTTG GTTTCATCCC TCCGCTCGAG TTGCTAAAAT TATGGCATGA CAATGGGTTT
TGGTATGACC GATCGAAGCA AGAGCGAACA CACGTCAGTG ATATGAAGCT TCTCGCGGCG
ATGGCGCCCC CGGGCGGCGG CAGAAATCCG TTCTCGCAGC GTGTTCTATC CATTTTTGCG
GTTCTAAACA TGGTGGATCC ATCCGACGCA CAGCTTGAGC GTATTTATGG CACAATTCTT
GGGGAAACGC AGGGTGGGTT CGACCAATCG ATCGCATCGA TCGGCACGAC TATTGCCAAG
GCGTCCATAG CGGTGTACAA CTCCCTCGCT CGCGAACTTC TCCCGACTCC GACGAAGAGT
CATTATCTGT TCAACACTCG CGATCTCGCA AAAGTCATTC AAGGAGTCAC TCGCGCTACA
AAACAGTTTT ACGACTCGAA AGAGTCCATC TTGCAACTGT GGATTCACGA AAACTTGCGT
GTCTTTGGTG ATCGTCTTTG GGATGTCAAC GATTCGTCGT GGCTCAGGAG GCAAATCGAC
ACCAACATGC GACTACACTT CGGAGTCTCG TGGAATGAGG TGCTCTCGAC GGGTGCGACG
ACGTCTGTCG CGAGTGAAGT CGACGAGAAA CTCAATGAAT GTCATCCATT CGTATCATTC
ATGCGCCAAG GTTTAGATGT TCCGCCGTAC GAAGCCGTCG TCGACGCGCC GGCCTTGAAA
GAGTTTCTCA CAGAAAAGCT TGAAGACTAC GGATTGGAAC CGGGCAACGC GCCTATGGAT
CTCGTACTTT TCAACGACGC AATAATGCAC GTTTGTCGAA TACACCGTGT CTTGACTCAA
CCTCGCGGAC ACGCGATGCT TGTGGGCGTT GGTGGAAGCG GTCGCAAATC GCTCGCTCGA
CTCGCTGCTT ACGTTGCCGA GATGAAATCA TTTTCAATCG AGATCACGAA GAACTACAAG
CAGCTCGAAT TCAGGGAAGA TCTCAAATCA CTGTATCGTC AAACAGGAGT CGCCGGCAAG
CCTACGGTGT TCGTGTTGGA TGACACGCAA ATCGTCAAAG AGACTTTTCT GGAGGACGTG
AACAACGCGT TGACGTCTGG GGAAATACCA GGTTTATTTG CAAAGGATGA AATTAGCGCG
ATTTGCGAGG ATATGCGCAA GATCGCAAAA GCACAGAGTA TTCGTGCGGT GACTCACGAC
GAGCTTTTCG CATTCTTCAT GGAGCGAGTG ATGCAAAATC TTCACATCGT CTTGTGCATG
TCGCCAATTG GAGACGCGTT TCGCGAGCGA ACGAGAATGT TCCCGGGGTT GGTTAATTGC
TGCACGATTA ATTGGTTCAA AGATTGGCCC GTTGACGCGT TGGAAGAAGT CGCGATGAAG
AAATTACGAG ACGACGACGT GAATGCGAAG GTGAAAGCGG ACTTGTGCAA AATTTTTGGA
ATGATTCACG CATCGACGGT TTCGACGGCG GATGAAATGT TTAATGCGAT TAAGCGCAAG
ATGTACGTCA CGCCAACGAA CTACATCGAG TTTGTCAACT TTTTTAGAGC ACTTATGGTG
GAAAAGAAGC GCGAGTTCAA CGCGAAAATT ACAAAGCTTC GAGGTGGTTT GACCAAGCTC
GCGGAAACAG AAGTCCAAGT TCGTGAAATG CAGAGCGTGT GCAAGGACAA GGCTGCAGTC
GTCGCACAGG CAAAGAAAGA CTGCGAAGAA TTGCTTAAAG TCATCGTCCA AGACAAACGC
GCAGCAGACG AACAAAGTAT GCGCGTGAGC GCGGAGGCGG AGCGCATAGA GGTTGAGGCA
AAGAAGGCGA ATGCAATCGC CGACGAGTGC CAACTAAAAC TTGACGAGGC GTTGCCTGCG
CTCCAAGAAG CAGAGGCAGC TTTGAACGTT TTGACGAAAA AGGACATGGG TGAACTCAAG
TCTTACGTTA AACCTCCAGC CCTAGTGGAA CTGTGCTTAA AGGGTGTGCT CACCGTACTC
AAGCGGCCGA CGACATGGGA TGAGTCGAAG AAGCAGCTTG GCGACTCTGG ATTCTTGGAG
CGATTGCTGA ACTTCGACAA GGACACGCTC GTCGACAGCT TGCTGACGAA GATCGCAAAG
TTTGTGAACA ATCCCGATTA TCAGCCTGAC GTCATCGGGA AAGTCTCCAA CGCCGCAAAG
GGGTTGTGCA AGTGGGTGCA TGCCATGTAC TCGTACGGAA ACGTCGCGCG GGAAATTGCG
CCAAAAAGAT TAATGCTCAA GCAAGCACAA GACGAGCTCA AAGGCAAGCA GGACGCACTC
GCGCTCACAC AAGCTAACTT GGCGGAAGTG ATGGCGAAAG TCGCCGCGCT GAAGGAAAAT
TACGAAAAGT CTGCGTCGAA TAAGGCGTCG TTGGAAAACG AACTCGCAGA TTTGGAGCTC
AAACTCGAAC GAGCCGAGGC GTTAGTCGAT GGATTATCCG GTGAGAAAAA GAGATGGGTG
AAATCGATTG AAGACTTTGA GTCGCAGATC GAGCGACTTC CGGGTGATGT GTGCATTGCC
GCAGCTTTCA TGAGTTACGC CGGGGCGTTT CCATCAGAGT ATCGCAAGGC GCTCGTGACG
GATTGTTGGA TGCCAATGTT GAAGGAGATG GCGATTCCGT GCACCTTGGA GTTTGACTTT
GCGAGTTTTC TCGCGAATCC TTCCGATGTG CGAGACTGGA ATATTCAAGG TTTACCAGCC
GATTCGTTTA GCACCGAGAA CGCCGTCATC GTCACTCGCG GTAATCGATG GCCGCTTCTC
ATCGATCCGC AGGGGCAAGG AAACAAATGG ATCAAATCGA TGGAGTCGGC GAATGGTTTA
ATCGTCACAA CTTTACACGC GCCTGATATG GTGCGACAAG TTGAACACGG CATTCAGTTC
GGCGTTCCTG TGCTCATTCA GGATGTGAAA GAAACGATAG ATCCGATTCT GGAAAACATC
GTGGCGAAAG TATTCATAAA AAAGGGCGGC TCGGTAACTG TGAAACTTGG CGACAAAGAT
TTGGATTACT CGCCAAAGTT CCGACTATAT TTCACGACAA AAATGATGAA TCCGCACTAC
ACGCCAGAAG TGAGCACCAA GCTCGCCGTC ACCAACTTCA CCGTGAAGGA ACAAGGATTG
AATGCACAGC TAAGAGATTT AGTCGTTCGC CGCGAGAGGC CGGAGCTCGA CGCGCAAAAG
AATGAACTCG TCGTAAAGGT TGCGCGGGGT AAACGGAAGC TCAGTGAACT CGAAGACCTA
ATTTTGGATT TACTAAGCAG AGCCTCTGGA TCTTTGCTGG ATAATATCGA GCTCATCGAT
TCTTTAACGC GCTCAAAAAA TACAAGTGAG GAAGTCACGG CGAGCCTAAA AGTCGCCGAA
ACCACCGGAG TGGAAATTGA GAACGCAGCT GCGGCTTACG CCCCTGTGGC GAAGCGAGCG
ACGATTCTGT ACTTTACGTT ATACAACCTG GCCGATATAG ATCCAATGTA CCAGTTTAGC
TTGGATGCAT ACACGAGCTT GTTCGACGCG TCGATCACAA AGTCGCGCCG GCAAAACGCG
GACGAAGATA GCGTCGACGT CAACGAAAGA CTGGCCATCT TGAACGATTA TCACACATAC
GCCGTGTACA GGTACACCTC CAGAGGCTTA TTTGAGTCGC ATAAACTTCT TCTCAGCTTA
CAAATGTGCG TTCGCATTCA ATCAGAAGAA GGAGCGATAC CTTCGGCTGA GTGGCAATAT
TTCGTGCGGG GAAGCGCCGG TGGCGTGCGC GAGAAACACG AAGTGGTCAT GCGCGCTGAA
GTGGGCGCGT GGTTGAGTAC AGATCAGTGG TCGAACGTCG TCGCGCTCGC TCGAGCGTTT
GAAGATGCTC TCTCAGACCT TCCAACTTCC ATAAATGAAA TGCACGCGGA CGAGTGGCGA
GTCTGGTATC GACATTCAAA GCCAGAGACG CAAGCGTTGC CTGGCGCGTG GGGTGAGAGT
CTCAGCGCAC TTCAAAAACT TTTGATTTTG CGCGCGCTTC GCTTAGACCG TGTGGAAAGC
GCGATAAGAC AGTTTGTTGC GGATAATCTT GGCGCAAAAT TCGTCGATCC GCCGGTGCTC
AACTTGAATG AAGTTTATTT CGATTCAGCG TGCGAGGTTC CGTGTATCTT TGTCTTGAGT
GCGGGCGTCG ATCCCGTGGC AAACTTGAGA CAATTGGCCA CTAGCAAAGG AATGAGCGAC
AAGCTGTTCA CCGTCGCTCT CGGACAGGGG CAAGCTTCGA TTGCGACCGA ACTCATCGAA
CGCGGAAGAA AGGAAGGCCA CTGGGTGTTC CTCGCTAACT GCCACCTCAT GATTTCATGG
CTGCCAAAGC TTCAAGAAAT CATCGAAGAC TTTGATAGCG AAGCTCCGCA CGAAAACTTT
AGACTTTGGC TATCCTCAAA TCCGACTCCC GCGTTCCCGC TGGCGATTCT CCAACGCGGT
CTGAAAATGA CGACGGAACC GCCAAAAGGT TTGCGCGCGA ATTTAGCGCG ACTGTATAGC
ACTTGCGTTT CGGAAGAATC TTTCGCGCAA TGCGCGAACA AAACCAAATA TGGTCGATTG
TTGTTTTCGT TATCGTTCTT CCATGCACTG TTGCTAGAGC GTAGAAAGTT CGGAACTCTC
GGTTTAAACA TTCCGTACGA TTTCAACGAC ACGGATTTTT CCGTCTCTGA CGATTTACTT
AAATCGTATT TGGATGGCTA CGAAGAAACA CCCTGGGATG CGTTGCGATA TTTAATCGGC
GAAGCAAATT ACGGCGGACG CGTGACCGAC GAAATCGATA GACGCGTGAT TAAAGCGTAT
CTGCTGCAAT TCTTTTGCGA AGACGTATTG CGAGAGGAAG CGGCGTACGA ATTGTGCGAA
GGCCCGAGCG GCAAAGTGTA TTGTATCCCG TCGAATACGA ATGAACTCAA AAATCATCGC
GAGTTTGTGA ACAAACTTCC GCTCAGTGAT CAAGCCGACG CTTTTGGGCA GCATCCAAAC
GCTGACATCT CCTACATGAT TGCGGAAAGC GAAGCCATTC TTAGCGCGAG CGCTAAATTT
CAAGTCTGCG ACGGCGGTTC GACAGCGAGT TCTAGTTCCA GCGCCGCCGC CCAAATGGAG
GCAAACGTGC TCGCAGTTAT CGGCGAAATG CTCGAGACGA TTCCCAAACC GCTGGATTAC
GGAGCGATCG CGCAGAAAAA AAGCACAGAC ATGAGCCCTC TCAACGTCAA TTTGCTCCAA
GAAATCGAGC GCTACAACGC GCTACTGGGT ACGCTTTCCC TCACTTTGCA ACGACTTCGC
AAGGGCATAA ATGGAACGGT TCTGATGTCG AGCGAGCTGG ATGACGTGTA CGATGCGTTG
GCGGCAAACA AAGTCCCAGC TGCGTTTCTC AAGGCGTATC CGTCTCTCAA GCCTTTGACT
TCATGGACGC TCGACTTGAT TTGTCGTGTC GAACAGCTCT CCGCGTGGGG ACACGGAACG
TATCCCAAGA CTTACTGGTT GGGCGGATTT ACGTATCCGA CGTGCTTTCT GACGTCTGTA
CTCCAGATGA CGGCGCGACG AGACGGCATC GCCATCGACG CGCTCTCGTT TTCATTCTCA
ATCGTCGACG AAAGCGACAG CGATCAATCG CCAGAGGACG GCGTCTACGT TCGAGATCTC
TACCTGGAAG GTGCGGGTTG GAATCACGAG AAAAAGTGCT TATGTGAACC GGACACCATG
AAACTCATCG TCAAAATGCC CGTGATGCAC TTCAAACCTA CGGAGCGGAA GAACAAAGCG
TCCGCGTCGA ACGTCACGTA TCAAGCGCCG CTGTATATGT ACGGTCGCAG GACGGGGACG
CGCGAGCGCC CGTCGTTCGT CACGATGGTC GACCTAGATG CGGGCGGCGT CGACCCATCG
CAGTGGATCA AGCGCGGCAC CGCTCTACTG CTGTCGCTGT CGAACTGA
 
Protein sequence
MRQRAIKSIE TSSPLTDLAA HAHTFMSKLL DASNSHRTVL YVPDDQSTGN LESVVANWSR 
QLRQVLGRGG DVGEDDESKR DAQRLDERVQ SDGPLMRIMK FWPERATDLR GVQEQLQSES
VRLIVKKLAS QKNANYLCDD IAAFEQLIVD RVHLAERISD MFSRVKPACE ALSVARLADV
PQILPRLVDA IFLAWLNTDS NIKDAVSDQY VDLLKRIADQ IMIICKGELN LNHIIQNDGS
NEEAYNGAVC TLRECAFVAD VWKNIYFERA AIESSGALLI DATTVFSHLD AFAHRCRDLL
ELCEAQEQFA PPASRSFPSA ANVVRYRLDD RLEVIRRVFL DVAINDLTKI EYDILDVSSM
RWHDDFAAFK SRVKKLEMKM CAIFGEACER AETLQARVSL LRLLFTKIAR RDGVVRAVES
CAARAYSMFA EDLQCAKRQF DANRMKPPIE SSLPRTFGAV HWANTLLHRL ENDFGELRRA
HAEGALPQLP ESVDLLAVYD LTIPQIERFI NEKYDEWVVY VNAMNVEERL DQPLLRADDD
GLLFITYAVD VGVACEDARG FASMSREVPA NIKEVAFGGM RCAHDALCER LSSLVERNNT
LLRQIGRDKH VNRLFRANVD ILKNLFAAAS KELTWRSEIS TIDAFCAESD KLCSFMSESL
EKFRGSAARI TEILEELSAV SLRRREGDHV DPLDEFKNIY EQETVHIRSK FGETSATILH
LTSHQRQSIG DVDTASEAWI AYIKDIEDAV VRALRKVLLR TLSHLHLTMG RVESDCAPLK
PVVLMSMNLS SCDVGTMRVE TSPQTTQLCE TINTVFDAAL DAYGSVPQIS PLNESIQDRL
LRDNRDMISR VRSSVTNSVK DIAEDLMRFA ARWEERYFHL WRENAVDVSD NSAIEARIHR
VQESMEDVRS EEHTHVIGFV RIECGYLKQS LLALAQNNVQ GMISELRAKA ATEIDDVMRY
FEMSVARFEQ PPSDINELVK HVSELKQLQT NVDSVPASFV KPRDSFRILN EFVDVTETES
QRLASIDDKF EEFKFQLEKI ESSLDATKDV FQNELRTSIV QLEKDAREYK VKFDRLAPMQ
THPACAKRAC DKSWEFIAGV DEVIEQFDHR LRVILNGLVV FEDLAPPTLV DMEELKADLE
QLKMVWRVVR EWNDLYEGWK DGKFNDLDVE SMENGATLLY KQITKLGRGK VKEWGTWISL
KDTVDSFKRI MPLIVDMRNP AVRRRHWELV MEACGKQFDP TSEGFTLDKV VELGLDHYAE
AISEISTDAT KELSVENTLR GIADVWTNVV LDTGPFKEGR DDVMKLRSAD DIFTALEDNT
VTLSTLKASK FFSVFERTIT SWEKTLGVVN DVVEMVLKVQ LAWMYLENIF IGSEDIARQL
PSETEMFGTI NTRFIKLMQE MHKTSNVVLA CTAMRAPDIG DTPDVSLLNE LSAMDSNLER
IQKSLDDYLE SKRQMFPRFY FLSNDDLLQI LGQAKEPQNI QPHLKGMFEG IKKLEMYAPD
PLTGRRHCES VAMTSPDGET IPFDNPIRTE GRPEEWLNTV EAAMYSATRT HLASTFEQCR
AKGIKKDKWV KDNPGQMLIT AGCISWTMEC ERALRDPENV KEALKKLRRK WIQYLNKLVE
LTRTPLDKVT RKKVTALITI EVHARDAIEK LIKTGCSSPN DFEWVSQLRF YWDRETRHCT
VKQVLSVFDY GYEYQGNNGR LVVTPLTDRC YMTLGAAMFT RRGGNPLGPA GTGKTETVKD
FGKALARYVI VFNCSDGVDY KMTGKMFSGL AQTGAWACLD EFNRITVEVL SVVATQISVV
MAAVKQNLKM FDFEGQRIRL IPSCGVFVTM NPGYAGRAEL PDNLKAIVRP VSMMVPDFCL
IAEIMMFSEG FTNAKPLAKK MVAIMELSQQ QLSKQDHYDY TLRSFIIPIS RAAGAKKRQA
PQADEQLILF NAMRDLIIPK LVYIDIPLFK ALLNDLFPEV NAPHEDSATL REALVQECRL
NNLQPVDAWI SKIVQIFDCK SARHGNMIVG KTGSGKTRAR EILIKAMSRL KQSGVQGDFQ
NVEVYPINPL ALSNDELYGS FDEATHEWSD GVLAKIMRNV CKDESANQKW ILMDGPVDTL
WIESMNTLLD DNKLLTLLSG ERIMMSPQVS ILFEVEDLSQ ASPATVSRAG MIYFNVEDLG
WQPYVASWRC ERKNRDRSDV DIEDALSNCM DKYMDEVLGF KRSKCREPVA TDELASIHQF
TTLFDAHHIQ SVGDVEPIFV FCIVWSIGGS IDHASRLRFD AMLHRIMPLK LFPHTPASAP
TDTTVFDFYY DAERRAFVPW VEKIPTYHLP HENTPFFKIM IPTVDSVRTK HLATLLLKAG
MNTLIVGNVG AGKSMVVDSC LSELPEGYIG SRITFSAQTS SNSLQETIEG KLEKRSKGSL
APPGGRKLIL AIDDLNMPKK SEFGFIPPLE LLKLWHDNGF WYDRSKQERT HVSDMKLLAA
MAPPGGGRNP FSQRVLSIFA VLNMVDPSDA QLERIYGTIL GETQGGFDQS IASIGTTIAK
ASIAVYNSLA RELLPTPTKS HYLFNTRDLA KVIQGVTRAT KQFYDSKESI LQLWIHENLR
VFGDRLWDVN DSSWLRRQID TNMRLHFGVS WNEVLSTGAT TSVASEVDEK LNECHPFVSF
MRQGLDVPPY EAVVDAPALK EFLTEKLEDY GLEPGNAPMD LVLFNDAIMH VCRIHRVLTQ
PRGHAMLVGV GGSGRKSLAR LAAYVAEMKS FSIEITKNYK QLEFREDLKS LYRQTGVAGK
PTVFVLDDTQ IVKETFLEDV NNALTSGEIP GLFAKDEISA ICEDMRKIAK AQSIRAVTHD
ELFAFFMERV MQNLHIVLCM SPIGDAFRER TRMFPGLVNC CTINWFKDWP VDALEEVAMK
KLRDDDVNAK VKADLCKIFG MIHASTVSTA DEMFNAIKRK MYVTPTNYIE FVNFFRALMV
EKKREFNAKI TKLRGGLTKL AETEVQVREM QSVCKDKAAV VAQAKKDCEE LLKVIVQDKR
AADEQSMRVS AEAERIEVEA KKANAIADEC QLKLDEALPA LQEAEAALNV LTKKDMGELK
SYVKPPALVE LCLKGVLTVL KRPTTWDESK KQLGDSGFLE RLLNFDKDTL VDSLLTKIAK
FVNNPDYQPD VIGKVSNAAK GLCKWVHAMY SYGNVAREIA PKRLMLKQAQ DELKGKQDAL
ALTQANLAEV MAKVAALKEN YEKSASNKAS LENELADLEL KLERAEALVD GLSGEKKRWV
KSIEDFESQI ERLPGDVCIA AAFMSYAGAF PSEYRKALVT DCWMPMLKEM AIPCTLEFDF
ASFLANPSDV RDWNIQGLPA DSFSTENAVI VTRGNRWPLL IDPQGQGNKW IKSMESANGL
IVTTLHAPDM VRQVEHGIQF GVPVLIQDVK ETIDPILENI VAKVFIKKGG SVTVKLGDKD
LDYSPKFRLY FTTKMMNPHY TPEVSTKLAV TNFTVKEQGL NAQLRDLVVR RERPELDAQK
NELVVKVARG KRKLSELEDL ILDLLSRASG SLLDNIELID SLTRSKNTSE EVTASLKVAE
TTGVEIENAA AAYAPVAKRA TILYFTLYNL ADIDPMYQFS LDAYTSLFDA SITKSRRQNA
DEDSVDVNER LAILNDYHTY AVYRYTSRGL FESHKLLLSL QMCVRIQSEE GAIPSAEWQY
FVRGSAGGVR EKHEVVMRAE VGAWLSTDQW SNVVALARAF EDALSDLPTS INEMHADEWR
VWYRHSKPET QALPGAWGES LSALQKLLIL RALRLDRVES AIRQFVADNL GAKFVDPPVL
NLNEVYFDSA CEVPCIFVLS AGVDPVANLR QLATSKGMSD KLFTVALGQG QASIATELIE
RGRKEGHWVF LANCHLMISW LPKLQEIIED FDSEAPHENF RLWLSSNPTP AFPLAILQRG
LKMTTEPPKG LRANLARLYS TCVSEESFAQ CANKTKYGRL LFSLSFFHAL LLERRKFGTL
GLNIPYDFND TDFSVSDDLL KSYLDGYEET PWDALRYLIG EANYGGRVTD EIDRRVIKAY
LLQFFCEDVL REEAAYELCE GPSGKVYCIP SNTNELKNHR EFVNKLPLSD QADAFGQHPN
ADISYMIAES EAILSASAKF QVCDGGSTAS SSSSAAAQME ANVLAVIGEM LETIPKPLDY
GAIAQKKSTD MSPLNVNLLQ EIERYNALLG TLSLTLQRLR KGINGTVLMS SELDDVYDAL
AANKVPAAFL KAYPSLKPLT SWTLDLICRV EQLSAWGHGT YPKTYWLGGF TYPTCFLTSV
LQMTARRDGI AIDALSFSFS IVDESDSDQS PEDGVYVRDL YLEGAGWNHE KKCLCEPDTM
KLIVKMPVMH FKPTERKNKA SASNVTYQAP LYMYGRRTGT RERPSFVTMV DLDAGGVDPS
QWIKRGTALL LSLSN