Gene OSTLU_25682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25682 
Symbol 
ID5006069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp289149 
End bp301484 
Gene Length12336 bp 
Protein Length4076 aa 
Translation table 
GC content53% 
IMG OID640421490 
Productpredicted protein 
Protein accessionXP_001422029 
Protein GI145355563 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG GGTTGGTCGT CGCGAGCGCG CTGTCGTCGC TGCGAGGGGT CCACGGGGGG 
TCGTCGCCGT CGCCGCCGAA AACGGATTGT GTAGGCTTTT GGCAAGGGAG TTGCACGGCT
TTTTGCGGAC CGGCGACGTA CTCGCAGACG TTCACGATCA CCGTCAAGCC GCGAAACGGT
GGACGAAGTT GTGGAGCGTA TGAGGGACAA ACACGAACGG AGAATTGCGA GGGTATAGCG
AATTGTTCGC CGCCGCCGAG CCCGCCGCCG CCGCCGAGTC CGCCGCCGCC GCCGAGCCCG
CCGCCGCCGC CGAGCCCGCC GCCGCCGAGC CCGCCGCCGA GCCCGCCGCC GCCGAGTCCA
CCGCCGCCGC CTGGTGCCGG TGCTGCGATT GAAGTGAGAT TTGGTTCCAC CACGGAGAGG
GTTGCGGCGT ACTCGTACGA TTTCCAAGTC GCGGTGCATC ATCCCGCGTG CGCTGAAGCG
AGCGATTGTT TACTGAATGT GAATGAAGAG AGCGTGAGCA ATCACGCGCT TGGTCTCCGA
TGCGATGTTC GCGTGAAACG AGACGATCAG TCGTGTTCCG AGTCGGATTA CGCGGATAAC
TTACCGTGTC GCAAATGGAA TGATTTTTTC CCGAAGGCGC TCGCGCTCAC AGAGTTGGAT
GGGACGCACT CGGTTGGATC TCACACCATA ACTTATAACT GTCACTTTGT TACGGTCGAT
GGCATCGTTC CAGGTTCGGA AACCGTTGGT CAACACTCTT TCGAAGTTAT CAAGGGTTGC
GATTTGCACC TGCCTCTCGG ATTCAACGCC GCCGGCCAAG CGAGCGCGAA TCAGAATTCG
GGGCTTGAAG AACTGGTCCG GTACATATTG CTGAGTAACG ATCAATTCAA CAACGTGCTG
TGCGATGATG GCGAAATTTA CAAGACAAAG GAAAGCATCT TTCGTAGGCA CGACGTCGAT
CCCAGCGACG GACTGCTGCG ACTGAATGAA CTTCAGGCCA TGTTGGTGGA ACAAAGCGTT
TCGAGGTACA TTCTCAACGT TTGGAATGAC CAATTTCGCG CCGCGCACGA CACAGATCTG
GCAGTGTCTA TTGTCGACGT GATGGAAATC CCCGTTCGAC CCGTCAAGTG CAGCGGAGCT
CCGTCGAACA GTTTCAGTAT CGATAGCATC ACCTATCCGA CTCTCGACTG GCAAACGGAG
AATCGCTGTG TGAGTGGCCT CAACGGGCTC ACCACTGCAT GGTCGTACAG TCCTAAGCCA
ACGAACGGCG ATTACGTCTG CGCATACGTG GATGGTATAC TCTTCAATCA GATCAACGTT
GCCGACGGTA CAGTCGACGA CGTCGTCGAT GTGCGCTTTG AAGAGTATGA AGTCGGCGGC
ACGACGCTTC AACTTCCCAT CTCCTTCACG GACATTCGCC CAGCAACGAG TGACTTTCTC
GACGAACACG CTTCTCTCGT GGCAGCATTT ACGTTTGATG AAGACCCTTT ACACGACGGA
TCGCTTGATT CGGTCGCGCC AATCGTCGAC GGACAGTCTT CCGTCCCGAT GAGATTGCAG
CCGAAATCCG CTCGCACAAT ATCGGGATGT TCGGGGGGCA ACGGATTTCG CTGCCTTCAA
GCAGTGGATG ATCCTGGGAC GAACGATTAT TCATACTCTC TTGATAAAGA AGATACGCTT
GGGCACGACA TCGCGATGAG CACATGGGTG CGCATCGATC CAAGTCGTTG CTCGCAACCC
AAATCCTCCA ACGACGAATC ACTCATGACG GTCGCTCAGT TTCAAGGACA GTTGGGACTT
CACGATCACA GACTCCACGT ATACCTCAAG ATACGTACGA GCGACGGCAT GGTCGAACTG
CGGGCATCTC ATCAGGTCAA GGAGTCTGTG GATCCTCCGA GCGCTTTCAC ATCCTCAAAG
GATGTGAAAC TTGATTTTGC GTGCGACGGG GAGTGGCACC TCATCGCGTT CGCCGTAAAC
TCGGTGAATT TCATGACATT GTACGTAGAT CCAACGAGTG ACGCAAAGGC CACAGTAATC
GATGATACGA CTTACACGCG CGATGAAACT TGGCCGCTGA GCGCGCTCGA GTCAATGACG
GACGTTACAC TGCTTGGCGC GGTGGATTTT GAATTCGACG ACGTTCGAGT GTATACCGGA
GTGATACGCG AAGTGATGTT CATTGATACC GTAAGATGTG GACATTACAA GCGCTGCGTG
CTTCGTCAAT CATCCGCACC AAAAGCTCGA AGAATTGTGT GTTTAAGTGG CGTCATTTCG
GACGCTCGCG AGGAAACTTA CTCAAAGTTT GAATGTGGAG GTGGATTGTA CTACGATGGT
TCTACTATTG ACGTCAACGG AAAACTGGAT CCAGTAGGAG TCACTTTCAG CTTCCGTGAT
ACCGCCTGGA ATGACGTTGG GTTCGAGATT CTTCGTCAAA GCATGCAGGA TTCTCTGACG
GATGCACCTT ATGAGTCAGT GGTGATGATT GACAGTGGCC TCACGTACTG TCATCGATCG
TACGCGCCGT TGCTTTTCGC CGACAGAGAG GCGGCGGCAA TCCCAGGACA GATTTGGAAG
TATAAGGTGG TGACGAAAAC ATCATCCGGT GGAAGTATAA ACTCTGTCCC TTTCAAATTC
ATCACGCCCT GGCACGGAGA GGTAGGTGGT TACGTGTACG CCGGTGATTC CGCTGTTCCA
GTACCCGACG TCCGAATATG CGCTAATTTT CACGACCCGA TCGTAATAGA TGGGGAGGTC
GCAAATTACC GAGGGTCCAG GCCTCCAGAC GCGGATGAAA ATGAGAATTC GACGATAACT
TTACGGCCGA GCGACGACGT CGAATCATCT TCCAGCAATG TCGCACTTTT CAAGCACGCA
TACTCAAACC CTTTGAATGA ACAAGCGTAC ACGCTCACCG ATGGGAATCA CATACCTACC
GGTGAATCTG TCGTCGTATC GACGGGTGGC TACGCGCGCG TCGATCTTGG TCTTTGGATG
TCAATAGGCA GTGTTCAAGT ATGTCTACAG CCGCAATCGA TCAAGGCACG CCAACTTGCC
GAGGGTTCTC TCACCGACCG ACTGGGGTAT TCTGGCGATA CAGGCGATGA TCCAGACGTG
ATCGTTGATC CACCTCGTTA CGAGAATGAT GTCATCCGAC TCGTGCGAGC AGACGAGAAA
CTCTGGGCGA AAAATACTGA TCGACTAGAC GCCGAAGCTG GTGATACAGT GTTGTTGGAA
TGCTGGGTAC GCGCCGTCGG GAAGGAGGCG ATTATGACTT TCCAGATTTT AGGGTATGAA
CAACAAACTA CAGCAGCAGG CCGTCAGACC GTTGGTGACG CGCTCGTCAT TACAGCTGAA
CCGAATAAAT GGAGGAAAAT AAGGCTTAAG CACACGTTAG AGGACGCCGC TTTCGTCGGA
TTTACTGTAT CAGCTTGGTC GGAAAGTACG TTCTACTTCA CCGGAGCTCA TTTCACTCTT
AATACTCTCA CCGAACGCCT CGAGCTTGTT CAGACAAGCG CCATCAGTCC TTTAACCGCG
AGCCCAACTG CGCCTCCACC TGGTTTCGAG GACGTTGTAA AATCGGTGGA CGCAGGCGTT
TCAGTCACGG TGCACGTTCC TGGCGAGTCT TTGCGCGCCC TGACTGGAGA TTCTGTCGTG
GTGTCGTTTT GGGCGAAGTC TTTGGGTGGT GAACGCGCGG TGTCAATCGA CTTGCGCGGA
ACATCCGCCA CTGACGCCAG TACGGAATCA TTCACCATAC ATGAGGAATC GTGGAGACGT
TTCAACGCCA CCCTCATATC ATCTCAAGAT CAATACATAA CTTTCCCCGA GATTTCGTTT
GGTGCGGAAT CGAGCAAAGT TGAAGTAACG GGATTTGAAT TTCGGCAACA GGCGAGTCCG
ACGCTTAGTT TACCGATGAA AGTGCACGTG CATGAAATCG ATCCCGAAGA CACGACTAAT
TATGGATATG CATGTAACCA CGATCCGTCT GATGTCAGTG ATGAATTCGC AGACTGTTTG
ACGTTTACAT GCGGGGGGGG CGGGGGCGAC ACTGTGCAGG CGATGCACGG GCAGTTTGTA
TCAGTCGTGG CGGTTCGAGA CGTCAATTTG ACTGAAATAA GAGTATTGGG TGGTGAGACG
CGATGCCCTT TCAGCGCGGT CACTGATTCG GAAGGATCGT TTGTTGTTCA TATCACAGAC
TCCACGGGTA TGACGCCAAT CAAAACTCAC GTTGACATCG GGGCGTACAA AGTGGAAGTT
TTTGAAAAGA CGACGCAGCC CATCTTGCAA ACGTCGGACG TGACAGACCC GCTGAACGTA
CCCAGTAAAG TCTTATTGGT CTTAAAGGAA GATGATTCGT CATCTCCTGA AACCACGACG
TCACTCTTGG GTAGGCAGCG CTGGGGGCGC CGCCTCCTGG GGAACATTGC GCAGTACATA
CCACCGCCGC CGTCGCCACC GCCGCCGTCG CCACCGCCGC CGCCGCCGCC TGAAAAAGCG
GCGGTTGGCA ACGTGGTCTG GGTGACGCTT CGCAGAGCCA AAGCGTCGAA TCGAAAAGTT
TGGCGGCGAT CTCCAAATTG GTGGAATTAT TTTTCCGCAG CCGGTGCGGT GAGTCGTCGG
TCGATCCACG GCCTGACGGA CTCAGTGCGC GGAATCTCGG CTCGTTGTGG GTCTGAGTCC
GGCCTCAAGT TCATCGGCTT GAATCGGCAG TCCGCGGCAG AGACCGTGTC CAGTCGAATG
GACTTCAGCA TCAGATGCAA TTTTCACTCC GATGCAGTTG AAGTGTACGA AGGTGGTAAA
AAAATTAAGG GTGCTGCAGT GGCGTATACG GACAGCGATA CACTGCAAGT CGTCATCAAC
GACGCCGGAG CCGTGGAGTA CTTCAAGAAT GAAATTCGCT TTTACACCAG TCAACAGACT
CCCGAGTACC CGCTGCACGC GGATGTTGCT TTTATTATGC GTGGTTCCTT CAGCGATGTT
AAATGGGTCG AGCGCATGGC GTCACCACCA CCGCCATCGC CGCCGCCGCC GTCGCCGCCA
CCGTCACCGC CGCCGCCGTC GCCGTCGCCG CCGCCGCCGC CGCCAGGAAA GGCGGTGGCG
GGCGAGATAC TTGGTAGCGT CGCGTGGGTA TCGCAAGTCG GCGTCGAATC CACAGAAAGT
GGCCAAGTCA CGCGCACGGC TGCAGTGACT GGCGCTTGGG ACGCCGGTGC GGTGAGTCAG
CGCGCTATTC GAAATGGTTC TGACTCTGTG CGTGGCATCT CCGCAGTCTG TGCCGGAGGA
TACAACAACA GGATGATCGG TTTGAACTCT CAGTCGACGA CGGATTCGAC GAATTATCAA
AGTCTTGATT TTGCGATTTT TTGTACGTAT ACCAAACGGA TCAAGGTGTA CGAGAAAGGC
AAGCGTAAGT ATACGATGGC TGGGACTTAC TCTTCGTCCG ACTCGCTGCA AATCGTCATC
AACGACGCCG GAGCCGTGGA GTACTTCAAG AATAAAATTC GCTTTTACAC CAGTCAACAG
ACTCCCGAGT ACCCGCTGCA TGCGGATATC TCTGTCAAGG CCGGTTCTCT CACAAACGTT
CACTGGGTCG AGCGCATCGC GTCACCACCG CCGTCGCCGC CGCCGCCGCC GTCGCCGCCG
CCACCGTCAC CGCCGCCGCC GTCGCCGCCG CCACCGTCAC CGCCGCCGCC GCCGCCAGGA
AAGGCGGTGG CGGACGATCA TGTCGCGTGG GTATCGCAAG TCGGCGTCGA ATCCACAGAA
AGCGGCCAAG TCACGCGCAC GGCTGCAGTG ACTGGCGCTT GGGACGCCGG TGCGGTGAGT
CAGCGCGCTA TTCGAAATGG TTCTGACTCT GTGCGTGGCA TCTCCGCAGT CTGTGCCGGA
GGATACAACA ACAGGATGAT CGGTTTGAAC TCTCAGTCGA CGACGGATTC GACGAATTAT
CAAAGTCTTG ATTTTGCGAT TTTTTGTACG CATACCAAAC AGATCAAGGT GTACGAGAAA
GGCAAGCGTG AGTATACGAT GGCTGGGACT TACTCTTCGT CCGACTCGCT GCAAATCGTC
ATCAACGACG CCGGAGCCGT GGAGTACTTC AAGAATGAAA TTCGCTTTTA CACCAGTCAA
CAGACTCCCG AGTACCCGCT GCATGCGGAT ATCTCTGTCA AGGCCGGTTC TCTCACAAAC
GTTCACTGGG TCGAGCGCAT CGCATCACCA CCGCCGCCAT CGCCGCCGCC GCCGTCGCCG
CCACCGTCAC CGCCGCCGCC GTCGCCGCCG CCGCCATCGC CGCCGCCGCC ACCGTCGCCA
CCGCCGCCGT CGCCGCCGCC GCCATCGCCG CCGCCGCCAT CGCCGCCGCC GCCATCGCCA
CCACCGCCAT CGCCTCCACC ACCGATGCAC GATTTTGACG AGACAGACAC AGATGGAAGC
GGGGGCATCG ATAAGTCAGA GTTGACCGCC ATCATCGAGA AAGGTGCGAG TTTTCCAAGC
AATGGGTACG CGATAATTTC AGAAGAACTT TGGAATTCGT CGTTGGATAT GGACGGTGAT
GGACTTCTGA ACGCACAAGA ATTCCTCAGT GTGGCGCATC GTCTGGCTCA ACGGACATTG
TTTGTCGAGC CAGTGCTCGT ATTCCCTCCG CGGAAAATTA CATACGACTT CGATTTGTTC
ACCGTGCAGA GTGTTAGCGC TGCGATCGCC GACGAGTCTC ACCAGGATGC GCGTAGCCAA
GTGTTTGGCA AAGCGTGCGA AAATTTCATC GTTCGAAGAT CGACATCAGC CCACGTTCCG
TACTCTTCCG AGGGATGGAA CGAGTGGTAT GATGAGCTCA GCGCAGAGAG TGGGGGTGAT
GTTGTGTCAA AAATCGATCA CTGTCGCCCT CTCGAAGACG CCGAGTTCGA TGCCATTGTC
GTCGGCGATG AGACCTCCGT GTATGCAACA CCGTTACTTG CTACATCTCC CGGCTCGAAT
AAGATTTTCG TTGACTTAGA TGCCGCACTA CTGCACGACG TGGTTCAAGA TGATCCGCAG
ACGCTGTTCG CGCGCGTCAC TCGTCCTGGA TTGGATGTCG CGCTGTGGAG CACGCAGCAG
CTGCCAGAAA TCGTGCATGT ATTCAACAAC AATGACGTCG TCAAGCAAGC TGGAACTTCG
GATGACGATG GTGACCTATA TGAAAGTGAT ACCGTATTTT CGGATACGGC TGATATTGAT
GTCATCAACC TGGAGCTGAC CCATAATCAT CAGGTAGAGC AGAAAATGCA GGACGATACG
ACAGTGGTCA TTTTGGGTGC CGTCATGTTC CCAATAGAAT GGACCGAGTT GTCCGAGTGT
GGACTTTTTG AGGCAAGCAT TTATGTCAAA GATGAAAGCG AGGCTGGTGC GCCGCGACAA
TTCACTACAG ACGAGACTGG CTGGTTCGAA TTCGCAGTCA CAAACGGGAA AACCTACACG
ATAACCGCGG AATATCCCGG TCACGAAATC TGCTATTCGG GAACCACCAT CGTGGAAGCT
ACGAGAATTT GGAGCTGCGA CGGTAAACCG ACCACAGTTA CTCTCCACGC CGTATCATCG
AAAAAGTACA TTTTCTTCTC AGACACGACG ACGGCAAACG TCGATTTGGG TGTCTATCAA
GGTGAGTGTG AAAAGCTTTA CACAGGGGCG ACTTTCAAAA TTACCCCCAT CAATGGCTGT
CACCCACCAG CCATCTTCAC TTCGGCTCAA GTATCCGGCT GGACTCTACC TGACCGCGAC
AAGGAAAACT CAGACGTTAT ACCAACATAT CAAGAAGTTC CGAATGGAAG ACGGTGGCCA
CTCGCCGCGA TGGATTACTC GGTAGTGATC GAAGAGGCCC CATCCACAGC GAATTTCACG
GCGATGCGCG AGGAGAAATA TCCGAACGCG TTGTGTCATG TCGGAGTAGA CGGCATCATT
CCATACTTCC GCCAGCATCC AACAACGATC GAAAGGCTTG TACCATTACG CACTGAGCAC
ACTTGGTATT CGGCGAGATA CAAGTACCAC GGTTATTTGT GCGCAGAGTT CGTAGATCTT
TCTGTAATCA CCGACGATGG CGATTACTGC TGGGACGTTG ATGGCGTGAA AGCTGGTGGC
ATTAGACACA ACCACTTTGT TGGGATATCG CCATTTTTTG ATGATTTCGG TGGAGAAGGG
CTCTACGCTT TGGGGCCGAA ACTTGTGAGC GCCAAAGTAC TTGAACTGCA TGTGGAGAAT
GGTTCACTCG ATGAATGCTT GACACTGCCT AGCGCCGAGA GCGGCGGGAT GACGGTGGCC
ATGTTCAGAC AAAGCGTCAC CGATATAGCC GAAAATCCAT GTCATACCGA TCGCAATGGA
GGACCATTGT GCGACTTTGA CGTTGAGATC GATCCGGAGA GCGACAAACT TTTATTCCCC
ACGGATGAGG GCGAGAAGAC GACGGATCGA CTCATTGTCG CGGGAGATCC TAAGCTCTCA
GGAAATTACC GCAGATCTGT CGAACTGACT GTGGGTAGAT TTGATGGTTC GATGACAGTG
ACAGTTCCGT TGAAACGAGA GCTCATCTCT CTTGGCTCGA AACCGCGAGG CGAAGGGATC
TACGGCGAAG AAAGCGATGA CGTGTATTGG GCAACTGTTC CACTCGAAGG TTTGGTGTAC
ATGACGGTAC ATGATCCGCC CGGTGGCAAC TCTTACGCCG AACTTCTTAT GGGCACGGAA
GTGACCATGT CCGTGGAGCT TTCAGACGAA CAAGCGGCGT CTGTGTCGAG TTCACACGAA
GGCGGTGGCG GCGTAGAACT CGAAGTGGAG GCGAAACCTG GGTTCGGCGC TGGGTATGGC
ATGGAGGTTT TATTCCAGCT CGGAATGCCT CTCTTTAAGA TAGATTTTGA CGCATACCAT
GAGGAGTCTG GACCAGAATT TTCGGTCGCT CAGAGTTCCG CCGTCGGTTG GGATCTCACG
GCTACGATAA ATCGTGTCAT ACGGACAAGT ACCGACGTCG CCATCCCGGG ACGACAGGGC
GATGCGATTC TAGGTGGCGG CGTTGAACTT GTCTACAGGC TAGCCGATAC GCTCGATCTT
GCGCTAAGGG AAGACGATAA ACCGTGCTTG CGGATATCGA CGGCGATCAC ATGGATGCCG
AGGAAACCCA CGTCCTACAT ATTCACGGTA CACTCGATTG AAGCGAAAGT GATTCCAAAC
TTGCGGTATT TGCTTTCCGT CGTGGAAAGC GGCCTCATCG TCGGGGATGA TTCTAAAATG
GAAGCGCCTA ATTGGCCACA GTACATCGCT GACAAAATTA ACGTGTGGCA ACGCACACTT
TTGTGGGCTT CGCCAACAGT GTACAAGGTT CCTATGGAGC CGACTGCGGA AGGCGCTCCC
GTGTTGTACG GTAAAAACTA CGGCGCAATG GAGCGCATCA TGGTTCCTTT TATGGATGAA
CAATCCGCGT TCGGTATGGA AGCCGAGAAA ATGGAAACAT CGTTCATAAG TGACGCGAGC
GGTCCGATTT CTACCCCAAT AGACGAGTTG AAGCTTGCTT GGAAGAGAAT CGACAGAGTT
AGTCCATACA TACATGGCGT TCCAGGGGAC TTGGGCGACG CGGATGGTAC TATAGACACA
CTTAACGACT TCATTCTCTC GGATGAGACG GATGAATCTC GATTGTCCCA GCTCATCAGT
GGCTACGTTA GCAATCCCGG GACTTTATTT TCAGACTTGT TGAGTGGAAA CCTTGGGTTC
CGCGGTGACG ACGACGATAA CGACGCCACC CCAGACGTCG GCAAGGTTTT CTGGTCACGA
AAGAGTGAAA GGATGAAGCC GAAGTTCCTC GAGCAATTTC AAGAAGCGGA TGAGCTCGTC
ACGACAATGG GGATGAATAG CCCCACTCGT GAAGATCTGA ACAACACGGT TTCATCCTGC
ATCACAGAAA AATGCAAAAC GCAGGCTCTA GACGTGTCGT CACAACTACT TCAATCGAGC
GATCTTAGCG ATCTTACGCT CGGAGAGGAT GCATCCATTT ACATGAACGA TACGAAACGA
GTCGATGCGT CGTTTACAGG GCGTATGGGT CCCCGCGGCT ACTCCACCAC AGATTCGAAT
GGACCCGCTG ATGAAGCAAT TTTACTATCG TTTAGCGGCG GTGGTGCTTC GACAGAGTAC
ATATTTTCTT CCAACGAAAA CGTCGACGGT CAAGATTACG CATGGACGCT CAGCCTGGAC
GGCTCAGCTG AAAACGGATA CTCCTTCGCG GGTTCAATCG CCATCGTCAA GGGGCATGTA
GGTGCTACCA CGGATATGAG CAAGTCAGTT TCAAAAAGTC GTGCCTTTGC CTGGGCAAAG
TATGAACACA TGAGCACGAT GTACTCATTA GGTGACCCAG ATTTTGGGGA TAAGTTTGTG
CTCCAGGTCA GTTCCGACAA ACGTTTCGGA TCTCCAGTGT TCATCACGAT GGGCGGCAGA
AGTCAATGTC CGGGAGAAAA ATGGACGATG TTCCGAGAAG CTGGCGTTAC CATAGGTAAG
GAGTCGACGC ATAACACGAA TCTGAACCCA GGAGAGCACG CTTTGATCCA ACTTCTCATC
AGCAATGAGT CGCCCTACAA GGAAGTGGCC AACATGGGCC TTCGTTTGGT AGATGGAGTC
GCCGACTGCG TTGGAAAAAT CATCGCGGCT GCTCATAACG CGGCACAGTT GAACGAGGAC
AACGCAACGT TCGTAAAGAA TGCAGCATAC GCCATGGCGG ATAGCCACGA ATGCTTTGCG
TCAGAGTCTG ATGACATTCA AGACCTGAAG GCGCGGATTG AAGACATCGT AGCGTCATAC
GAACAGTCAA TGAGCGTCAG AGCAGGGAGA TTGTTGGCAA ACGCAATCGC TCGTCTCACG
CAAACCACTA CAGCACAAGG AACACTCCTA CAGGGAATGA AGTTCTCGAT CAATGGTATT
CAAATGTGGT CATTCGGTGA AGTCCTACCT CTACGAAGAT TGGCTGGAGA GCGTATGGAT
TCGCAAAGCG TTGTGCGCGA GAGTCGCGTG CTTCTGAGCG TGGAGCCAAG CGACGGAGTG
TATGAAAGCT CGTACCTGGG GATTTCCTTG GTGAGCTTGT GTGAGTCGAT GATTGAAGAA
TACATGTATC GCCCGATCAT CTCGAGCAGT GTTACACTCG GTGCCATGTC GTGGAGTAAA
GATTGCCCTC AAGTTGCTTT TCACTCGTCT ACGCTCAGCA AGGAAGCAAG TTACTTTGAG
AAGAGCGCAG CGTCTGATTC GAGCGTTTTG AGCATCTCGG TCGTAAATCC GAATAGATAT
TCTCTCTGGC CCAAGGAAAC TGCATCCACG TCCGAAGCGT TGGTGACGAA TGGGAATCTC
GCCTACGTCT ACGTGCAATA TCGTTCGGTA TCAGGAGGCG AATGGATCAC AGCGAAAGAC
GCCGAGCGCG GTGCCGGCAC GAATAAGAAT TTCAATTTAC TTTGCCCGGA GTCGCGGGGT
GGAGACGGTT GCATCTTTGA CTGGGATCTA AACGACCCTT ACAACAAACT TCTGAGTGGA
TATAAAGATG GGAAATATGA TATCCGATTG AAAACAATGT GCGTGGGCGG ATCTCACCTC
GCAAAGCCGT CAGTGCACGA ATTTGTGAGC GATCAAAATC TCATGGTGAA GATAGATACG
AAGGACCCGC TCGTCGGTGA TTTCAAATAT GTGAGCTCAC AGGTAACGCA ACGCGTCGAT
TTCATGGAAG ACATTGATTG CACGAAGCAA GTGATCACGG CCAAGCGAGG CAACTCGTCG
ACGGGTCCCT TCGAAGCCGT TTCCAACGAA GACTTGAGAC AGTACGTCCT TCAATGCGTT
AACGATGGCG CTGGTGGTCA CTGGTTGATG AAGTTTCCAT ACTTTTCACA AGGCTATCAC
AAAGTGACTG TTACGGAAGT GACCGACGTC GCAGGCAACC CAGCGGCTGA GTTTGAATTC
GTCGCACCTG TGCGCGTCGG CGCGAGCACC GCCGAAACGC CACAACTAGG GTCGTCGGGA
TCTTCTCCAC GACAAAAATC ATCTCTCGCA TCATCGAGCC CTCAAGATGA CATCGACGGC
AGTCAAAAGG TCGCAATTTT TTACACTCCC TTCGCGTTCA CGCTGTGCGT CGCGCTGTTC
GCGTTCGCGT TCCGCGCGCG CGCGCGTCCA GCCGACGCCG AAATGTCCGA ACGTACGTCG
CTCGTCGCGA CGACGTCACG CCAAGTCGAC GGCTACAGCT CCGTCATCTG ATTGACTTGG
ACGTAAAATC CAAAACACCG TCATTTTAAC TATTTCAACT CTACAGTACA ACTTACAGTG
CGACATCTTT AATGCAACAT CTTTATTGAA CATTAA
 
Protein sequence
MMRGLVVASA LSSLRGVHGG SSPSPPKTDC VGFWQGSCTA FCGPATYSQT FTITVKPRNG 
GRSCGAYEGQ TRTENCEGIA NCSPPPSPPP PPSPPPPPSP PPPPSPPPPS PPPSPPPPSP
PPPPGAGAAI EVRFGSTTER VAAYSYDFQV AVHHPACAEA SDCLLNVNEE SVSNHALGLR
CDVRVKRDDQ SCSESDYADN LPCRKWNDFF PKALALTELD GTHSVGSHTI TYNCHFVTVD
GIVPGSETVG QHSFEVIKGC DLHLPLGFNA AGQASANQNS GLEELVRYIL LSNDQFNNVL
CDDGEIYKTK ESIFRRHDVD PSDGLLRLNE LQAMLVEQSV SRYILNVWND QFRAAHDTDL
AVSIVDVMEI PVRPVKCSGA PSNSFSIDSI TYPTLDWQTE NRCVSGLNGL TTAWSYSPKP
TNGDYVCAYV DGILFNQINV ADGTVDDVVD VRFEEYEVGG TTLQLPISFT DIRPATSDFL
DEHASLVAAF TFDEDPLHDG SLDSVAPIVD GQSSVPMRLQ PKSARTISGC SGGNGFRCLQ
AVDDPGTNDY SYSLDKEDTL GHDIAMSTWV RIDPSRCSQP KSSNDESLMT VAQFQGQLGL
HDHRLHVYLK IRTSDGMVEL RASHQVKESV DPPSAFTSSK DVKLDFACDG EWHLIAFAVN
SVNFMTLYVD PTSDAKATVI DDTTYTRDET WPLSALESMT DVTLLGAVDF EFDDVRVYTG
VIREVMFIDT VRCGHYKRCV LRQSSAPKAR RIVCLSGVIS DAREETYSKF ECGGGLYYDG
STIDVNGKLD PVGVTFSFRD TAWNDVGFEI LRQSMQDSLT DAPYESVVMI DSGLTYCHRS
YAPLLFADRE AAAIPGQIWK YKVVTKTSSG GSINSVPFKF ITPWHGEVGG YVYAGDSAVP
VPDVRICANF HDPIVIDGEV ANYRGSRPPD ADENENSTIT LRPSDDVESS SSNVALFKHA
YSNPLNEQAY TLTDGNHIPT GESVVVSTGG YARVDLGLWM SIGSVQVCLQ PQSIKARQLA
EGSLTDRLGY SGDTGDDPDV IVDPPRYEND VIRLVRADEK LWAKNTDRLD AEAGDTVLLE
CWVRAVGKEA IMTFQILGYE QQTTAAGRQT VGDALVITAE PNKWRKIRLK HTLEDAAFVG
FTVSAWSEST FYFTGAHFTL NTLTERLELV QTSAISPLTA SPTAPPPGFE DVVKSVDAGV
SVTVHVPGES LRALTGDSVV VSFWAKSLGG ERAVSIDLRG TSATDASTES FTIHEESWRR
FNATLISSQD QYITFPEISF GAESSKVEVT GFEFRQQASP TLSLPMKVHV HEIDPEDTTN
YGYACNHDPS DVSDEFADCL TFTCGGGGGD TVQAMHGQFV SVVAVRDVNL TEIRVLGGET
RCPFSAVTDS EGSFVVHITD STGMTPIKTH VDIGAYKVEV FEKTTQPILQ TSDVTDPLNV
PSKVLLVLKE DDSSSPETTT SLLGRQRWGR RLLGNIAQYI PPPPSPPPPS PPPPPPPEKA
AVGNVVWVTL RRAKASNRKV WRRSPNWWNY FSAAGAVSRR SIHGLTDSVR GISARCGSES
GLKFIGLNRQ SAAETVSSRM DFSIRCNFHS DAVEVYEGGK KIKGAAVAYT DSDTLQVVIN
DAGAVEYFKN EIRFYTSQQT PEYPLHADVA FIMRGSFSDV KWVERMASPP PPSPPPPSPP
PSPPPPSPSP PPPPPGKAVA GEILGSVAWV SQVGVESTES GQVTRTAAVT GAWDAGAVSQ
RAIRNGSDSV RGISAVCAGG YNNRMIGLNS QSTTDSTNYQ SLDFAIFCTY TKRIKVYEKG
KRKYTMAGTY SSSDSLQIVI NDAGAVEYFK NKIRFYTSQQ TPEYPLHADI SVKAGSLTNV
HWVERIASPP PSPPPPPSPP PPSPPPPSPP PPSPPPPPPG KAVADDHVAW VSQVGVESTE
SGQVTRTAAV TGAWDAGAVS QRAIRNGSDS VRGISAVCAG GYNNRMIGLN SQSTTDSTNY
QSLDFAIFCT HTKQIKVYEK GKREYTMAGT YSSSDSLQIV INDAGAVEYF KNEIRFYTSQ
QTPEYPLHAD ISVKAGSLTN VHWVERIASP PPPSPPPPSP PPSPPPPSPP PPSPPPPPSP
PPPSPPPPSP PPPSPPPPSP PPPSPPPPMH DFDETDTDGS GGIDKSELTA IIEKGASFPS
NGYAIISEEL WNSSLDMDGD GLLNAQEFLS VAHRLAQRTL FVEPVLVFPP RKITYDFDLF
TVQSVSAAIA DESHQDARSQ VFGKACENFI VRRSTSAHVP YSSEGWNEWY DELSAESGGD
VVSKIDHCRP LEDAEFDAIV VGDETSVYAT PLLATSPGSN KIFVDLDAAL LHDVVQDDPQ
TLFARVTRPG LDVALWSTQQ LPEIVHVFNN NDVVKQAGTS DDDGDLYESD TVFSDTADID
VINLELTHNH QVEQKMQDDT TVVILGAVMF PIEWTELSEC GLFEASIYVK DESEAGAPRQ
FTTDETGWFE FAVTNGKTYT ITAEYPGHEI CYSGTTIVEA TRIWSCDGKP TTVTLHAVSS
KKYIFFSDTT TANVDLGVYQ GECEKLYTGA TFKITPINGC HPPAIFTSAQ VSGWTLPDRD
KENSDVIPTY QEVPNGRRWP LAAMDYSVVI EEAPSTANFT AMREEKYPNA LCHVGVDGII
PYFRQHPTTI ERLVPLRTEH TWYSARYKYH GYLCAEFVDL SVITDDGDYC WDVDGVKAGG
IRHNHFVGIS PFFDDFGGEG LYALGPKLVS AKVLELHVEN GSLDECLTLP SAESGGMTVA
MFRQSVTDIA ENPCHTDRNG GPLCDFDVEI DPESDKLLFP TDEGEKTTDR LIVAGDPKLS
GNYRRSVELT VGRFDGSMTV TVPLKRELIS LGSKPRGEGI YGEESDDVYW ATVPLEGLVY
MTVHDPPGGN SYAELLMGTE VTMSVELSDE QAASVSSSHE GGGGVELEVE AKPGFGAGYG
MEVLFQLGMP LFKIDFDAYH EESGPEFSVA QSSAVGWDLT ATINRVIRTS TDVAIPGRQG
DAILGGGVEL VYRLADTLDL ALREDDKPCL RISTAITWMP RKPTSYIFTV HSIEAKVIPN
LRYLLSVVES GLIVGDDSKM EAPNWPQYIA DKINVWQRTL LWASPTVYKV PMEPTAEGAP
VLYGKNYGAM ERIMVPFMDE QSAFGMEAEK METSFISDAS GPISTPIDEL KLAWKRIDRV
SPYIHGVPGD LGDADGTIDT LNDFILSDET DESRLSQLIS GYVSNPGTLF SDLLSGNLGF
RGDDDDNDAT PDVGKVFWSR KSERMKPKFL EQFQEADELV TTMGMNSPTR EDLNNTVSSC
ITEKCKTQAL DVSSQLLQSS DLSDLTLGED ASIYMNDTKR VDASFTGRMG PRGYSTTDSN
GPADEAILLS FSGGGASTEY IFSSNENVDG QDYAWTLSLD GSAENGYSFA GSIAIVKGHV
GATTDMSKSV SKSRAFAWAK YEHMSTMYSL GDPDFGDKFV LQVSSDKRFG SPVFITMGGR
SQCPGEKWTM FREAGVTIGK ESTHNTNLNP GEHALIQLLI SNESPYKEVA NMGLRLVDGV
ADCVGKIIAA AHNAAQLNED NATFVKNAAY AMADSHECFA SESDDIQDLK ARIEDIVASY
EQSMSVRAGR LLANAIARLT QTTTAQGTLL QGMKFSINGI QMWSFGEVLP LRRLAGERMD
SQSVVRESRV LLSVEPSDGV YESSYLGISL VSLCESMIEE YMYRPIISSS VTLGAMSWSK
DCPQVAFHSS TLSKEASYFE KSAASDSSVL SISVVNPNRY SLWPKETAST SEALVTNGNL
AYVYVQYRSV SGGEWITAKD AERGAGTNKN FNLLCPESRG GDGCIFDWDL NDPYNKLLSG
YKDGKYDIRL KTMCVGGSHL AKPSVHEFVS DQNLMVKIDT KDPLVGDFKY VSSQVTQRVD
FMEDIDCTKQ VITAKRGNSS TGPFEAVSNE DLRQYVLQCV NDGAGGHWLM KFPYFSQGYH
KVTVTEVTDV AGNPAAEFEF VAPVRVGAST AETPQLGSSG SSPRQKSSLA SSSPQDDIDG
SQKVAIFYTP FAFTLCVALF AFAFRARARP ADAEMSERTS LVATTSRQVD GYSSVI