Gene OSTLU_31292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31292 
Symbol 
ID5001487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp635360 
End bp647434 
Gene Length12075 bp 
Protein Length4003 aa 
Translation table 
GC content52% 
IMG OID640416908 
Productpredicted protein 
Protein accessionXP_001417304 
Protein GI145345623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCGC TTCTGCTCGC GCTGTGCGCT TTGGCGTCGC TCTTAGGCGT CGCTCGGGGC 
GGTGGTTCGC CGTCGCCGTC CCCCGTGCAC TGTTCGGCGA GGTGGATCGA CGTCAACAAA
GGGGGCTGCC CAAGCTGCGG ACCAGGCACG TATAGTAAGC GATACCAAAT CGATCGCGAT
CCCGCGTACG ATGGACGACG GTGTCCTCAT GTCAACGGGC ATGTGTCTTA TGGACATGAT
TGCGAGGCAC CGCCATGTCC GCCGCCGCCG CCATCGCCGC CGCCGCCGTC GCCACCGCCG
CCGCCGCCGG GACAGTTTGT TCCCGTGCAG GTATCCATCA ACGATCCCGT GCCGAGAGTG
TCAGATGCGT ACGACTTTCA CATTCTCACC TCACATCCTT CGTGCGATTC ACAAAACGAT
TGTTTTGAGA GTCACATCGA GACGGACGTG AGCGAGCATC ACGTCGGACT CAGGTGCGAC
GTCACCGTGC AAAAGAAGGA CCAAGTTTGT GCGACATTCG GTCGTCACAT GCCTTGCAGG
CTGTGGAATG ACTTTTATCC AACAACTTTA ACCCTCACGG ATGAGCATGG TGACATCGTC
GAAGGCGATT ACGTCATTGA GTATGAATGC TACTTTGAAC TCGCAAACGG CGAGCTCGTA
CCTGAGCCGA CCAAGGTGAA GAAGACGCTG CACGAGTTCA GCGTATCCAA AGGGTGCGAT
TTGAAAATGA ATTTGAATCG AGATTCCAAA GCAGAGCTCG CGGCGTATCT CCTCAGTGGC
GCCGATGAAT TCAATATCGC TGACTGCTCG AGTTTGAAGA GTATTTACGC CGCGAAGGAG
GCCGTGTTCC GTATTTTCGA TTCAAACCCG ACAGATGGCC GACTCGATCA CGAAGAAATC
CTCATCGGTG TTGAGGCACA CAGCATGGAT ACACAGATAG TGAATCATTG GAAGGACATC
GTCGACGCTG AGGGTGGCGA TGGACTGTTC CTCACACTTT CGAACTTCAT GTCGGCAGAC
ATCGCGCCTC GCGGTTGTCA ATCCGGCGCC GGTGACGTCG TGACGTTCAC GGAGGCGACG
TATCCAACGA GTGATCCTGG ACGTGAAACG GGATCCAGGC AATGCGGAGA GATGGCGGGA
TCTATGAAGA CCACGTGGTC GTTTCCTCGA TCAATGCAAC TCGGCGATTG GTCGTGCGTA
TACATCGACG GGCTTTTGTA CAGTCAATTG CTCGCCCAAA CGAGCCGAAA CGCCGTATCG
ATGCAGAACA GCGTGACAGA TATTCGCCCT GTCATTGGCT CTTTTCATGA TGCAGAGCAA
TCTCTCGCTG CGCATTTCAC GTTCGCAGGC GTGTCAAACG CAAGGCTGAT TTCTTCAGAA
TCGCCGCGGG CGGATGGCCA AGAAGTGGTT CCAACCCTCT CCATCGATGA TACGTATACA
GGTAGTCTGC GAGTCTTCAC GGCTTCTCAA CACGGTAAAC ACGGGGCGTG CCTGAACAGC
TGCCAAGGTT GTACCTTCCA TTCGTACAAG TACATTGGGG AACCGTTCAA GAATGATTTT
GCCGTGTCGA TGCAAATATT CATTCCTACA CAGCTTTCGA GCAATTCTGG CTTTCGGAAG
AAGTCGGAAC CTCTCATGAG CTTCACTTCA GATGCATCTG GTAGTATGCA GCACAGCATC
AAGTTACGAT TGGGCACGGC ACTCTCGAAG TTTTCACTTC AATTTATACA TGAAAAAGTA
AACAGCGTGA CGGGTCATCG AGATGAGCTA TCAGAGCCCA TTCCTCTGGG TGCGTTTGGA
CGCTGGACGT CAATTGGATT TTCTTTCCAC CCAAAAGATG GTCTCACGTT GTACAGGTTC
AGCGTTGAGG GAGGTAGGGT AACACATGAA ATCAAGGTGC CGAGCAATAA CGATGAGTGG
AAATCGAGAC TGCGAGACAC GTTGCATTTG ATGCACCGTC GTGAATTTTT CAAAGAGAAC
CGAATAGAAT ACGACGATCT TCGCGTGTAC ACTGGCCGAG TAAAGAACAA CACATTCATA
GACGCGTATA AGTGCGACGC GTTGGGTGCT CAGTGTGCAC CTCGAGCACA CGCAACGCCA
AACTCACGTC GCGTCGTATG TGTCATGCTG GATGTCCACG ACAGTAGTGG AGGAGACGCC
CGTGCGCCGT ATTCCTGCAC CGGTGCCCTG TACTACGACG GCGGCGCCAT TGAAGTGCGC
GCGAAAATGG ATCTCACTGG AGTCGCATTC GAATTCCGCG ACACGGCTTG GCACGAATCA
TCTTTTGAAA TCTTGCGACG CACGCACGAT CCAAGTGGCG ATAACGCGTT TGATACAGTC
GTACTCGTTG ACGGTGGTTT GAACGGCTGT GCGTCGATGT TTTCGTCGAT TACGTATCTC
GACCGAGAGG CTGGTCAGCA GCCCAACTCT CGCTGGCAGT ACAAGATCAA AACAAAGACG
GACGATGTCG ATGTAAGCTT TTTGTCGCTC ACTACTCATT TTAAAACGCC TTGGATTGGT
CAAATTGTGG GCGAAGTGAT GGCCGGTAAG TCGGTTGTTC CTGTTGGCGA AGTCCGCATT
TGCGCAGACT TCGTGAAGCC CAACGGTACT TTTCTGATGT ATCGCAGTCA CAATCACTCT
AATCTGGCTT TGCACATGCC CGTCGTTCAC ACATCTAATC TCACTATCTC AGCAGAACAA
CAGGCGCATC GAGCTACGGA CGGCTTACAC TTGAAGCCTG ACTCGATTCG TGTCGCAAGA
GGACAGTATT TGCGCGTCAA CCTCGACTAT TGGAGCGCAA TACAAGAGGT CGAAGTGTGC
ACCGCAACCG GAGAGAACAA TTTCAGAACT TTCGTGCGGG AGTTCGATCC GGGATACTCT
GTGAATCACG GTCACGAGTG CAAATTACGA GAAACTAACG CAACCCAAAC TTTCAGCGCA
TCGCGCGTGA CTTGCTTCTT CTTCACCTGT CGGGGCTCAA AGCTTACGAC ATTGCATGGT
CAGTTCGTAA CCGCAGCCGT CGAAGAAGGC GACGACGTGA GAATGACGCA AATTCGTGTT
GTTGGTCAGC GTGGCAGATG CACGTTCACG GCGATAAGTG ATGACGATGG GCGCTATGCA
ATCGATGTAC TCGATCGCTC CGGGAACGTC CCCGTCAAGG CGAATATGCT CATCGGCGCG
TATAAGGAAG AAGTTTTTGG TGAAGAATTC GAAGTTCCCC TCCTGGATAC GAGCGATTCA
GATGGCGATG ATGATTCGCT AAACGTTCCG ACGCGAGTGT TACTCGTACT TCGTCGAAAG
AACAGTTCGC TTGCGGCGCG TCTCGGGCAT GAAGTCCCGG ATTACAGCAG CTCCTACTAC
GACGGCGAAA CGCATCAGTA TCTGGATCAA CATGGATATA GTCCACCGTA TGGCAGCTCC
TACTACGACG GCGAAACGCA TCAGTATCCG GATCAACATG GATACAGTCC ACCGTTCGGC
AACACGCCAT ATGTGCACAC TATTTACGAT AACGCCTACG ATTCAGGCGT TCCTGCCGCC
AAAGTAGAGT TCCAAAGTTT GACGAATTTA CGTCTGGAAA ACACAACTCT TATGAAGAAA
GATGGAGTTG CTGATGGTTG GAATGCAGGG GGGTCCTCGA ATCAGTCAAT GTTTAATGTC
ACGGGTCGTT TTACTGGAAT GGACGCGGTT GAGGGTATCA GGGTCGAATG CTCCTGCGAT
GATGGCGAAT ATGTCGTTGG TTTGAGCTAC GCAGAGCACA TAGCTAACTT TTCCAAAGCA
GCGACCGAAT TTGCCATTCA CTGTCGTGGT AAAGGCTATG AGCCAGCCAT CATTTCAGGT
GGTCGCTCTT ACGCAAATTC AAATAACTAT CGGATGGCCA TGGAGACGAG ACCATCGCAT
CACATGACGG ATCTATACGA TGTGTACCAG TCAACTGACA TCACATCTTC GCCACCCATC
TCCACGGCGA ATGATTTCTC ATATGCAGCA GCGCAGAGGC GACTACTGGC TCCAACGAGT
TCGTACCCTG CAAGAAGCCC GACGTCAAAC GAGTATCCAA CGACGAGCCC GACGCCAAGC
AGTCAATTAT TGAACTCTTA TCCTCAGTCG TATGATGCGC AAACGAATGA GTATCCTCAG
TCGTATGATG CGACGACGTC TACTTATTTT CCACCCTACC ACATCGATGA CTATCTCGAT
CGAGGCTATC GTTCAAACAT GTGTATGAAG AAGACATTCA CCATTAAACT TACACGCAGT
GGGCGTGTTT TGTTTTATGA AACGCAAGGC CATTGGCTGT ACGATCATCC GGTGCAGCAT
GAAAATTTTC ACCATACTCA GACGTGGCAA ACTGGTGGAC AGTCGCCGCC GTACGGCTAC
ACGGAGCAAA CGCCACCGTA CGGCTACGCG GAGCAAACGC CGCCGTACTC TGCGCGCGAC
AAGTCTTCCC TGGGCGCACA TGCCGACGTT TCGCACACGG GCCAAGCTTT GCATGAGTAC
CATGAACCTG CTCAACTCAT AGTACCGGTC GCGTACGCAG AAAGAGCACA CAGCCCCTTG
TTTGTGCGAA TGGTGGTAAT TTCACAAAAT GTGACGCTGA GCGACGCCGA CATACGTTGG
ACAGTGCCGA CTCCCTCACC GCCGCCATCA CCGCCACCAT CACCACCGCC ACCGTCACCA
CCGCCATCAC CACCACCACC GCCGCCGCCG CCGTCCCCGC CGCCGCCGCC GCCTTCACCG
CCGCCGCCGC CGCCGTCGCC ACCGCCGTCG CCGCCGTCGC CGCCGCCGCC GCCCCCTGCG
AAAGCTATCA GAGGTAACAG CGTCGATTGG ACGGTGAAGA AGAACGTCGA TAGTACCTCA
TCGGGGGGAG TTTTCTTGGT CGGATGGAAT TCGAAGGGTG CTGCAGTGAG CGCGCGGGCG
ATCCATTCGT CCTCAAATGT CACTCGAGGG ATATCTGCAA CTTTGGAACG TGGGCCTTAT
TCAATGGGTC ACACTTGCGT AGGTCTGACG AGCTCTGCGG ATCGCGAGAC GGTTGATAGT
TCTAACATCG ATTTCGCCAT GTGTTGCGAA TGGGGTTATC TTGGTGTTTT TGAAAAGGGA
TCGCGCAAAT GGCGTGCTAG TGGAAACTTG CAGGATAGTT GTTCACAAAC GGACACGCTA
CAAATCGTCG TGAACGATGG GGATGGCGCG ATCGAGTACT TCAAGAATCG GCGTCGCGTG
TATACCAGTC AACGGACACC TCAGTATCCG CTTCACGCAG ACGCTGAAGT CTATTTCGGA
AGACTGGGTG AAGTTGTTTG GATTGAACGA ATTCCATCAC CGCCTCCTTC TCCTCCTCCA
CCGCCTTCAC CGCCGCCGCC ACCATCACCG CCCCCGCCGC CCCCGCCTTC ACCGCCGCCA
TTGCATCAGT TTACATCCGC GGATGTAGAT CCACGCATCA ACACGGCCGC GGCTATTCCG
CACATCACAC GCGAGGAGTT CGAGGATTTC ATCGAAAATA TAACAGGCTT CCCTAATTTG
GGACACGCCA TCGTGATGGA TGAAATCTGG CAGTATATCG ACGACGACGG GGACGGTGTT
TTGAGTGAGA TTGAGTTCCA GCACGCGTCT TACAAAATGC ACACTCACGA GCTGTACGTT
GATCCCATCA TCGTCTACCC CAGCCGCTCG ATAAAACATC GCTACGGCTT CAAGGTTTCC
AAATCAGTCA GGAGTGATTC TAGTGCCTGC GTGGGTGACG CCGTATCTTA CGACGGTCCC
TCTGCGTCGC TTGGTTCGTG CGAATTGACT TCGGAGAAGG AGCCTGTAGA GATTACATTA
TGGGGAAATG GTTTTGCTGG CACACCGTCG ACGACCACTA TTTACAGCTT GGGAGCTTCC
AAGAATGGAG ACACTACGTA CATCGCGAGC GCGCGCGTTT CTGTCGGCAC CTGGTTCACG
TACAACGTGT GTGCGGTGAT AAAGTTTCGC GTGATAAAGA AAGATGGTCA GTGCTCGATC
GTCGTTGACC AAGTTCATAA GATGTGGCCA AGCCTGGTGT CTTCGAAATG TGGCATCACG
CCACCCATCT ATGACAGATG GAGCCGCTAC ATGGTTGCAT CCGTTGCGAC GTCGGCGACG
AACGGCTTTG CTGGCATGAA ATCTTCTATG TTCAAGAGCT GCGCTGCGAG TTGTGAGAAT
GCGATTCTAC TTCGACACAA TAAAACTGTG CCGTACAATA CATCTGAGTG GCGAAATTAT
TACGAGTCTC TAGTAAAACG AGACATGGAG CCGACTACTC ACGTCGACTG CTCTCACCGA
GAAGACATGG TTGTGCTCGG CAGAGAAGAG GACACGCTCG CCATGCCCAT GGTGCTCTCA
TCCATGGCTA GGAAGCAATG GCTTCTTGAC CTCGATTCGG CGCGGCGAAA GCAGAGTACC
AGTGAGAATG TCACCAAATT GCTCGGCACC ATGCACTCCA ATTTTGCCAT CAGTTCAGAT
TTATGGCTGC CGGATATTGT ACACGTGTTT GACAAAATTC CAAAGCCTGA AAATGCTTCT
TCGGGTCCAT CCAAGGAGGA TGAAATTCAC GTGGTGGATA ACGATCTCGG GAATTCTTTC
TCAAACTTTG CGCGTATAGA AGTAAACCGA CTCCAACTTG AACACAAGCG CATTGGAGAA
AAAACATTTA CAGATGACAC CGCCGCTGTG GTCAAAGGCG CTATTCTTTT CCCACAGCAT
CTCACGGGCG GCAGTCCAAA CTGTGGATTG TTTGAAGCCA TGATTCGTGT TTACGACGTG
GAAGAAGACG GCGAGCCAGA TGTATACTTG ACTGAAGAAG ATGGCTCGTT CGAGATGGCT
TTAACGCGTG GAAAGACCTT CAGAATTGAA GCCTTTTATG AAGGACACAC CATTTGCTAC
GCCGGAAGTC AGCCTCAAGA CGCGACGTCA CCCAATGGCT ATGATTTCGA AGGAGGACAG
CAGGCGTGTG TCAAGACGAC CCACACGCTC GCTCGTGTCG GCGATGGCAA CTTTTTATAC
TTTACAGATA TTACGCGCGG AAATATTGAC CTTGGTTTGT ATGCGGGAGA ATGCGAAGAA
AGGTACACGG GCGAAGATGT CAAGTTTAGG GTAACGCCCG TCAACGGCTG TCATAAACCC
GTCATCGTTT CCGAAACCGA CATCAACGAG CGATGGCATA CGATTGAGGA TATCCTACCG
GAAGGCCAGC AGGTTGGCGA CAACGCCCGC GTGTGGAAAT TCGCGGCGAT GGATTACTCC
ATCACGCTAG CGAGTGGTTC CGACATCAGT AGAGTTACCG AAAAAGCGTG TCCAGACGAG
GCATGCCCAG ATTACTGGGA TGGATGTCAA GTCGAGCCAG ACGACATGCG TCAATTTTTC
CGCAATCGAA ACTCACTCGA AAGACTTGCG CTGATGAGAG ATGAATTTGC GTGGAATTCC
ATCAGGTACA AGTATCACGG ATATATCTGC GCACAAATAA TGGACATTCC TCGAATCACT
GATAACGAGC TCAGACTCGA GACTTGCGTA GGCAACTTAA CTTCCGGAAT GTTGACAGAG
AAGCATTTAC TGGGTTCGTC TGATCACGAC ACAATCAGCG TCGCCGCTGA CAAATCGTTG
AGGGTGAAGG TGTTTGAACT TCACGATCTC ACACCGAATG CCGCAGATGC CCCATTTAAA
TGCACAAAGT TTCCAAATAC GCAAAGTCGC ACGGGAAACA CGAAATTTTC CGTTCGACAG
ACTGTCACAA ATGAAGAAGA TAACGTGTGT CATCCAAATC GCGGGGGTGG GCCCCTGTGC
GATTTTGAAG TTAATATATC AACGGCTGAC GTCGGCAAAC TCATCTTTCC GACGGAAGAA
GATACGGATC AGGAAACTAC CCAAATGGTC ATTGCCGCGG GGTATCCAAA CCTTGCAAAC
CCGTATCGAC GAGAAGTCAC GATCACTGTG ACACGCGACG ATACACCCTT GGACGGTGGT
CTTGGGCGAT CGCTGCAAGC AACGATGACG CGAGTACTTA TCCCATTGGG CTCCAAGCCA
CGAGGAGGAT TTGACACGGG AGATGACACC TTGTGGGCGA CAGTGCCTAT TGATGGATTG
GTATACACAG TGGTGCACGA TCCGCCAGGT GGTAATTCTG TAGCCGAGCT GCAATCTGGA
TCTGAGATTA CGATTCAGTG GGAAATCGCT TCAGCTCGCT CTGTGAAAGT TGGCAGATCG
ATCGAGCTCA ACCTCGGTTT CGCAGGTAAA ACTACGTTTG ATCTAGGCTT TAACGCCGGT
TACACCGCAG AGGCTGCTAC AAAATCGAGC GTTCTCACGT TGGAGACGGA GAAAGATTTC
GGACACAAAG AATCTGGTCC ACACTTCACC GCGAAGGCGA CGTCCGAAGA AGTGTGGGAG
ACGACGGTTA CGATGGATCG ACACATCCGG AGCAGCGACG ATGAAGGCAC ACCTGGACGT
CCGGGCGACG TCATTTTAGG AGGTGGTATC GAACTCGTGT ACAAGGTATC AGATACGCTC
GACGTAGGTA CTGTTGACGA CTGCTTAACT ACGGGTGTTG GTATAACCTG GTTACCGCGT
CGACCAACCT CTTACGTGTT CAACGTCTTT TCCATCGAGT CTCAAGTGCT GCCGAATCTC
TACTTTCTGT ATACTGTCGC CAAGTCTGGC GAAAATACGG CGAAGACACC GTCAGATAGA
ATCGTCAAGG ATGGATCGGG TATGCGATAT GAGTGCGCGT CATCGCCGTG CTCTAACTCA
GAAATGAATC AACATTGGGC AAGCTACATA CTGAGACGAA TCCATACGTG GAAGCGAACG
TTGCTTTGGT CCTCGCCCGA AGTTTACTGG ATGCCAAAGG GTGCGAGTTA TGAAAAGAAC
TACAAAGCAT ACGATCGCAT CAATGAACCG TACATCGATT CACGTTCCCT CTTTGAACGG
CGCATGTCAT CCGCGTTGGC ACAGCGTGAT TCCAAGCGTG ATCCATCGAT TCTGAGGGTT
GCATTCGAGC TGAGTAAAAT GTGGTACGAG CAATCCCTCC TCAGTGTGGG TGACAGAGGC
ATGCTTCGTT CGAAACCTCA CATCCCTGCT TTATTTCTCA CTGCAGGACT GGAAATTCCG
AGACTAGCAC CATTTCTCGA AAATTCACGA GCAGCGTCTT TTGCTTCGAT CGCGTACCCA
CGTTCGTCAA AGGGTGCGTG GTCATTCGAT GAAGTGGTAA ATGACCAAAC AAACATCGGA
ATGTACTCTT TTGGGATGAA CGAAGTCGCC GCTCGTGACC TCGATGCGCA GGTCAAGAAG
TGCGACGGTG TACTGTGCGA GACCAGTGGA GAATTCGACG CCGCACCACA TGACGTGTTG
TACGGAGGAT CACTGAACGC CATGGGTGAG ACCGGCGACC TTCGCGCCAC GGACCCTTCG
CGCTTAATGG CGTCTCTTAC TGGGACTCCT GGTCCAACCG GAATGCTTCA AAAGGATGCC
GTCGCGAGCG GCGGAGAAGA GGAGACGATA TACTTAACTT TTGGTGGTGG TGGACATGCG
TTGGAATTCT CTTTCAGCGC AAAGGAGTCC ATCAACCGCG ATGCATTCGC GTACTCGCTT
GACTTTGATG CAAGCGCCGA AAACACAAAC AACATGAAGT TCTCGGGCTC ACCAATACCC
ACCATCGAGT TTAATCGAGC GACAGACATC GCCAGAACGC TGTCCACGGA TCGTATCTTT
GCGTGGAACA AGTACGGAAC GATGACGACG GTGTATTCTC TCGCGGATCC AGATTATGGC
GATAAATTTG TTCTGAAAGT CGGAAGCGAT GCGCGATTCG GCACACCGCT CTTCATGACA
ATGGGCGGTA GATCCATGTG TCCCGGCGAA TTGACGACGA TGTGGCGCGA ATCGGGAAAT
ATCATTGAAA GACCTCGTTC GTTATCCTTT GGTACGCTCC CGCTGAACCC AGGAGAGCGC
GCTGTTCACG AAATAACTAT TCAGAATGAA TCTCCATACA GAGAAGCCCG ACCGCTGTAT
TTACGTATCG TTGATGGACA CTCAGAGTCG TTACGATCGC TGGTGCGCGC AGCGCTGAAC
ATCATCAAGG ATGATGAATC AGCTTCGGCA CAACATGTAG CCACAGTTGT CACGAACACT
GCTGCAAACT CAGTAGCTTC GGCATCCCTG TCCATGGCCA CTGTCATAGA GAAAGTTGAA
GAGGCAGCTG GCAGCGGCGA CGCGACTGCC CTGAGTGTTA TGCAAGTCGT CGTCGCCGAA
GTCGAGAGCG TCAGTAAATC GGCTATGACG CCGTTACAGG ATGTAGAGAT TACCGTGAAC
GGCAAGAAGG TGGTACCTGT CGCTGAGTGG CTTCCTTTGA AGCATGTCAT CGGCGATGCT
CTTGAATCAC AACGGACCGT TCACAAGACT GTGTTCAACT TGGGTTTCAA GCCAACGGAA
ACCTCGCATC CTATCATTGA ACACATTCAA GTTGAAATTG GATCTCTGTG CGAAATGCAA
ATCTCATATG ATGGAGCTGG TCTGTATCGC GATCCAATCG CCGTCAAGGC TCAACTTGAA
AAGATGCGAT GGGAATCCAA GTGTCCGAGC GTGACGTTTT CCAAGTCCAC CGTCATTGAC
AGTGAAATCA CGCGCGCAAA CGCGACTGCG CCGCCGCTTC GTTTGGTGGT GGTCAATCCC
GATTCTGGAA ACTTGTGGGA TCCTAGCAGC GCCAGTAGAC TCGAAAAGGT TGTCGTACAG
TACAGACCGC TTTCGGGTGG CGAATGGATT ACTGCGAAGG AGGATCATGT GAAGTACAAA
GACGACTCGT ACAAGAAAAA TCTCATCTGC GAACATTCTC GAACGAGCGG ATGCACGTGG
GACTGGGATT TGAACAACAA ATACGACAAG TTACTCAGCG GATACAAGGA CGGTGCGTAC
GAGGTTCGCG TTAAAAACTT TTGCTCGGGC GGTGACGCTT TCGCCGCGAC CGAGGTGCAC
GAGTACGTCG GCGATAAGAC GCTATTACTC TTCACGGATA CGAAAAACCC GCTCGTCGAG
CAGCACGTGT CGTCACCCGC GGCGCGCACC GTCACCATCG TGTACGCGGA GGAAATCGAT
TGCGCAGATC CACCGACCTT CGAAGTGGCG CAAACTCACG ACGAATCCTG CGTCGCCGTC
GCGGGCGATG GCACAGTCAG CGCGGCTATC ATTCGTGAGT CTTACGAGCA AACGTGCCTC
AACACCGGCG TTCAAGGAAA GTGGATGATG CAATATCCCA ACAATGAGGA TGCTCCGAAT
CCTGGATTGT ACAGAGTCAC CGTCAGAGAC ATACGTGACA AAGCCGGTAA CCCTTTCGTC
GGCGATTTGA CGATAAACTA CGTCGTGGGT GACGTCGTCG ATGACTGTGA CATTTTGAAA
GCATCCAAAG ATGACCTCAC CCCGCTCATC GGTTTGGGAA ATTTCGCCGA TGTTCAAGGC
AATACCATCT GGGGGGTGTA CTCTAAAAAT ATCGACTGTC GCTCGCAGCG AGTCACCATC
ACGAAGAGTT TTGATCAATC GTGCCGATCC ACGAGCACAG AAATTGGCAC AACCAACTTC
ACGCTTGCGT GCGTGAACGT CGGCGGAACG GGACAATGGA TCATGAAATA TCCTGAAGAT
CTGGAATCTG GAGTTTACAC CGTGCGAGTT CAAGGCGTGA AAGATCGCAC TGGTCACGAA
GCGTCTTCTT TCATTCGCAA GTTTACAACG CTTCATAACT CGGATGGAAC GTTGAACGAA
TGCGACGACG CGTCCGCCTC TTTTGCCATC GGACTTGGGC GGATGCGTCG TCGACGTCTC
GCTCCGAGCG CGTCCGCGAC AAAGACGTCC ACAAACTTGA TTCTGTCCGA TCTCGCCATC
GTCGTGTTCG CTATCTGCGC GCTCGTGATG ATCATCAAGA AAAGAATCGC GCTGCCCGCG
GATGAGAGTG AGCTGCGCGA ACGCGAAGCC TTCGACAAGG TGGACATTCC GCGCTATGGT
TCATCTTTGT GATTCCACCG CGCGAATTGA CCGTCGGCGA CAAAGTCGAG AAACCGACGA
ACCTACGTCG CCGTC
 
Protein sequence
MSALLLALCA LASLLGVARG GGSPSPSPVH CSARWIDVNK GGCPSCGPGT YSKRYQIDRD 
PAYDGRRCPH VNGHVSYGHD CEAPPCPPPP PSPPPPSPPP PPPGQFVPVQ VSINDPVPRV
SDAYDFHILT SHPSCDSQND CFESHIETDV SEHHVGLRCD VTVQKKDQVC ATFGRHMPCR
LWNDFYPTTL TLTDEHGDIV EGDYVIEYEC YFELANGELV PEPTKVKKTL HEFSVSKGCD
LKMNLNRDSK AELAAYLLSG ADEFNIADCS SLKSIYAAKE AVFRIFDSNP TDGRLDHEEI
LIGVEAHSMD TQIVNHWKDI VDAEGGDGLF LTLSNFMSAD IAPRGCQSGA GDVVTFTEAT
YPTSDPGRET GSRQCGEMAG SMKTTWSFPR SMQLGDWSCV YIDGLLYSQL LAQTSRNAVS
MQNSVTDIRP VIGSFHDAEQ SLAAHFTFAG VSNARLISSE SPRADGQEVV PTLSIDDTYT
GSLRVFTASQ HGKHGACLNS CQGCTFHSYK YIGEPFKNDF AVSMQIFIPT QLSSNSGFRK
KSEPLMSFTS DASGSMQHSI KLRLGTALSK FSLQFIHEKV NSVTGHRDEL SEPIPLGAFG
RWTSIGFSFH PKDGLTLYRF SVEGGRVTHE IKVPSNNDEW KSRLRDTLHL MHRREFFKEN
RIEYDDLRVY TGRVKNNTFI DAYKCDALGA QCAPRAHATP NSRRVVCVML DVHDSSGGDA
RAPYSCTGAL YYDGGAIEVR AKMDLTGVAF EFRDTAWHES SFEILRRTHD PSGDNAFDTV
VLVDGGLNGC ASMFSSITYL DREAGQQPNS RWQYKIKTKT DDVDVSFLSL TTHFKTPWIG
QIVGEVMAGK SVVPVGEVRI CADFVKPNGT FLMYRSHNHS NLALHMPVVH TSNLTISAEQ
QAHRATDGLH LKPDSIRVAR GQYLRVNLDY WSAIQEVEVC TATGENNFRT FVREFDPGYS
VNHGHECKLR ETNATQTFSA SRVTCFFFTC RGSKLTTLHG QFVTAAVEEG DDVRMTQIRV
VGQRGRCTFT AISDDDGRYA IDVLDRSGNV PVKANMLIGA YKEEVFGEEF EVPLLDTSDS
DGDDDSLNVP TRVLLVLRRK NSSLAARLGH EVPDYSSSYY DGETHQYLDQ HGYSPPYGSS
YYDGETHQYP DQHGYSPPFG NTPYVHTIYD NAYDSGVPAA KVEFQSLTNL RLENTTLMKK
DGVADGWNAG GSSNQSMFNV TGRFTGMDAV EGIRVECSCD DGEYVVGLSY AEHIANFSKA
ATEFAIHCRG KGYEPAIISG GRSYANSNNY RMAMETRPSH HMTDLYDVYQ STDITSSPPI
STANDFSYAA AQRRLLAPTS SYPARSPTSN EYPTTSPTPS SQLLNSYPQS YDAQTNEYPQ
SYDATTSTYF PPYHIDDYLD RGYRSNMCMK KTFTIKLTRS GRVLFYETQG HWLYDHPVQH
ENFHHTQTWQ TGGQSPPYGY TEQTPPYGYA EQTPPYSARD KSSLGAHADV SHTGQALHEY
HEPAQLIVPV AYAERAHSPL FVRMVVISQN VTLSDADIRW TVPTPSPPPS PPPSPPPPSP
PPSPPPPPPP PSPPPPPPSP PPPPPSPPPS PPSPPPPPPA KAIRGNSVDW TVKKNVDSTS
SGGVFLVGWN SKGAAVSARA IHSSSNVTRG ISATLERGPY SMGHTCVGLT SSADRETVDS
SNIDFAMCCE WGYLGVFEKG SRKWRASGNL QDSCSQTDTL QIVVNDGDGA IEYFKNRRRV
YTSQRTPQYP LHADAEVYFG RLGEVVWIER IPSPPPSPPP PPSPPPPPSP PPPPPPSPPP
LHQFTSADVD PRINTAAAIP HITREEFEDF IENITGFPNL GHAIVMDEIW QYIDDDGDGV
LSEIEFQHAS YKMHTHELYV DPIIVYPSRS IKHRYGFKVS KSVRSDSSAC VGDAVSYDGP
SASLGSCELT SEKEPVEITL WGNGFAGTPS TTTIYSLGAS KNGDTTYIAS ARVSVGTWFT
YNVCAVIKFR VIKKDGQCSI VVDQVHKMWP SLVSSKCGIT PPIYDRWSRY MVASVATSAT
NGFAGMKSSM FKSCAASCEN AILLRHNKTV PYNTSEWRNY YESLVKRDME PTTHVDCSHR
EDMVVLGREE DTLAMPMVLS SMARKQWLLD LDSARRKQST SENVTKLLGT MHSNFAISSD
LWLPDIVHVF DKIPKPENAS SGPSKEDEIH VVDNDLGNSF SNFARIEVNR LQLEHKRIGE
KTFTDDTAAV VKGAILFPQH LTGGSPNCGL FEAMIRVYDV EEDGEPDVYL TEEDGSFEMA
LTRGKTFRIE AFYEGHTICY AGSQPQDATS PNGYDFEGGQ QACVKTTHTL ARVGDGNFLY
FTDITRGNID LGLYAGECEE RYTGEDVKFR VTPVNGCHKP VIVSETDINE RWHTIEDILP
EGQQVGDNAR VWKFAAMDYS ITLASGSDIS RVTEKACPDE ACPDYWDGCQ VEPDDMRQFF
RNRNSLERLA LMRDEFAWNS IRYKYHGYIC AQIMDIPRIT DNELRLETCV GNLTSGMLTE
KHLLGSSDHD TISVAADKSL RVKVFELHDL TPNAADAPFK CTKFPNTQSR TGNTKFSVRQ
TVTNEEDNVC HPNRGGGPLC DFEVNISTAD VGKLIFPTEE DTDQETTQMV IAAGYPNLAN
PYRREVTITV TRDDTPLDGG LGRSLQATMT RVLIPLGSKP RGGFDTGDDT LWATVPIDGL
VYTVVHDPPG GNSVAELQSG SEITIQWEIA SARSVKVGRS IELNLGFAGK TTFDLGFNAG
YTAEAATKSS VLTLETEKDF GHKESGPHFT AKATSEEVWE TTVTMDRHIR SSDDEGTPGR
PGDVILGGGI ELVYKVSDTL DVGTVDDCLT TGVGITWLPR RPTSYVFNVF SIESQVLPNL
YFLYTVAKSG ENTAKTPSDR IVKDGSGMRY ECASSPCSNS EMNQHWASYI LRRIHTWKRT
LLWSSPEVYW MPKGASYEKN YKAYDRINEP YIDSRSLFER RMSSALAQRD SKRDPSILRV
AFELSKMWYE QSLLSVGDRG MLRSKPHIPA LFLTAGLEIP RLAPFLENSR AASFASIAYP
RSSKGAWSFD EVVNDQTNIG MYSFGMNEVA ARDLDAQVKK CDGVLCETSG EFDAAPHDVL
YGGSLNAMGE TGDLRATDPS RLMASLTGTP GPTGMLQKDA VASGGEEETI YLTFGGGGHA
LEFSFSAKES INRDAFAYSL DFDASAENTN NMKFSGSPIP TIEFNRATDI ARTLSTDRIF
AWNKYGTMTT VYSLADPDYG DKFVLKVGSD ARFGTPLFMT MGGRSMCPGE LTTMWRESGN
IIERPRSLSF GTLPLNPGER AVHEITIQNE SPYREARPLY LRIVDGHSES LRSLVRAALN
IIKDDESASA QHVATVVTNT AANSVASASL SMATVIEKVE EAAGSGDATA LSVMQVVVAE
VESVSKSAMT PLQDVEITVN GKKVVPVAEW LPLKHVIGDA LESQRTVHKT VFNLGFKPTE
TSHPIIEHIQ VEIGSLCEMQ ISYDGAGLYR DPIAVKAQLE KMRWESKCPS VTFSKSTVID
SEITRANATA PPLRLVVVNP DSGNLWDPSS ASRLEKVVVQ YRPLSGGEWI TAKEDHVKYK
DDSYKKNLIC EHSRTSGCTW DWDLNNKYDK LLSGYKDGAY EVRVKNFCSG GDAFAATEVH
EYVGDKTLLL FTDTKNPLVE QHVSSPAART VTIVYAEEID CADPPTFEVA QTHDESCVAV
AGDGTVSAAI IRESYEQTCL NTGVQGKWMM QYPNNEDAPN PGLYRVTVRD IRDKAGNPFV
GDLTINYVVG DVVDDCDILK ASKDDLTPLI GLGNFADVQG NTIWGVYSKN IDCRSQRVTI
TKSFDQSCRS TSTEIGTTNF TLACVNVGGT GQWIMKYPED LESGVYTVRV QGVKDRTGHE
ASSFIRKFTT LHNSDGTLNE CDDASASFAI GLGRMRRRRL APSASATKTS TNLILSDLAI
VVFAICALVM IIKKRIALPA DESELREREA FDKVDIPRYG SSL