Gene OSTLU_31401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31401 
Symbol 
ID5001271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp807897 
End bp821672 
Gene Length13776 bp 
Protein Length4591 aa 
Translation table 
GC content55% 
IMG OID640416692 
Productpredicted protein 
Protein accessionXP_001417362 
Protein GI145345747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGCG CCGCGGGCGC GTCCGCGCCG TCCACGGCGA GCTCGCACGT TGCGTTCAGT 
GGCTGGCTGA CCAAGCGCGG AGGAAAGGAT TTATACGGTA AAGCCACGTG GCGCCTGCGA
TATTTCGTGC TTCGACGACC GAGACACGCG AAAGAGCGGG CGTGCGTGCG GTACTTCGCC
AACGAACCCG CGGACGGGGC GGAGAGCGCG GCGAAAGGGG AGTTTGCGAT TAACGGACAG
AGTGCGGTGC GAATCTTGGA TTCCGATGAA ACGTTCGCGG AATTCGGGCT CAAAGCGCAC
AAGTATGCCG GGAAGCGGAT GTTTGCGTTT CAGACGATGC CGGGGCAGGG CGGGAAGAGC
GCGGCGCTTG TCGTGGAGGC GGAGGATCTG ACGACGATGC AACGGTGGGT GACGGCGATT
AGCGAAGCAA TCACTCGGGC AAGAAGTTTG GAGAATGAAG CGGCGAGTAA AACCGGTTTG
GCGGTGTTAA ACGCGCAAGT CGATGGCGAT GAAAACGATG TTTTACGAGG GAAGAACTTT
CGGGGGACGC TTGAATTGGC GGGACTTTGC GGACCGCTCG GCGACGGCGC GCTCCCCGTG
CATTTGGACG CGTGCTGGGA GAATTTGACC TTTGTTGATC TCATGGCGGA TTCGTGCTTG
AACGAAAAGC TCGGCAAGAT GCGACGATCG AAATACGTGC GCGAGACGTT GACGAGCTTG
GTCGACAGTG CGATCACGAT TGGACGGTGC TTGGGGGCGT CGCCGATTCC CATGAGCGGG
CCAGAGTCTA ACGCGATGAG CACGATTCGT CGAGCTCTTG ATGATATGCT CGCACTCGCT
CGCTTGATGC CAGTCGGTAC GAAAGACAAT AATATTTTTG TTTCGTTTCT TGCGGCGCTG
TTGGATCGCG TTCGTCAGCT TCGACCAGGT GAACTGATCA TGTGTCCGGG AGGTTGGGCA
AACGAAGAAG GCGGCGCTGC GGTGATTTAC ACCATTCACC GTCTCCCCGC ACACTTCGTT
GTGAGCTTGA CCAACTGCGG CGACGGTGTG GAGTTTCATC CTATTCAAGC TGATCCTGCG
AAAGATGCTT TCAAGTATAT TCATACCATT CAGTTGATGG ACGTTCCGAT AGCCGTCGCG
ATGGACTCCT CGGCGTGGGC GCTGCTGTTC CGCCCGTTGA TTTTCCCTCA AGACGCCGCC
AGTGCGAAAG ACATGATTTA CAACAAGCTC CTTCCCATGA TGAATGGTCG TCCTCTACTC
GCTTCGGTCA CCCCCAAACC ACTGCGCACT GTTAAGTTTG CGAAACAGCC AAGAGGACGC
GATTCAAGCG GCGTCTTTAC CGCACTCGAA GCCGCTCGAT GCGGTCTCGC TAGTTTAGGT
TGCCCGCCGA ATCGTGCAGA TGCGTTGGTG AACCTTGGAG TAAGGTTTGT CATGCTTGAA
GAGGCCAAGC GCGATTTGGA GCGCGTGAAC ATGATCGGTG CAAGCGCTCT GGTTACGTTG
CAAGCGGCGT GTAGAACGGT TGCGGAATAC GCGGCGAATC ATGCCGACTT GTGCGTCGAG
GCAGAAGAAG CAAGTAATGG TGAGCCACCG AAGAAAACGA CAATGCCCCG AGAAACGCTG
GCGCAGATGT TGCGCTCAGT TGAATCGGTG ATTCTGCGGA CAAAGGCCCT GCACGGCGCA
ACGACACTAC CGCCGGCATT GCAACCGCCG AAGGTACCGA TCACGGCGTG CGCTTTACCA
CTCTTCAACC GCCTCGCCAG CGAAGGTAAC GTAGATTCGT TAGCAGGTGC AGCTAGAACG
CCACCGATTT TGAGGCCAGT CGAACTCACA TTGGTGCCGG ATGCCGTGGC GAAGCTAGAA
GACGTACCCA AGGCGTTGAG AAACGCGCTG CATTGCTGCA CGATTCTCGC AAACCAAGCG
GATTTGATGC CAAACTCTTA CGCCTTGCGC GTGAGCTTGC TCGTGCACTT GTTTGTACAT
GTCCTGCCGG CACCTTTACC TATTACGCAC CCTGAGAGAC ATACGAAATG CTTTTGGGCA
TCGGTGAACA CCACCTATGA GCTCCAAGCT GAAATTTTGA GGAGTCTAAA TTTGTTGCTG
CGGCACTTTT CCGCCGCAGC GCTTTCAATC AGCGTCACGC AGACTTTCGA TGCAGCGCGT
TTACTCACCA TTGCCAATTT CGCGGCAATC GCGGATGCGA CGCTTCGTCT TAAAGTAGCA
GACGCGCCGT CTGCGTTCAT GCAACACTAC GCCGGATACG CCGAAGGTCC GATTTCGGCG
TTTGGTTTTG AAATCGGTCC GTTCGCCGTT GAGGCAGAGT ACTTGCGTTT CCACTGCCCA
GAACGTACCA TTAGGTTGAC TCAGACGCTG GATTACTTTC ATAGCTTGAA GAAAACGATC
GAGAGCGATC ACATGATTTA CCGTTGGGAG AATGAAGCCG CGGGTGGGAT GTCGTTTGGC
GTCGGCGAGG CGAGACTCAT CGATCAAGTC TGTTTGCAAA TGGGTATCGG CCGAGAAGAT
CCAGAGTACT TGGCAAAGTA TCACAGTGGC GAGTTGCGTG AGTTTGCCGA TAGCTATCCG
GAAATGAGTG CACTTCGAGA TATCATTTTC ATGTTCAAGT TACTCATGGC GCCGACGTCA
GACGACCTTC CAGAGCTCGC ACGTTGGAAG CCCATCGACG CTGCGTTGGA TTGGAGTGTG
GCGCGACGCG AAAGTTCGCG TAGTTACTTT GACTTGGAAG TGCGGGCTTT CGGTCGTGCT
TTGACTATTC AGCCTTGGAA AGAACCAGAG GTAGAACAGG GCACGTCTAA GACAGGTTGG
CGTAACATTT TTGGCGGTGG ATTTTTCGCA AGAATGCAAG ATCGCCGGCC GCGATGCCCA
CCTTCGGGCG CCAATCCCAG CAACCTGGCT GGTCAGCGAG TCGACACCGA AGAAGACGTC
CTTCATTTGC GCGAGGTACC AACGTTTGAC AATCTGCTTA AACCAAGCGA GTCGGAGCTG
CTCCTGACTT ACTTGACCGC TCCCATGATT CGTTTGCCGC TCGTGATGAA GCTGTTCGCA
GAACCGACAC GCGTGCGCGG GTTGACGCAC AACGACATTC AGAGTGTTTT AGACGCTGTG
ATTTTCGAGC CCGGACCTTG GGCGCCTCCA GAGCCGACGA CAATACCAAA GGAAATACCT
GCAACAACGC GAGCGCATAT GGCAACGCCG ACTGGTTTAC TCGTGCACGA GCTCATCAAC
TGTCCGGCGC CTCTCGCGGA TAGCGCCGAA GAGCTGTTGG CGTTGAGTTT GGAATTAGAT
ACCGGGCGTC ACGACGCACC GAGCGCACAA GGGGTGTTGT TCGCGTGTAG GATGCTCACT
AGACTGCTGG CGTTCGTAGA TAGCATCCTC AAGGCAAATT CGCTCGATTC GGAATTCACG
ACATCCACTG GCTTGACGCG AGGGCTGGAT TGTTCGTCGG CGTCTTTGTC GGCACTCGCG
GACGTCAAAG TCCGCATCAC CAAAGCACTC AAGGAGAGAT GTCGACCAGT GATTCAGCGA
TGGTTACGTC ACGCGGTGAA GAAAAATGCA ATTCGTGCGG CTTGCGCACT TCACGCACAT
CTCGCGTACA TGTACTACTG GACGCCTCGA GTAGAAATAG ACAAACAAGC GGCCCTCATT
ATTCTCACGG CACAGCAGTA CATTTTCGTG AACTACCACT TCACAGATAC ACCGGATGGA
AGCGAGACGA GCAATAGAAA GTTGGCGGAC ACGGGGAGTA TGAGCGGTGT CGATACTGGT
CTCGGCTTTG CGCCGACGGA GATTTTTGAC CTGTTCCAGC GAAAGCGACG GAGCATTCTG
GAATGGGCGG CGGCGAATCC GCAAGACGCT AATAGCATTC TGGAGAACGT CGTCGAGACG
CTCACTACTA AGAAGACAGA TGGCTCGAAC TCACCAACGA ACGTGTTGGA TGCAGAGGAC
ACGTTTAGCG CGAAGCGTGC GTGGCGCCGT GCACCAGGTG CGGGTGGTGC TGGTAGATTT
GTCAGCTACG TGTCGGATAT CCCATTCGCA GATCTCGTCG AGAGGGACAA ACGGGCCGCG
GCGCTTTTAC TTCAAAACCA TCGTCACTAC GGAGAGTGGC TTCTGGAGAC GACGCTGCAG
TCTTTCGATA TGGAAATTAA TGCTCAGCTC GGCGAATTCA CCGTTCGAAA GAACCGATTG
CGTCATTTAG AAGCTAGTAT TCGTGAAATG CCGGACTTCG TCGCAGCGCT GGGACCAGTG
CTCGCCAAGC GCGGGATTGA GCCGTCGCAC TTCCAAGAAC TCAGCAAAGG GGACGAGCTC
GTGATCGGCG ATTCGGGGGA TGTCGTTCAC TGTGCGGAGG TAAAGAATAC GGAGCATCGT
ACATGGATGC GTCTCGTGGG TCTCAATCAT GATATTCAGT TGTGGGATCG AGATCCAAGA
CCGCCGCTGA ACGAATTTAC GCGACCGCTC GTTAGCCGCG TGCAGACAAA GTCACAAGCA
TTAGAACTTT TGGGTCGTGC AGTCGGTGCT GGTGGTCTCG CAAGCTGCGA GCAATGGATT
CTTTCTATTC TTGATCCCAT CATCAAGGGT CCGGGCGCGG GATATCTGCA CGGTGTCGAG
CTCTTTTTAC CCAAAAATAC AATTCACGGG TCCATAGCTC GCCTCGCCGG CACTGGCGCG
CCTGAGAATG CGGTGAGCGA ACCTTGGAGT TTTTGGAAGC GAAAGACTGT GAGCACATCC
AATGACGATC AACGTCGCGT GGAAGAGGCC AAGCAGCTCT TGACGGAGGT AATGATCGTT
AGAGAACCTC CTCTGGTACA AGTTTTCCGC GTGGAATCGT ACGGTCGACG TTGGCGACGT
TCGTTAGTGT TCGCATCAAA TGCTTCTTTC TGCCTCGCAG ATATCGATCC CGATGGTCAA
CCAATTCTAA AGGATGAAAA TGAACGATTT TTACTATCTG CCGCGCGTCT AGGCTCGCGA
GCTCCGAGTT TAGTCGTCAC GAGACTCTTG AATGGCTTCA TCGGACGCGA GTCGTTTGTT
CCGGCGCGAC ATTTGCGTGG CTTACTGCCC CAGGTTTTGC TCGAAGAGTA CCAGTTTTGG
CGAAGCGAAA CTACTGGTGA TTTATTTGGT ACCGCGAATG AAAGTGCTTC GCACCCGAAT
ACTATGATCC ATGTGCGATT GGGCTATGAA AACAATTTAC TGCTCGATGG AGGATTTCCC
GTAGATACTC GCGCCATCGT GCGAAGACTT CCAACAATGG GTGCTGATGG GATTTCGCGT
CTCCCCCCGC TAGATGGTTC GTACGTCGAT GGTCAGCTTA CATTAGTAGA TCTTCTGTAT
GCGCCCTCGC TGGCGGGTGA ACAAGGGCGC GCTCTACTCG CGCTCACCAA GTCTCTCGCC
GCCGTCGAAG GATTGGCACA TTCGCTTGTC TGGTCATCAA AACCTGGGCT CGAAAGTGGG
CCGAATGAGC ACATCGCGGG TGTGTGCGAT ATATCGCGTG TTGAACTTCC AAGACTTGGT
GTTTCGTTCA ATGCGATAGA GTCAGGTGGT GCTATTCGCT TGGAGTGTGA GGAACACGCC
GGGTTGTTTT TAGCGCAAAG ACGCAACGCC GCGCTCGAGA TTCTGCTTAC CGGGCTGCCG
CATGCGCTTC TTCTCGAAGC TCGAGACGGT GCACTGTCTG TGCTATTATC GGCGAGCGCA
ATACCTCGAC CCGCGGCGAC TGCGTCGGAC TTCGCGCATG GAGCGAGCAC GGGACTCGGC
TCAGATCTCA TAATCGATCG TTGGAGCACG CAGTGGGTGG AAAACATAGG CAGCACAAAG
CACTATGTGT ATCCTGTGCA TCCTTCGCAT GCGACAATCA GTGCAACGTC GATGCCAGCC
GCTCTATATC TCGCAATCCT GCGATTCATA GGTCGCGATT ATCTCGCCGT GCTCAGACTT
GCGGAACTCT GTGTGTCAGA AGCGGTGCCC ACGCCAGAGG AAGCGCAGCT GTGGGACGCG
CTCGCGCATG CGTGCTCTTT GGATCCAAGC CCCGACGCCA CAGCAGTTCG CGTGAAGCTT
ATGTTGATGG TAAAAGACAC TCCAATCGAA CCCCGACTCG CGCTGAAGTG GCCACCCGTG
GCGGATGTAC TGAATTACGT CAATTGCTGG TCGTACATCA GCGCAAATTG TCGCCTCACT
GGACATGAAG AGCTCACCGC GATCGAAAGC TTGGTTACGG AGACCATGGC AAAGAAGAAA
ACTTCCAAGT TCAAGGCTGT CATTAGTTTC ATGTTCCCCG GCTCCAAGAA GGCGAGCGCG
GCGGATATGA GTATGGATCC AACGAAGACC CCAGAGTTGT TGAATCGTGC GGCGTACTTG
AGGCAGCTCG TGGGCGGAAT CGCGCGCATG AGCGCCGGAT CTACGCCGAC GAGTTCTGGA
TTCCCTCTAG TATATCCAGT GGCGCCAGAA TATGACAGAT TCGATACCGT GGACGACCTG
ACTTGCTTGT CCGATAAGGT AACGGGTGAA GGACCTCTGG GCAAACTCGC GGGTCTGATT
CAAACGTACA ATCGTCCAGA TGACGTGAGC GTCAACGGTA GTCAGGCGCT CGCTTTGATC
GGCCAATGGA TCGACAAGGA AAAGTTCAAT CTCGACGGTA ACTTGGGATT CCCGTTATTC
TTTGAGCTCA TGACTGGAAC CCTGCCGCTG CGCGTTCTTC CTGGTGATTC TCCCCATCGT
TGGGGTTGCG TTCTCCTTCG TTTTGTGCCA AACGAGCAAT CGATGAAGTG CGGTACACTC
ATGAGTACGT TACGCGTACT TGCTTCCAGC CCCAACATCG CGCAGGATTG TCCTAAAATG
GAAGAACAGA GCGTTGGTCA ACGGTTTAGT TCAATGTTTA CTTCGGACGG TGCGATAGGT
ATGTTACTTC GCAAAGTACG ACCGTACCTG CAACAGCGAA AAGAGGAATT CCCCGGTCGC
ATATCTTGGA CGGGCAAGGA GTACTTGACG CAGAGCATAT ACGAGTCTGA CGTGCAGACT
TGGGCGCGTA CGATTGCTAC TGGCGGCAGA TGGGCGCTGA CTCGATTCAT GAATACATCG
TGCTCTGAGC GCGAGTTCTC TCTCAAGTCG AGCGATAGCG CTGAGATGGC TGGTGGTACC
ACGTGGAGTG GGAAAACACT CCAGCACGTG GATCTGGAGA TGCTCGTGAA GCGCCCACTC
GGTGTCCTTG ATTTGAACAA GTATGTCGTT CCTCTTACGG AAGATAGAGC CGATGAACTC
GCGGAATCCG TCGCGCTCGG CGGTGTCGGT GGTTTGCCGT TTGACGTCAG CCAGCATCCA
GCGGCTACGA CGTCCGTCGC TAGACAGATG CTCAATCGTT TGCGCGCCGA CGTAACCGGA
GCATCAGACA AAGCTAAGGC GCGACGCATG AATGGTGATT TTAGTGCGGT GACGTTGAAG
GGTTTCTCCC CCAGTGATAC TGCCGCGATC GCATCTGAAG CGGGTGCAGG CCGCGCCGGT
CATGCCGCGG GGACCGCCCG TGCGGTTCTA CGTGAATTGC TAGAAGGAAT TCGTCGTGTC
ACCGCCGACG ATAAGGCGGC CGCATCTGAG GCTATGTCAG CTGCACTTGC TACGGCGAAT
GGTTTACGAA AAGACGTCGA GTCTGGGACA AAGATGTTAC GTCTCGGATT GCAGCGCCGC
GCAGGGGCTT GGGCCGAGGT GACATTTGAA GATTTGGTTG AGTGTTTGAC GTGCGAAGGC
GGCGAACACA TTTTATCCCG TTTGTCTCCA GATGTGACCG CAAAGCAAAT AGTTGATGCG
TTATATTTGA CGTCGCAAGC GTTGCTGCTC ACGCTTCGCG CGACGCTAGC CGCGAAAGCG
GCGGGTATGG TGGCTGACAC GCTCATAGCG TTGACGAATT GTGCAAAGAC AGCGGCTATT
GACGGTATTG ATTCTGTCGA TCGAGACTTG AGGCTCAAAG CTCTGGCAAT CGCGGAGTTG
ATTTCAGTGG AACGACATTA CTTCCAGGCG GACACTGCGA ATCCATTCAG CAAGGCAACG
CGCTTGATCT TTGATCCTCG TTTGCTAGTC TACGAGTACA GTCAGAGCAT CGTGCTGCGC
AAGCGCCAAG TCGAGCTGAC TCGAGATTTC GTTGACACGG CTGAGAACGG TGGCGGCCAG
TGCGCACAGA TGCTCATGGG TGAAGGTAAA ACAACTGTGG TGTGTCCATT ACTGGGCTTC
GTTCTCGCCA AGAGCAGCTC TGTCGTCGTC CAAGTAGTCC CTCACGCACT TCTCGAGTTT
TCGCGAAGCA CTTTGCGCAA GGCGTACTCA GGAGTCATCA GACGTTCGGT CGTTACCCTC
GAGTTCAACC GATACACATC CCTCGCCGAT GGCTCAATAC TCGCAACTTT AAGGAAAGCG
AAAGAGAATC GAGCCATCGT CATCTCGTCA CCGACAAGCA TCAAGTCTCT CGCGCTTAAA
TTTTTGGAAG TCTGCCACCT GCTCGATCAA TCGTACATCG CTGACCGTGA TGGCAAGACC
GGTGGTATGC TCGCACAAGG AGCTGATTTA GTCAGTCGCG TTTTTGGTGG CGCTAAAAGT
CGGAGTGAAC GTGTTGCTGA GGCTGGTGGG TTAACCATCG AAGAAATAAA CACATTGCGC
TCAGAGGCAA CGGCGGCCGC GGAAATTTTT GAAATTTTCC AAACTGGCAC GCTTATTTTG
GACGAAGTCG ACTTGATTCT TCATCCTTTG AAATCCGAAC TCAACTGGCC AATGGGTCGA
CGTATTCCGC TCGACTTCAG CAAGGCTGCG TCAGGCGACG GTTTGAGATG GAAGCTTCCA
TATTACATCC TTGACGCCAT CTTCGCAGTC ACGTACGGGC GCTCGACGGC TTCGGAGGCT
GAAGGTTCGC AAGAGGCAAT CGATCTGCTC GACGGTATAC GCGCCGCGGT TCGAACGGCG
ATCGAATCGA AACAATTGCA GACATCACCG CATCTAGCGT TGCTCGATAA GTCTTGGTAT
CAAGATATCT TGCGTCCGCT CTTGGCGCAG TGGACGAGCA TTTGGCTGCG TGCTCATGGC
GCGTTGCGTG GCATGTCAGA GGCACTCATT ATGGCGTATC TTCTGCGGGG ACCGGGCGCG
GCTGAAGCAA AAGCGCAGCT GAATCGTGAA TGTGAAGACG AGGCTGTGAA GATGACCAAC
CTCGCGCGAG ATTGGCTTTG CTCTCTGTTG CCTCACATGC TCGCGAAGAT TAATCGCGTA
ACGTTTGGTT TGTTACAGCC CGCGGACGTA TCGCTACTCG AAGAAGTGAC GGGATCGCGT
CTGCCGAAAA CCCGTAGACT TTTGGCAGTT CCTTTCGTGG GTAAGGATAT TCCGAGTCGA
ACGAATGAGT TCTCACATCC AGATGTCGTT CTCGGGTTGA CAATTTGCGC ATACCGCCTC
GAGGGTTTGC GCCGTGCCGA CTTCAAGCAA ATGCTTCGAA TCTTAATGGA AGAGTACGAT
AACGAGGCCG GTCCTCCTCA GAAGCGACCG GCTGCTCGTA GGTGGGCGGA CTGGGTGCGT
ACGACTGGGA AGCGAGTGCG AGGTGAAGCG CGCGCTGAGG CAGCGCTCGC CGCAAGACGA
CGCGGCGTTT CGCAGCAAAC CGGAACTACA AATGTCGGGG ATACCCGCGG ATTGGCGTGG
CTCGAAGACG ACACCGCAGG TCTGCAAGAC GACGAGGTTT GGCCTTTGCA GCTTATCGAT
CTCAGAGATC ACGATCAGGT TGAAGAAATC TACCAACTTC TGCGAAGGAC GCCGCCGTGT
ATCGAGTATT ATCTCGACGA ATTTGTCTTC CCTGAAACGG CGCAGCATCA AGATTTGAAA
CTGAGCGCCA CTGGCCAAGA ACTCGGTGGT GATGTGTTGT TCCCCAACCG ACTCGGTTTT
AGCGGTACCC CGAGCGACTT ACTGCCGCTC GAGCTCGGTG CTCCGCGATA CGAATCCGGC
TCTGACGCCA AGATGCTGGC ATACTTGACG GATCCGGAAA CCGCAGCGAC GATGCCTTTG
GACGTTGATT GGAACGTGCA GACTCTCCTC ACCCGTATTG CGACGGTAGA CCCACCGATA
CGTGCGCTCA TCGATACGGG TGCGCTTGTC ACGGGCATGT CCAACCTCGA TGTCTCGAGA
TTCTTACTCG AGAAGGGTCT GAAATGGGCT GAAGGATGTG TGTTCCTCGA CGAAAATGAT
CGACAGATGA TTCTCATGCG CAAAGGATGG GAAGTCATTC CTCTTCAACG CGTCGCCGCA
ATGCCTTTGA CGAAACGATT TACGTTCTAT GATCAAGTTC ATACGACCGG TATGGACATC
AAACAGGCCG CGGCGTCGCA GGCAGCCATC ACCCTCGGTA AAGACATGTG TCTTCGCGAT
TATGCGCAGG GTGCTTGGCG CATGCGCGGT TTGGGCAAGG GTCAGACGCT AAAGGTGATT
GTGATTCCGG AGGTTGCGAG ACTCATCTCG CACGAAGTTG CCAAAGGAGG TGGACAGATC
CCATCGACGC GCGATGCTGA ACTCGCGACA ATGTCGGCCG AAGCGGTCGA GAAGCGTAAG
TTACGAGACA TCACGGCTTG GCTCGTCGTG AACAGTATGC GAAGTGAGAA TATTCAAGCC
GGTTTGCTCG CAGAGCAACG CGCGGCGAAT ATCTGGAGAA AGCATTCATA CAGGAAGCTT
GTGTCATCGC CCACTGCACC GGGAACACCA AAGGCGGAGG GTATGATTCA CGCGTGCCTT
GACGTCTTCC GAGATCGCGT GTCGTACGTC GTGGCGAATG TCATTCCTCA AAGTGTATCG
CCTTCAGAGA AGTTAGCGAG GAGCGTACAG CAGTTCGGAC ATTTGCTCGA CGATAGTCCT
CGCGCAAAAG AGCAATTACG ATTGATCGTG AACGACGTGA AGCAAATGGA GGAGACGACG
AATGCGGCGT TGAAGCGGAT TCGCGCTGGC ACACACATGG CTCCAGATGC GGCGGAGTCC
GGTGATCAAG GTTTGGCTGG TGGTTTGGAG CACGCCTTGG ACCTCGAGCA AGTTCAAGAG
AATGAGGAAG AACAAGAAAA TGAGCAAGAA CAAGAGCAAG AGAAGGAAGA AGAACGTGAG
GAAGAAATCA TCGAGGAAGC CCCAGATGGC TTGAAGTACG TGCGCGACGA CGAGAAACAG
CCGTCGTGGA GTTACGAGTC TTTGTGCAAA AGTCCAAGCT CAATGGCTCA AGGTTTCTAC
CCACTCAAAA CGTTCAAGAT ACATAAAGGT GTCGGCAAGA CTACAGCACC TTTGGATTTC
CCGGAGTATC TATATCTATC GAAAAATTAC TTCCGTGAAG GATGGGCGCT CAACGGCCAT
CGACGCTTGA AGAATCTCAC CGTCGTGCTG GACTGGGTAC CGGATAGCAG CGCTTTATCG
ACCGACGCCG GTGGTAGCGT CGCGGCGGGT ACGTTCACAG CAGACCAGGA GTCAGCGCTA
CGAACTGCGT TCGATATGTT TGACTCTTCC GGAAACGGTC GTTTGACAGA GTCCGACCTC
CGAGAGGTTT TGCGTGAGAT CGACGCTTGC GTAGAAGACG ATGAGCTTCG TCTTCAACAG
TTGGCGCGTG AGGTTGCTGC AGAATCACGC ACGTTCAATA CCGGGGGTAA GTCATCTGGG
ACGTCTTTCG AGGAGTTAAA GCGCGTCCTT CGTTCGAAGA ACATTTACTC ATTGCAACGT
GGGCGATACT ACGTCGCGTT GTCTCTGGCT GAAGCCGAGT CGCTGCGCGT TGTCATGCAT
CTCACGAAGG GTAGTAAAGC GAGTGACGGC AAACTGATTC CATCCAAACT CACCGAAGCT
GCGTTGCGAA TCGGTGGCGG TACGCTCATC GATAACACGT CGAGATTTAC GCCCGCACAG
GTGTTTCAAG GTGCGACGGC CGAGCAATGT TTCAGATTTC TCGACAGTCA GCTGGATTTC
GAAGACCGCG AAGTTTCATT GCTGCTTCGT GCGCTCCAAG GGTCGTCGTG CGATAAGCGA
GCGCAATTTT TCGTGTCTGT TCGGGCGTGC AGGCGGCGCG CTAAAATTCC ATGGGCGTCG
TCGCCACTCG CCAAGGTTCT CACTACGGTG GATGAATTCC ATTTACTCGC GTCGAGAGCA
CTCGTCGCTC GTGTACGTAC GGCGCTCAAG TCCAAACGTA TGCGTCTTTT AGATGCGTTC
CGGGCTTTTG ACGGAGATAA CATTGGAAAG CTTACGTATG AAGCGTTATA TGGTGGCTTG
TCGTGGCTCG GCCTTTCGCT CACGTCGGCT CAAATGCTCG AGCTGGCTGC GCGTGTGGAC
AAGACGCACG ACGGCTACAT CAGTCGAGAA GAGTTCGAGG AAGCTTTCGG GCCAGATGAC
GATTGGGTTT TAGATGGCGA TTTCCACGAA CAGGTTCCCA TACCGACTCT AGCTCCCTCG
GTGCCAGTGC CGACGTCTCA GCTTGCGCAA GTGAGCTTGA TGGATATTCT CGGCGACGTT
GGAAGCGACG AGGGGTCGAA ATCGTCGAGT AAGGATCTTG TCGCTCCTGA GATACAGTCG
CCATCGAAGG TAATGAGCAC GGAAAATTTG AGCATCGATG ACGGTACAGG ACTTGGTGAC
TGGGGCGCGC TCGGAATGGC GGCGACTTTA CCAGTTGTTG CGAAGAAGAA CCCGCCTCTC
GCCAAGGAAC AGCCCAAGCC AAGCGCGAAC GATCCGTTTG GTTTGGGTCC TCCAGCCGGC
GCGTCGTCAA CCGCGTTCGA TGATTGGTTG TCTGGCGGTT CGGGCAACAT GCTTGGCGGT
GGCTCGTTGA ACGATCCCCT CGGCACTCAA GCAAACCCAA ATGCCGCAGA CGGCTGGGAA
GGCAACGACA TGATTCAGCT CGACACCTCG ACTGGTACCA TCGCCGCGCC GGCGCGAGAT
AAAATCCAGG CAAGAGCTCT TGTGGATCCG GCAAAGGAAG CTGAAGAGCG CAGGCACCGC
CAAGTCGATT CCAAGTTGAC GAGCTCGGCT CTTAATGGGT TCCAAGTAGC CGTCGTGCAC
CAAAAATCGT TCCAAAAAGT TTGGACGAGC GACGGCACTG GAAGCCGTTC GACTGGAAGC
GTGTGGCAAG CAAACTTGGG TAAATCGACG CTGAAGAAAA AGACTGAGCG CATTTCGCTC
GGTCACTACG CTTCGCCACA CTTTTCGCAA CCGAAACCTG CGCCATTTGC TGTCGAAGTG
ACCGACATAA ATGCGTTCGC GCTCACTGGT TCGAACTATA TGCCACGAGT GTTGGATCGC
TTGTTCCCAC ATCCGAGCCG GTTCAGACAG GTTTGGGGAC AAGAATGGAA AGGAACGGCG
GTGTATGCGT GGACTCCCGT GCCGCCACCT GGATGCGTAG CCATTGGTAT GGTGTTGACG
TCTATCCCGC AGGCACCGGA CGTGTCCACC ATGCGATGCA TTCCAGAGAA ATGGGCAATA
AAGCCAACGA TGGAACCCCA ACTCGTGTGG ACGAATGAAG GCAGTGGTGG TCGTCCGGCG
AGTATTTGGT TGGTGAACTC GCTCGGTTTG TTGTACGTGA ATGTCGGGCA CAATCCTCCT
CCGGCGAGAG ATGTTTGGGA CATCAAACGA GGTCAGCTCA CCGCGGACCA GGTGCTTCAG
TATCAGGCGT CGGCGCCGCA AACGTCCTAC GGCGGAACAG GAATTTCGAG CGACAGCTTT
GACGGACCGA CGTACAACCC TCCGCGGCCT CCACCCCCGA GCGGCAACAG CGGTGGATCA
AACGCTTTGC CCGCGGGATT AGGATCGCTG ATTTAG
 
Protein sequence
MSRAAGASAP STASSHVAFS GWLTKRGGKD LYGKATWRLR YFVLRRPRHA KERACVRYFA 
NEPADGAESA AKGEFAINGQ SAVRILDSDE TFAEFGLKAH KYAGKRMFAF QTMPGQGGKS
AALVVEAEDL TTMQRWVTAI SEAITRARSL ENEAASKTGL AVLNAQVDGD ENDVLRGKNF
RGTLELAGLC GPLGDGALPV HLDACWENLT FVDLMADSCL NEKLGKMRRS KYVRETLTSL
VDSAITIGRC LGASPIPMSG PESNAMSTIR RALDDMLALA RLMPVGTKDN NIFVSFLAAL
LDRVRQLRPG ELIMCPGGWA NEEGGAAVIY TIHRLPAHFV VSLTNCGDGV EFHPIQADPA
KDAFKYIHTI QLMDVPIAVA MDSSAWALLF RPLIFPQDAA SAKDMIYNKL LPMMNGRPLL
ASVTPKPLRT VKFAKQPRGR DSSGVFTALE AARCGLASLG CPPNRADALV NLGVRFVMLE
EAKRDLERVN MIGASALVTL QAACRTVAEY AANHADLCVE AEEASNGEPP KKTTMPRETL
AQMLRSVESV ILRTKALHGA TTLPPALQPP KVPITACALP LFNRLASEGN VDSLAGAART
PPILRPVELT LVPDAVAKLE DVPKALRNAL HCCTILANQA DLMPNSYALR VSLLVHLFVH
VLPAPLPITH PERHTKCFWA SVNTTYELQA EILRSLNLLL RHFSAAALSI SVTQTFDAAR
LLTIANFAAI ADATLRLKVA DAPSAFMQHY AGYAEGPISA FGFEIGPFAV EAEYLRFHCP
ERTIRLTQTL DYFHSLKKTI ESDHMIYRWE NEAAGGMSFG VGEARLIDQV CLQMGIGRED
PEYLAKYHSG ELREFADSYP EMSALRDIIF MFKLLMAPTS DDLPELARWK PIDAALDWSV
ARRESSRSYF DLEVRAFGRA LTIQPWKEPE VEQGTSKTGW RNIFGGGFFA RMQDRRPRCP
PSGANPSNLA GQRVDTEEDV LHLREVPTFD NLLKPSESEL LLTYLTAPMI RLPLVMKLFA
EPTRVRGLTH NDIQSVLDAV IFEPGPWAPP EPTTIPKEIP ATTRAHMATP TGLLVHELIN
CPAPLADSAE ELLALSLELD TGRHDAPSAQ GVLFACRMLT RLLAFVDSIL KANSLDSEFT
TSTGLTRGLD CSSASLSALA DVKVRITKAL KERCRPVIQR WLRHAVKKNA IRAACALHAH
LAYMYYWTPR VEIDKQAALI ILTAQQYIFV NYHFTDTPDG SETSNRKLAD TGSMSGVDTG
LGFAPTEIFD LFQRKRRSIL EWAAANPQDA NSILENVVET LTTKKTDGSN SPTNVLDAED
TFSAKRAWRR APGAGGAGRF VSYVSDIPFA DLVERDKRAA ALLLQNHRHY GEWLLETTLQ
SFDMEINAQL GEFTVRKNRL RHLEASIREM PDFVAALGPV LAKRGIEPSH FQELSKGDEL
VIGDSGDVVH CAEVKNTEHR TWMRLVGLNH DIQLWDRDPR PPLNEFTRPL VSRVQTKSQA
LELLGRAVGA GGLASCEQWI LSILDPIIKG PGAGYLHGVE LFLPKNTIHG SIARLAGTGA
PENAVSEPWS FWKRKTVSTS NDDQRRVEEA KQLLTEVMIV REPPLVQVFR VESYGRRWRR
SLVFASNASF CLADIDPDGQ PILKDENERF LLSAARLGSR APSLVVTRLL NGFIGRESFV
PARHLRGLLP QVLLEEYQFW RSETTGDLFG TANESASHPN TMIHVRLGYE NNLLLDGGFP
VDTRAIVRRL PTMGADGISR LPPLDGSYVD GQLTLVDLLY APSLAGEQGR ALLALTKSLA
AVEGLAHSLV WSSKPGLESG PNEHIAGVCD ISRVELPRLG VSFNAIESGG AIRLECEEHA
GLFLAQRRNA ALEILLTGLP HALLLEARDG ALSVLLSASA IPRPAATASD FAHGASTGLG
SDLIIDRWST QWVENIGSTK HYVYPVHPSH ATISATSMPA ALYLAILRFI GRDYLAVLRL
AELCVSEAVP TPEEAQLWDA LAHACSLDPS PDATAVRVKL MLMVKDTPIE PRLALKWPPV
ADVLNYVNCW SYISANCRLT GHEELTAIES LVTETMAKKK TSKFKAVISF MFPGSKKASA
ADMSMDPTKT PELLNRAAYL RQLVGGIARM SAGSTPTSSG FPLVYPVAPE YDRFDTVDDL
TCLSDKVTGE GPLGKLAGLI QTYNRPDDVS VNGSQALALI GQWIDKEKFN LDGNLGFPLF
FELMTGTLPL RVLPGDSPHR WGCVLLRFVP NEQSMKCGTL MSTLRVLASS PNIAQDCPKM
EEQSVGQRFS SMFTSDGAIG MLLRKVRPYL QQRKEEFPGR ISWTGKEYLT QSIYESDVQT
WARTIATGGR WALTRFMNTS CSEREFSLKS SDSAEMAGGT TWSGKTLQHV DLEMLVKRPL
GVLDLNKYVV PLTEDRADEL AESVALGGVG GLPFDVSQHP AATTSVARQM LNRLRADVTG
ASDKAKARRM NGDFSAVTLK GFSPSDTAAI ASEAGAGRAG HAAGTARAVL RELLEGIRRV
TADDKAAASE AMSAALATAN GLRKDVESGT KMLRLGLQRR AGAWAEVTFE DLVECLTCEG
GEHILSRLSP DVTAKQIVDA LYLTSQALLL TLRATLAAKA AGMVADTLIA LTNCAKTAAI
DGIDSVDRDL RLKALAIAEL ISVERHYFQA DTANPFSKAT RLIFDPRLLV YEYSQSIVLR
KRQVELTRDF VDTAENGGGQ CAQMLMGEGK TTVVCPLLGF VLAKSSSVVV QVVPHALLEF
SRSTLRKAYS GVIRRSVVTL EFNRYTSLAD GSILATLRKA KENRAIVISS PTSIKSLALK
FLEVCHLLDQ SYIADRDGKT GGMLAQGADL VSRVFGGAKS RSERVAEAGG LTIEEINTLR
SEATAAAEIF EIFQTGTLIL DEVDLILHPL KSELNWPMGR RIPLDFSKAA SGDGLRWKLP
YYILDAIFAV TYGRSTASEA EGSQEAIDLL DGIRAAVRTA IESKQLQTSP HLALLDKSWY
QDILRPLLAQ WTSIWLRAHG ALRGMSEALI MAYLLRGPGA AEAKAQLNRE CEDEAVKMTN
LARDWLCSLL PHMLAKINRV TFGLLQPADV SLLEEVTGSR LPKTRRLLAV PFVGKDIPSR
TNEFSHPDVV LGLTICAYRL EGLRRADFKQ MLRILMEEYD NEAGPPQKRP AARRWADWVR
TTGKRVRGEA RAEAALAARR RGVSQQTGTT NVGDTRGLAW LEDDTAGLQD DEVWPLQLID
LRDHDQVEEI YQLLRRTPPC IEYYLDEFVF PETAQHQDLK LSATGQELGG DVLFPNRLGF
SGTPSDLLPL ELGAPRYESG SDAKMLAYLT DPETAATMPL DVDWNVQTLL TRIATVDPPI
RALIDTGALV TGMSNLDVSR FLLEKGLKWA EGCVFLDEND RQMILMRKGW EVIPLQRVAA
MPLTKRFTFY DQVHTTGMDI KQAAASQAAI TLGKDMCLRD YAQGAWRMRG LGKGQTLKVI
VIPEVARLIS HEVAKGGGQI PSTRDAELAT MSAEAVEKRK LRDITAWLVV NSMRSENIQA
GLLAEQRAAN IWRKHSYRKL VSSPTAPGTP KAEGMIHACL DVFRDRVSYV VANVIPQSVS
PSEKLARSVQ QFGHLLDDSP RAKEQLRLIV NDVKQMEETT NAALKRIRAG THMAPDAAES
GDQGLAGGLE HALDLEQVQE NEEEQENEQE QEQEKEEERE EEIIEEAPDG LKYVRDDEKQ
PSWSYESLCK SPSSMAQGFY PLKTFKIHKG VGKTTAPLDF PEYLYLSKNY FREGWALNGH
RRLKNLTVVL DWVPDSSALS TDAGGSVAAG TFTADQESAL RTAFDMFDSS GNGRLTESDL
REVLREIDAC VEDDELRLQQ LAREVAAESR TFNTGGKSSG TSFEELKRVL RSKNIYSLQR
GRYYVALSLA EAESLRVVMH LTKGSKASDG KLIPSKLTEA ALRIGGGTLI DNTSRFTPAQ
VFQGATAEQC FRFLDSQLDF EDREVSLLLR ALQGSSCDKR AQFFVSVRAC RRRAKIPWAS
SPLAKVLTTV DEFHLLASRA LVARVRTALK SKRMRLLDAF RAFDGDNIGK LTYEALYGGL
SWLGLSLTSA QMLELAARVD KTHDGYISRE EFEEAFGPDD DWVLDGDFHE QVPIPTLAPS
VPVPTSQLAQ VSLMDILGDV GSDEGSKSSS KDLVAPEIQS PSKVMSTENL SIDDGTGLGD
WGALGMAATL PVVAKKNPPL AKEQPKPSAN DPFGLGPPAG ASSTAFDDWL SGGSGNMLGG
GSLNDPLGTQ ANPNAADGWE GNDMIQLDTS TGTIAAPARD KIQARALVDP AKEAEERRHR
QVDSKLTSSA LNGFQVAVVH QKSFQKVWTS DGTGSRSTGS VWQANLGKST LKKKTERISL
GHYASPHFSQ PKPAPFAVEV TDINAFALTG SNYMPRVLDR LFPHPSRFRQ VWGQEWKGTA
VYAWTPVPPP GCVAIGMVLT SIPQAPDVST MRCIPEKWAI KPTMEPQLVW TNEGSGGRPA
SIWLVNSLGL LYVNVGHNPP PARDVWDIKR GQLTADQVLQ YQASAPQTSY GGTGISSDSF
DGPTYNPPRP PPPSGNSGGS NALPAGLGSL I