Gene OSTLU_119561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119561 
SymbolPks1 
ID5000513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp555043 
End bp568623 
Gene Length13581 bp 
Protein Length4526 aa 
Translation table 
GC content63% 
IMG OID640415934 
Productpolyketide synthase 
Protein accessionXP_001416177 
Protein GI145342265 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACG ATGAGTTCGC GCATGTACTT AGTTCTGTCA TCGAGCTCGC AAATCGCGTT 
GACCTGACAC GACTTAGGCA ACATCACTCC ATAGATAACT TTGCCACACA CGTTCTCACT
TTTCGGTTCA TTCTGGACCT GTTGAGGCAA ACTTATCAGG AATACATCAA AGGAAGCATA
GAAAATAATT CGTTTGAGCA TGTCGAATTT ACTTTCAACA ATTGCATGGA TGGTACGCAC
GGAATGGCAA CATGCTTTAA ACCGAATAAA AAGCATAGTC ACCTTTGTTA CATTTTGGAT
ATTTCGCAGG CAGAAAGAAG TTGCAATGGT CGTACTGATG ACACAAAATG CGTGCACGAC
AAAATTACTA TGAACGATCG CGTTCTTTCC GCGGCTAAGA TCCCAGACAA CGCAGTGAAT
AAACAATTCA GTGCTAGTAT ACTCAATGAT GTGTATGGTT TCGATTCAAG TCTGTTCCAT
ATTTCTCAGT CGGAAACCCT CTGCATGGAC CCACAACAAC GACTATTACT TCTTGCAGCA
GCAGACTTAT ACCTGAGTAA CACACGTGTT TGTCAACACG ACGACTCCTT CTGCGTGCAC
GTCGGTGTAT CGTGGAATGA CTTTGAGGAA ATTCGCAGAA AGTATAGTGC AGACTTTTTC
TCCACTTATG ATTCGACGGG AACATCAATA AGTGTCGCAT CGGGTAGAAT TGCGTATTAT
TTTGATTTCA AAGGGTGTGC TGTGACGATT GATACGGCTT GTTCATCATC TTTGGTTGCG
TTGCATCAAT CCATGTACGC AGCTGGCTCG CTCCGTGAAT CAAGTGCACT TGTATGTGGC
ATCAACTTGA TTTTGACAAC CACGATGACA CATCGATTTC AGCTTGCTGG AATGATGTCT
TCAGAGCATC GATGTATGAC ATTTGATGCA CGCGCGGATG GCTACGTCCG GAGTGAGGCA
TGCAACATCG TACTCTTACA CTTCAATGGC TTCACAAAAA CTCATAGGAA TGTACTGCGG
CATGACGTTC ATATGTCATG CGTCGTCAAT CAAGATGGAA CATCAACTGG ACTTACGGCA
CCTAATGGAA CAGCTCAGCG GAACCTGATG AATGAGTTAC TATCGGACGT TTATAACCCT
TCAAAACACG TTGGTGCTAA AATACATGCC CACGGCACAG GTACGCCTCT TGGGGATCCG
ATAGAGGTGT TTGCTATCAA TTCTGCCCTC TCGTTTTCGT CACTGTGCAG GCTGTATGTG
TCAGAAAAGT CGATGCGAGG GCACACAGAA CCTGCTTCAG GTCTATGCAG CACCATACTT
GCTTTCGTAT CAGGTGCACA GTGCAAGCTA CATGGTCAAC CACATTTACA GCAGTTAAAC
AATCATATAC TAACATCGGT TGATGAAAGT ATTTTTCAGC GTTCACTGCA TTCCAGTTTT
CCCTTAACTT CTATGGCGAG TGTCACCACA GTAAGCTCCT TCGCGTTCCA AGGGACGAAC
GCAATGGCTT CACTGACATA TATCGACAAA TGCAGTATTA GAGTGATTCG CAACACGTCA
GGTCTGAGAT CTTTGGTCAA GTTGGAGACT GAATCCCCTG TAAGACTCCC GTACACATTT
TCTTGCCTTA GCATCGTCAA TGGGAAAATG ATACAGCTAA CTTCAATGAT TCACGACAAA
ACATTGAAGC GACTTTTGGA TCATCGTGTA GCTGATATGC ATCTTATGCC AGCCACAGGC
CATCTTGAAC TTGAAAGAGA GCTCACGAAG TGCGCTCTCG TAAGGAATGA ACCATATCTT
CTGACAAAGA TCGCATTTAT GCTGCCTCTC AAGCTTGATG AAAGCAATCA ACTTATCACA
GAGATGAACA CACATGGTCG CTTACGCGTT GGATGTGGCT CTGGCATATC CAATTACTGG
TGTAGTACTG CACGACTTCA AATACCAGTA GGACGTGCCT GGTGCACGGA CCGTACTTTT
TATCTGACTA GTAAACTCGT TTCTCGGCGT TTCAAAGGTG CGACCGCAAC AATAGCAGGG
ACGACGCCAA TTCTCGATTC ACAAAATGTC GTGAATCCGT GTATCCAGGA TGCGAGTCTA
CAATGCTCTG CATACGTCTT GAACACGTCA GCTTTCGCAT CTGATACAAC CTACATGGAC
TGCGTATCGT TACTTAAGCT ACCAACAACA ATTCAAGACG TCATAGTTGT TAATCAGACA
GGAAAAATAC TTGTTAGATC GACGCATGTT GACATGTTGC AGAAGACGCA GCATGTGACT
GAAAGTAATA ACTCTGTCAC AAGTAACTCT TCGAGTAACA ATAAAACGCA GACATGGATA
CATCGTTTAC TGTGTAAAGT ATCAACCGTA CGGATGAAAG CTGGCTTACG CACTCGCACA
TGCTTGGAAC TCGCGAGATT GTTGTACCAC GCGAGACAAC AAAGAGTGCT GATATCAAAT
ATGCAAAACT TGCGTGAAGT AGGCAGTTTA CATGCTCCAG AACTGATGCC CATCATGCGC
GAAATGGGTC GTAAGTCAAT GCAAGATATT AATTCGGCTA GTGCCTCTCT TTGCGCGGTT
GCGCTGCTGC AGGCTTGCGG CGCTCGTGCG CGCGTTTCCG GCGCCCACGT GCGTGCAAAC
GACATCGTCC ACGAGGACGC TTCGGCCGCG CCCGCAGTCT GTGGATCGTG CGTGCGCGCC
ACCGTGCAAT CTTCCGCGCA GGAACGCAGC GAGCCATTGG TGCTGAGTGT CGCGAGCGCC
CCGCACACCG CGCCAGCCCG AGCGTTGCGA TCGATCGCCG CCGGGGCGTC GATCGCCGGG
GCGCTGTGCG TTGAGCGGCT GCTTCCGGCG TTCGGTCGCC GGACGCCCGC CGCGTCGCGC
GACGGCGCGG CGACGACCAC GGCGTCGCCG GCACCGAAAG TCTCGAGCGT GTTCGTGACT
GGCGGTCTGG GCGCGTTGGG TCAGACTTCG GCGGCGTGGT ACGCGCATCG AGGCGTTCGC
CACGCCTACT TGACTGGTCG CCGCGGGCGC GGAGAGTCCC TTCTGCTGCG ATGGTCGTCG
CGCGGCGGTA CCGACGTGAG CGCATTGCAC GTGCTTCGCG TCGACGCGTG TGCGACCGAC
GAGGCGTCGT ACGGCGGTGC GCGCGCGCAC GCGCGCGTGC GCGCACACGG CGTGCTGCAC
GCGGGCGGCG TGCTACGCGA CTCTTTGACT GCTCGCCAAT CCGCGTCGGC CCATCGATAC
GTGTGGTCGT CCAAAGTAGT GTCCGCGCAC GCATTGCTCC GCCAGTGCGT CGGCGTCGAG
CGCGGCGCGC GTCACATCGC GCTGTTCTCG TCGGTGGCGG CGCTGCTGGG GAGTCCAGGG
CAATGCAACT ACAGCGCGGC CAACGCCGCG CTGGACGCGC GAGCCGCACA CGAGTGGTGC
CGCGGGTACG CGGTGCAATC CGTGCAGTGG GGCGCGTGGG ACGGCGCGGG AATGGCGCTG
ACAAACGCGT CTGTACTCAG GAACGCGCGC ACGACAGGCA TCGGCGTGCT CGATCCGCAA
GCCGGGCTCG GGGCCGTACA CGTCGCGCTT CAGGGCGCAT CAGTGACGGG TCGCTGGCGG
GCAATCCAAG GCGTCCTGGC CGTCGTCCCG TTCCAGTGGT CCGTGCTCGC GCGCGCGCAG
CGCTCGCGCG CGCTGATTGC GGAGTACCGC GCGGAGTCGG CGCGGTCGAG CGCTCCAGTC
GCCGTCCATC CGGGCGCCGA GGAAGACGCG GCGCGCGATG TGTCAGATGA GAGCGTGCGG
CAGATCGTGG CCTCCGCCGT GACGCGCGCG CTCGGCAGAG ACGTGTCGGT GGACGCGTCG
CTCATGGAAG AGGGCCTAGA CTCGCTCGCG GCGGTCGAGC TCGGAGGGGC GTTACAAAAG
GAAACGGGCG TGAACATGCC GGCCACGCTG GCATTTGACT ACCCGACGAT CACCGCGATA
TCTGGGTACC TGAAGGGTGC GATGCTGGCG CGATCGCGTG CGCGCCAATC GAGCGGCGCG
GCGTCCAGCG CCGCTCGCAC GCGCGCCGCG AGCGGCGCGG TCGCGCATTC GCGCGACTCG
ATGCCAGGTA TTGCATCCAA CGCGCGCGAA ATCTTGGGTC TTGCGCCCGT TGTTATGGCC
ATCAGACGGA GCGTGCGACG ACTCGACGCG GATGATCGAG AGCCAAAGCC TGCTCGACAT
GGCGGCTCGG CGTTGTTTTC ATCGTTCGTG AGGTGCGTCA TGGCAGAGCG AGCCTCTGAA
GATCGCTTCG ACACGCCATC GTTGGGCAGG CAGCCGAGCG CGACAGCGCG CGTGGTCTCC
TTGACGAAAG CCAGAGATTT CACTACACAA GCTCTCGACA GCATGCCGAA ATGCAACGAG
ATTGAGCTGA GCGTCGTCAG CGCTGTGTCG AGTAAGGCAT CAGGCGCGCA CGATGCCGTG
CGTGTGAACA ACATGCGACA TGCGTGCGGC GAAGTGTGTG ATCCACAAAG CTCAAAATCG
CACGAGTTCG TCGCTCGGCG TCTTGTGCAG TTTGGGTACT TCATCGAGCG CAGCTTGTTT
GACGGGGAGC AGTTCTCCGT AACGGCGCAC GAGGCGCGGT TGATGGACCC GCAGCAGCGC
GTGCTCCTGG AGAACGCGCT CGCCGCGACG AGTGCGACGC CGCACAGCGC GCCGAAGCCA
GCCACAGCCG TGTCCGTCGG CGCTTGGACG TCTCAGTACG CAGACATGTG CCGCGTGTAC
GAGCCAACGC CGGGGCCGTA CTCTGGCGTT GCTTCCGCCT TGAGCGTGCT GTGCGGTCGC
GTATCGTACA CGTTTGGTAT GCGTGGCCCG TCCGTGTGTG TCGACACCGC GTGCTCCTCC
TCGCTCGTGG CCGCGCATGT TGCGAGCTCT CACTTGCGCG ATGGATCGTG CGAGGACGCC
CTCGTCGCCG GCGTGAGCGT GAACGTCGGC ATCGGCACCT TCCTCGTGGC CACCGCCGCG
AGCATGCTAT CGTTCGATGG TCGGTGCAAA ACCTTGGATG CCAGCGCCGA CGGTTACGCC
AAGGGCGAGT GCTGCGTGGT CCTGCGCGTG TCGGTGGACG GGGCGAGCGC TGGAGCGCCC
AAAGCGCCCG ATCTTCCAAA CGCCGAGTCT CCCTCTCTCG CTCTTCTCGA ACAGACCGGT
GTGAACCAGG ATGGACGGTC GAGCTCGCTC ACCGCACCAA ACGGGCCGTC GCAGCAGGCA
CTCGTGGCCG ATGCGCTTCT CCGCTCGGGG CTGCACGGCG ACGCGCTCGG CAGGCTGGAG
ATGCACGGCA CGGGTACTTC GCTGGGAGAT CCGATAGAGG TCGGCGCCGC GCTTGAGGTT
CTCGCGCCGC CGACGCGCGC GGACCGGGCG CCGTCCAGGG GCCCCGCGCT CACGCTTGAG
GGCGTGAAGC CACGCGCTGG CCACTGCGAA CCAGGGGCTG GCGCGGTTGG GTTGCTGTTT
GCGATGGCAA ACTTGACGGA GCGAGCCGTC GCGTCCCTCG TCCACCTGCG CCAGCTCAAC
CCTCACGTCG AGGCCGCGAC GCGCCGAAAC GCACCGTCGC CGCATGCGCG CGCGCCGGTG
CTCGCGCGAC AAGGCATGCC GCGACCGGAT GACGGCGCGT CTGAGGCTGG AGCAGCGGAC
GCTGCTCGAG CGCGCGTGCG GCGATCCGGC GTGAGCGGCT TCGCGTTCCA GGGCACAAAC
GCGCACGCAA TCATGGCCCG GGGACCCGGA CAGGCGACTG CGCCCGGGTA CACGGGAAGC
ACGCACACGG CGGCGCGTGC GTGGGAGCGT CAGCGCCACT GGTGCGCGCC AGAACGGCAC
TGCCTCATCG AGTCCGTGCG CGCTCTGCAT CCTCGCGAGG GAGACTCGAG CGCGTTGGCG
CGGTCGTGCG TGCGCCTGTG TCGGTCCGCG TCGCTCGCTG GCATGCTTGA TCACGCCGTC
GGGCTCCGGA CGCTGCTACC AGGCACAGCG CACGTCGAGA TATGTCGAGC CATGGCGGGC
GCGCTGGCAG ACCAGCCGCT CGCCGACATC TCGCTGTCGC ACGTATCTTT TGCAGCGCCC
CTCGAGCTTC GAGCGCCGAC GACCGACGTG CTGTGCGACG TCACGCCGCA AGGCGTGGCC
CGCGTGGGCG TGCGCGTGTC GTCGTCCTCT GCCGCCGCGC CGTTCCTCGC GGGCGCGCTC
AGCCGCAGAA GCGCCTCAAG AATTACTGCG TCTCCGGCGC ACCTCTCGCG AGGACCGGCG
ATGCCCGCGC TGGACGTGCA GCCTTGTGTT CACGGCGCGA GCTGGCTGCG CTCGTCGCAG
CGACGAGGCC CGGGCGCGGA GCACCGCGTG TACGGCGACA TACGCGCGCG TCAGATGTCC
ACGCTGTCGG CTGCGGCGTA TCACTCGCCG CCCGCAGTGC TCGATGCCGC GCTGCACGCG
CTCAGCGCGT GCGCGCCGCC GCCGGGTAGC GATGCGACAG AAGAGAGCCC GCACGTCCCG
GCCACGATCG GCGGCGTCCG CACGCGAGGC GGGCACTCGG ACATATACCG CGCCCGGTTC
GCGCTCGCGA CGGCTCCGCC TGAGCGCGGG GAGCGGAACG CGTCCACGCG CAGATCTCGT
CACCACCTGT GGGACTGGTC GAGTCTTTCG GCGGCGCATC GAGGGTGCGT GGTGCACGCG
CTGGAGTCTC GACGACTCGG TGGCGCCGCG GTCGTGCGCT CGCGAGCACA ACAGCGCTCG
AGGGCACCGC CGCCGCGTCG CGCGGCCGCG CCGCGTGTGC TTTATGGAGT CGGGCAACAG
CGCGCGTGCG GCGCGGCGGA TGGCCAGGGC GCGCGTCGAC GCTCAACCCT CAACTCGCAG
TGCTTTGTGG ACGACGCGTC GCACCGCCGC GAGCTGATGG AGACCCACGC GTCACCATCG
ATCGCGCAAG GCAGCACGCG AAGCAGCACG CGCGCCTCTC TTTGCGCGGT TGCGCTGCTG
CAGGCTTGCG GCGCTCGTGC GCGCGTTTCC GGCGCCCACG TGCGTGCAAA CGACATCGTC
CACGAGGACG CTTCGGCCGC GCCCGCAGTC TGTGGATCGT GCGTGCGCGC CACCGTGCAG
TCTTCCGCGC AGGAACGCAG CGAGCCATTG GTGCTGAGTG TCGCGAGCGC CCCGCACACC
GCGCCAGCCC GAGCGTTGCG ATCGATCGCC GCCGGGGCGT CGATCGCCGG GGCGCTGTGC
GTTGAGCGTC TGCTTCCGAC GTTCGGTCGC CGGACGCCCG CCGCGTCGCG CGACGGCGCG
GCGACGACCA CGGCGTCGCC GGCACCGAAA GTCTCGAGCG TGTTCGTGAC TGGCGGTCTG
GGCGCGTTGG GTCAGACTTC GGCGGCGTGG TACGCGCATC GAGGCGTTCG CCACGCCTAC
TTGACTGGTC GCCGCGGGCG CGGAGAGTCC CTTCTGCTGC GATGGTCGTC GCGCGGCGGT
ACCGACGTGA GCGCATTGCA CGTGCTTCGC GTCGACGCGT GTGCGACCGA CGAGGCGTCG
TACGGCGGTG CGCGCGCGCA CGCGCGCGTG CGCGCACACG GCGTGCTGCA CGCGGGCGGC
GTGCTACGCG ACTCTTTGAC TGCTCGCCAA TCCGCGTCGG CCCATCGATA CGTGTGGTCG
TCCAAAGTAG TGTCCGCGCA CGCATTGCTC CGCCAGTGCG TCGGCGTCGA GCGCGGCGCG
CGTCACATCG CGCTGTTCTC GTCGGTGGCG GCGCTGCTGG GGAGTCCAGG GCAATGCAAC
TACAGCGCGG CCAACGCCGC GCTGGACGCG CGAGCCGCAC ACGAGTGGTG CCGCGGGTAC
GCGGTGCAAT CCGTGCAGTG GGGCGCGTGG GACGGCGCGG GAATGGCGCT GACAAACGCG
TCTGTACTCA GGAACGCGCG CACGACAGGC ATCGGCGTGC TCGATCCGCA AGCCGGGCTC
GGGGCCGTAC ACGTCGCGCT TCAGGGCGCA TCAGTGACGG GTCGCTGGCG GGCAATCCAA
GGCGTCCTGG CCGTCGTCCC GTTCCAGTGG TCCGTGCTCG CGCGCGCGCA GCGCTCGCGC
GCGCTGATTG CGGAGTACCG CGCGGAGTCG GCGCGGTCGA GCGCTCCAGT CGCCGTCCAT
CCGGGCGCCG AGGAAGACGC GGCGCGCGAT GTGTCAGATG AGAGCGTGCG GCAGATCGTG
GCCTCCGCCG TGACGCGCGC GCTCGGCAGA GACGTGTCGG TGGACGCGTC GCTCATGGAA
GAGGGCCTAG ACTCGCTCGC GGCGGTCGAG CTCGGAGGGG CGTTACAAAA GGAAACGGGC
GTGAACATGC CGGCCACGCT GGCATTTGAC TACCCGACGA TCACCGCGAT ATCTGGGTAC
CTGAAGGGTG CGATGCTGGC GCGATCGCGT GCGCGCCAAT CGAGCGGCGC GGCGTCCAGC
GCCGCTCGCA CGCGCGCCGC GAGCGGCGCG GTCGCGCATT CGCGACGTAC AACCATTACT
CAGCTCCCTC GTCTACGTCT ATCATACGTT CCGCATGAAG TCAGCCGACG AACGACTCTT
GGCATTAGCG GTCAATATTC TCACGAGCTT AGACACTTTA GCTCTGCGGG ACCTTCGACC
AGTAACTTTG AGGGAGTCCT CTTTGTTCGC ATTCATAACT CCACCAAGCC CTTACCCGTG
CATCCATATT TCATTGGCAA CTTTTTGCGT TGTGACGAAC TCGAGTTCGG CTTGTTTGGC
GCACCGCACA GCGAATGGAC TACTCTGGAC GCGCGGCAGA TTCTCTTGAT TGAGAGTGTA
CACACTATCC AGCTGAACAA CAGTTCAGAC GTTTCATTGC GCACTCTAGA GACGACGGCT
GTTCTTGTTG GAGCACAAGC GGTAGATTCA ACTGACGAGC ATAGCGTTGA GATACAGTAC
AACGCCTACT CTGGCGTTGC TTCCGCCTTG AGCGTGCTGT GCGGTCGCGT ATCGTACACG
TTTGGTATGC GTGGCCCGTC CGTGTGTGTC GACACCGCGT GCTCCTCCTC GCTCGTGGCC
GCGCATGTTG CGAGCTCTCA CTTGCGCGAT GGATCGTGCG AGGACGCCCT CGTCGCCGGC
GTGAGCGTGA ACGTCGGCAT CGGCACCTTC CTCGTGGCCA CCGCCGCGAG CATGCTATCG
TTCGATGGTC GGTGCAAAAC CTTGGATGCC AGCGCCGACG GTTACGCCAA GGGCGAGTGC
TGCGTGGTCC TGCGCGTGTC GGTGGACGGG GCGAGCGCTG GAGCGCCCAA AGCGCCCGAT
CTTCCAAACG CCGAGTCTCC CTCTCTCGCT CTTCTCGAAC AGACCGGTGT GAACCAGGAT
GGACGGTCGA GCTCGCTCAC CGCACCAAAC GGGCCGTCGC AGCAGGCACT CGTGGCCGAT
GCGCTTCTCC GCTCGGGGCT GCACGGCGAC GCGCTCGGCA GGCTGGAGAT GCACGGCACG
GGTACTTCGC TGGGAGATCC GATAGAGGTC GGCGCCGCGC TTGAGGTTCT CGCGCCGCCG
ACGCGCGCGG ACCGGGCGCC GTCCAGGGGC CCCGCGCTCA CGCTTGAGGG CGTGAAGCCA
CGCGCTGGCC ACTGCGAACC AGGGGCTGGC GCGGTTGGGT TGCTGTTTGC GATGGCAAAC
TTGACGGAGC GAGCCGTCGC GTCCCTCGTC CACCTGCGCC AGCTCAACCC TCACGTCGAG
GCCGCGACGC GCCGAAACGC ACCGTCGCCG CATGCGCGCG CGCCGGTGCT CGCGCGACAA
GGCATGCCGC GACCGGATGA CGGCGCGTCT GAGGCTGGAG CAGCGGACGC TGCTCGAGCG
CGCGTGCGGC GATCCGGCGT GAGCGGCTTC GCGTTCCAGG GCACAAACGC GCACGCAATC
ATGGCCCGGG GACCCGGACA GGCGACTGCG CCCGGGTACA CGGGAAGCAC GCACACGGCG
GCGCGTGCGT GGGAGCGTCA GCGCCACTGG TGCGCGCCAG AACGGCACTG CCTCATCGAG
TCCGTGCGCG CTCTTCATCC TCGCGAGGGA GACTCGAGCG CGTTGGCGCG GTCGTGCGTG
CGCCTGTGTC GGTCCGCGTC GCTCGCTGGC ATGCTTGATC ACGCCATCGG GCTCCGGACG
CTACTACCAG GCACAGCGCA CGTCGAGATA TGTCGAGCCA TGGCGGGCGC GCTGGCAGAC
CAGCCGCTCG CCGACATCTC GCTGTCGCAC GTATCTTTTG CAGCGCCCCT CGAGCTTCGA
GCGCCGACGA CCGACGTGCT GTGCGACGTC ACGCCGCAAG GCGTGGCCCG CGTGGGCGTG
CGCGTGTCGT CGTCCTCTGC CGCCGCGCCG TTCCTCGCGG GCGCGCTCAG CCGCAGAAGC
GCCTCAAGAA TTACTGCGTC TCCGGCGCAC CTCTCGCGAG GACCGGCGAT GCCCGCGCTG
GACGTGCAGC CTTGTGTTCA CGGCGCCATC AGACGGAGCG TGCGACGACT CGACGCGGAT
GATCGAGAGC CAAAGCCTGC TCGACATGGC GGCTCGGCGT TGTTTTCATC GTTCGTGAGG
TGCGTCATGG CAGAGCGAGC CTCTGAAGAT CGCTTCGACA CGCCATCGTT GGGCAGGCAG
CCGAGCGCGA CAGCGCGCGT GGTCTCCTTG ACGAAAGCCA GAGATTTCAC TACACAAGCT
CTCGACAGCA TGCCGAAATG CAACGAGATT GAGCTGAGCG TCGTCAGCGC TGTGTCGAGT
AAGGCATCAG GCGCGCACGA TGCTGTGCGT GTGAACAACA TGCGACATGC GTGCGGCGAA
GTGTGTGATC CACAAAGCTC AAAATCGCAC GAGTTCGTCG CTCGGCATCT TGTGCAGTTT
GGGTACTTCA TCGAGCGCAG CTTGTTTGAC GGGGAGCAGT TCTCCGTAAC GGCGCACGAG
GCGCGGTTGA TGGACCCGCA GCAGCGCGTG CTCCTGGAGA ACGCGCTCGC CGCGACGAGC
GCGACGCCGC ACAGCGCGCC GAAGCCAGCC ACAGCCGTGT CCGTCGGCGC TTGGACGTCT
CAGTACGCAG ACATGTGCCG CGTGTACGAG CCAACGCCGG GGCCGTACTC TGGCGTTGCT
TCCGCCTTGA GCGTGCTGTG CGGTCGCGTA TCGTACACGT TTGGTATGCG TGGCCCGTCC
GTGTGTGTCG ACACCGCGTG CTCCTCCTCG CTCGTGGCCG CGCATGTTGC GAGCTCTCAC
TTGCGCGATG GATCGTGCGA GGACGCCCTC GTCGCCGGCG TGAGCGTGAA CGTCGGCATC
GGCACCTTCC TCGTGGCCAC CGCCGCGAGC ATGCTATCGT TCGATGGTCG GTGCAAAACC
TTGGATGCCA GCGCCGACGG TTACGCCAAG GGCGAGTGCT GCGTGGTCCT GCGCGTGTCG
GTGGACGGGG CGAGCGCTGG AGCGCCCAAA GCGCCCGATC TTCCAAACGC CGAGTCTCCC
TCTCTCGCTC TTCTCGAACA GACCGGTGTG AACCAGGATG GACGGTCGAG CTCGCTCACC
GCACCAAACG GGCCGTCGCA GCAGGCACTC GTGGCCGATG CGCTTCTCCG CTCGGGGCTG
CACGGCGACG CGCTCGGCAG GCTGGAGATG CACGGCACAG GTACTTCGCT GGGAGATCCG
ATAGAGGTCG GCGCCGCGCT TGAGGTTCTC GCGCCGCCGA CGCGCGCGGA CCGGGCGCCG
TCCAGGCCCC CCGCGCTCAC GCTTGAGGGC GTGAAGCCAC GCGCTGGCCA CTGCGAACCA
GGGGCTGGCG CGGTTGGGTT GCTGTTTGCG ATGGCAAACT TGACGGAGCG AGCCGTCGCG
TCCCTCGTCC ACCTGCGCCA GCTCAACCCT CACGTCGAGG CCGCGACGCG CCGAAACGCA
CCGTCGCCGC ATGCGCGCGC GCCGGTGCTC GCGCGACAAG GCATGCCGCG ACCGGATGAC
GGCGCGTCTG AGGCTGGAGC AGCGGACGCT GCTCGAGCGC GCGTGCGGCG ATCCGGCGTG
AGCGGCTTCG CGTTCCAGGG CACAAACGCG CACGCAATCA TGGCCCGGGG ACCCGGACAG
GCGACTGCGC CCGGGTACAC GGGAAGCACG CACACGGCGG CGCGTGCGTG GGAGCGTCAG
CGCCACTGGT GCGCGCCAGA ACGGCACTGC CTCATCGAGT CCGTGCGCGC TCTTCATCCT
CGCGAGGGAG ACTCGAGCGC GTTGGCGCGG TCGTGCGTGC GCCTGTGTCG GTCCGCGTCG
CTCGCTGGCA TGCTTGATCA CGCCATCGGG CTCCGGACGC TACTACCAGG CACAGCGCAC
GTCGAGATAT GTCGAGCCAT GGCGGGCGCG CTGGCAGACC AGCCGCTCGC CGACATCTCG
CTGTCGCACG TATCTTTTGC AGCGCCCCTC GAGCTTCGAG CGCCGACGAC CGACGTGCTG
TGCGACGTCA CGCCGCAAGG CGTGGCCCGC GTGGGCGTGC GCGTGTCGTC GTCCTCTGCC
GCCGCGCCGT TCCTCGCGGG CGCGCTCAGC CGCAGAAGCG CCTCAAGAAT TACTGCGTCT
CCGGCGCACC TCTCGCGAGG ACCGGCGATG CCCGCGCTGG ACGTGCAGCC TTGTGTTCAC
GGCGCGAGCT GGCTGCGCTC GTCGCAGCGA CGAGGCCCGG GCGCGGAGCA CCGCGTGTAC
GGCGACATAC GCGCGCGTCA GATGTCCACG CTGTCGGCTG CGGCGTATCA CTCGCCGCCC
GCAGTGCTCG ATGCCGCGCT GCACGCGCTC AGCGCGTGCG CGCCGCCGCC GGGTAGCGAT
GCGACAGAAG AGAGCCCGCA CGTCCCGGCC ACGATCGGCG GCGTCCGCAC GCGAGGCGGG
CACTCGGACA TGTACCGCGC CCGGTTCGCG CTCGCGACGG CTCCGCCTGA GCGCGGGGAG
CGGAACGCGT CCACGCGCAG ATCTCGTCAC CACCTGTGGG ACTGGTCGAG TCTTTCGGCG
GCGCATCGAG GGTGCGTGGT GCACGCGCTG GAGTCTCGAC GACTCGGTGG CCTCACGACG
AATTCATTTT ATCATAGCTT TACGTCTACG CAGGCATATA AGTTGGCTCA TATTCCACGG
CCAGACATCC TATCGCAGTC TTTCACTTCA ATGAAGACTA TAAATGCTGT CGACAAAGCT
AGCAGCACAT TTACAGAACG CAGTAGTGAT GCGATGGAAA TTGTTAACGC CGTGCAACGC
GAGATTTTGG AAATTTTATC AACATTAACG ACCAGTCCTG TAAGTTTAGA TGATACACTT
GCGAGCCACG GATTGGACTC ACTGGGCATT GCATATTTTT TTATTCAAGT TGGCAAACGG
TTCAATGTTG AAGTGACTGT TGACAGTCTT GGCACGAACA CCACAGTCAG TAGTATAGTG
CATCAGATAT CACAGATGCT TAGTGCAAGG GTGATTTTCC TACACACCAA GGACCTGGTG
CGCGAGCGAA AAGCCGACAG CTCCTCAAGA AGAGGAAGGA GCACTGCGAT CACATCGATG
GTGTCTGCAT ACCGTCGACA CTTGCAGTCA ACTGAATTCG AGCCCAAAGC CTGTTTTGCA
AGAGGCGTAC TTGCTTGCGC AATATTTACT ATATTTATCA TTGCGTACGG ATATCACGCA
CACAGATTCG ACATGTTTTA G
 
Protein sequence
MHNDEFAHVL SSVIELANRV DLTRLRQHHS IDNFATHVLT FRFILDLLRQ TYQEYIKGSI 
ENNSFEHVEF TFNNCMDGTH GMATCFKPNK KHSHLCYILD ISQAERSCNG RTDDTKCVHD
KITMNDRVLS AAKIPDNAVN KQFSASILND VYGFDSSLFH ISQSETLCMD PQQRLLLLAA
ADLYLSNTRV CQHDDSFCVH VGVSWNDFEE IRRKYSADFF STYDSTGTSI SVASGRIAYY
FDFKGCAVTI DTACSSSLVA LHQSMYAAGS LRESSALVCG INLILTTTMT HRFQLAGMMS
SEHRCMTFDA RADGYVRSEA CNIVLLHFNG FTKTHRNVLR HDVHMSCVVN QDGTSTGLTA
PNGTAQRNLM NELLSDVYNP SKHVGAKIHA HGTGTPLGDP IEVFAINSAL SFSSLCRLYV
SEKSMRGHTE PASGLCSTIL AFVSGAQCKL HGQPHLQQLN NHILTSVDES IFQRSLHSSF
PLTSMASVTT VSSFAFQGTN AMASLTYIDK CSIRVIRNTS GLRSLVKLET ESPVRLPYTF
SCLSIVNGKM IQLTSMIHDK TLKRLLDHRV ADMHLMPATG HLELERELTK CALVRNEPYL
LTKIAFMLPL KLDESNQLIT EMNTHGRLRV GCGSGISNYW CSTARLQIPV GRAWCTDRTF
YLTSKLVSRR FKGATATIAG TTPILDSQNV VNPCIQDASL QCSAYVLNTS AFASDTTYMD
CVSLLKLPTT IQDVIVVNQT GKILVRSTHV DMLQKTQHVT ESNNSVTSNS SSNNKTQTWI
HRLLCKVSTV RMKAGLRTRT CLELARLLYH ARQQRVLISN MQNLREVGSL HAPELMPIMR
EMGRKSMQDI NSASASLCAV ALLQACGARA RVSGAHVRAN DIVHEDASAA PAVCGSCVRA
TVQSSAQERS EPLVLSVASA PHTAPARALR SIAAGASIAG ALCVERLLPA FGRRTPAASR
DGAATTTASP APKVSSVFVT GGLGALGQTS AAWYAHRGVR HAYLTGRRGR GESLLLRWSS
RGGTDVSALH VLRVDACATD EASYGGARAH ARVRAHGVLH AGGVLRDSLT ARQSASAHRY
VWSSKVVSAH ALLRQCVGVE RGARHIALFS SVAALLGSPG QCNYSAANAA LDARAAHEWC
RGYAVQSVQW GAWDGAGMAL TNASVLRNAR TTGIGVLDPQ AGLGAVHVAL QGASVTGRWR
AIQGVLAVVP FQWSVLARAQ RSRALIAEYR AESARSSAPV AVHPGAEEDA ARDVSDESVR
QIVASAVTRA LGRDVSVDAS LMEEGLDSLA AVELGGALQK ETGVNMPATL AFDYPTITAI
SGYLKGAMLA RSRARQSSGA ASSAARTRAA SGAVAHSRDS MPGIASNARE ILGLAPVVMA
IRRSVRRLDA DDREPKPARH GGSALFSSFV RCVMAERASE DRFDTPSLGR QPSATARVVS
LTKARDFTTQ ALDSMPKCNE IELSVVSAVS SKASGAHDAV RVNNMRHACG EVCDPQSSKS
HEFVARRLVQ FGYFIERSLF DGEQFSVTAH EARLMDPQQR VLLENALAAT SATPHSAPKP
ATAVSVGAWT SQYADMCRVY EPTPGPYSGV ASALSVLCGR VSYTFGMRGP SVCVDTACSS
SLVAAHVASS HLRDGSCEDA LVAGVSVNVG IGTFLVATAA SMLSFDGRCK TLDASADGYA
KGECCVVLRV SVDGASAGAP KAPDLPNAES PSLALLEQTG VNQDGRSSSL TAPNGPSQQA
LVADALLRSG LHGDALGRLE MHGTGTSLGD PIEVGAALEV LAPPTRADRA PSRGPALTLE
GVKPRAGHCE PGAGAVGLLF AMANLTERAV ASLVHLRQLN PHVEAATRRN APSPHARAPV
LARQGMPRPD DGASEAGAAD AARARVRRSG VSGFAFQGTN AHAIMARGPG QATAPGYTGS
THTAARAWER QRHWCAPERH CLIESVRALH PREGDSSALA RSCVRLCRSA SLAGMLDHAV
GLRTLLPGTA HVEICRAMAG ALADQPLADI SLSHVSFAAP LELRAPTTDV LCDVTPQGVA
RVGVRVSSSS AAAPFLAGAL SRRSASRITA SPAHLSRGPA MPALDVQPCV HGASWLRSSQ
RRGPGAEHRV YGDIRARQMS TLSAAAYHSP PAVLDAALHA LSACAPPPGS DATEESPHVP
ATIGGVRTRG GHSDIYRARF ALATAPPERG ERNASTRRSR HHLWDWSSLS AAHRGCVVHA
LESRRLGGAA VVRSRAQQRS RAPPPRRAAA PRVLYGVGQQ RACGAADGQG ARRRSTLNSQ
CFVDDASHRR ELMETHASPS IAQGSTRSST RASLCAVALL QACGARARVS GAHVRANDIV
HEDASAAPAV CGSCVRATVQ SSAQERSEPL VLSVASAPHT APARALRSIA AGASIAGALC
VERLLPTFGR RTPAASRDGA ATTTASPAPK VSSVFVTGGL GALGQTSAAW YAHRGVRHAY
LTGRRGRGES LLLRWSSRGG TDVSALHVLR VDACATDEAS YGGARAHARV RAHGVLHAGG
VLRDSLTARQ SASAHRYVWS SKVVSAHALL RQCVGVERGA RHIALFSSVA ALLGSPGQCN
YSAANAALDA RAAHEWCRGY AVQSVQWGAW DGAGMALTNA SVLRNARTTG IGVLDPQAGL
GAVHVALQGA SVTGRWRAIQ GVLAVVPFQW SVLARAQRSR ALIAEYRAES ARSSAPVAVH
PGAEEDAARD VSDESVRQIV ASAVTRALGR DVSVDASLME EGLDSLAAVE LGGALQKETG
VNMPATLAFD YPTITAISGY LKGAMLARSR ARQSSGAASS AARTRAASGA VAHSRRTTIT
QLPRLRLSYV PHEVSRRTTL GISGQYSHEL RHFSSAGPST SNFEGVLFVR IHNSTKPLPV
HPYFIGNFLR CDELEFGLFG APHSEWTTLD ARQILLIESV HTIQLNNSSD VSLRTLETTA
VLVGAQAVDS TDEHSVEIQY NAYSGVASAL SVLCGRVSYT FGMRGPSVCV DTACSSSLVA
AHVASSHLRD GSCEDALVAG VSVNVGIGTF LVATAASMLS FDGRCKTLDA SADGYAKGEC
CVVLRVSVDG ASAGAPKAPD LPNAESPSLA LLEQTGVNQD GRSSSLTAPN GPSQQALVAD
ALLRSGLHGD ALGRLEMHGT GTSLGDPIEV GAALEVLAPP TRADRAPSRG PALTLEGVKP
RAGHCEPGAG AVGLLFAMAN LTERAVASLV HLRQLNPHVE AATRRNAPSP HARAPVLARQ
GMPRPDDGAS EAGAADAARA RVRRSGVSGF AFQGTNAHAI MARGPGQATA PGYTGSTHTA
ARAWERQRHW CAPERHCLIE SVRALHPREG DSSALARSCV RLCRSASLAG MLDHAIGLRT
LLPGTAHVEI CRAMAGALAD QPLADISLSH VSFAAPLELR APTTDVLCDV TPQGVARVGV
RVSSSSAAAP FLAGALSRRS ASRITASPAH LSRGPAMPAL DVQPCVHGAI RRSVRRLDAD
DREPKPARHG GSALFSSFVR CVMAERASED RFDTPSLGRQ PSATARVVSL TKARDFTTQA
LDSMPKCNEI ELSVVSAVSS KASGAHDAVR VNNMRHACGE VCDPQSSKSH EFVARHLVQF
GYFIERSLFD GEQFSVTAHE ARLMDPQQRV LLENALAATS ATPHSAPKPA TAVSVGAWTS
QYADMCRVYE PTPGPYSGVA SALSVLCGRV SYTFGMRGPS VCVDTACSSS LVAAHVASSH
LRDGSCEDAL VAGVSVNVGI GTFLVATAAS MLSFDGRCKT LDASADGYAK GECCVVLRVS
VDGASAGAPK APDLPNAESP SLALLEQTGV NQDGRSSSLT APNGPSQQAL VADALLRSGL
HGDALGRLEM HGTGTSLGDP IEVGAALEVL APPTRADRAP SRPPALTLEG VKPRAGHCEP
GAGAVGLLFA MANLTERAVA SLVHLRQLNP HVEAATRRNA PSPHARAPVL ARQGMPRPDD
GASEAGAADA ARARVRRSGV SGFAFQGTNA HAIMARGPGQ ATAPGYTGST HTAARAWERQ
RHWCAPERHC LIESVRALHP REGDSSALAR SCVRLCRSAS LAGMLDHAIG LRTLLPGTAH
VEICRAMAGA LADQPLADIS LSHVSFAAPL ELRAPTTDVL CDVTPQGVAR VGVRVSSSSA
AAPFLAGALS RRSASRITAS PAHLSRGPAM PALDVQPCVH GASWLRSSQR RGPGAEHRVY
GDIRARQMST LSAAAYHSPP AVLDAALHAL SACAPPPGSD ATEESPHVPA TIGGVRTRGG
HSDMYRARFA LATAPPERGE RNASTRRSRH HLWDWSSLSA AHRGCVVHAL ESRRLGGLTT
NSFYHSFTST QAYKLAHIPR PDILSQSFTS MKTINAVDKA SSTFTERSSD AMEIVNAVQR
EILEILSTLT TSPVSLDDTL ASHGLDSLGI AYFFIQVGKR FNVEVTVDSL GTNTTVSSIV
HQISQMLSAR VIFLHTKDLV RERKADSSSR RGRSTAITSM VSAYRRHLQS TEFEPKACFA
RGVLACAIFT IFIIAYGYHA HRFDMF