Gene Cyan8802_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3737 
Symbol 
ID8393085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3825652 
End bp3837126 
Gene Length11475 bp 
Protein Length3824 aa 
Translation table11 
GC content40% 
IMG OID644981667 
Productamino acid adenylation domain protein 
Protein accessionYP_003139383 
Protein GI257061495 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0118377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTA TTGAAGAATT TTTAAATGAG TTAGCCAATT TAGAGGTTAA ACTTTGGCTT 
GAGGAGGGTC GAGTTCGTTG TCGAGCTACT AAAGGCAAAC TCACCCCCGA ATTACATCAT
CAATTGTCAG CACGAAAGCA AGAAATTCTC AAATTTTTAC AGAGTAATCA ACTTAATACT
TCTCCGATTA TTAGTCAAAT TAAATCGGTT TCTCGCTCTG AACCGTTACC CTTATCTTTT
GCTCAGCAGC GACTTTGGTT TCTTGACCAA CTAGAAGGAC AAAAGGCAGC TTATAATGAG
GTAGGAGTGA TTCGTCTAGA AGGAACGCTC AACGCTTCTC TTCTTAGCCA ATCTTTTGAG
GAAATTATCA GACGACATGA AATTCTTCGC ACTAACTTAC AAACTAAGGG AGAAGAAGTA
TTTCAGGTTA TTAGTGAGTC CAAAACCTTA GAAATTAAAA CAATTGATAT CGCTAGTGTC
CCAACAATAC AACAGCCTGA AGCGTTAAAA CAAATTGCAT CAAGAGAAAT TGAAATACCG
TTCAATTTAG AACAGGATTT ATTACTAAGA GTAGCATTAA TTCGACTCAC CGAAAAGAGC
CATATTTTAT TAGTGGTTAT GCACCACATT GTCTGTGATG GGTGGTCATT TGGGATTTTA
ATTGAGGAGT TATCCACGTT ATACAAAGCT TATCAAGAGG GAAAATCATC ACCTTTACCT
GAGTTAACTA TTCAATATGC TGATTTTGCG GTGTGGCAAC GAGAATGGTT AACCGGAGAA
ACCTTAGAAA AAAAGCTAGA TTATTGGACA GAAAAACTAT CAGGACTACC CCCTTTATTA
GAACTGCCAA CAGACTATCG TAGACCCCCG GTACAAAGTT TTAAAGGGTC ACATATTAGC
TTTAATCTCA ACTCAGAAAT CAGTGAAAAA CTGAAACAAC TAAGTCAACA AACAGGAGTC
ACCCTATTTA TGACCCTGTT AACAGCTTTC AGTATCTTAT TATCCCGGTA TTCAAGACAA
GAAGATATCG CTATTGGTTC ACCCATTGCT AACAGAAACC GAGGAGAAAT AGAGTCATTA
ATTGGCTTTT TTGTGAATAC TTTAGTGATG AGAGTGAATC TACAGGACAA TCCTACGGTA
GAGGAACTGC TCACACAAGT TAAAAAAAGT TGTTTAGAAG CTTACTCCCA TCAAGACGTA
CCCTTTGAGA AATTAGTCGA AGAATTGAAA CCCGAAAGAA ACATGAGTTA TAGTCCTATG
TTTCAGGTGA TGTTTGTTTT ACAGAATACG CCAAGCCAGG AATTAAGCTT ACCTGGATTA
ACTTTATCCT CAGTAGAAAT AGAATATAAC ATTGCTAAAT TTGACTTAAC GTTATCGATG
TCAGAAACAG ATAACGGACT AGCAGGAGAT TGGGAATATA ATACTGATTT ATTTGAAAGA
GAAACGATTG AGAGAATGAT AGGAAATTTC CAAGTGCTGT TAGAAGAAAT CGTCAGTAAT
CCCCAAGAGA AAATCGGAAA ACTGACCCTA TTAACCGAAG TAGAAAAAGA TAAAATTTTA
GTAGAATGGA ACGATACCAA ACGGGATTAT CCTAAGGATA AATGTATCCA TCAATTATTT
GACGAACAGG TAGAAAAGAC ACCTGATGCC ATAGCATTAG TTTATGAAGA AGAACAATTA
ACTTATCGAG AACTTAATCA AAAATCAAAT CAATTGGCAC ATTATTTACA GAAATTAGGT
GTTAAACCCG ATACCTTGGT GGGAATATGT GTTGATCGTT CTTTGGAGAT GATTATTGGA
TTGTTGGGGA TATTAAAAGC GGGAGGAGCT TATGTTCCCA TTGACCCGAA TTATCCAGCA
CAAAGAATTG AATATATGTT GTCGGATGCG AGAGTTAATT TATTGCTGAC TCAAGAGCGT
TTTAAGAGCT TGTTCTTGAA ATTTTCTGAG CAGATACTAT TCTGGGAGCA TGACCAGACA
AATTGGCTAG AAGAAATTCA GACAAATCCT ATCAATCAAA CTGAACTAAA CCACTTAGCA
TATATTAACT ATACTTCAGG GTCAACGGGT CAACCAAAAG GAGTGATGAT TCCCCATAAA
GGAGTTGTTC GTTTACTAAT CAATCCTAAT TATGTTGACT TAGATTCTCA GACAAACTTA
TTACATTTAT CACCAATTGC GTTTGATGCG TCAACGTTTG AAATTTGGGG AGCATTACTA
CATGGAGGAA AATGTGTTCT TTTTTCAGAG AAAATACCTA CAGCCTTAGC TTTGAAACAA
ACGATTGAAA AACACAAAAT TAATACTTTA TGGCTAACAT CAGCCTTGTT TAACTCTGTC
GTTGATGAGT TACCAGATAC TTTAGGAGAG ATTAAACAGC TATTAACTGG AGGAGAAGCT
TTATCTGTTA ATCACATCAA TCAAGCTCTA AAAGTCTTAC CTTCAACTCA ATTAATTGAC
GGTTATGGAC CGACAGAAAG TACCACGTTT ACTTGTTGTT ATCTAATTCC TCCATCTCTA
GCTTCGGATA TACTGGCAAT TCCTATCGGA AAACCCATTA GTAATACTCA AGTCTATATC
CTAGACACTA ACCTGCAACC AGTTCCCATA GGAGTAGCAG GAGAACTACA TATCGGAGGG
GACGGACTCG CCAGAGGCTA CCTCAACCGA CCCGAATTAA CTGCCGAGAA ATTCATCATT
AACCCCTTCG ACCCCACAGG AGAAAGCCGA CTCTACAAAA CCGGAGATTT ATGCCGATAT
TTACGGGACG GAAATCTAGA ATACATCGGA AGAATCGACC ACCAAGTCAA AATACGAGGA
TTTAGAATCG AACTAGGAGA AATTGAATCG ATACTCAGTA TTCATCCTGA CATACAAGAA
AGCGTAGTTA TCGCCAGAGA AGACCAACCA GGAAACAAAC GCTTAGTAGC CTATCTTGTC
TCAAAACTCA TCCCAGAACG CTTACATTAT GTTAAAGCTT GTCAACTCGA AATCAAGGGC
AAAACCTACC CAGTGGAAAG CGAAGACTTC TCGGTTGGGG GAATGGGATT AGGAAAAATT
CCGGTTGAAT TAGGCATTAA TGACCCCGTA AAAGTCCAGA TTCAGTTACC CGGACAAGAA
AGTCCTAGTT GGCTATCAGG GAAAGTAATC TGGTATCGAG ATAATCATGG CGGAATCGAA
TGGAGATTAA CCTCACAAGA ATCTGAAAGA GTTAAACAAA GCTATCAATT AATCAAAGAA
GAATTAGGAG TAATAGCGAG TTTACAGCGC AGTTTAAGCC AAGGATTGCG GGAATATCTC
AAAGAGAAAT TACCCGATTA TATGGTCCCT AGTGCCTTTG TGCTGTTAGA AAAACTGCCT
TTAACCCCTA ACGGCAAAAT TGACCGTAAA GCCTTACCTG CACCGGATTG GAGCAATAGA
GGACAAGAAG ACTATATCGC ACCGCAAACA CCTAATCAAG AAATCCTCGC GAGTATTTGG
CAAAATGTTT TGCCAAAGGA GAAAATTGGA GTCAAAGACA ACTTCTTTGA ATTAGGGGGA
CATTCCCTAC TAGCGACACA AGTTATCTCG CGCATCAGAG AAACCTTTAG CCTAGACTTA
CCCATTAGGA GCCTATTTGA AAATCCGACT CTAGAGGAAC TAGCCCAAGA GATAGAAAAC
AGCCAAAAAG TTGAGATTAA CCCGATTATC CCGATCAATC GAGCAGAAAA CCTGACCCTA
TCCTTTGCTC AGCAAAGACT GTGGTTTTTG GACCAACTCG AAGGAGAAAA CGCCACCTAT
AATATTCCTG GAGCCTTGAA ACTCGAAGGA AGCCTGAAGA TTGAAGCCCT AGAAAAAAGC
CTAAACCAGA TAATTAAAAG ACACGAAAGC CTGAGAACGC GATTTAAGAC GGTTAACGGA
GAGGCTGTGC AGATTATCGA CCCAGAAGGT CAAATTAACC TAAAAATGAT CACACTAGAA
AGCTTAGACG AATCTGAAAA AAAGAGTCAA ACCCAGAGTC TAATTAAACA AGAAGCAGAA
AAACCCTTTA ACTTGAGTCA AGACAGACTA ATACGAGCCA GCTTAATCAA ATTAGGCAGC
GAAAGCCACA TCTTGCTGAT TACCATGCAC CATATCATCT CTGATGGCTG GTCAATGGGA
GTCTTCGTGC AAGAATTAAC TAGCTGCTAC TCAGGCTATG GTCAAGGAAA AGAAACCCAA
TTAAAGCCCT TAAGCATACA ATACGCCGAC TTTGCCGTGT GGCAAAGAGA ATGGCTCTGC
GGAGAAAACT TACAAAAGCA ACTCAATTAT TGGAAAAAGA AACTAACAGG ATTACCGCCC
TTAATAGAAC TCCCAACAGA CCATCCAAGA CCCCCTATTC AAAGTTTTCA AGGGTCACAT
ATTAGCTTTA ATCTTACCCG AGAAATGAGT GAGAAGCTTA AACAAATGAG TCAACAAACA
GGAGTCACCC TATTTATGAC CCTGTTAACA GCTTTTAGTA TCTTATTATC GAGGTATTCA
AGACAAGAAG ACATTGCCAT TGGTTCACCG ATTGCGAACA GAAACCGAGC CGAAATAGAG
CCATTAATCG GGTTTTTTGT CAACACTTTA GTAATGAGAG TAAATCTAGA AGACAATCCT
ACAGTAGAAG AACTGCTGAA ACAGGTAAGA AAAACTTGTT TAGAAGCTTA CTCCCATCAA
GACGTACCCT TTGAGAAATT AGTCGAAGAA GTCAAACCCG AAAGAAACAT GAGTCACAGT
CCTCTGTTTC AGGTGATGTT TGTGCTGCAA AATGCGCCAG ATGAGGAATT AAGCTTACCT
GGATTAACAG TATCCCCTGT AGAAATCGAA TACAATATTG CCAAATTTGA CTTAACGTTA
TCCATGGCAG AAACGGAGAA AGGACTAGCC GGAGATTGGG AATATAATAC GGACTTATTT
GAGAGAAAGA CCATCGAAAG GATGATAGGA CATTTCCAAG TCCTGTTAGA AGGAATAGTT
AATCATCCTC AAGAGAAAAT TGGTCAATTA CCCCTATTAA CTGAAGCAGA AAAACAACAA
ATCTTAGGAG AATGGAATGA TACCAAAGCA GATTATCCGA AAGAGAAATG TATTCATCAA
TTATTTGAAG AACAAGTAGA AAGAACCCCT GATGCAGTAG CCGTGGTTTA TGAAGACCAA
CAATTAACCT ATCTTCAACT CAATCAAAAA GCCAATCAGT TAGCACATTA TCTAATTAAA
TTCGGAGTTA AACCTGATAC CTTAGTCGGG ATATGCGTTG AGCGTTCATT GGAGATGGTG
ATGGGGTTAT TGGGGATATT AAAAGCGGGA GGAGCCTATG TTCCTATTGA CCCCAATTAT
CCAGCAGAAC GCATTGAATA TATGTTGAAG GATTCTGCCG TTTCAATTTT ATTGACTCAG
GAAAGATTAG TCAAAGAGTT ACCTGAGACT CAAGCTCAGA TGATTTGTTT GGATAACGAT
TGGTTGACCA TTTCTCAAGA AAATCCTAAT AACTGTTTAT CTCAAGTTAA TGCCAAAAAT
CTGGCTTATA TCATTTATAC TTCAGGTTCA ACAGGCAACC CTAAAGGGGT GATGATTGAG
CATAATTCTT TAGTTAATTT AGCGATTAAT TTAAAACAAA AAATATACAG TCAAACAAAA
CAGCAAAAAA TTACTCTGAA CGGTTCCCTA TCCTTTGATA CCAGTGTGAA ACAATGGATA
CAACTTGCCT ATGGTCATAG TGTCTACATT ATTCCAGAAG ATATTAGACT AGACTCAGTA
ACTTTCTTAA AATATCTCCG AGACTATCGG ATTCAAGTGT TAGATTGTAC CCCAGGACAA
TTAAGAGGAA TGATTGAGTC AGATTTACTC ACCACGGAAA GCTATTTATC TAAAATTCTT
TTAGGAGGAG AATCAATAGA TGTATCTACT TGGGGCAATC TTAGTCAAAA CTCTCATATT
CAATTCTATA ACTTATACGG ACCAACAGAA AATAGTGTAG ACACGACAAT AAGTAAAATT
GAAGTGAATC AACCTCTACC TAATATTGGT AAACCTATTA ATAACGTCCA AGTTTATATC
CTAGACACTA ACCTGCAACC AGTTCCCATA GGAGTAGCAG GAGAACTACA TATCGGAGGG
GACGGACTCG CCAGAGGCTA CCTCAACCGA CCCGAATTAA CTGCCGAGAA ATTCATCGTT
AACCCCTTCG ACCCCACAGG AGAAAGCCAA CTCTACAAAA CCGGAGATTT ATGCCGATAT
TTACGGGACG GAAATCTAGA ATACATCGGA AGAATCGACC ACCAAGTCAA AATACGAGGA
TTTAGAATCG AACTAGGAGA AATTGAATCG ATACTCAGTA TTCATCCTGA CATACAAGAA
AGCGTAGTTA TCGCCAGAGA AGACCAACCA GGAAACAAAC GCTTAGTAGC CTATCTTGTC
TCAAAACTCA TCCCAGAACG CTTACATTAT GTTAAAGCTT GTCAACTCGA AATCAAGGGC
AAAACCTACC CAGTGGAAAG CGAAGACTTC TCGGTTGGGG GAATGGGATT AGGAAAAATT
CCGGTTGAAT TAGGCATTAA TGACCCCGTA AAAGTCCAGA TTCAGTTACC CGGACAAGAA
AGTCCTAGTT GGCTATCAGG GAAAGTAATC TGGTATCGAG ATAATCATGG CGGAATCGAA
TGGAGATTAA CCTCACAAGA ATCTGAAAGA GTTAAACAAA GCTATCAATT AATCAAAGAA
GAATTAGGAG TAATAGCGAG TTTACAGCGC AGTTTAAGCC AAGGATTGCG GGAATATCTC
AAAGAGAAAT TACCCGATTA TATGGTCCCT AGTGCCTTTG TGCTGTTAGA AAAACTGCCT
TTAACCCCTA ACGGCAAAAT TGACCGTAAA GCCTTACCTG CACCGGATTG GAGCAATAGA
GGACAAGAAG ACTATATCGC ACCGCAAACA CCTAATCAAG AAATCCTCGC GAGTATTTGG
CAAAATGTTT TGCCAAAGGA GAAAATTGGA GTCAAAGACA ACTTCTTTGA ATTAGGGGGA
CATTCCCTAC TAGCGACACA AGTTATCTCG CGCATCAGAG AAACCTTTAG CCTAGACTTA
CCCATTAGGA GCGTATTTGA AAATCCGACC CTAGAGGAAC TAGCCCAAGA GATAGAAAAC
AGCCAAAAAG TTGAGATTAA CCCGATTATC CCCATCAATC GAGCAGAAAA CCTGACCCTA
TCCTTTGCTC AGCAAAGACT GTGGTTTTTA GACCAACTCG AAGGAGAAAA CGCCACCTAT
AATATTCCTG GAGCCTTGAA ACTCGAAGGA AGCCTGAAGA TTGAAGCCCT AGAAAAAAGC
CTAAACCAGA TAATTAAAAG ACACGAAAGC CTGAGAACGC GATTTAAGAC GGTTAACGGA
GAGGCTGTGC AGATTATCGA CCCAGAAGGT CAAATTAACC TAAAAATGAT CACACTAGAA
AGCTTAGACG AATCTGAAAA AAAGAGTCAA ACCCAGAGTC TAATTAAACA AGAAGCAGAA
AAACCCTTTA ACTTGAGTCA AGACAGACTA ATACGAGCCA GCTTAATCAA ATTAGGCAGC
GAAAGCCACA TCTTGCTGAT TACCATGCAC CATATCATCT CTGATGGCTG GTCAATGGGA
GTCTTCGTGC AAGAATTAAC TAGCTGCTAC TCAGGCTATG GTCAAGGAAA AGAAACCCAA
TTAAAGCCCT TAAGCATACA ATACGCCGAC TTTGCCGTGT GGCAAAGAGA ATGGCTCTGC
GGAGAAAACT TACAAAAGCA ACTCAATTAT TGGAAAAAGA AACTAACAGG ATTACCGCCC
TTAATAGAAC TCCCAACAGA CCATCCAAGA CCCCCTATTC AAAGTTTTCA AGGGTCACAT
ATTAGCTTTA ATCTTACCCG AGAAATGAGT GAGAAGCTTA AACAAATGAG TCAACAAACA
GGAGTCACCC TATTTATGAC CCTGTTAACA GCTTTTAGTA TCTTATTATC GAGGTATTCA
AGACAAGAAG ACATCGCCAT TGGTTCACCG ATTGCGAACA GAAACCGAGC CGAAATAGAG
CCATTAATCG GGTTTTTTGT CAACACTTTA GTAATGAGAG TAAATCTAGA AGACAATCCT
ACAGTAGAAG AACTGCTGAA ACAGGTAAGA AAAACTTGTT TAGAAGCTTA CTCCCATCAA
GACGTACCCT TTGAGAAATT AGTCGAAGAA GTCAAACCCG AAAGAAACAT GAGTCACAGT
CCTCTGTTTC AGGTGATGTT TGTGCTGCAA AATGCGCCAG ATGAGGAATT AAGCTTACCT
GGATTAACAG TATCCCCTGT AGAAATCGAA TACAATATTG CCAAATTTGA CTTAACGTTA
TCCATGGCAG AAACGGAGAA AGGACTAGCC GGAGATTGGG AATATAATAC GGACTTATTT
GAGAGAAAGA CCATCGAAAG GATGATAGGA CATTTCCAAG TCCTGTTAGA AGGAATAGTT
AATCATCCTC AAGAGAAAAT TGGTCAATTA CCCCTATTAA CTGAAGCAGA AAAACAACAA
ATCTTAGGAG AATGGAATGA TACCAAAGCA GATTATCCGA AAGAGAAATG TATTCATCAA
TTATTTGAAG AACAAGTAGA AAGAACCCCT GATGCAGTAG CCGTGGTTTA TGAAGACCAA
CAATTAACCT ATCTTCAACT CAATCAAAAA GCCAATCAGT TAGCACATTA TCTAATTAAA
TTCGGAGTTA AACCTGATAC CTTAGTCGGG ATATGCGTTG AGCGTTCATT GGAGATGGTG
ATGGGGTTAT TGGGGATATT AAAAGCGGGA GGAGCCTATG TTCCTATTGA CCCCAATTAT
CCAGCAGAAC GCATTGAATA TATGTTGAAG GATTCTGCCG TTTCAATTTT ATTGACTCAG
GAAAGATTAG TCAAAGAGTT ACCTGAGACT CAAGCTCAGA TGATTTGTTT GGATAACGAT
TGGTTGACCA TTTCTCAAGA AAATCCTAAT AACTGTTTAT CTCAAGTTAA TGCCAAAAAT
CTGGCTTATA TCATTTATAC TTCAGGTTCA ACAGGCAACC CTAAAGGGGT GATGATTGAG
CATAATTCTT TAGTTAATTT CAGAACAACA GCCATTGAAA AATATCAATT TACTGTTGAA
GATCGAATTT TACAATTTTC TTCTATTAGT TTTGATGCAG CTAGTGAAGA AATTTATCCA
TGTTTAACAA TAGGAGCAAC TTTAGTGTTG CGAACTCAAG AAATACTCAC TGGTGGAATT
GGATTATTAG AACAATGTAG AAAGTGTCAA CTAACTATAC TAGATTTGCC AACAGCTTTT
TGGTATCAAA TTGTTTCTGA ATTATCTATG ACAAAGAATC GCTTTCCCGA AACATTGCGA
TTGATCATTG TTGGAGGAGA AGCAGTTACT ACCGAGCACA TACAAACTTG GATAGCTTGG
ACTGAAAATA CTCCTCAATT AGTTAATAGT TATGGCCCCA CTGAAGCTAC AGTCGTTTCT
ACAATATGGT TATTAGAATC TTCAGAAGTA GGCTTATCAG TTCCTATTGG CCGTCCTTTA
GCCAATATTC AGACTTATAT TCTAGACCCC AACCTCAAAC CAGTTCCCAT AGGAGTCGCA
GGAGAACTGC ACATCGGAGG AGATGGACTT GCCAGAGGCT ACCTCAACCG ACCCGAATTA
ACCGCCGAGA AATTCATCCC TAATCCCTTC GACCCGACCC GACAGTCAAA ACTCTACAAA
AGCGGAGACT TATGCCGCTA TTCACCCGAT GGCAACATCG AATACATCGG AAGAATCGAC
CATCAAGTCA AAATCCGAGG ATTTAGAATC GAACTTGGAG AAATCGAATC CTTACTCAGC
ACCCATCCTG AGATTCGAGA AAGCGTGGTT ATCCTCAGAG AACACGAACC TGGCAATAAG
AGATTAGTCG CCTATCTTGT TTCAAACCTA ATCCCAGAAC GCTTACATTA TGCTAAACCT
TGTCAACTCG AAATTAACGG CAGAAGTTAC TCAGTCCAAA GCGAAGACTT CTCGGTTGGG
GGGATGGGAT TAGCCAAAAT TCCGGTTGAA TTAGGCATTA ATGACCCTGT AAAAGTCCAG
ATTCAGTTAC CCGGACAAGA AACTCCTAGG TGGCTATCAG GGAAAGTAAT CTGGTATCGA
GACAGACATA CAGGAATCGA ATGGACATTA ACTCCACCAG AGAAAAAAAT AGTCGAACAA
AGCTATCAAT TACTCAAAGA AGAGTTAGGG ATAATGGCTA CCCTACAGCG CAGTTTAAGC
CAAGGATTGC GGCGATATCT CAAAGAAAAG CTACCTGACT ACATGATACC CAGTGCCTTT
GTCTTGTTAG AAAAACTCCC TTTAACCCCT AACGGTAAAA TTGATAGAAA AGCCTTACCT
GCCCCCGATT GGAGCGATAG AGGACAAGAA GACTACATAG CTCCCAGAGA TGCCATTGAA
ATCAAACTCG CTCAGATTTG GTCAAATGTC TTAAATGTTT CTCCCATTAG CATCAAAGAT
AACTTTTTTG AACTGGGAGG ACATTCTCTA TTGGCAGTTC GGTTGATTGC TGAAGTGAAA
CAAGAATTTA ATCAGCATCT TCCTTTAGTG ACACTATTTA ATAGTCCCAC GATTGAACAA
TTAGCCTCTC TATTGAGAAC AGAAACACCA TCAGTCTCTT GGTCTTCTCT CGTTCCCATT
CAAACCCAAG GAGATCAACC TCCCTTTTTC TGTGTTCCAG GGGTAGGGGG AAATGTTATC
TATCTTTACG ATTTGGCACG TTATTTAGGC AAAGACCATC CCTTCTACGG ATTGCAATCC
GTCGGTTTAG ATGGGGAATC TGCACCTTAT ACGACGATAA AAGCAATGGC TGAGCATTAC
ATTAAGTTAA TCCAATCGGT TCAACCCAAA GGCCCCTATT ACCTCGGTGG TCATTCCTTT
GGAGGTTGGG TAGCATTTGA GATGGCACAA CAATTACAAC AACAAGGACA AGAAGTTAAA
TGTTTAGCCT TGCTCGATAC TCCTCCTTTT CAACAAGATA AAGATAAAGA TATCCCAGAC
AATGAAGACG ATACCCGTTG GATGATCAGT CTGATTAAAA TGATTGAATA CTCTTATCAG
ACCTCTTTCC AAATTTCAGA GGAGCAACTG CGAAGACTGA CGGCTGAAGA ACAACTCCAT
TTACTCCATG AGATCCTACA AAATCTTAAT ATTGTTCCTT CTGGAAATGA TATCAGACAG
GTTCAAGGTG TAGTGCAAGT ATTTAAAGCC AGTTCTCAAA TTACCTATCA TCCTCAAACG
ATTTTGCCCT TGCCAATTAC TCTTTTCTGT GCCAAGGAAG AGTCTTTGAT TGATCGTCAA
TCTTGGGTCA CAGACTGGCA AAAATCGACA ACTTTACCGA TTCAGGTGGA ATGGGTGGCG
GGAAAACACC AAACAATGAT GGAATCACCT CATGTCCAAA AGTTAGCTCA ATGCCTTCAA
CAATATTGGG AATAA
 
Protein sequence
MKPIEEFLNE LANLEVKLWL EEGRVRCRAT KGKLTPELHH QLSARKQEIL KFLQSNQLNT 
SPIISQIKSV SRSEPLPLSF AQQRLWFLDQ LEGQKAAYNE VGVIRLEGTL NASLLSQSFE
EIIRRHEILR TNLQTKGEEV FQVISESKTL EIKTIDIASV PTIQQPEALK QIASREIEIP
FNLEQDLLLR VALIRLTEKS HILLVVMHHI VCDGWSFGIL IEELSTLYKA YQEGKSSPLP
ELTIQYADFA VWQREWLTGE TLEKKLDYWT EKLSGLPPLL ELPTDYRRPP VQSFKGSHIS
FNLNSEISEK LKQLSQQTGV TLFMTLLTAF SILLSRYSRQ EDIAIGSPIA NRNRGEIESL
IGFFVNTLVM RVNLQDNPTV EELLTQVKKS CLEAYSHQDV PFEKLVEELK PERNMSYSPM
FQVMFVLQNT PSQELSLPGL TLSSVEIEYN IAKFDLTLSM SETDNGLAGD WEYNTDLFER
ETIERMIGNF QVLLEEIVSN PQEKIGKLTL LTEVEKDKIL VEWNDTKRDY PKDKCIHQLF
DEQVEKTPDA IALVYEEEQL TYRELNQKSN QLAHYLQKLG VKPDTLVGIC VDRSLEMIIG
LLGILKAGGA YVPIDPNYPA QRIEYMLSDA RVNLLLTQER FKSLFLKFSE QILFWEHDQT
NWLEEIQTNP INQTELNHLA YINYTSGSTG QPKGVMIPHK GVVRLLINPN YVDLDSQTNL
LHLSPIAFDA STFEIWGALL HGGKCVLFSE KIPTALALKQ TIEKHKINTL WLTSALFNSV
VDELPDTLGE IKQLLTGGEA LSVNHINQAL KVLPSTQLID GYGPTESTTF TCCYLIPPSL
ASDILAIPIG KPISNTQVYI LDTNLQPVPI GVAGELHIGG DGLARGYLNR PELTAEKFII
NPFDPTGESR LYKTGDLCRY LRDGNLEYIG RIDHQVKIRG FRIELGEIES ILSIHPDIQE
SVVIAREDQP GNKRLVAYLV SKLIPERLHY VKACQLEIKG KTYPVESEDF SVGGMGLGKI
PVELGINDPV KVQIQLPGQE SPSWLSGKVI WYRDNHGGIE WRLTSQESER VKQSYQLIKE
ELGVIASLQR SLSQGLREYL KEKLPDYMVP SAFVLLEKLP LTPNGKIDRK ALPAPDWSNR
GQEDYIAPQT PNQEILASIW QNVLPKEKIG VKDNFFELGG HSLLATQVIS RIRETFSLDL
PIRSLFENPT LEELAQEIEN SQKVEINPII PINRAENLTL SFAQQRLWFL DQLEGENATY
NIPGALKLEG SLKIEALEKS LNQIIKRHES LRTRFKTVNG EAVQIIDPEG QINLKMITLE
SLDESEKKSQ TQSLIKQEAE KPFNLSQDRL IRASLIKLGS ESHILLITMH HIISDGWSMG
VFVQELTSCY SGYGQGKETQ LKPLSIQYAD FAVWQREWLC GENLQKQLNY WKKKLTGLPP
LIELPTDHPR PPIQSFQGSH ISFNLTREMS EKLKQMSQQT GVTLFMTLLT AFSILLSRYS
RQEDIAIGSP IANRNRAEIE PLIGFFVNTL VMRVNLEDNP TVEELLKQVR KTCLEAYSHQ
DVPFEKLVEE VKPERNMSHS PLFQVMFVLQ NAPDEELSLP GLTVSPVEIE YNIAKFDLTL
SMAETEKGLA GDWEYNTDLF ERKTIERMIG HFQVLLEGIV NHPQEKIGQL PLLTEAEKQQ
ILGEWNDTKA DYPKEKCIHQ LFEEQVERTP DAVAVVYEDQ QLTYLQLNQK ANQLAHYLIK
FGVKPDTLVG ICVERSLEMV MGLLGILKAG GAYVPIDPNY PAERIEYMLK DSAVSILLTQ
ERLVKELPET QAQMICLDND WLTISQENPN NCLSQVNAKN LAYIIYTSGS TGNPKGVMIE
HNSLVNLAIN LKQKIYSQTK QQKITLNGSL SFDTSVKQWI QLAYGHSVYI IPEDIRLDSV
TFLKYLRDYR IQVLDCTPGQ LRGMIESDLL TTESYLSKIL LGGESIDVST WGNLSQNSHI
QFYNLYGPTE NSVDTTISKI EVNQPLPNIG KPINNVQVYI LDTNLQPVPI GVAGELHIGG
DGLARGYLNR PELTAEKFIV NPFDPTGESQ LYKTGDLCRY LRDGNLEYIG RIDHQVKIRG
FRIELGEIES ILSIHPDIQE SVVIAREDQP GNKRLVAYLV SKLIPERLHY VKACQLEIKG
KTYPVESEDF SVGGMGLGKI PVELGINDPV KVQIQLPGQE SPSWLSGKVI WYRDNHGGIE
WRLTSQESER VKQSYQLIKE ELGVIASLQR SLSQGLREYL KEKLPDYMVP SAFVLLEKLP
LTPNGKIDRK ALPAPDWSNR GQEDYIAPQT PNQEILASIW QNVLPKEKIG VKDNFFELGG
HSLLATQVIS RIRETFSLDL PIRSVFENPT LEELAQEIEN SQKVEINPII PINRAENLTL
SFAQQRLWFL DQLEGENATY NIPGALKLEG SLKIEALEKS LNQIIKRHES LRTRFKTVNG
EAVQIIDPEG QINLKMITLE SLDESEKKSQ TQSLIKQEAE KPFNLSQDRL IRASLIKLGS
ESHILLITMH HIISDGWSMG VFVQELTSCY SGYGQGKETQ LKPLSIQYAD FAVWQREWLC
GENLQKQLNY WKKKLTGLPP LIELPTDHPR PPIQSFQGSH ISFNLTREMS EKLKQMSQQT
GVTLFMTLLT AFSILLSRYS RQEDIAIGSP IANRNRAEIE PLIGFFVNTL VMRVNLEDNP
TVEELLKQVR KTCLEAYSHQ DVPFEKLVEE VKPERNMSHS PLFQVMFVLQ NAPDEELSLP
GLTVSPVEIE YNIAKFDLTL SMAETEKGLA GDWEYNTDLF ERKTIERMIG HFQVLLEGIV
NHPQEKIGQL PLLTEAEKQQ ILGEWNDTKA DYPKEKCIHQ LFEEQVERTP DAVAVVYEDQ
QLTYLQLNQK ANQLAHYLIK FGVKPDTLVG ICVERSLEMV MGLLGILKAG GAYVPIDPNY
PAERIEYMLK DSAVSILLTQ ERLVKELPET QAQMICLDND WLTISQENPN NCLSQVNAKN
LAYIIYTSGS TGNPKGVMIE HNSLVNFRTT AIEKYQFTVE DRILQFSSIS FDAASEEIYP
CLTIGATLVL RTQEILTGGI GLLEQCRKCQ LTILDLPTAF WYQIVSELSM TKNRFPETLR
LIIVGGEAVT TEHIQTWIAW TENTPQLVNS YGPTEATVVS TIWLLESSEV GLSVPIGRPL
ANIQTYILDP NLKPVPIGVA GELHIGGDGL ARGYLNRPEL TAEKFIPNPF DPTRQSKLYK
SGDLCRYSPD GNIEYIGRID HQVKIRGFRI ELGEIESLLS THPEIRESVV ILREHEPGNK
RLVAYLVSNL IPERLHYAKP CQLEINGRSY SVQSEDFSVG GMGLAKIPVE LGINDPVKVQ
IQLPGQETPR WLSGKVIWYR DRHTGIEWTL TPPEKKIVEQ SYQLLKEELG IMATLQRSLS
QGLRRYLKEK LPDYMIPSAF VLLEKLPLTP NGKIDRKALP APDWSDRGQE DYIAPRDAIE
IKLAQIWSNV LNVSPISIKD NFFELGGHSL LAVRLIAEVK QEFNQHLPLV TLFNSPTIEQ
LASLLRTETP SVSWSSLVPI QTQGDQPPFF CVPGVGGNVI YLYDLARYLG KDHPFYGLQS
VGLDGESAPY TTIKAMAEHY IKLIQSVQPK GPYYLGGHSF GGWVAFEMAQ QLQQQGQEVK
CLALLDTPPF QQDKDKDIPD NEDDTRWMIS LIKMIEYSYQ TSFQISEEQL RRLTAEEQLH
LLHEILQNLN IVPSGNDIRQ VQGVVQVFKA SSQITYHPQT ILPLPITLFC AKEESLIDRQ
SWVTDWQKST TLPIQVEWVA GKHQTMMESP HVQKLAQCLQ QYWE