Gene BMA10247_A0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A0845 
Symbol 
ID4890714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp781556 
End bp793861 
Gene Length12306 bp 
Protein Length4101 aa 
Translation table11 
GC content73% 
IMG OID640147119 
Producthypothetical protein 
Protein accessionYP_001078044 
Protein GI126447961 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCAT TGACGCGCAC GGCGCGCGCT TTCCAGAACC CGACACAGAC GATTCCCATG 
ACCGCCTCCC CTCCCTCCAG CGCACTCGTC ACGGCCGTCG AAGCGGCCGT CCTGTCGCTC
GCCGGCGACG TCGCCGGCCG CGCGTTCGAC GCGTCGGCTG CGGCGTGCCC GCTGCACGCG
CTCGGCTTCG ATTCGGTGCA GTACGTCGAA TTGTCCGGAT GCCTGAACGA ATACTACGGG
CTCGATCTCG CGCCGACGCT GTTCTTCGAC GTGCACGTGC CGCGCCGGAT CGCCGAGCAT
CTCGTCGCGC GGCATCCGGC GGCGCTCGCG CGCAAGCACG GCATCGGGGC CGGGGACGAC
GCCGACACGG CCGCTCGGGC CCGCGCGGCC GCGGCCGAGA ACGGCGCGCC GCAGCCGGAC
ATGCGAGCCG GGGCGGCGCG GCCCGCGGGC GAGCCGCTTC TCGACACGCA TGCGAGCCCC
GGCGAGCCGC GCGGCGACGC ACACGAGAAT CCATGTGACG ACACGCGCGG CGCGGCCGCC
GCCGACGCGC ACGAATCGGC CGCCGATATC GCGATCGTCG GCATGGCCGG CATCTTCCCG
CAATCGGCCG ACCTCGACGC GTTCTGGCGG CATCTCGCCG CGGGCGACGA TCTGATCGCC
GAGGCGCCGG CCTCGCGCTG GGATTGGCGC GCGGGCGACG GCGAGCCCGC ATCGCGCTGG
GGCGGCTTCA TCCCGCGCAT CGAATATTTC GACGCCGCGT TCTTCGGCAT CTCGCCGCGC
GAAGCCGAGC AGATGGACCC GCAGCAGCGC CTGCTGATGC AGACCGCGTG GGCGGCGCTC
GAGGACGCGG CGGTGCGCCC GTCCGATCTG ATGGGCAGCG ACGCGGCGGT GTTCGTCGGC
ATCAGCACGT CTGACTACAT GGCGCTGCTG CCCGGCGCGG ACGGCCATCT CGCGGTCGGC
AACGCGCACG CGATGCTGCC GAACCGGCTG TCGCACCTGC TCGGCGCGCA CGGGCCGAGC
GAGGCTGTCG ATACCGCGTG CTCGAGCTCG CTTGTCGCGC TGCATCGCGC GGTGCGCGCG
CTGCGGCGCG GCGAAAGCAG CGTCGCGATC GTCGGCGGCG TCAACGTGAT GCTGACGACG
CGGCTGCACC GCGCGCTCGC CGCCGCCGGC ATGCTGAGCC CCGACGGGCG CTGCAAGACG
TTCGACGCGG CGGCGAACGG CTACGTGCGC GGCGAGGGCA TCGCGGCGCT CGTGCTGATG
CCGCTCGAGC GCGCGCGCGC GAACGGCCAC CCGGTGCGCG CGGTGATCAA GGGCAGCGCG
GTCAATCACG GCGGCCGCGC GGCGTTCCTG ACCGCGCCGG ACATCAACGC GCAGGCCGCG
CTGATCGAAG CCGCGTATCG CGACGCGGGC GTCGACCCCG CCACTGTTTC GTACATCGAA
GCGCACGGCA CCGGCACGTC GCTCGGCGAT CCGATCGAAG TGCAGGCGCT GCGCCAGGGC
CTCGACGCCT GCGCGCGCGA CCTCGCGGGC ACCGCCTCGC ACGCGCCGGC ACGCTGCGGC
CTGGGCTCGG TCAAGACCAA TATCGGGCAT CTCGAAGCGG CGGCGGGCCT CGCGGGCGTC
GTCAAGGTCG TGCTCGCGAT GGACCGGCGC ATGCTGCCGC CGAGCCTGCA TTGCCGTGAA
CTGAATCCGT ATCTGAAGCT CGACGGCAGC CGTTATCACG TCGTCACGGA ACCCACGCCC
TGGCCGGACG AAGCAACGCC GACGCCGCTG CGCGCGGGCG TCAGCTCGTT CGGGTTCGGC
GGCTCGAACG CGCACGTCGT GCTGCAATCG GCGCACGCGC GGCCGATCGC GCGAGCGAGC
GCGCCCCCAC CGCCGCACCC GAACGAACAG GCCGGTGCCG ACGCGCCCGC CGCCGACGGC
CCGCGCGCGT GGTTCATCCC GCTATCGGCG CGCACCGATG CCGCGTTGCA TGCGCGCGCC
GCTCAGCTCG CGCACTGGCT CGACACCGAG CCGGCCGACG ACGCGTGGCT GCCCGCGCTC
GCGAAGACGC TGTCGATCGG CCGCGAACCG ATGGCGCGCC GCTTCGGCAT CACGTGCGCG
TCGCTCGACG AACTGCGCGC GCAACTCGCG ATCGCGCTGG GCGGCCGCGC AACGTCGCTC
GCGCGCGATG ACGCCCGGCT GCGGCCGCAT GCGCCCGCCT GCGCGGCGTG GCTCGCGGGC
GAGACCGACC CGCTGCCCGC CGCGTGGGAT GACGCGACGC CGCGCCTGCG GTTGCCCGTC
TACCCGTTCG AAGGCGAGCG GCACTGGCCG ACCGAAGCAG CGCCGGCGGC GCGCTTCGCG
CTCGCGCCCG ACGCCGACGG CGCATACCGG ATCGCGATCG CACCCGACGC GCCGCTCGTC
GCCGACCATC GGCTCGCCGG CGAGCCGGTG CTCGCCGCCG CCGCGCAAAT CGTGATCGCG
TGGCGCGCGT TCGAGGCGGA CGCGCTCGCC GGCGATGCCG GCCAGGCGGG CGACGTCGGC
GAGTCGATGG AGTCGATGGA GTCGATGGAG TCGAACGGAT CGAGCGCATC GAAGCCGGCG
GCAACGTCCG CCGATTCGGG CACCGCCGCC GATTCACGCG ATCTGCACGA TTCATACCAC
TCGCACGACT TCCGCCACAC GATCGACACG ATCGACACGA ACGCCACGAG CGCCGCGACG
CCTATCGCGC TGTGCGACAT CGAATGGCTC GCGCCGATCG CGATCGGCGC GCCGACCGAC
CTGTGCATCA CGCTCGCGCG CGACGCCCAC GGCGACATCG ACGCGCGCCG CGGCGAAGCC
GCCCATCGGC GCGCAAACGG CCGCGCCGCC CGCTTCGCGA TCGCGGCCGC CCCCGCCATC
GATACGCCGC TCGGCCGCGG ACACGCGACG CGCATCGCGA GCGCGCCGTC GAACGCGCCC
GAGCTCGACA TCGAGGCCAT CCGCGCGCGC TGCACGCAAG CGGTCTCGGC CGACGCGTGC
TACGACGCGT TCGCCGCGAT CGGCATCGAT TACGGCCCGA CGTTCCGCCC GCTGCGCGCG
ATCGCGGTCG GCCGTGACGA AGCGCTCGCC GAATTCGACG CGTCGGCGCT CGCGCGCACG
ACGGGCGACG CGCGTATCGT CGCGCTGCTC GACGGCGCGT TCCAGGCGAT CGCGGGCCTG
ACGCTCGCGC ACGCCGCGAG CCTCGAAAGC GGCCTGCTGC CCGCGTCGCT CGCACGCATC
GAGTTCACCG GGCCGCTCGC GGACAGCGTC CGCGCGTGGA TTCGCGAAGC ACCGAGCGAC
ACGGGCCGCC GCACATTCGA TATCGACCTC GTGACGGCGA GCGGCCGGTC GTGCGCGTCG
CTGCGCGGCC TCGCGCTCGC GTCCGGCCGA AGCGCAACGT CGCGCGAAGC GCCACGCATC
ACGACGCCGG GCGACCATCT GTTCGCGCCG CAATGGCTGC CGTGCGCGAC GAACGCGGCC
GGCGCGGCAA CGCCGTCGCC GCGCGCCGGC GCGCTCGCGA TCATGGGCGG CACGCCGGCG
CAGCGCGCCG CGCTCGCGGC GACGCGCGCG GCGGCGCAGC GCCTGATCGA CGACATCGCC
GAACTCGACG CGAACGTGAG CCATCTCGTC TGGCTGCCGT CCGCGCCCGC GGACGCACAT
GCGCCGCTCG CGCAATGCGC GAGCCTCGAC GGGTTGCGCC TCGTGAAGCG TTTGCTCGCG
CTCGGCGCGG GCGATCGCGC ATTCGATCTG ACGGTGCTCA CCGTCCGTTC GTGGACGATG
CCGGGCGACG CGCCCGCGTT TCCCGCGCAC GCGGATCTCG CGGGGCTGTG CGGGGCGCTC
GCGAACGAAT ACCCGCACTG GCGCGTGCGG CTCATCGATC TGCCCGACGC CGCTGCGCTG
CCCGCCGACT GGCACGCGCG GAGCGCCGAA GGCGGCCATC CGCTGCTGCT GCACCGGCAC
GGCCAATGGT TCGCGCGCCG GCTCGTGCCG CTCGCGGCGC TGCCCGCGCC CGCCGCGCAG
CCGTATCGGC CGGGCGGCGT GTACGTCGCG ATCGGCAGCG CGGGCGGCCT CGGCCGGGTG
TGGACCGAGC ACGCGATTCG CGCCTGCGGC GCGCAAGTCG TGTGGATCGG GCGGCGGCCG
CTCGACGCGC AGATCGATGC GCACTGTGAC GCGCTCGCCG CGCTCGGCCC GCGCCCGTCG
TATCTGAGCG CCGACGCGAG CGACGCCGAG AGCTTGCGCG CCGCGCGCGA TGCGGTGCTC
GAACGCTTCG GGCGGCTCGA CGGCGTCGTG CACACGGCGA TCGTGCTGGA GGACGGTGGC
CTCGCGCAGC TCGACGAAGC GCGATTCAGC GCGGCGCTGA ACGCGCAGGT CGCGACGACC
GCGAACCTCG CCCGCGTGTT CGGCAGCGAT CCGCTCGATT TCATCCTGTT CTTCTCGTCG
CTGCAAAGCG CGTTCGTCGC GGCGGGCCAG AGCAATTACG CGGCCGGCTG CACGTTCCGC
GACGCGTTCG CCGACTGGCT GCGCACGCAG CTCCGATGCG CGGTCAAGGT CGTGAACTGG
GGCTACTGGG GGCAGACGGG CGTGGTCGCG ACCGAGCCGT ACCGCGCCCG CATGGCCGCG
CTCGGCATCG GGTCGATCGA GCCCGCCCCG GCGATGGCGG TCGTCGACGC GCTGCTCGCC
TCGAACGTCG ATCAGGTCGG CTATCTGAAG ACGATCGCGA GCGCCGCGGT GCCGACGCTC
GCGCCCGCGC TCGCCGCGCG CATCGCGCCG CGCACGCGCG CGCTTGCCGG CACGCCGCCG
CGCGTCGACG CGACGGACGA CAGCGCGGCG TGGCGGGACG CGCTCGCGGC GCTCGAACGC
GCGATCGCGC TCCGGCTGTT CGCCGAGCTC GGCGCGCTGC GCGTGTTCGG CGGAAGCGGC
GCGCCGGGCG GCCATGCGTT CGACGACGGC GCGGCCCGAA ACAGCGCGGC CGGCAAATGT
TCGGCCGATG ACCGCGCGCC CGACGCCGCG CCGCCGACGC CGGACGCGGC GCGCGCGGAA
TGGGCGCGCG CACGCGCCGA GCTCGAGCGC ACCGCGCTGC TCGACGCCCA TCTCGCGCTC
GTCGACGCGA CGCTCGACGC GCTGCCCGCG ATCCTGCAAG GCAGCGTGCC CGCCACGTCG
ATCCTGTTCC CGGACGGCGA TCTGAGCCGC GTCGAAGCGG TCTATCAGCG CAACGAGCAG
GCGGACCGCT GCAACCGCGC GCTCGCCGAT GCGGTGCTGC ACCTCGTCGG CGACGCATCG
TCCGCGCAAC CGGCCGCGCT CGCCGAAATC GGCGCGGGCA CGGGCGGCAC GACCGTGCCG
CTGCTCGCGG CGCTCGACGC GCGCGGCGCG CGGCTCGGCC GCTACGACTT CACCGACATC
TCGAAGGCGT TCCTGCTGAA CGCCGAGCAA ACGTTCGGCC GGGGCCGCGA CATGCTGCGC
TACCGGCTGT TCGACGTCGA GCGGCCGATC GCCGGGCAGG CGCTCGACAC CGGCGGCTAC
GACATCGTGA TCGCGACGAA CGTGCTGCAC GCGACGCAGG ACATCGGCGT CACGCTGCGC
AATGCGAAGG CGCTGCTGAA GGCAGGCGGC CATCTGATCA TCAACGAACT GCTCGGCACG
CACGGCTTCG CGCATGCGAC GTTCGGGCTG CTGCCCGGCT GGTGGCGGCA CCGCGACAGC
GCGCGCCGCC TGCCCGGCAG CCCGCTGCTG TCGCGCGACG GCTGGACGCG CGCGCTGCGC
GAAGCCGGCT TTGCGGTGCT CGACGGCGGC TCGGCCGGCG CCGCGGCGGG GCAAGGCGTG
ATCGTCGCGC TCAGCGACGG CGTGATCGTG CAGCCGTCGC ACGCCGACGC GCGGGCGGCC
TCATGTGCGG CTTCGCGCGC GGCCCCGGGC GACGACGCCG GCGCGCACGC CAGCGCCGCG
CGGCCGGCCG CATCGGCTTG TTCGACTGCC TCGCCCGCAC ACGCGCCCGC GGCTTCGCCG
ATCGCCGCCG CGCCGACCGG CGCGAGCCTG CGCGCGCGCT GCGTGCAGGC GCTCGCGCAA
CTCGTCGCGC GGACGCTGAA AATGCCGGTC GGCAAGCTCG CGCCCGATCA GCCGCTCGGC
AGCTACGGCG TCGATTCGAT TCTCGTGATC GGGCTCACGA AAACGCTGCG CGAGACGTTC
GGCGTCGCGC TGTCGAACGC GACGCTGTTC GAGCATGCGA CGCTGAACGC GCTCGCCGAA
TTCTTCGTCG CCGAACATCG CGCGGCGTGC GAACGCGTGC TGGGCGGCGA CGCGGAACCC
GCGCCGAATG CGCCGAACGG ATCAAACGCC GCGAGCGCAG CGGCGGCCAC GCGCCCGGCC
ATGCCACCGG CCCGCGCCGG CGCCCCATCG CCCGCCGCGG CTTCGGCCGC GCCGAAGCCG
CGCGAATCGA ACGTGTGCGC CCCGCCCTCC GCCGACGACA CCGCCGTCGC CGTGATCGGC
ATGTCCGGCC GCTATGCGCA GGCGGACAAC CTGCGCGAGT TCTGGGCGAA CCTGCGCGCG
GGCCGCCATT GCATCACCGA AGTGCCCGCC GAGCGCTGGG ACTGGCGCAC GCACTTCGAT
GCGGAAAAAG GCGCGCCGGG CCGCACGTAC AGCCGCTGGG GCGGCTTCCT GACGCAGATC
GACCGCTTCG ACGCCGCGTT CTTCCGAATC GCGCCGAACG ACGCCGAGCA GATCGATCCG
CAAGGCCGCC TGTTCCTCGA GGAATCGTGG GCCGCGATCG AGGATGCCGG CTATACGAGC
GACACGCTCA GCGCGGACCG CCGGGTCGGC GTGTTCGTCG GCGTGATGAA CGGCGACTAT
CCCACGGGCG CGCAGTTCTG GAGCATCGCG AACCGCGTGT CGCACGCGCT CGACCTGCAC
GGGCCGAGCC TCGCCGTCGA CACCGCGTGC TCGTCGTCGC TGACCGCGAT CCATCTCGCG
CTCGACAGCC TGCGCAGCGG CACCTGCGAC TGCGCGCTCG CGGGCGGCGT CAACCTGATT
CAGAGTCCGA AGCATCTGGT CGGGCTCGCG TCGCTCACGA TGCTCTCGGC GGGCGACGCG
TGCCGCGCGT TCGGCGCGGG CGCGGACGGC TTCGTCGACG GCGAAGGCGT CGGCGTGCTC
GTGCTCAAGC CGCTGTCGCG CGCGCTCGCC GATGGCGACG CGATCCACGG CATCATCCGC
GGCAGCATGA TCAACGCGGG CGGCAAGACG CACGGCCTCA CGGTGCCGAA CCCGCGCGCG
CAGCAGGCCG TCGTCGGCGC GGCGCTCGCG CGAAGCGGCG TGCCGGCGCG CGCGGTCGGC
TACATCGAGG CGCACGGCAC CGGCACCGCG CTCGGCGATC CGATCGAACT CGCGGGCCTC
ACGCGCGCGT TCGCCGAAGC GACCGACGAG CTCGGCTTCT GCGCGCTCGG CTCGGTCAAA
TCGAACATCG GCCATTGCGA AAGCGCGGCG GGCGTCGCCG GCGTGACGAA GGTGCTGCTG
CAGATGAAGC ATCGCGAACT CGTGCCGACG CTGCATGCGC ACGAGCCGAA CCCCGACATC
GATTTCGCGC GCTCGCCGTT CGTGCTGCAA CGCACGCTCG CGCCGTGGCC GCAGCCGGCG
CTCGACGGAT GGCCCCGGAT CGCGGGCGTG TCGTCGTTCG GCGCGGGCGG CGCGAACGCG
CACGTCGTGC TCGAAGAATT CATCGAGACG CGCGCCGCCG CCGGCGGCGA CGACGCCGGC
CCCGCGATCG TCGTGCTGTC CGCCGCGACC GACGCAGCGC TGCGTCGCCG CGCGCGGCAA
TTGCACGCCG CGCTCGCCGC CGGCGAAATC GGCGACGAGC GCCTGCACGA TCTCGCGTAC
ACGCTGCAGA TCGGCCGCGC CGCGATGGTC TCGCGCTTCG GCTGCGTCGC CGGCAGCGCC
GCCGAATTGC AGGCGCAGCT CGCCGCGTTC GTCGAAGGCG ACGCATCGCG CGGCTGGCAC
GCGCACCGGC TCGCCGGCGA CCGCCGCGGC CTCGCCGAGC TCGACGCCGA TCCCGAACTG
CGCGCGTCGC TCGTCGAGCA ATGCGTCGCG GCCGGCAAGC TCGACCGGCT CGCGGCACTC
TGGTGCCAGG GGCTCGGCAT CGACTGGCCC GCGCTGCATC GCGGCCGCGC GCGCCGGCGC
ATGCATCTGC CGACGTACCC GTTCGACGGC CCGCGCTACT GGCTGCGCGA CGACGCGGCG
CACGCCGCCG AGCCCGCGCC GGCCGACGGC GCCGCCGAAG ACGCAAGCGC CGACGCACCG
AATGCAGCGA ACGCGCCGAC GCCCGACGTC GCAACGCTCG TCCGTCGAAC GGTGGCGCAA
GTGCTCGGCT ATCCGGACGT CGACATGAAC GAATCGTTCC TGTCGCTCGG CGGCGATTCG
ATCCGCGCGG CGCGCGCGCA TCGGGTGCTG CAACGGGCGC TCGACACGAG GATTCCGCTC
AGCCTGATGC TGGAGGCAAG CACGCTCGCC GAATGCGCGC AAGCGATCGA TGCACTGCTT
TCGACGCAAC CGGAACCGGC GAGCGCGCTC GCCTGCGAAA CGAACGCGGG CGCGGCCGGC
GCGCCGATCG CCGACGCGGC CGCGTTCGAG TCGTCGGCGC CGCCCTCCCG GGAATCGGCC
TCCCCGCCAC ACCCGGCCTC CCCGCCGCGC GACGCGCGCC CGCGCGTTCA TCCGCTGTCA
TCGAACCAGC AGCAATTCTT CTTCCTCGAC CGGCTGAACC CGGCGAACCC GGCGTTCAAC
CTGCCCGGCG CGCTGCGCGT GCGCGGCGAA TGGCACGCGC ACGCGCTCGA AGCCACGTAT
CAGGCGCTCA TCGATACGCA CGACGTGCTG CGCACCCGCT TCGTCGTGCG CGGCGGCGAA
CCGTGCGCGG AAGTCGCGCC GCACCGCGCG GCCGCGATTC GCCGGCACGA TCTGACGGCG
CTGCTGCCGA AGCATCAGGC CGCGCGCGTC GCCGAGTGCC TCACCGAGTC GAGCCGCGAG
GGCTTCGCGC TCGAACAGGG CGAACCGAGC CGGCTGACGG TACTCGAACT GCGCGACGAC
GATCACGTGA TCCTGCTGAA TCTGCATCAC ATCGTCGGCG ATGCGGTGTC CGTCGTCGTG
CTGCTCGACG CGCTCGCGCG CGCCGCGCTC ACGGGCCGCG CGGCCGCGCC GGACCGCGCG
CGGCCGCAAT ACGCGCAATG GGCCGCGCAC GAACGCGATG CGCTGCCGGC GACGATCGCG
CGCGAACTGC CGTACTGGCT CGAGCGCCTG CGCGACGTGC CGCCGCCGTT GCCGCTGCCG
TGCGACCGCG CGCGGCCGCC GGTGCCGAGC TATCGTGGGC GCAGCGTGCC GCTCGCGTTT
GCGCCGGCGC TCATCACGCT GCTCGACGCA TACTGCAAGG CGCACGGGCT GTCGCGCTTC
GTCGTGATGC TCGCCGCGTT CAAGCTCGCG CTGCGCGTGC TGTCGGGCCG TGACGACGTC
GTCGTCGGCA GCCCGTACGC GAACCGCGCC GAGGACGACA CGGCCGACAT GATCGGCAGC
CTCGCCTACG CACTCGTGCT GCGCACGCGG CTTGGCGAAG CACAGACGTT CGCCGATGCG
GTCGCGCTCG TGCGGCGCAC CGTGCACGGC GCGTTCGACC ATCTCGGCGT GCCGTATCCG
CGCCTCGTCG AGGCGCTGAA TCCGGCGCGG CACGGCGGCG CGAACCCGCT GTATCAGATC
ATGTTCAACG TGATCCCGAT GCCCGCGCTG CCCGAGGGCG TCGAGCCCGT CGAAGTCGAT
TCCGGCTGGC TCGACTACGA TCTGTTCGTG CGGCTGCGCG CGTCGAGCCA CGCCATCGAC
GGCGTGCTGC AATTCAGCGC GGATCTCTTC GATCGTTCGA CGGCCGAAGC GATCGCCGCA
TACTACGTCG AGCTGCTGCA CACGCTGCTC GCGCATCCGT CGCTGCCGCT CGCGAGCCTC
GCGCCGCCCG CCGAGCTCGC GCTCGAACGG ACGATCGCCG ACGCGATGCC GCCGCTGCGC
ATCGAGATCG CGTCGACGTT CACCGACCGC CCGTTAGCCG GCACGCTGCG CTACTGGGGC
ACCGCGACCG GCCAGCCGAT CGAGCCGAAT TTCGCGCCGT ACGGACAACT GTTCCAGACG
CTCTACGATC CGTCCACGCC GTTCCATGCG AATCGTCACG GAACGAACGT CGTGCTGGTC
AGGCCGTGCG ACTGGCTGCG CTTCGACGAC GCGAACGCGG ACGCCGCCCG CGCCGACCTC
ACGGGCGACG CCGGCGCGGC GGCCGCCGAA CGCATCGCGC TGTACGCCGA CGAACTCGCC
GACGCGCTGC GCGACGCGGC GCCGTCGCTC GCGGTGCCCG TGCTCGTGCT GGTGCTGCCG
GACGATGCCG CGTCGCTCGC GGCGCGTGAC GAACACACGG GTACGGCAAC CGAAGCGCCT
GCCGAGGCGC TCGCCGACGC ACGCGCCGGC AAGCCCTCGC CCGACACGTC GCTCGCCCCT
TACCGCATGC TCCGCGCCGC GCTCGCGGAT CTGCCGTCGA TAACGGTCGC GCACTGGCGC
GATGTCGCCG CGATCTACCC GGTCGCCGAC GTGTTCGATC CGCATGCGGA CGCGGCCGGC
CACGTGCCGT TCACGAGCGA GTACTACGCG GCGCTCGCGA GCTACATCGC GCGCACCGCG
TTCCAGCACG CGTCGGTGCC GCTCGACGAC GCCTGGAACC GGCTCGCCGC GCAGATCCGC
GACGACGCCG AGCACCTGCT CGCCGCGCCG GCCGACGGCG CACGCGCGCG CCGCGCGCCG
CACGCCGCGC CGACGAACGA AACGCAGGCG ACGCTCCTGC CGATCTTCGC GGCCGCGCTG
AAGCTCGACG ATCCCGGCAT CGACGACAAC TTCTTCGACT GCGGCGGCCA CTCGATCCTC
GCGATCGGCG TCGTCCATCA GATCAACGAA GCATTCGGCA CGTCGCTGTC GGTCGCGGAC
ATCTTCATGG CGCCGACCGT GCGCCGCCTC GCCGAGCGCA TGCGCGACGC GCCGGACGGC
CCCGAGTACG TCGAGCTCGC GAGCGCGGCC GCGCTGCCCG ACGACATCGC GCCGCTGCCC
GGCCCAGTGG CCGACGCGCC GCGCGCGCTG TTGCTCACGG GCGCGACGGG CTTTGTCGGC
CGCCATCTGC TGCGCGAGCT GATCGATCGC ACCAGCGCGA CGATCTACTG CCTCGTGCGC
GCGCCGGACG CCGCGCAGGG CCTCGCGCGG ATCCGCGCGA CGCTCGGGCG CTGGTCGCTG
TGGCGCGACG GCGACGCCGC GCGCGTGATC GCGGTGCCGG GCGATCTCGG CCGCCCTCGC
ATCGGCCTGT CGGACGCCGC GCGCGCGCGG CTCGTCGCCG AAGTCGACGC GATCTATCAC
AACGGCACCA GCATGAACCA TCTCGAATCG TTCGAGATGG CGCGCGCGGC GAACGTCGGC
GGCGTGATCG AGCTGCTGCG GATCGCCACC GAAGGCCGGC CGAAGACGTT CAACTACGTG
TCGACGCTCG CGGTGTTCAG CATGCGCGAG CGCACCGGCA CGCACGTATT CGACGAAGCC
GCGCCGATCG ACGGCGAGCG GCATCCGTCC GACCAGGGCT ACACGACGAG CAAGTGGGTG
GGCGAGCAGC TCACGCATCT CGCGGCCGCG CGCGGCGTGC CGTGCAACGT GTTCCGCCTC
GGCCTCGTGA CGGGCGACGT GCGCCACGGT CACTACGACG AACTTCAGGC GTACTACCGG
CTGCTGAAGA GCTGCATCCT GATGGGCGCC GCGTTCGACG ATTTCCGCTA CGACCTCGTG
ATCACGCCCG TCGATTACGT CGCACGCGCG CTTGCGCATC TCGGCGCGCG GCATTCGCAA
GGCGGCCGGG TGTTCCATCT GTCGACGATG CAGGTCACGC CGATGCGCAC CGTGTTCGAG
ATGATGAACG CGCATCTGCG CACGCCGATG CGCATGCTCA CACACCGCGC GTGGATCGAC
GAGCTGCGCG TGCGCTACCG GCGCGGCGAC GTGCAATCGA TCGTGCCCGT CGTGCAATGG
ATGATGAACA TGAGCGATGC GGAGCTCGTG AAGCTCGCGC GCGAGCGCGA GGAAACGACC
TTCATCTACG ACTGCACGGC GACGCACCGC GAGCTCGAGC AAGCCGGCAT CGTCGTGCCC
GTGTTCGACG ACGCGCTGCT GCAGCGGTAT CTGCGCGGCA TGTTCAACGA CGACGCGGAC
CTGCGCGCGC TCGCCGCCCG GCTCGACGGC GGCGAGTGCG CTTCTCCCCT TCACTCCCAC
ACGTGA
 
Protein sequence
MAALTRTARA FQNPTQTIPM TASPPSSALV TAVEAAVLSL AGDVAGRAFD ASAAACPLHA 
LGFDSVQYVE LSGCLNEYYG LDLAPTLFFD VHVPRRIAEH LVARHPAALA RKHGIGAGDD
ADTAARARAA AAENGAPQPD MRAGAARPAG EPLLDTHASP GEPRGDAHEN PCDDTRGAAA
ADAHESAADI AIVGMAGIFP QSADLDAFWR HLAAGDDLIA EAPASRWDWR AGDGEPASRW
GGFIPRIEYF DAAFFGISPR EAEQMDPQQR LLMQTAWAAL EDAAVRPSDL MGSDAAVFVG
ISTSDYMALL PGADGHLAVG NAHAMLPNRL SHLLGAHGPS EAVDTACSSS LVALHRAVRA
LRRGESSVAI VGGVNVMLTT RLHRALAAAG MLSPDGRCKT FDAAANGYVR GEGIAALVLM
PLERARANGH PVRAVIKGSA VNHGGRAAFL TAPDINAQAA LIEAAYRDAG VDPATVSYIE
AHGTGTSLGD PIEVQALRQG LDACARDLAG TASHAPARCG LGSVKTNIGH LEAAAGLAGV
VKVVLAMDRR MLPPSLHCRE LNPYLKLDGS RYHVVTEPTP WPDEATPTPL RAGVSSFGFG
GSNAHVVLQS AHARPIARAS APPPPHPNEQ AGADAPAADG PRAWFIPLSA RTDAALHARA
AQLAHWLDTE PADDAWLPAL AKTLSIGREP MARRFGITCA SLDELRAQLA IALGGRATSL
ARDDARLRPH APACAAWLAG ETDPLPAAWD DATPRLRLPV YPFEGERHWP TEAAPAARFA
LAPDADGAYR IAIAPDAPLV ADHRLAGEPV LAAAAQIVIA WRAFEADALA GDAGQAGDVG
ESMESMESME SNGSSASKPA ATSADSGTAA DSRDLHDSYH SHDFRHTIDT IDTNATSAAT
PIALCDIEWL APIAIGAPTD LCITLARDAH GDIDARRGEA AHRRANGRAA RFAIAAAPAI
DTPLGRGHAT RIASAPSNAP ELDIEAIRAR CTQAVSADAC YDAFAAIGID YGPTFRPLRA
IAVGRDEALA EFDASALART TGDARIVALL DGAFQAIAGL TLAHAASLES GLLPASLARI
EFTGPLADSV RAWIREAPSD TGRRTFDIDL VTASGRSCAS LRGLALASGR SATSREAPRI
TTPGDHLFAP QWLPCATNAA GAATPSPRAG ALAIMGGTPA QRAALAATRA AAQRLIDDIA
ELDANVSHLV WLPSAPADAH APLAQCASLD GLRLVKRLLA LGAGDRAFDL TVLTVRSWTM
PGDAPAFPAH ADLAGLCGAL ANEYPHWRVR LIDLPDAAAL PADWHARSAE GGHPLLLHRH
GQWFARRLVP LAALPAPAAQ PYRPGGVYVA IGSAGGLGRV WTEHAIRACG AQVVWIGRRP
LDAQIDAHCD ALAALGPRPS YLSADASDAE SLRAARDAVL ERFGRLDGVV HTAIVLEDGG
LAQLDEARFS AALNAQVATT ANLARVFGSD PLDFILFFSS LQSAFVAAGQ SNYAAGCTFR
DAFADWLRTQ LRCAVKVVNW GYWGQTGVVA TEPYRARMAA LGIGSIEPAP AMAVVDALLA
SNVDQVGYLK TIASAAVPTL APALAARIAP RTRALAGTPP RVDATDDSAA WRDALAALER
AIALRLFAEL GALRVFGGSG APGGHAFDDG AARNSAAGKC SADDRAPDAA PPTPDAARAE
WARARAELER TALLDAHLAL VDATLDALPA ILQGSVPATS ILFPDGDLSR VEAVYQRNEQ
ADRCNRALAD AVLHLVGDAS SAQPAALAEI GAGTGGTTVP LLAALDARGA RLGRYDFTDI
SKAFLLNAEQ TFGRGRDMLR YRLFDVERPI AGQALDTGGY DIVIATNVLH ATQDIGVTLR
NAKALLKAGG HLIINELLGT HGFAHATFGL LPGWWRHRDS ARRLPGSPLL SRDGWTRALR
EAGFAVLDGG SAGAAAGQGV IVALSDGVIV QPSHADARAA SCAASRAAPG DDAGAHASAA
RPAASACSTA SPAHAPAASP IAAAPTGASL RARCVQALAQ LVARTLKMPV GKLAPDQPLG
SYGVDSILVI GLTKTLRETF GVALSNATLF EHATLNALAE FFVAEHRAAC ERVLGGDAEP
APNAPNGSNA ASAAAATRPA MPPARAGAPS PAAASAAPKP RESNVCAPPS ADDTAVAVIG
MSGRYAQADN LREFWANLRA GRHCITEVPA ERWDWRTHFD AEKGAPGRTY SRWGGFLTQI
DRFDAAFFRI APNDAEQIDP QGRLFLEESW AAIEDAGYTS DTLSADRRVG VFVGVMNGDY
PTGAQFWSIA NRVSHALDLH GPSLAVDTAC SSSLTAIHLA LDSLRSGTCD CALAGGVNLI
QSPKHLVGLA SLTMLSAGDA CRAFGAGADG FVDGEGVGVL VLKPLSRALA DGDAIHGIIR
GSMINAGGKT HGLTVPNPRA QQAVVGAALA RSGVPARAVG YIEAHGTGTA LGDPIELAGL
TRAFAEATDE LGFCALGSVK SNIGHCESAA GVAGVTKVLL QMKHRELVPT LHAHEPNPDI
DFARSPFVLQ RTLAPWPQPA LDGWPRIAGV SSFGAGGANA HVVLEEFIET RAAAGGDDAG
PAIVVLSAAT DAALRRRARQ LHAALAAGEI GDERLHDLAY TLQIGRAAMV SRFGCVAGSA
AELQAQLAAF VEGDASRGWH AHRLAGDRRG LAELDADPEL RASLVEQCVA AGKLDRLAAL
WCQGLGIDWP ALHRGRARRR MHLPTYPFDG PRYWLRDDAA HAAEPAPADG AAEDASADAP
NAANAPTPDV ATLVRRTVAQ VLGYPDVDMN ESFLSLGGDS IRAARAHRVL QRALDTRIPL
SLMLEASTLA ECAQAIDALL STQPEPASAL ACETNAGAAG APIADAAAFE SSAPPSRESA
SPPHPASPPR DARPRVHPLS SNQQQFFFLD RLNPANPAFN LPGALRVRGE WHAHALEATY
QALIDTHDVL RTRFVVRGGE PCAEVAPHRA AAIRRHDLTA LLPKHQAARV AECLTESSRE
GFALEQGEPS RLTVLELRDD DHVILLNLHH IVGDAVSVVV LLDALARAAL TGRAAAPDRA
RPQYAQWAAH ERDALPATIA RELPYWLERL RDVPPPLPLP CDRARPPVPS YRGRSVPLAF
APALITLLDA YCKAHGLSRF VVMLAAFKLA LRVLSGRDDV VVGSPYANRA EDDTADMIGS
LAYALVLRTR LGEAQTFADA VALVRRTVHG AFDHLGVPYP RLVEALNPAR HGGANPLYQI
MFNVIPMPAL PEGVEPVEVD SGWLDYDLFV RLRASSHAID GVLQFSADLF DRSTAEAIAA
YYVELLHTLL AHPSLPLASL APPAELALER TIADAMPPLR IEIASTFTDR PLAGTLRYWG
TATGQPIEPN FAPYGQLFQT LYDPSTPFHA NRHGTNVVLV RPCDWLRFDD ANADAARADL
TGDAGAAAAE RIALYADELA DALRDAAPSL AVPVLVLVLP DDAASLAARD EHTGTATEAP
AEALADARAG KPSPDTSLAP YRMLRAALAD LPSITVAHWR DVAAIYPVAD VFDPHADAAG
HVPFTSEYYA ALASYIARTA FQHASVPLDD AWNRLAAQIR DDAEHLLAAP ADGARARRAP
HAAPTNETQA TLLPIFAAAL KLDDPGIDDN FFDCGGHSIL AIGVVHQINE AFGTSLSVAD
IFMAPTVRRL AERMRDAPDG PEYVELASAA ALPDDIAPLP GPVADAPRAL LLTGATGFVG
RHLLRELIDR TSATIYCLVR APDAAQGLAR IRATLGRWSL WRDGDAARVI AVPGDLGRPR
IGLSDAARAR LVAEVDAIYH NGTSMNHLES FEMARAANVG GVIELLRIAT EGRPKTFNYV
STLAVFSMRE RTGTHVFDEA APIDGERHPS DQGYTTSKWV GEQLTHLAAA RGVPCNVFRL
GLVTGDVRHG HYDELQAYYR LLKSCILMGA AFDDFRYDLV ITPVDYVARA LAHLGARHSQ
GGRVFHLSTM QVTPMRTVFE MMNAHLRTPM RMLTHRAWID ELRVRYRRGD VQSIVPVVQW
MMNMSDAELV KLAREREETT FIYDCTATHR ELEQAGIVVP VFDDALLQRY LRGMFNDDAD
LRALAARLDG GECASPLHSH T