Gene BMAA1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1204 
Symbol 
ID3087718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1273749 
End bp1286387 
Gene Length12639 bp 
Protein Length4212 aa 
Translation table11 
GC content74% 
IMG OID637565100 
Productputative polyketide synthase 
Protein accessionYP_338453 
Protein GI77358767 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGGCG CTCCTGCGCG CTTCGCGCGG CTCGGCATCG AATATGGCGC CACGCATCGC 
GTGCTGCTGC GCCTGCGCGT GGACGGCGAC GAAGCGCTCG CCGAGCTCGC GCCGGCGGCG
GACGGGATCG CGCAGCACGA ACTGCACCCC GGCACGCTCG ACGCGGCACT GCAACCGATG
CTCGCGCTGC TCGGCGAGCG GGTCGGCGAC GGCGTGCCCG TCGTGCCTTA CCGGATCGAG
CGCGCCGAAA TCCATGCGCC GACGCACGGC GCGAGATGGG CGTGGCTGCG GATGCGGCCC
GACGCGCACG AATGGATCTT CGATGTCGAT CTTTGCGACG CGCGCGGCGC GCTGTGCGTC
GCGTTGCGCG GCATCGCGGT GACCGCGTGG CGGCGGCCGG ACGAAGTGGT GCGGCTCGAG
CCGGTCTGGC GCGCGGCGCC CGTCGAGGCC GACTTGCACG ACGAGCGCGC CGGGCCCGAC
GCGCAGCGCG TGGTGTTCGT CTGCGGGGCG CACGGCGCGC CGCGCGCGTG GCCGGCGGAC
GGCATCGCGC CGGTGCGCTA CGCGGCGCTC GCCGCGGGCG CGCCGCCGGG CGAGCCCGAT
GCGCTCGCCG GCTGGTTCGA AGCGCACGCG CTCGCGCTGT TCGACGAAGT GCGCCGGCTG
CTCGCGCCCG GCATGCGGCA GGCGACGCTC GTGCAGATCG TCGTGCCCGC GGCCGGGCCC
GGCGCGATCC TGCATGCGCT CGGCGCGCTG CTGCAGACCG CGCACCTGGA GAATCCGCTG
CTGCACGGGC AGGTGATCGC GATCGACGAC ATCGACGCGC CCGACCTGCC GCGGCGGCTC
GCGCGCGACG CGCGCCGCGC GGCCGACACG CGCATCCGCT ACGTGAACGG CGAGCGCCAG
GTGGCGTGCT TCGACGAGGC GGCCGCGCCG GACGGCGCGC GCGCGTTGCC GTGGCGTCAG
GGCGGCGTGT ATCTCGTGAC CGGCGCGGCC GGCGGCCTCG CGCGGGCGCT GGCCGACGCG
ATCGCGCGCG GAATCGGCGG GGACGCGCCG CGGGCGACGC TCGTGCTCAC CGGGCGCTCG
CCCGCCCGCG ACGACATGCG TGCGCTCGTC GCGAGCCTTT GCGCGCTCGG CGCCGCCACT
GACTACCGCG TGCTGGATGT CGCCGATCGC GACGCGGTCG CGCGGATGGT CGAGGCGATC
GTCGGCGAGT TCGGCGCGCT GCACGGCGTC GTGCATTGCG CCGGCGTGCT GCGCGACAAC
TATCTGCTGC GCAAGTCGGC CGACGAGTTC GCGCAGGTAC TGGCGCCGAA GGTGCGCGGC
ACCGTGAATC TGGATCTCGC GACGCGCGAC GTGCGCAGCC TCGATTTCTT CGTGACGTTC
TCGTCGGGCG CCGGCGTCGT CGGCAATCCC GGGCAGGCGG ACTACGCGGT CGCCAACGCG
TTCATGGATG CGTTCGCGGC GCATCGCGCG TCGCTCGGCG CGGCGCGGCC CGGCGTCAGC
GTGTCGATCG CGTGGCCGAT ATGGCAGGCG GGCGGCATGC GGATCGACCG GCAGACCGAG
GCCGAACTCG AGCGCCGCCT TGCGATGCGG CCGATGCCGA CCGCGCTCGG GCTCGACGCG
CTGCATGCGT GCCTGCTGGG CGCGAGCCCG TGCCCGACGG TCGTCCACGG CGCGCGCGCG
CGAATCCTCG CGCTCGCGCG ACAGGGCTTC GCCGCGCCGC CCGCCGCGCC GCCCGGATTC
GGCGCGGCCG ATCGCGGCGC GGATGCGCCG GGCGCCGATG CTGATGCCGT GAAGACGCGG
GTGCGCGCGG CGATCGACGG CGCGCTCTCG GCCGTGCTGA AGCTGCCCGA CGCGCGGCTT
CGCGAGCCCG AATATTTCGA GAGCTACGGG ATCGATTCGA TCAACGCGAT CCGTCTCACG
GTCGAGCTGG AGCGCACGTT CGGGCCGCTG CCGAAGACGC TGTTCTTCGA ATACCGGCAC
GCCGACGAGC TCGAGCGCTA CCTGGTCTCG GTGCATGGCG CGGCCGTCGG CGCGCGAATC
CGCTCGGCCG CGCGCGGCGC GCCCGCCGGC CCGGCATGCG CGCGCGCGGG CGAGTCGGGC
GAGTCGGGCG AGCCGCCGGG CGCACCGACG CGGTCGCCGG CGAGCGAAGG CGGGGCGGCA
GGCGCGCCTC GCGCGAGCGC CGCGCCGCCG CTCGCGGAGC GCGACATCGC GGTGATCGGC
ATGGCCGGGC GCTATCCGCA GGCGGACGAT CTGCAGCAAT ACTGGGACAA CCTGCGCGAC
GGCCGCGACT GCATCGAGGA GATTCCGCCG CATCGCTGGG ACTGGCGCAA GCACTACGAT
CCGGCCCGCG GCCACGGCGC CCACCACAGC AAGTGGGGCG GCTTCATCAA CGATGTCGAC
GCATTCGATC CGCTGTTCTT CAACATCTCG CCGAAGGAAG CGGTGTCGAT GGACCCGAAG
GAGCGGCTGT TCCTCGAACA GGTATGGACC GCGATGGAGG ATGCCGGGCT GCGGCCCGAG
GATCTGCGGC GCGACGCGCA ACGCGGCACG GGCGTCTACG TCGGCCTGAT GTACGAGAAC
TATCAGTTGC TCGCGGCCGA GGCCGCGGCC GCGGGCAGCG ACGTCGGGAT GGCGGGCGGC
AGCTACGCGA GCATCGCGAA CCGCGTGTCG TTCTTCCTCG ACCTGCGCGG CCCGAGCCTT
GCCGTCGACA CGATGTGCTC GAGCTCGATG GTCGCCGTGC ATCTCGCCTG CCGCGATCTG
CTCGCGGGCG AGATCGGCGT GGCGATCGCG GGCGGCGTCA ATCTCAGCCT GCATCCGAAC
AAGTACCGGA TGCTGAGCGC CGCGCGCTTC ATGTCGGGCG ACGGGCGCTG CGCGAGCTTC
GGCAGCGGCG GCGAGGGTTA CGTTCCCGGC GAGGGCGTCG GCGTGCTGTT GCTGAAGCGG
CGCGCCGACG CCGAGCGTGA CGGCGACCGC ATCCTCGGCC TCATCAAGGC GAGCGCGATC
AATCACGGCG GCCGCTCGAA CGGCTACACG GTGCCCAACG CGGCCGCCCA GGGCGGCGTG
ATCGCGAACG CGATCCGGGC CGCCGGCATC GACGCGCGCG CGATCAACTA CGTCGAGGCG
CACGGCACGG GAACGGCGCT CGGCGATCCG ATCGAGCTGG CCGGCCTCGC GCGCGGGTTC
GCCGACAGCG GCGCGAACGG GCCGTGCCGG ATCGGCTCGG TGAAATCGAA CATCGGCCAT
TGCGAGGGCG CGGCGGGCAT CGCGGCGCTG ACGAAGGTGC TGCTGCAACT GGCGCATCGC
CAGATCGTGC CGTCGCTCCA TTCGCGCGAG CTGAATCCCG ATCTGCCGCT CGACGGCTCG
CGCTGGATCG TGAACCAGTC GCTGTGCGAT TGGGAGCGCG TCGTCGTCGA CGGCGTGCCG
CTGCCGCGCA CGGCCGGCGT GTCCGCGTTC GGCGCGGGCG GGACCAACGC GCATCTGATT
CTGTCCGAAT ACCCGGCCGA CGCGTGCGCG GCGCCGGCGG GCGTCATCGA GCCGGCGGGG
CGCGATGCGC ACGACACGCA CGACACGCAC GACACGCACG ACATGCATGA CATGCAGGAC
ATGCAGGACA TGCAGGACGC GCTCGTCGTG CCGCTGTCCG CGCGCAACGC GCAGCGGCTG
CACGCGTATG CACAACGGCT GCGCGCGTTC GTCGCCGCGC ACGCGCGCGG CGAGCGTGGC
GCGCCGCCGC GGCTCGTCGA TCTCGCCTTT ACGTATCAGC GTGGCCGCAT TGCGATGCCG
GAGCGGCTCG CGATCGTCGC GCGCTCGCTC GCGGAGCTCG AACGCGCGCT CACCGCGTAC
GTCGCCGGGC AGCGCGCGGG CGACGGCATT TACGCCGGCC GGGCGGATCG CGCGGCGGCG
GGCGACGCAC GCGGCGCGGC GTCCGAGCGA ACCGCCGCCG ATTTCGCCGA CCGGCTCGCG
GCGCGCTGGG TCGCGGGCGA GGCTGTCGAC TGGCACGCGC TGTTCGACGG GCGCGCGCCG
CGCCGGATTG CCGCGCCGAC CTATCCGTTC GAGCGCGGCC GGTATTGGAT CGGCGCATCG
CGCGCGGCGG CGGGGGCCGC CGAGGCCGCG ATGCGTGCCG GGGGCGGCGC ATCGCGCGCG
ACGACGGCGC CGCGCGCGCT TTCCGAATCA GGGGGAGAGG TGAGACAACC AGACCAGACG
ATACGGCGCG TGCGGTTGGC GCCGACTTCG AGCTTCACCG CGACGGCGCG CGCGCCGTCG
CCTACGCAGG CCGCGCAGGC CACGCAGGCC ACGCAGGCTA CGCAAGCCAC GCAAGCCTCA
CAAGCCACGC GGGCCGCCGC GGCGTTCGCG CCCGCGACGA AAGCGTCGCT CGAGGCGGCG
TTGCGCGACA GTCTCGCGAG CGCGCTGTTC GTCGGGGTCG ACGAAATCGA CCCCAGCCGG
CCGTTTAGCG AGCTGGGGCT CGATTCGATC GTCGGCGTCG AATGGATTCG CGACATCAAC
CGCCGGTACG GCGTGTCGAT CCGGACGACC GACGTCTACG ATTATCCGAG CGTCGGCGAA
TTCGCCGGCT TGCTTGAGCG GCTGCTGCGC GAATCGTCCG TGGCCGGCGC GCCGCCGGCG
CCCGCGACGG AACCGACGAT GGAGCCGGTG ACGGCGCCGC GGCCCGAAGC CGCCGGCGCG
GCCCCGTGCG AGCCGCAGCC CGACGAAGGC CGGGGCGGCG CGCCGGGCAA CGTCGAGCGG
ATCCGGCGCG AGCTGATGCG CAGTCTCGCC GACGCGCTGT TCGTCGACAT CGCCGAGATC
GACGTCGATC GGCCGTTCGC GCAGATCGGC ATGGATTCGA TCGTCGGCGT CGAGTGGATC
AAGGGAATCA ACCAGCGCTA TCGCGTCGCG CTGAAGGCGA CGGACGTCTA CGATCACCCG
ACGATCGCCA GCATCGCGGC ACTCGTCGAC GCGCGCGGCG CGAGTGCCGC AAGCACCGCA
AGCACCGCAA GCACCGCAAG CACCGCAAGC ACCGCAAGCA CCGCAAGCAC CGCAAGCACC
GCAAGCACCG CAAGCACCGC AAGCACCGCA AGCACCGCAA GCACCGCAAG CACCGCAAGC
ACCGCAAGTC CCGAGGCCGA CGTGCGGCTC GCGGCCGATG CCCGCATCGC GTCCGCCGCA
TCCGGCGCGC CGGAAAGCGA ACCCGCGCGC GCGGCGCACC CGGGCGCTGC CGCTTCGCGC
GCGCGCGCCG ACGTGGCGCC CGACGCCGGC CCGGCACCGC GCCGTCTCGA CGCGGCGGCG
CCCGACGCGC AGGCGGCGCG CGCCGAGCCC GTGCCCGAGC GGATCGCGAT CGTCGGCATG
TCGGGCCGCT ATCCGGGCGC GCCCGATCTC GACGCGTTCT GGGACAACCT CGCGGCCGGC
CGCGACGCGA TCGCCGAGAT CCCGCCGAGC CGCTGGCCCG TCGGCGCGTT CTACGATCCC
GAGCCGGGCA AGCCCGGCAA GGTTTATTGC ACGCGCATCG GCCTGCTCGA CGATGTCGAC
CGTTTCGATC CCGACTTCTT CCGGATCTCG CCGGCGGAAG CCGAAGAGAT GGATCCGCAG
CACCGGCTGT TCCTGCAGGA GGGATACCGC GCGATCGAGC AGATGGGCTG CGCGCCGGCG
TCGCTGTCGC GCCGCAAGTG CGGCGTCTAT CTGGGCGTGA TGAACCACGA GTACGGCGAG
CTCGCGATGC GCCACCGCGG CGCCGCATCC GGGATCGGCA GCAGCTACGC GATCGGCGCC
GCGCGTCTCG CGTATTACCT GAACCTGAAG GGGCCGGCGA TTCCGGTCGA CACCGCGTGC
TCGTCCGCGC TCGTCGCGAC GCACCTCGCA TGCCAGGCGC TGCGCAACGG CGAGATCGAT
CTCGCGCTCG TCGGCGGCGT GACCGTCTAT CTGACGCCCG AATCGTATGT CGCGATGTGC
GCGGCGGGCA TGCTGTCGCC CGAGGGGCGC TGCAAGACCT TCGACGACGC GGCCGACGGC
TTCGTGCCGG GCGAGGGCGT GGGCGCGCTC GTGCTGAAGC GGCTCGCCGA CGCCGAGCGC
GATCGCGATC CGATCCTCGG CGTGATCGTC GGCTCGGGCC TGAACCAGGA CGGCCGCACG
AACGGCATCA CCGCGCCGAG CGGCAGCAGC CAGACCGAGT TGCTGCGCGA TGTCTACCGC
CGGCACCGGA TCGATCCGGC CGGCATCGGT TACGTCGAGG CGCACGGCAC CGGCACGAAG
CTCGGCGATC CGATCGAGTT GACCGCGCTG TCCGCGGCGT TCGGCGATTA CACCGACCGG
CGCGGGTTCT GCGCGCTCGG CTCGGTGAAG ACCAACATCG GGCATACGTC GGCGGCGGCC
GGCGTCGCGA GCATCCACAA AGTGCTGCTG TGCCTCGCGC ATCGCGAGCT GGTGCCGACG
CTGAACTACG CGAACCCGAA CCGCCATTTC GATTTCGCCG ATTCGCCGTT CTACGTGAAC
ACCGACCGGC GCGCGTGGGA CGCCGCCGGC GACGCGCCGC GCCGCGCGGC CGTCAGCTCG
TTCGGCTTCA GCGGCACCAA CGCGCACGTC GTGATCGAGG AGTACCGCCC GGCCGCCGCC
GCCGCGCCGG ACGCGTCGCC GCCGCGCGTG ATCGTGCCGC TCTCGGCGCG GCATCCGGAG
CGGTTGCGCG CCTACGCGCG CAACCTGGCC GACTGGCTCG CGCAGGCGGC CGCGCGCGGC
GCGCCGGAGC GGCTCGCCGC GCATCTCGCG TACACGATGC AGGTCGGCCG CGACGCGATG
GCCGAACGCG TCGCGTTCGT CGCCGACGGC CGCGACGAAC TCGAGCGGCA GTTGCGCCGC
TACGCCGACA CCGGCGAGAC GAGCAACGGC GTGTACGCGG GCCGCGCCGA GCCGCACGCT
CAGGCGTCGA ACGCGCTGAT GCTCGACGAG GCGTTCGGCG CGGCGATCGA CGGATGGATG
CGCACGGGCA AGCACGAGCC GCTCGCGAAG CTGTGGGCGG GCGGCTTCGA TCTGGACTGG
GCGCGGCTTT ACGACGGCGT GCCGGCCGCC GCGATGCCGC GCCGGATCGC CGCGCCGACT
TATCCGTTCG CATCGGGGCG CTACTGGATC GATGTCGAAC CCGACGGGCG CGCCGCGGGC
CCGGATGCGG ACGCCGCTTC GCCCGAGGCC GATTCCGGCT CCGACTCCGA GCACGCACAC
GAACATGAAC ATGAACCCGC CGCGACGCTT GCATACCTGC CCGTCTGGGA GGAACTGCCG
CCCGCGCAGC CGCGCGCCGC GCCCGATGCG CAGGCGGGCG GCGTACTCGT CGTGCATCGC
GGCGGCGCGT GGGGGCTCGT CGACGCGATC GAGCGCGAGT GCGTGGACGG CCGCCATGCC
GGCGCGACCT GCATGACGCT CGATCTGTCC GGGCATGCGC CGTCGCCGGA GGGCCGCGCG
TGGCGCAACG CCGCGCCCGA CGCGGCGCGG CTCGCCGCGT GGCTCGGCGA ATTCGGCCCG
GTGCGCGCGG TATTCTTCGC GGCGGGCTGC AGCGAGGCCC GCCACGATGC GTCGGGCGCG
CACGGCTGGG CGAGCGCGCC CGATGCGCAC GGCGAGGACG AGCGCGCGCT GTTGCAGCTC
GCGCAGGCGC TGATGCGCTC GCAGGCGGCC GACGCGTCGA TCGAGTTCGT CGTGCTGTCG
CTCGATCACC ATCGCACCGA CGGCACGCCG TCGAATCCGG CGGGCGGCGG CGTCGCCGGC
ATTGCCTACG CGATCGCGCA GGGCGATCAC CGCTTCCGGG TGACGAACGT CGACGTGTCG
CTCGACGAGC TGCGCGCGGC GCGGCATGCG CCGGCGCCGC ATCCGGTGCT CGCGGCCGTG
CTGCGGCTCG CGCCGTCCGA TCGCGGCGCG CTCGTCAGGC TGCGCGCCGG CCGCGGCTAC
CGGCAGGCGT TCGTGCGGCT CGACTGGGCC GCCGAGGCGG GCGCCTCGGG GCTGAAGCAG
GGCGGCGTCT ACGTGATGCT CGGCGGCGCG GGGCGCGTCG GCCGGGCGCT GACGCGGCGG
CTCATCGAGC GCTATCGCGC GAACGTCGCG TGGATCGGCC GCAGCCCGGC CGATTCGGCG
AGCGTCGCGC ATGCGCTGCG CGCGCTCGGC CCGGCGGGCC CCGCGCCGTA TTACGCGCAG
GCCGACGCGA CCGATGCGGC GCGCGATCGA AGCCGTCCGG CAGCGCCACG GCCGCATCGA
CGGCGCGGTG TTCTGCGGGA TGGTGTTCGA CGCGAACCAC GCGATCGCGA GCGTGCCGGC
GCACCGGTTC GACGAGATTC TGGACGTCAA GGCGCGCGGC AGCCGCATCT TCTACGAGGC
GCTCGCGCAC GAGCCGCTCG ATTTTCTCTG CTACTGCTCG TCGGCGCAGT CGTTCTCGTT
CTCGGGCGCG GCGCGGCTCG GCGCGTACGC GGCGGCGACG ACGGCGGGCG ACGCGATCGT
GCGCTCGATC GCGCCCGTCG CCGCGTTTCC GGTCGGCACG ATCCACTGGG GGTTCTGGGA
AACGTCGGTC GAGGATTCCG CGCTCGGCTC GCGGCATCTC GGCGCGCTGT CCGACGACGA
GGGGTTCGCG TGCTTCGAGC GGTTCGTCGG CCAGTGCATG CGCGGCAATC CGCTGCGCGA
GGTCGTCTGC ATGCGGGCGT CGCCGGAGGT CGAGCATCTG ATGCAGGTGC TGCCGGGCGA
AACCGCGACG CTCGCCGCGC CGGGGCAGCC CGCGCAGCCG GCGCCGCTTC GCGACGCGCC
GGACGGCGCG GCCGACGTAT CGGCGGATAT CGACGCATGG CTCGCGCGGC TCACGTTCGC
GACGCTGCGC CCGATGCTCG ACGGCCCGCG CGCGCGTGCC ATGCGCGCTG GTGGGACGAG
ACGCTGCGGA TCTTCGCGGC GCGCGGCTGG TTGCGCATCG TCGACGGCGC GCCGCGCGTG
ATCGCCGAGC CCGATGCGGG CGAGCACGTC TGGCGCGACT GGGCGCGCTA CCGGTTCGAC
ACGCCCGCGG CCCGCGGCCG GCGCGCGCAG ATCGACCTCG CCGACGTGTG CGTGCGCGCG
CTGCCGGACG TGCTCGCGGG CCGGCTGCCC GCCGCCGACG TGCTGTTTCC GGGCGGCTCG
ATGGAGCGCG TCGAGGGCGT GTACCGCGAC AATCCGATCT CGGATTACTT CAACGCGGTG
CAGGCCGACG CGCTGATCCG CCATGTCCGC GCGTGGATCG ATGCCGGCCG GCGCGAGCCG
ATCCGCATTC TCGAGGTCGG CGCGGGCACG GGCGGCACCA CGGCGCTCGC GCTCGAGCGG
CTGCGGCCCT ACGCGGCCGC GATCGGCGAG TATTGCTTCA CCGACGTGTC GCAAGCGTTC
CTGCAGCACG CGCAGGCCGC GTTCGGGGCG CGGGCCGGGT ACTTGCGCAC CGCGCTGTTC
GACGTCGAGC GGCCGCTCGA CGCGCAGCGG ATGCCGGCGG GCCGCTACGA CATCGTGATC
GCGACCAACG TGCTGCATGC GACGCGGCAG GTACGCGGCG CGCTGCGCAA CGTGAAGGCG
TGCCTGCGCG CGGGCGGCGT GCTGCTGCTC AACGAGATCA GCGAGAAATC GCTGTTCGCG
CATCTGACGT TCGGCCTGCT CGAAGGCTGG TGGCTGCACG AGGATTCGTC GCTGCGCGAA
CCGGGCAGCC CCGTGCTCGC GCCCGCCACC TGGCGCCGGC TGCTCGAAGA CGAAGGCTTC
GGCGCGATCG CGTTTGCGGC GCGCGACGCG CATGCGCTCG GGCAGCAGGT CGTCTGCGCG
ACGAGCGACG GCGTGATCCG CCAGCGCGCC GGCGAACCTT CGGGCCATTC GAGCCGGCAA
GGCCATCGGA ACCGTCAGGA TCATCAGGGC CGTCAGGAGA GTTCGACCCA TGCGGGCGAG
GCCGGCGCCG CGCCGGCCGC GAGCCCCGCC GGCACGGCGC GCGAGCCTGT CGTCGCGGCG
ATCCATCGCG CGCTGCAACA GTCGCTGAAA TTGTCGGAAG CCCGGATCGG CGATCACACG
CCGTTCCTCG ACTACGGGAT CGATTCGATT CTCGGCGTGC GCTTCGTCGA TTCGCTCAAG
CAGGCGCTCG ACGTGCCGCT CAACACGGCT GTCCTGTTCG ACTATCCGAC CGTCGAGCGG
CTCGCGGATT TCATTGTGGC CACCTACGGC GCGCGGCTCG CCGCGCGCGG CGCATCCGCC
GCGCCGGCGA GCGTCGCGAC CGCCTCCGCG ACACTTGCCG CATCGACTGC ATCGACTGCA
TCAACTGCAT CAACTGCACC GACTGCGCCG ACTGCGCCGA CTGCGCCGAC TGCATCGGCG
GCACCGGCGG CACCGGCCGC GCCGAACGCA CCCGCCGCGC CGGCGATGCC CGCCGAGGCC
GTTTCGCGCG ACGCCGCCGC GCCGCGCGCC GAACCGGCGG GCGCGCGGCC GGCGGACATC
GCGGTCATCG GCATGGCCGG GCAGTTCCCG GACGCGCCCG ACGTCGACGC GTTCCGCGCG
CTGCTCGAGC ACGCGCGGGA CGGCGCGCCC GGCGTGTCGG GCGGCATGCT CGAGAATCGC
GACCGCTTCG ATCACGCGTT CTTTCACATC ACGCCCGACG AGGCCGACGC GATGCACCCG
TATCAGCGGC TCGTGCTGCA GGAATCGTGG AAGGCGCTCG AGGATGCCGG CTACAACCCG
GCCGCGCTCG CCGGCGCGCG GGTGGGCGTG TTCGTCGGCG CGGAGCCGGC CGATTACCGG
TCGACGACGT TCAGCGGCTC GTCCGACGCG CTGATCGCGT CGCGCGTGTC GTATCACCTG
AATCTGCGCG GCCCGGCGTA CGTGGTCAAC ACCGGGTGTT CGTCGGGCGC CGTCGCGATT
CATCTCGCCT GCGAGAGCCT GCGCCGCAAC GAATCGGACG TCGTGCTCGC GTGCGGCATC
TTCGCGGCGA TGGGGCCGCG CATGCTGGGC GCGCTGGGGC AGGCCGGCAT GCTGTCCGCC
GGCGGGCGGT GCCGCAGCTT CGACGCGGGC GCCGACGGCA CGGCGTTCGC CGAGGGAATC
GGCGTCGTCG CGCTCAAGCG GCTCGCGGAC GCGATCGCCG ACGGCGATCC GATCCACGGC
ATCGTGAAGG CGTCCGGCGT GAACCAGGAC GGCACCAGCA ACGGGATCAT GGCGCCCAAC
GGCGTCGCGC AGGAGGAACT GATCGTCGAT GTCTACGAGC GCTTCGGGAT CGATCCGGCC
GACATCCGCT ATGTCGAGGC CCACGGCACC GGCACGCTGT TCGGCGACGC GGTCGAGGCC
AACGCGCTCG TCAGGGCGTT TCGCCGCTTC ACCGAGCGCA GCGCGTACTG CGCGCTCGGC
ACCGTGAAGG CGACCATCGG GCATACGGCG GCCGCGGCCG GCGTGATCGG GCTGATCCGC
ATCCTGCTGT CGATGCGCGC GCGCCGGCTG CCGGGCATGC CCGGCCTCGG CCGCGCGAAC
CCGATGATCG ATCTCGACGC GTCGGCGTTC TCGCTCGGCC TCGTCAGCCG TGAATGGCCG
GCCGGCCGCG ACGGCCGGCC GAGGCTCGCC GCGTTGAACA CGTTCGGCCA CAGCGGCACC
AACGTCCATA TCGTCGTGCA GGAGCCGCCG CAGGCGCGGG CGCGGCCAGC CCGCGCGGCG
GACGGCCCGC GCGTCGCGGT GCCGCTTTCC GCGATGGACC GGGAGGCGCT GCGCCGCTAC
GCGGCGCGCC TTTGCGAGCG GCTCGAAGCG GAGGGCGCGG CGCTTTGCGT CGGCGACGTC
GCGCACACGC TGCGCGTCGC GCGCGAGCCG ATGGCGCAGC GAATCGTGCT GTTCGCGTCG
ACGACGGGCG AGCTCGCCGC GTTGCTGCGC GCGTTCGTCG ACGGACGGGA TTCGCCGTGC
CTGCTCGACG GCGCGGTGAC GGCGGCCGCG CGAGCGGCGG GCCTCGACGC GGCGCAGCTC
GCGCAGGCGG CGCGCTGGCT CGCCGGCGAG CGCGTCGACT GGCCGCCCGC CGGCGGGACG
CCGATGCGCG TGCATTTGCC AGCCTATCCG TTCGCCGGGC GGCGCTGCGG CGCGGCCGGA
TGGGCGCGCG CCGAGGCCGG CGCGTCGCGC GACTGCGCGG CGGCGGGGCC GTGCGAGCCG
CCCGCGGGCG TCGCGGCCGC CGCGATGACC GTGGCGGCCG CGACGCCCCG CGTGGACGCG
ACGCCGTCCG CCGCGGCCGA CCGCGCGCGC GGCGCGGCGC GGCCGGCCGA GTGGCTCGCG
GCGCGCGTCG CCGCGCGGCT CGGCGTGCCG GCCGCGCGCG TCGACCGGCG CCGCAGCCTG
CTGGATCTCG GACTCACGTC GCAGGACCTC GTGAGCCTCG CGGGCCAGTT GCGGGACGCG
ACGGGCGAAG CGCTGCTGCC GAGCGTGCTG TTCGACTATC CGACGATCGA ACGGCTCGCC
GCTCATCTGG CCGACACCTG TCCCGCCGCG TTCGGCGCGG CCGAGCTCGC CGAGCTCACC
GAGACCGCCG AGACCACCGA GCCCGCCGAG ACCGGCCGCG CCGCGGCGGG CGACGCGGCG
AGCGGCCCCG CGCCGGGCGT GATCGCCCTG CTGGAACGAC TCGAGGGCGG CGGCCTGAGC
CTCGAGGAGA CGATTTACTT GATCGAGAAC ACCAAATGA
 
Protein sequence
MRGAPARFAR LGIEYGATHR VLLRLRVDGD EALAELAPAA DGIAQHELHP GTLDAALQPM 
LALLGERVGD GVPVVPYRIE RAEIHAPTHG ARWAWLRMRP DAHEWIFDVD LCDARGALCV
ALRGIAVTAW RRPDEVVRLE PVWRAAPVEA DLHDERAGPD AQRVVFVCGA HGAPRAWPAD
GIAPVRYAAL AAGAPPGEPD ALAGWFEAHA LALFDEVRRL LAPGMRQATL VQIVVPAAGP
GAILHALGAL LQTAHLENPL LHGQVIAIDD IDAPDLPRRL ARDARRAADT RIRYVNGERQ
VACFDEAAAP DGARALPWRQ GGVYLVTGAA GGLARALADA IARGIGGDAP RATLVLTGRS
PARDDMRALV ASLCALGAAT DYRVLDVADR DAVARMVEAI VGEFGALHGV VHCAGVLRDN
YLLRKSADEF AQVLAPKVRG TVNLDLATRD VRSLDFFVTF SSGAGVVGNP GQADYAVANA
FMDAFAAHRA SLGAARPGVS VSIAWPIWQA GGMRIDRQTE AELERRLAMR PMPTALGLDA
LHACLLGASP CPTVVHGARA RILALARQGF AAPPAAPPGF GAADRGADAP GADADAVKTR
VRAAIDGALS AVLKLPDARL REPEYFESYG IDSINAIRLT VELERTFGPL PKTLFFEYRH
ADELERYLVS VHGAAVGARI RSAARGAPAG PACARAGESG ESGEPPGAPT RSPASEGGAA
GAPRASAAPP LAERDIAVIG MAGRYPQADD LQQYWDNLRD GRDCIEEIPP HRWDWRKHYD
PARGHGAHHS KWGGFINDVD AFDPLFFNIS PKEAVSMDPK ERLFLEQVWT AMEDAGLRPE
DLRRDAQRGT GVYVGLMYEN YQLLAAEAAA AGSDVGMAGG SYASIANRVS FFLDLRGPSL
AVDTMCSSSM VAVHLACRDL LAGEIGVAIA GGVNLSLHPN KYRMLSAARF MSGDGRCASF
GSGGEGYVPG EGVGVLLLKR RADAERDGDR ILGLIKASAI NHGGRSNGYT VPNAAAQGGV
IANAIRAAGI DARAINYVEA HGTGTALGDP IELAGLARGF ADSGANGPCR IGSVKSNIGH
CEGAAGIAAL TKVLLQLAHR QIVPSLHSRE LNPDLPLDGS RWIVNQSLCD WERVVVDGVP
LPRTAGVSAF GAGGTNAHLI LSEYPADACA APAGVIEPAG RDAHDTHDTH DTHDMHDMQD
MQDMQDALVV PLSARNAQRL HAYAQRLRAF VAAHARGERG APPRLVDLAF TYQRGRIAMP
ERLAIVARSL AELERALTAY VAGQRAGDGI YAGRADRAAA GDARGAASER TAADFADRLA
ARWVAGEAVD WHALFDGRAP RRIAAPTYPF ERGRYWIGAS RAAAGAAEAA MRAGGGASRA
TTAPRALSES GGEVRQPDQT IRRVRLAPTS SFTATARAPS PTQAAQATQA TQATQATQAS
QATRAAAAFA PATKASLEAA LRDSLASALF VGVDEIDPSR PFSELGLDSI VGVEWIRDIN
RRYGVSIRTT DVYDYPSVGE FAGLLERLLR ESSVAGAPPA PATEPTMEPV TAPRPEAAGA
APCEPQPDEG RGGAPGNVER IRRELMRSLA DALFVDIAEI DVDRPFAQIG MDSIVGVEWI
KGINQRYRVA LKATDVYDHP TIASIAALVD ARGASAASTA STASTASTAS TASTASTAST
ASTASTASTA STASTASTAS TASPEADVRL AADARIASAA SGAPESEPAR AAHPGAAASR
ARADVAPDAG PAPRRLDAAA PDAQAARAEP VPERIAIVGM SGRYPGAPDL DAFWDNLAAG
RDAIAEIPPS RWPVGAFYDP EPGKPGKVYC TRIGLLDDVD RFDPDFFRIS PAEAEEMDPQ
HRLFLQEGYR AIEQMGCAPA SLSRRKCGVY LGVMNHEYGE LAMRHRGAAS GIGSSYAIGA
ARLAYYLNLK GPAIPVDTAC SSALVATHLA CQALRNGEID LALVGGVTVY LTPESYVAMC
AAGMLSPEGR CKTFDDAADG FVPGEGVGAL VLKRLADAER DRDPILGVIV GSGLNQDGRT
NGITAPSGSS QTELLRDVYR RHRIDPAGIG YVEAHGTGTK LGDPIELTAL SAAFGDYTDR
RGFCALGSVK TNIGHTSAAA GVASIHKVLL CLAHRELVPT LNYANPNRHF DFADSPFYVN
TDRRAWDAAG DAPRRAAVSS FGFSGTNAHV VIEEYRPAAA AAPDASPPRV IVPLSARHPE
RLRAYARNLA DWLAQAAARG APERLAAHLA YTMQVGRDAM AERVAFVADG RDELERQLRR
YADTGETSNG VYAGRAEPHA QASNALMLDE AFGAAIDGWM RTGKHEPLAK LWAGGFDLDW
ARLYDGVPAA AMPRRIAAPT YPFASGRYWI DVEPDGRAAG PDADAASPEA DSGSDSEHAH
EHEHEPAATL AYLPVWEELP PAQPRAAPDA QAGGVLVVHR GGAWGLVDAI ERECVDGRHA
GATCMTLDLS GHAPSPEGRA WRNAAPDAAR LAAWLGEFGP VRAVFFAAGC SEARHDASGA
HGWASAPDAH GEDERALLQL AQALMRSQAA DASIEFVVLS LDHHRTDGTP SNPAGGGVAG
IAYAIAQGDH RFRVTNVDVS LDELRAARHA PAPHPVLAAV LRLAPSDRGA LVRLRAGRGY
RQAFVRLDWA AEAGASGLKQ GGVYVMLGGA GRVGRALTRR LIERYRANVA WIGRSPADSA
SVAHALRALG PAGPAPYYAQ ADATDAARDR SRPAAPRPHR RRGVLRDGVR REPRDRERAG
APVRRDSGRQ GARQPHLLRG ARARAARFSL LLLVGAVVLV LGRGAARRVR GGDDGGRRDR
ALDRARRRVS GRHDPLGVLG NVGRGFRARL AASRRAVRRR GVRVLRAVRR PVHARQSAAR
GRLHAGVAGG RASDAGAAGR NRDARRAGAA RAAGAASRRA GRRGRRIGGY RRMARAAHVR
DAAPDARRPA RACHARWWDE TLRIFAARGW LRIVDGAPRV IAEPDAGEHV WRDWARYRFD
TPAARGRRAQ IDLADVCVRA LPDVLAGRLP AADVLFPGGS MERVEGVYRD NPISDYFNAV
QADALIRHVR AWIDAGRREP IRILEVGAGT GGTTALALER LRPYAAAIGE YCFTDVSQAF
LQHAQAAFGA RAGYLRTALF DVERPLDAQR MPAGRYDIVI ATNVLHATRQ VRGALRNVKA
CLRAGGVLLL NEISEKSLFA HLTFGLLEGW WLHEDSSLRE PGSPVLAPAT WRRLLEDEGF
GAIAFAARDA HALGQQVVCA TSDGVIRQRA GEPSGHSSRQ GHRNRQDHQG RQESSTHAGE
AGAAPAASPA GTAREPVVAA IHRALQQSLK LSEARIGDHT PFLDYGIDSI LGVRFVDSLK
QALDVPLNTA VLFDYPTVER LADFIVATYG ARLAARGASA APASVATASA TLAASTASTA
STASTAPTAP TAPTAPTASA APAAPAAPNA PAAPAMPAEA VSRDAAAPRA EPAGARPADI
AVIGMAGQFP DAPDVDAFRA LLEHARDGAP GVSGGMLENR DRFDHAFFHI TPDEADAMHP
YQRLVLQESW KALEDAGYNP AALAGARVGV FVGAEPADYR STTFSGSSDA LIASRVSYHL
NLRGPAYVVN TGCSSGAVAI HLACESLRRN ESDVVLACGI FAAMGPRMLG ALGQAGMLSA
GGRCRSFDAG ADGTAFAEGI GVVALKRLAD AIADGDPIHG IVKASGVNQD GTSNGIMAPN
GVAQEELIVD VYERFGIDPA DIRYVEAHGT GTLFGDAVEA NALVRAFRRF TERSAYCALG
TVKATIGHTA AAAGVIGLIR ILLSMRARRL PGMPGLGRAN PMIDLDASAF SLGLVSREWP
AGRDGRPRLA ALNTFGHSGT NVHIVVQEPP QARARPARAA DGPRVAVPLS AMDREALRRY
AARLCERLEA EGAALCVGDV AHTLRVAREP MAQRIVLFAS TTGELAALLR AFVDGRDSPC
LLDGAVTAAA RAAGLDAAQL AQAARWLAGE RVDWPPAGGT PMRVHLPAYP FAGRRCGAAG
WARAEAGASR DCAAAGPCEP PAGVAAAAMT VAAATPRVDA TPSAAADRAR GAARPAEWLA
ARVAARLGVP AARVDRRRSL LDLGLTSQDL VSLAGQLRDA TGEALLPSVL FDYPTIERLA
AHLADTCPAA FGAAELAELT ETAETTEPAE TGRAAAGDAA SGPAPGVIAL LERLEGGGLS
LEETIYLIEN TK