Gene BMAA1451.1 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1451.1 
Symbol 
ID3704282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1563641 
End bp1577383 
Gene Length13743 bp 
Protein Length4580 aa 
Translation table11 
GC content73% 
IMG OID637565338 
Producthypothetical protein 
Protein accessionYP_338486 
Protein GI77358933 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCGT TCGCGAGGAA AACGAAACAT GCTGATCAAC GGCCTTTGCA TCGAAGGGGG 
CGGCGAGCCG CTGTCGATCG TCGATCCGGC GACGGGCGAG CCGCTCGCGG CGCCCGCCGC
CGCGAGCGCG GCCGACCTCG AGCGCGCGGT CGCGGCCGCC GAAGCGGCGT TCCCCGCATG
GCGCGCGACG ACACCGGCCA CGCGCGCGAG CCTGCTGCTC GCGCTCGCCG ACGAAATCGA
CCGGCAAGCG GGCGCGCTCG CGCGAATCGA AAGCCGCAAC ACCGGCAAGC CGTTGCATCT
CGTCGTGCAG GACGAACTGC TCGCCGTCGC CGATTGCTTT CGCTTCTACG CGGGCGCGGC
GCGCACCGCG AGCGGGCCGT CCGCCGGCGA ATACGTCGAG GGCCACACGA GCATGGTGCG
GCGCGATCCC ATCGGCGTCG TCGCACAGAT CGCGCCGTGG AACTACCCGC TGATGATGGC
CGCGTGGAAG CTCGCGCCGG CGCTCGCGGC CGGCAACACG ATCGTGTTCA AGCCGTCGGA
ATGGACCCCG CTGTCGATCG TCGCGCTCGA AGCCGCGCTC GCGCGCATCT TTCCCACGGG
TGTCGTCAAC ATCGTGCTCG GCGACGGCGC GAGCGTCGGC CGCGCGCTCG CCACGCATCC
GCGCGTGCGG ATGATCTCGC TGACGGGCTC GGTCGAAGCG GGCAAGTCCG TGCTCGCCGC
GGCGGCCGGC AATCTGAAGC GCACGCATCT CGAACTCGGC GGCAAGGCTC CCGTGCTCGT
GTTCGACGAC GCCGATCTCG ACGCGGCCGT CGCCGGCATC CGCTACGCGG GCTTCTACAA
CGCCGGGCAG GACTGCACGG CGGCGACGCG CATCTACGTG CAGCGCGGCA TCTACGACAC
GCTCGCGCAG CGGCTCGCCG ATGCGGCGAG CACGCTGCGC GTCGGCCCGC CCGATCGCGC
GGATGCCGAG ATGGGGCCGC TCGTGAGCGC CGCGCACCGC GCGCGCGTCG AGCGCTTCGT
GCGCGAGGCG GCCGCGCTGC GGCACGCGCG CGTACTCACG GGCGGCGCGC CGCTGCCCGG
CCCCGGCTGC TACTACGCGC CGACCGTGAT CGCCGGCGTG CGCCACGACG ACGCGCCGAT
GCGCCGCGAA GCGTTCGGCC CGGTCGTCAC GCTCACGCCG TTCGACACCG AATCACAAGC
GCTCAGGTGG GCGAACGATT CGGAATACGG ATTGGCTTCG TCGGTATGGA CCCGCGACGC
GGCGCGCGGC ATGCGGCTCG CCGCGTGCAT CGACGCGGGC GTCACGTGGG TGAACGCGCA
TTTCACCTAC ACGGCCGACA TGCCGCACGG CGGGACCAAG CAGTCCGGCT ACGGCTCCGA
TCTATCGACG CTCGGCCTCG CCGATTACAC GCAGCCGCGC CACGTGATGT GGCGGCATTG
ACGCGCACGG CGCGCGCTTT CCAGAACCCG ACACAGACGA TTCCCATGAC CGCCTCCCCT
CCCTCCAGCG CACTCGTCAC GGCCGTCGAA GCGGCCGTCC TGTCGCTCGC CGGCGACGTC
GCCGGCCGCG CGTTCGACGC GTCGGCTGCG GCGTGCCCGC TGCACGCGCT CGGCTTCGAT
TCGGTGCAGT ACGTCGAATT GTCCGGATGC CTGAACGAAT ACTACGGGCT CGATCTCGCG
CCGACGCTGT TCTTCGACGT GCACGTGCCG CGCCGGATCG CCGAGCATCT CGTCGCGCGG
CATCCGGCGG CGCTCGCGCG CAAGCACGGC ATCGGGGCCG GGGACGACGC CGACACGGCC
GCTCGGGCCC GCGCGGCCGC GGCCGAGAAC GGCGCGCCGC AGCCGGACAT GCGAGCCGGG
GCGGCGCGGC CCGCGGGCGA GCCGCTTCTC GACACGCATG CGAGCCCCGG CGAGCCGCGC
GGCGACGCAC ACGAGAATCC ATGTGACGAC ACGCGCGGCG CGGCCGCCGC CGACGCGCAC
GAATCGGCCG CCGATATCGC GATCGTCGGC ATGGCCGGCA TCTTCCCGCA ATCGGCCGAC
CTCGACGCGT TCTGGCGGCA TCTCGCCGCG GGCGACGATC TGATCGCCGA GGCGCCGGCC
TCGCGCTGGG ATTGGCGCGC GGGCGACGGC GAGCCCGCAT CGCGCTGGGG CGGCTTCATC
CCGCGCATCG AATATTTCGA CGCCGCGTTC TTCGGCATCT CGCCGCGCGA AGCCGAGCAG
ATGGACCCGC AGCAGCGCCT GCTGATGCAG ACCGCGTGGG CGGCGCTCGA GGACGCGGCG
GTGCGCCCGT CCGATCTGAT GGGCAGCGAC GCGGCGGTGT TCGTCGGCAT CAGCACGTCT
GACTACATGG CGCTGCTGCC CGGCGCGGAC GGCCATCTCG CGGTCGGCAA CGCGCACGCG
ATGCTGCCGA ACCGGCTGTC GCACCTGCTC GGCGCGCACG GGCCGAGCGA GGCTGTCGAT
ACCGCGTGCT CGAGCTCGCT TGTCGCGCTG CATCGCGCGG TGCGCGCGCT GCGGCGCGGC
GAAAGCAGCG TCGCGATCGT CGGCGGCGTC AACGTGATGC TGACGACGCG GCTGCACCGC
GCGCTCGCCG CCGCCGGCAT GCTGAGCCCC GACGGGCGCT GCAAGACGTT CGACGCGGCG
GCGAACGGCT ACGTGCGCGG CGAGGGCATC GCGGCGCTCG TGCTGATGCC GCTCGAGCGC
GCGCGCGCGA ACGGCCACCC GGTGCGCGCG GTGATCAAGG GCAGCGCGGT CAATCACGGC
GGCCGCGCGG CGTTCCTGAC CGCGCCGGAC ATCAACGCGC AGGCCGCGCT GATCGAAGCC
GCGTATCGCG ACGCGGGCGT CGACCCCGCC ACTGTTTCGT ACATCGAAGC GCACGGCACC
GGCACGTCGC TCGGCGATCC GATCGAAGTG CAGGCGCTGC GCCAGGGCCT CGACGCCTGC
GCGCGCGACC TCGCGGGCAC CGCCTCGCAC GCGCCGGCAC GCTGCGGCCT GGGCTCGGTC
AAGACCAATA TCGGGCATCT CGAAGCGGCG GCGGGCCTCG CGGGCGTCGT CAAGGTCGTG
CTCGCGATGG ACCGGCGCAT GCTGCCGCCG AGCCTGCATT GCCGTGAACT GAATCCGTAT
CTGAAGCTCG ACGGCAGCCG TTATCACGTC GTCACGGAAC CCACGCCCTG GCCGGACGAA
GCAACGCCGA CGCCGCTGCG CGCGGGCGTC AGCTCGTTCG GGTTCGGCGG CTCGAACGCG
CACGTCGTGC TGCAATCGGC GCACGCGCGG CCGATCGCGC GAGCGAGCGC GCCCCCACCG
CCGCACCCGA ACGAACAGGC CGGTGCCGAC GCGCCCGCCG CCGACGGCCC GCGCGCGTGG
TTCATCCCGC TATCGGCGCG CACCGATGCC GCGTTGCATG CGCGCGCCGC TCAGCTCGCG
CACTGGCTCG ACACCGAGCC GGCCGACGAC GCGTGGCTGC CCGCGCTCGC GAAGACGCTG
TCGATCGGCC GCGAACCGAT GGCGCGCCGC TTCGGCATCA CGTGCGCGTC GCTCGACGAA
CTGCGCGCGC AACTCGCGAT CGCGCTGGGC GGCCGCGCAA CGTCGCTCGC GCGCGATGAC
GCCCGGCTGC GGCCGCATGC GCCCGCCTGC GCGGCGTGGC TCGCGGGCGA GACCGACCCG
CTGCCCGCCG CGTGGGATGA CGCGACGCCG CGCCTGCGGT TGCCCGTCTA CCCGTTCGAA
GGCGAGCGGC ACTGGCCGAC CGAAGCAGCG CCGGCGGCGC GCTTCGCGCT CGCGCCCGAC
GCCGACGGCG CATACCGGAT CGCGATCGCA CCCGACGCGC CGCTCGTCGC CGACCATCGG
CTCGCCGGCG AGCCGGTGCT CGCCGCCGCC GCGCAAATCG TGATCGCGTG GCGCGCGTTC
GAGGCGGACG CGCTCGCCGG CGATGCCGGC CAGGCGGGCG ACGTCGGCGA GTCGATGGAG
TCGATGGAGT CGATGGAGTC GAACGGATCG AGCGCATCGA AGCCGGCGGC AACGTCCGCC
GATTCGGGCA CCGCCGCCGA TTCACGCGAT CTGCACGATT CATACCACTC GCACGACTTC
CGCCACACGA TCGACACGAT CGACACGATC GACACGAACG CCACGAGCGC CGCGACGCCT
ATCGCGCTGT GCGACATCGA ATGGCTCGCG CCGATCGCGA TCGGCGCGCC GACCGACCTG
TGCATCACGC TCGCGCGCGA CGCCCACGGC GACATCGACG CGCGCCGCGG CGAAGCCGCC
CATCGGCGCG CAAACGGCCG CGCCGCCCGC TTCGCGATCG CGGCCGCCCC CGCCATCGAT
ACGCCGCTCG GCCGCGGACA CGCGACGCGC ATCGCGAGCG CGCCGTCGAA CGCGCCCGAG
CTCGACATCG AGGCCATCCG CGCGCGCTGC ACGCAAGCGG TCTCGGCCGA CGCGTGCTAC
GACGCGTTCG CCGCGATCGG CATCGATTAC GGCCCGACGT TCCGCCCGCT GCGCGCGATC
GCGGTCGGCC GTGACGAAGC GCTCGCCGAA TTCGACGCGT CGGCGCTCGC GCGCACGACG
GGCGACGCGC GTATCGTCGC GCTGCTCGAC GGCGCGTTCC AGGCGATCGC GGGCCTGACG
CTCGCGCACG CCGCGAGCCT CGAAAGCGGC CTGCTGCCCG CGTCGCTCGC ACGCATCGAG
TTCACCGGGC CGCTCGCGGA CAGCGTCCGC GCGTGGATTC GCGAAGCACC GAGCGACACG
GGCCGCCGCA CATTCGATAT CGACCTCGTG ACGGCGAGCG GCCGGTCGTG CGCGTCGCTG
CGCGGCCTCG CGCTCGCGTC CGGCCGAAGC GCAACGTCGC GCGAAGCGCC ACGCATCACG
ACGCCGGGCG ACCATCTGTT CGCGCCGCAA TGGCTGCCGT GCGCGACGAA CGCGGCCGGC
GCGGCAACGC CGTCGCCGCG CGCCGGCGCG CTCGCGATCA TGGGCGGCAC GCCGGCGCAG
CGCGCCGCGC TCGCGGCGAC GCGCGCGGCG GCGCAGCGCC TGATCGACGA CATCGCCGAA
CTCGACGCGA ACGTGAGCCA TCTCGTCTGG CTGCCGTCCG CGCCCGCGGA CGCACATGCG
CCGCTCGCGC AATGCGCGAG CCTCGACGGG TTGCGCCTCG TGAAGCGTTT GCTCGCGCTC
GGCGCGGGCG ATCGCGCATT CGATCTGACG GTGCTCACCG TCCGTTCGTG GACGATGCCG
GGCGACGCGC CCGCGTTTCC CGCGCACGCG GATCTCGCGG GGCTGTGCGG GGCGCTCGCG
AACGAATACC CGCACTGGCG CGTGCGGCTC ATCGATCTGC CCGACGCCGC TGCGCTGCCC
GCCGACTGGC ACGCGCGGAG CGCCGAAGGC GGCCATCCGC TGCTGCTGCA CCGGCACGGC
CAATGGTTCG CGCGCCGGCT CGTGCCGCTC GCGGCGCTGC CCGCGCCCGC CGCGCAGCCG
TATCGGCCGG GCGGCGTGTA CGTCGCGATC GGCGGCGCGG GCGGCCTCGG CCGGGTGTGG
ACCGAGCACG CGATTCGCGC CTGCGGCGCG CAAGTCGTGT GGATCGGGCG GCGGCCGCTC
GACGCGCAGA TCGATGCGCA CTGTAACGCG CTCGCCGCGC TCGGCCCGCG CCCGTCGTAT
CTGAGCGCCG ACGCGAGCGA CGCCGAGAGC TTGCGCGCCG CGCGCGATGC GGTGCTCGAA
CGCTTCGGGC GGCTCGACGG CGTCGTGCAC ACGGCGATCG TGCTGGAGGA CGGTGGCCTC
GCGCAGCTCG ACGAAGCGCG ATTCAGCGCG GCGCTGAACG CGCAGGTCGC GACGACCGCG
AACCTCGCCC GCGTGTTCGG CAGCGATCCG CTCGATTTCA TCCTGTTCTT CTCGTCGCTG
CAAAGCGCGT TCGTCGCGGC GGGCCAGAGC AATTACGCGG CCGGCTGCAC GTTCCGCGAC
GCGTTCGCCG ACTGGCTGCG CACGCAGCTC CGATGCGCGG TCAAGGTCGT GAACTGGGGC
TACTGGGGGC AGACGGGCGT GGTCGCGACC GAGCCGTACC GCGCCCGCAT GGCCGCGCTC
GGCATCGGGT CGATCGAGCC CGCCCCGGCG ATGGCGGTCG TCGACGCGCT GCTCGCCTCG
AACGTCGATC AGGTCGGCTA TCTGAAGACG ATCGCGAGCG CCGCGGTGCC GACGCTCGCG
CCCGCGCTCG CCGCGCGCAT CGCGCCGCGC ACGCGCGCGC TTGCCGGCAC GCCGCCGCGC
GTCGACGCGA CGGACGACAG CGCGGCGTGG CGGGACGCGC TCGCGGCGCT CGAACGCGCG
ATCGCGCTCC GGCTGTTCGC CGAGCTCGGC GCGCTGCGCG TGTTCGGCGG AAGCGGCGCG
CCGGGCGGCC ATGCGTTCGA CGACGGCGCG GCCCGAAACA GCGCGGCCGG CAAATGTTCG
GCCGATGACC GCGCGCCCGA CGCCGCGCCG CCGACGCCGG ACGCGGCGCG CGCGGAATGG
GCGCGCGCAC GCGCCGAGCT CGAGCGCACC GCGCTGCTCG ACGCCCATCT CGCGCTCGTC
GACGCGACGC TCGACGCGCT GCCCGCGATC CTGCAAGGCA GCGTGCCCGC CACGTCGATC
CTGTTCCCGG ACGGCGATCT GAGCCGCGTC GAAGCGGTCT ATCAGCGCAA CGAGCAGGCG
GACCGCTGCA ACCGCGCGCT CGCCGATGCG GTGCTGCACC TCGTCGGCGA CGCATCGTCC
GCGCAACCGG CCGCGCTCGC CGAAATCGGC GCGGGCACGG GCGGCACGAC CGTGCCGCTG
CTCGCGGCGC TCGACGCGCG CGGCGCGCGG CTCGGCCGCT ACGACTTCAC CGACATCTCG
AAGGCGTTCC TGCTGAACGC CGAGCAAACG TTCGGCCGGG GCCGCGACAT GCTGCGCTAC
CGGCTGTTCG ACGTCGAGCG GCCGATCGCC GGGCAGGCGC TCGACACCGG CGGCTACGAC
ATCGTGATCG CGACGAACGT GCTGCACGCG ACGCAGGACA TCGGCGTCAC GCTGCGCAAT
GCGAAGGCGC TGCTGAAGGC AGGCGGCCAT CTGATCATCA ACGAACTGCT CGGCACGCAC
GGCTTCGCGC ATGCGACGTT CGGGCTGCTG CCCGGCTGGT GGCGGCACCG CGACAGCGCG
CGCCGCCTGC CCGGCAGCCC GCTGCTGTCG CGCGACGGCT GGACGCGCGC GCTGCGCGAA
GCCGGCTTTG CGGTGCTCGA CGGCGGCTCG GCCGGCGCCG CGGCGGGGCA AGGCGTGATC
GTCGCGCTCA GCGACGGCGT GATCGTGCAG CCGTCGCACG CCGACGCGCG GGCGGCCTCA
TGTGCGGCTT CGCGCGCGGC CCCGGGCGAC GACGCCGGCG CGCACGCCAG CGCCGCGCGG
CCGGCCGCAT CGGCTTGTTC GACTGCCTCG CCCGCACACG CGCCCGCGGC TTCGCCGATC
GCCGCCGCGC CGACCGGCGC GAGCCTGCGC GCGCGCTGCG TGCAGGCGCT CGCGCAACTC
GTCGCGCGGA CGCTGAAAAT GCCGGTCGGC AAGCTCGCGC CCGATCAGCC GCTCGGCAGC
TACGGCGTCG ATTCGATTCT CGTGATCGGG CTCACGAAAA CGCTGCGCGA GACGTTCGGC
GTCGCGCTGT CGAACGCGAC GCTGTTCGAG CATGCGACGC TGAACGCGCT CGCCGAATTC
TTCGTCGCCG AACATCGCGC GGCGTGCGAA CGCGTGCTGG GCGGCGACGC GGAACCCGCG
CCGAATGCGC CGAACGGATC AAACGCCGCG AGCGCAGCGG CGGCCACGCG CCCGGCCATG
CCACCGGCCC GCGCCGGCGC CCCATCGCCC GCCGCGGCTT CGGCCGCGCC GAAGCCGCGC
GAATCGAACG TGTGCGCCCC GCCCTCCGCC GACGACACCG CCGTCGCCGT GATCGGCATG
TCCGGCCGCT ATGCGCAGGC GGACAACCTG CGCGAGTTCT GGGCGAACCT GCGCGCGGGC
CGCCATTGCA TCACCGAAGT GCCCGCCGAG CGCTGGGACT GGCGCACGCA CTTCGATGCG
GAAAAAGGCG CGCCGGGCCG CACGTACAGC CGCTGGGGCG GCTTCCTGAC GCAGATCGAC
CGCTTCGACG CCGCGTTCTT CCGAATCGCG CCGAACGACG CCGAGCAGAT CGATCCGCAA
GGCCGCCTGT TCCTCGAGGA ATCGTGGGCC GCGATCGAGG ATGCCGGCTA TACGAGCGAC
ACGCTCAGCG CGGACCGCCG GGTCGGCGTG TTCGTCGGCG TGATGAACGG CGACTATCCC
ACGGGCGCGC AGTTCTGGAG CATCGCGAAC CGCGTGTCGC ACGCGCTCGA CCTGCACGGG
CCGAGCCTCG CCGTCGACAC CGCGTGCTCG TCGTCGCTGA CCGCGATCCA TCTCGCGCTC
GACAGCCTGC GCAGCGGCAC CTGCGACTGC GCGCTCGCGG GCGGCGTCAA CCTGATTCAG
AGTCCGAAGC ATCTGGTCGG GCTCGCGTCG CTCACGATGC TCTCGGCGGG CGACGCGTGC
CGCGCGTTCG GCGCGGGCGC GGACGGCTTC GTCGACGGCG AAGGCGTCGG CGTGCTCGTG
CTCAAGCCGC TGTCGCGCGC GCTCGCCGAT GGCGACGCGA TCCACGGCAT CATCCGCGGC
AGCATGATCA ACGCGGGCGG CAAGACGCAC GGCCTCACGG TGCCGAACCC GCGCGCGCAG
CAGGCCGTCG TCGGCGCGGC GCTCGCGCGA AGCGGCGTGC CGGCGCGCGC GGTCGGCTAC
ATCGAGGCGC ACGGCACCGG CACCGCGCTC GGCGATCCGA TCGAACTCGC GGGCCTCACG
CGCGCGTTCG CCGAAGCGAC CGACGAGCTC GGCTTCTGCG CGCTCGGCTC GGTCAAATCG
AACATCGGCC ATTGCGAAAG CGCGGCGGGC GTCGCCGGCG TGACGAAGGT GCTGCTGCAG
ATGAAGCATC GCGAACTCGT GCCGACGCTG CATGCGCACG AGCCGAACCC CGACATCGAT
TTCGCGCGCT CGCCGTTCGT GCTGCAACGC ACGCTCGCGC CGTGGCCGCA GCCGGCGCTC
GACGGATGGC CCCGGATCGC GGGCGTGTCG TCGTTCGGCG CGGGCGGCGC GAACGCGCAC
GTCGTGCTCG AAGAATTCAT CGAGACGCGC GCCGCCGCCG GCGGCGACGA CGCCGGCCCC
GCGATCGTCG TGCTGTCCGC CGCGACCGAC GCAGCGCTGC GTCGCCGCGC GCGGCAATTG
CACGCCGCGC TCGCCGCCGG CGAAATCGGC GACGAGCGCC TGCACGATCT CGCGTACACG
CTGCAGATCG GCCGCGCCGC GATGGTCTCG CGCTTCGGCT GCGTCGCCGG CAGCGCCGCC
GAATTGCAGG CGCAGCTCGC CGCGTTCGTC GAAGGCGACG CATCGCGCGG CTGGCACGCG
CACCGGCTCG CCGGCGACCG CCGCGGCCTC GCCGAGCTCG ACGCCGATCC CGAACTGCGC
GCGTCGCTCG TCGAGCAATG CGTCGCGGCC GGCAAGCTCG ACCGGCTCGC GGCACTCTGG
TGCCAGGGGC TCGGCATCGA CTGGCCCGCG CTGCATCGCG GCCGCGCGCG CCGGCGCATG
CATCTGCCGA CGTACCCGTT CGACGGCCCG CGCTACTGGC TGCGCGACGA CGCGGCGCAC
GCCGCCGAGC CCGCGCCGGC CGACGGCGCC GCCGAAGACG CAAGCGCCGA CGCACCGAAT
GCAGCGAACG CGCCGACGCC CGACGTCGCA ACGCTCGTCC GTCGAACGGT GGCGCAAGTG
CTCGGCTATC CGGACGTCGA CATGAACGAA TCGTTCCTGT CGCTCGGCGG CGATTCGATC
CGCGCGGCGC GCGCGCATCG GGTGCTGCAA CGGGCGCTCG ACACGAGGAT TCCGCTCAGC
CTGATGCTGG AGGCAAGCAC GCTCGCCGAA TGCGCGCAAG CGATCGATGC ACTGCTTTCG
ACGCAACCGG AACCGGCGAG CGCGCTCGCC TGCGAAACGA ACGCGGGCGC GGCCGGCGCG
CCGATCGCCG ACGCGGCCGC GTTCGAGTCG TCGGCGCCGC CCTCCCGGGA ATCGGCCTCC
CCGCCACACC CGGCCTCCCC GCCGCGCGAC GCGCGCCCGC GCGTTCATCC GCTGTCATCG
AACCAGCAGC AATTCTTCTT CCTCGACCGG CTGAACCCGG CGAACCCGGC GTTCAACCTG
CCCGGCGCGC TGCGCGTGCG CGGCGAATGG CACGCGCACG CGCTCGAAGC CACGTATCAG
GCGCTCATCG ATACGCACGA CGTGCTGCGC ACCCGCTTCG TCGTGCGCGG CGGCGAACCG
TGCGCGGAAG TCGCGCCGCA CCGCGCGGCC GCGATTCGCC GGCACGATCT GACGGCGCTG
CTGCCGAAGC ATCAGGCCGC GCGCGTCGCC GAGTGCCTCA CCGAGTCGAG CCGCGAGGGC
TTCGCGCTCG AACAGGGCGA ACCGAGCCGG CTGACGGTAC TCGAACTGCG CGACGACGAT
CACGTGATCC TGCTGAATCT GCATCACATC GTCGGCGATG CGGTGTCCGT CGTCGTGCTG
CTCGACGCGC TCGCGCGCGC CGCGCTCACG GGCCGCGCGG CCGCGCCGGA CCGCGCGCGG
CCGCAATACG CGCAATGGGC CGCGCACGAA CGCGATGCGC TGCCGGCGAC GATCGCGCGC
GAACTGCCGT ACTGGCTCGA GCGCCTGCGC GACGTGCCGC CGCCGTTGCC GCTGCCGTGC
GACCGCGCGC GGCCGCCGGT GCCGAGCTAT CGTGGGCGCA GCGTGCCGCT CGCGTTTGCG
CCGGCGCTCA TCACGCTGCT CGACGCATAC TGCAAGGCGC ACGGGCTGTC GCGCTTCGTC
GTGATGCTCG CCGCGTTCAA GCTCGCGCTG CGCGTGCTGT CGGGCCGTGA CGACGTCGTC
GTCGGCAGCC CGTACGCGAA CCGCGCCGAG GACGACACGG CCGACATGAT CGGCAGCCTC
GCCTACGCAC TCGTGCTGCG CACGCGGCTT GGCGAAGCAC AGACGTTCGC CGATGCGGTC
GCGCTCGTGC GGCGCACCGT GCACGGCGCG TTCGACCATC TCGGCGTGCC GTATCCGCGC
CTCGTCGAGG CGCTGAATCC GGCGCGGCAC GGCGGCGCGA ACCCGCTGTA TCAGATCATG
TTCAACGTGA TCCCGATGCC CGCGCTGCCC GAGGGCGTCG AGCCCGTCGA AGTCGATTCC
GGCTGGCTCG ACTACGATCT GTTCGTGCGG CTGCGCGCGT CGAGCCACGC CATCGACGGC
GTGCTGCAAT TCAGCGCGGA TCTCTTCGAT CGTTCGACGG CCGAAGCGAT CGCCGCATAC
TACGTCGAGC TGCTGCACAC GCTGCTCGCG CATCCGTCGC TGCCGCTCGC GAGCCTCGCG
CCGCCCGCCG AGCTCGCGCT CGAACGGACG ATCGCCGACG CGATGCCGCC GCTGCGCATC
GAGATCGCGT CGACGTTCAC CGACCGCCCG TTAGCCGGCA CGCTGCGCTA CTGGGGCACC
GCGACCGGCC AGCCGATCGA GCCGAATTTC GCGCCGTACG GACAACTGTT CCAGACGCTC
TACGATCCGT CCACGCCGTT CCATGCGAAT CGTCACGGAA CGAACGTCGT GCTGGTCAGG
CCGTGCGACT GGCTGCGCTT CGACGACGCG AACGCGGACG CCGCCCGCGC CGACCTCACG
GGCGACGCCG GCGCGGCGGC CGCCGAACGC ATCGCGCTGT ACGCCGACGA ACTCGCCGAC
GCGCTGCGCG ACGCGGCGCC GTCGCTCGCG GTGCCCGTGC TCGTGCTGGT GCTGCCGGAC
GATGCCGCGT CGCTCGCGGC GCGTGACGAA CACACGGGTA CGGCAACCGA AGCGCCTGCC
GAGGCGCTCG CCGACGCACG CGCCGGCAAG CCCTCGCCCG ACACGTCGCT CGCCCCTTAC
CGCATGCTCC GCGCCGCGCT CGCGGATCTG CCGTCGATAA CGGTCGCGCA CTGGCGCGAT
GTCGCCGCGA TCTACCCGGT CGCCGACGTG TTCGATCCGC ATGCGGACGC GGCCGGCCAC
GTGCCGTTCA CGAGCGAGTA CTACGCGGCG CTCGCGAGCT ACATCGCGCG CACCGCGTTC
CAGCACGCGT CGGTGCCGCT CGACGACGCC TGGAACCGGC TCGCCGCGCA GATCCGCGAC
GACGCCGAGC ACCTGCTCGC CGCGCCGGCC GACGGCGCAC GCGCGCGCCG CGCGCCGCAC
GCCGCGCCGA CGAACGAAAC GCAGGCGACG CTCCTGCCGA TCTTCGCGGC CGCGCTGAAG
CTCGACGATC CCGGCATCGA CGACAACTTC TTCGACTGCG GCGGCCACTC GATCCTCGCG
ATCGGCGTCG TCCATCAGAT CAACGAAGCA TTCGGCACGT CGCTGTCGGT CGCGGACATC
TTCATGGCGC CGACCGTGCG CCGCCTCGCC GAGCGCATGC GCGACGCGCC GGACGGCCCC
GAGTACGTCG AGCTCGCGAG CGCGGCCGCG CTGCCCGACG ACATCGCGCC GCTGCCCGGC
CCAGTGGCCG ACGCGCCGCG CGCGCTGTTG CTCACGGGCG CGACGGGCTT TGTCGGCCGC
CATCTGCTGC GCGAGCTGAT CGATCGCACC AGCGCGACGA TCTACTGCCT CGTGCGCGCG
CCGGACGCCG CGCAGGGCCT CGCGCGGATC CGCGCGACGC TCGGGCGCTG GTCGCTGTGG
CGCGACGGCG ACGCCGCGCG CGTGATCGCG GTGCCGGGCG ATCTCGGCCG CCCTCGCATC
GGCCTGTCGG ACGCCGCGCG CGCGCGGCTC GTCGCCGAAG TCGACGCGAT CTATCACAAC
GGCACCAGCA TGAACCATCT CGAATCGTTC GAGATGGCGC GCGCGGCGAA CGTCGGCGGC
GTGATCGAGC TGCTGCGGAT CGCCACCGAA GGCCGGCCGA AGACGTTCAA CTACGTGTCG
ACGCTCGCGG TGTTCAGCAT GCGCGAGCGC ACCGGCACGC ACGTATTCGA CGAAGCCGCG
CCGATCGACG GCGAGCGGCA TCCGTCCGAC CAGGGCTACA CGACGAGCAA GTGGGTGGGC
GAGCAGCTCA CGCATCTCGC GGCCGCGCGC GGCGTGCCGT GCAACGTGTT CCGCCTCGGC
CTCGTGACGG GCGACGTGCG CCACGGTCAC TACGACGAAC TTCAGGCGTA CTACCGGCTG
CTGAAGAGCT GCATCCTGAT GGGCGCCGCG TTCGACGATT TCCGCTACGA CCTCGTGATC
ACGCCCGTCG ATTACGTCGC ACGCGCGCTT GCGCATCTCG GCGCGCGGCA TTCGCAAGGC
GGCCGGGTGT TCCATCTGTC GACGATGCAG GTCACGCCGA TGCGCACCGT GTTCGAGATG
ATGAACGCGC ATCTGCGCAC GCCGATGCGC ATGCTCACAC ACCGCGCGTG GATCGACGAG
CTGCGCGTGC GCTACCGGCG CGGCGACGTG CAATCGATCG TGCCCGTCGT GCAATGGATG
ATGAACATGA GCGATGCGGA GCTCGTGAAG CTCGCGCGCG AGCGCGAGGA AACGACCTTC
ATCTACGACT GCACGGCGAC GCACCGCGAG CTCGAGCAAG CCGGCATCGT CGTGCCCGTG
TTCGACGACG CGCTGCTGCA GCGGTATCTG CGCGGCATGT TCAACGACGA CGCGGACCTG
CGCGCGCTCG CCGCCCGGCT CGACGGCGGC GAGTGCGCTT CTCCCCTTCA CTCCCACACG
TGA
 
Protein sequence
MPPFARKTKH ADQRPLHRRG RRAAVDRRSG DGRAARGARR RERGRPRARG RGRRSGVPRM 
ARDDTGHARE PAARARRRNR PASGRARANR KPQHRQAVAS RRAGRTARRR RLLSLLRGRG
AHRERAVRRR IRRGPHEHGA ARSHRRRRTD RAVELPADDG RVEARAGARG RQHDRVQAVG
MDPAVDRRAR SRARAHLSHG CRQHRARRRR ERRPRARHAS ARADDLADGL GRSGQVRARR
GGRQSEAHAS RTRRQGSRAR VRRRRSRRGR RRHPLRGLLQ RRAGLHGGDA HLRAARHLRH
ARAAARRCGE HAARRPARSR GCRDGAARER RAPRARRALR ARGGRAAARA RTHGRRAAAR
PRLLLRADRD RRRAPRRRAD APRSVRPGRH AHAVRHRITS AQVGERFGIR IGFVGMDPRR
GARHAARRVH RRGRHVGERA FHLHGRHAAR RDQAVRLRLR SIDARPRRLH AAAPRDVAAL
TRTARAFQNP TQTIPMTASP PSSALVTAVE AAVLSLAGDV AGRAFDASAA ACPLHALGFD
SVQYVELSGC LNEYYGLDLA PTLFFDVHVP RRIAEHLVAR HPAALARKHG IGAGDDADTA
ARARAAAAEN GAPQPDMRAG AARPAGEPLL DTHASPGEPR GDAHENPCDD TRGAAAADAH
ESAADIAIVG MAGIFPQSAD LDAFWRHLAA GDDLIAEAPA SRWDWRAGDG EPASRWGGFI
PRIEYFDAAF FGISPREAEQ MDPQQRLLMQ TAWAALEDAA VRPSDLMGSD AAVFVGISTS
DYMALLPGAD GHLAVGNAHA MLPNRLSHLL GAHGPSEAVD TACSSSLVAL HRAVRALRRG
ESSVAIVGGV NVMLTTRLHR ALAAAGMLSP DGRCKTFDAA ANGYVRGEGI AALVLMPLER
ARANGHPVRA VIKGSAVNHG GRAAFLTAPD INAQAALIEA AYRDAGVDPA TVSYIEAHGT
GTSLGDPIEV QALRQGLDAC ARDLAGTASH APARCGLGSV KTNIGHLEAA AGLAGVVKVV
LAMDRRMLPP SLHCRELNPY LKLDGSRYHV VTEPTPWPDE ATPTPLRAGV SSFGFGGSNA
HVVLQSAHAR PIARASAPPP PHPNEQAGAD APAADGPRAW FIPLSARTDA ALHARAAQLA
HWLDTEPADD AWLPALAKTL SIGREPMARR FGITCASLDE LRAQLAIALG GRATSLARDD
ARLRPHAPAC AAWLAGETDP LPAAWDDATP RLRLPVYPFE GERHWPTEAA PAARFALAPD
ADGAYRIAIA PDAPLVADHR LAGEPVLAAA AQIVIAWRAF EADALAGDAG QAGDVGESME
SMESMESNGS SASKPAATSA DSGTAADSRD LHDSYHSHDF RHTIDTIDTI DTNATSAATP
IALCDIEWLA PIAIGAPTDL CITLARDAHG DIDARRGEAA HRRANGRAAR FAIAAAPAID
TPLGRGHATR IASAPSNAPE LDIEAIRARC TQAVSADACY DAFAAIGIDY GPTFRPLRAI
AVGRDEALAE FDASALARTT GDARIVALLD GAFQAIAGLT LAHAASLESG LLPASLARIE
FTGPLADSVR AWIREAPSDT GRRTFDIDLV TASGRSCASL RGLALASGRS ATSREAPRIT
TPGDHLFAPQ WLPCATNAAG AATPSPRAGA LAIMGGTPAQ RAALAATRAA AQRLIDDIAE
LDANVSHLVW LPSAPADAHA PLAQCASLDG LRLVKRLLAL GAGDRAFDLT VLTVRSWTMP
GDAPAFPAHA DLAGLCGALA NEYPHWRVRL IDLPDAAALP ADWHARSAEG GHPLLLHRHG
QWFARRLVPL AALPAPAAQP YRPGGVYVAI GGAGGLGRVW TEHAIRACGA QVVWIGRRPL
DAQIDAHCNA LAALGPRPSY LSADASDAES LRAARDAVLE RFGRLDGVVH TAIVLEDGGL
AQLDEARFSA ALNAQVATTA NLARVFGSDP LDFILFFSSL QSAFVAAGQS NYAAGCTFRD
AFADWLRTQL RCAVKVVNWG YWGQTGVVAT EPYRARMAAL GIGSIEPAPA MAVVDALLAS
NVDQVGYLKT IASAAVPTLA PALAARIAPR TRALAGTPPR VDATDDSAAW RDALAALERA
IALRLFAELG ALRVFGGSGA PGGHAFDDGA ARNSAAGKCS ADDRAPDAAP PTPDAARAEW
ARARAELERT ALLDAHLALV DATLDALPAI LQGSVPATSI LFPDGDLSRV EAVYQRNEQA
DRCNRALADA VLHLVGDASS AQPAALAEIG AGTGGTTVPL LAALDARGAR LGRYDFTDIS
KAFLLNAEQT FGRGRDMLRY RLFDVERPIA GQALDTGGYD IVIATNVLHA TQDIGVTLRN
AKALLKAGGH LIINELLGTH GFAHATFGLL PGWWRHRDSA RRLPGSPLLS RDGWTRALRE
AGFAVLDGGS AGAAAGQGVI VALSDGVIVQ PSHADARAAS CAASRAAPGD DAGAHASAAR
PAASACSTAS PAHAPAASPI AAAPTGASLR ARCVQALAQL VARTLKMPVG KLAPDQPLGS
YGVDSILVIG LTKTLRETFG VALSNATLFE HATLNALAEF FVAEHRAACE RVLGGDAEPA
PNAPNGSNAA SAAAATRPAM PPARAGAPSP AAASAAPKPR ESNVCAPPSA DDTAVAVIGM
SGRYAQADNL REFWANLRAG RHCITEVPAE RWDWRTHFDA EKGAPGRTYS RWGGFLTQID
RFDAAFFRIA PNDAEQIDPQ GRLFLEESWA AIEDAGYTSD TLSADRRVGV FVGVMNGDYP
TGAQFWSIAN RVSHALDLHG PSLAVDTACS SSLTAIHLAL DSLRSGTCDC ALAGGVNLIQ
SPKHLVGLAS LTMLSAGDAC RAFGAGADGF VDGEGVGVLV LKPLSRALAD GDAIHGIIRG
SMINAGGKTH GLTVPNPRAQ QAVVGAALAR SGVPARAVGY IEAHGTGTAL GDPIELAGLT
RAFAEATDEL GFCALGSVKS NIGHCESAAG VAGVTKVLLQ MKHRELVPTL HAHEPNPDID
FARSPFVLQR TLAPWPQPAL DGWPRIAGVS SFGAGGANAH VVLEEFIETR AAAGGDDAGP
AIVVLSAATD AALRRRARQL HAALAAGEIG DERLHDLAYT LQIGRAAMVS RFGCVAGSAA
ELQAQLAAFV EGDASRGWHA HRLAGDRRGL AELDADPELR ASLVEQCVAA GKLDRLAALW
CQGLGIDWPA LHRGRARRRM HLPTYPFDGP RYWLRDDAAH AAEPAPADGA AEDASADAPN
AANAPTPDVA TLVRRTVAQV LGYPDVDMNE SFLSLGGDSI RAARAHRVLQ RALDTRIPLS
LMLEASTLAE CAQAIDALLS TQPEPASALA CETNAGAAGA PIADAAAFES SAPPSRESAS
PPHPASPPRD ARPRVHPLSS NQQQFFFLDR LNPANPAFNL PGALRVRGEW HAHALEATYQ
ALIDTHDVLR TRFVVRGGEP CAEVAPHRAA AIRRHDLTAL LPKHQAARVA ECLTESSREG
FALEQGEPSR LTVLELRDDD HVILLNLHHI VGDAVSVVVL LDALARAALT GRAAAPDRAR
PQYAQWAAHE RDALPATIAR ELPYWLERLR DVPPPLPLPC DRARPPVPSY RGRSVPLAFA
PALITLLDAY CKAHGLSRFV VMLAAFKLAL RVLSGRDDVV VGSPYANRAE DDTADMIGSL
AYALVLRTRL GEAQTFADAV ALVRRTVHGA FDHLGVPYPR LVEALNPARH GGANPLYQIM
FNVIPMPALP EGVEPVEVDS GWLDYDLFVR LRASSHAIDG VLQFSADLFD RSTAEAIAAY
YVELLHTLLA HPSLPLASLA PPAELALERT IADAMPPLRI EIASTFTDRP LAGTLRYWGT
ATGQPIEPNF APYGQLFQTL YDPSTPFHAN RHGTNVVLVR PCDWLRFDDA NADAARADLT
GDAGAAAAER IALYADELAD ALRDAAPSLA VPVLVLVLPD DAASLAARDE HTGTATEAPA
EALADARAGK PSPDTSLAPY RMLRAALADL PSITVAHWRD VAAIYPVADV FDPHADAAGH
VPFTSEYYAA LASYIARTAF QHASVPLDDA WNRLAAQIRD DAEHLLAAPA DGARARRAPH
AAPTNETQAT LLPIFAAALK LDDPGIDDNF FDCGGHSILA IGVVHQINEA FGTSLSVADI
FMAPTVRRLA ERMRDAPDGP EYVELASAAA LPDDIAPLPG PVADAPRALL LTGATGFVGR
HLLRELIDRT SATIYCLVRA PDAAQGLARI RATLGRWSLW RDGDAARVIA VPGDLGRPRI
GLSDAARARL VAEVDAIYHN GTSMNHLESF EMARAANVGG VIELLRIATE GRPKTFNYVS
TLAVFSMRER TGTHVFDEAA PIDGERHPSD QGYTTSKWVG EQLTHLAAAR GVPCNVFRLG
LVTGDVRHGH YDELQAYYRL LKSCILMGAA FDDFRYDLVI TPVDYVARAL AHLGARHSQG
GRVFHLSTMQ VTPMRTVFEM MNAHLRTPMR MLTHRAWIDE LRVRYRRGDV QSIVPVVQWM
MNMSDAELVK LAREREETTF IYDCTATHRE LEQAGIVVPV FDDALLQRYL RGMFNDDADL
RALAARLDGG ECASPLHSHT