Gene BURPS1106A_A0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0434 
Symbol 
ID4905934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp410814 
End bp423287 
Gene Length12474 bp 
Protein Length4157 aa 
Translation table11 
GC content73% 
IMG OID640143541 
Productpolyketide synthase 
Protein accessionYP_001074477 
Protein GI126456395 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCAT TGACGCGCAC GGCGCGCGCT TTCCAGAACC CGACACAGAC GATTCCCATG 
ACCGCCTCCC CTCCCTCCAG CGCACTCGTC ACGGCCGTCG AAGCGGCCGT CCTGTCGCTC
GCCGGCGACG TCGCCGGCCG CGCGTTCGAC GCGTCGGCTG CGGCGCGCCC GCTGCACGCG
CTCGGCTTCG ATTCGGTGCA GTACGTCGAA TTGTCCGGAT GCCTGAACGA ATACTACGGG
CTCGATCTCG CGCCGACGCT GTTCTTCGAC GTGCACGTGC CGCGCCGGAT CGCCGAGCAT
CTCGTCGCGC GGCATCCGGC GGCGCTCGCG CGCAAGCACG GCATCGGGGC CGGGGACGAC
GCCGACACGG CCGCTCGGGC CCGCGCGGCC GCGGCCGAGA ACGGCGCGCC GCAGCCGGAC
ATGCGAGCCG GGGCGGCGCG GCCCGCGGGC GAGCCGCTTC TCGACACGCA TGCGAGCCCC
GGCGAGCCGC GCGGCGACGC ACACGAAAAT CCATGTGACG ACACGCGCGG CGCGGCCGCC
GCCGACGCGC ATGAATCGGC CGCCGATATC GCGATCGTCG GCATGGCCGG CATCTTCCCG
CAATCGGCCG ACCTCGACGC GTTCTGGCGG CATCTCGCCG CGGGCGACGA TCTGATCGCC
GAGGCGCCGG CCTCGCGCTG GGATTGGCGC GCGGGCGACG GCGAGCCCGC ATCGCGCTGG
GGCGGCTTCA TCCCGCGCAT CGAATATTTC GACGCCGCGT TCTTCGGCAT CTCGCCGCGC
GAAGCCGAGC AGATGGACCC GCAGCAGCGC CTGCTGATGC AGACCGCGTG GGCGGCGCTC
GAGGACGCGG CGGTGCGCCC GTCCGATCTG ATGGGCAGCG ACGCGGCGGT GTTCGTCGGC
GTCAGCACGT CCGACTACAT GGCGCTGCTG CCCGGCGCGG ACGGCCATCT CGCGGTCGGC
AACGCGCACG CGATGCTGCC GAACCGGCTG TCGCACCTGC TCGGCGCGCA CGGGCCGAGC
GAGGCTGTCG ATACCGCGTG CTCGAGCTCG CTTGTCGCGC TGCATCGCGC GGTGCGCGCG
CTGCGGCGCG GCGAAAGCAG CGTCGCGATC GTCGGCGGCG TCAACGTGAT GCTGACGACG
CGGCTGCACC GCGCGCTCAC CGCCGCCGGC ATGCTGAGCC CCGACGGGCG CTGCAAGACG
TTCGACGCGG CGGCGAACGG CTACGTGCGC GGCGAGGGCA TCGCGGCGCT CGTGCTGATG
CCGCTCGAGC GCGCGCGCGC GAACGGCCAC CCGGTGCGCG CGGTGATCAA GGGCAGCGCG
GTCAATCACG GCGGCCGCGC GGCGTTCCTG ACCGCGCCGG ACATCAACGC GCAGGCCGCG
CTGATCGAAG CCGCGTATCG CGACGCGGGC GTCGACCCCG CCACTGTTTC GTACATCGAA
GCGCACGGCA CCGGCACGTC GCTCGGCGAT CCGATCGAAG TGCAGGCGCT GCGCCAGGGC
CTCGACGCCT GCGCGCGCGA CCTCGCGGGC ACCGCCTCGC ACGCGCCGGC ACGCTGCGGC
CTGGGCTCGG TCAAGACCAA TATCGGGCAT CTCGAAGCGG CGGCGGGCCT CGCGGGCGTC
GTCAAGGTCG TGCTCGCGAT GGACCGGCGC ATGCTGCCGC CGAGCCTGCA TTGCCGTGAA
CTGAATCCGT ATCTGAAGCT CGACGGCAGC CGTTATCACG TCGTCACGGA ACCCACGCCC
TGGCCGGACG AAGCAACGCC GACGCCGCTG CGCGCGGGCG TCAGCTCGTT CGGGTTCGGC
GGCTCGAACG CGCACGTCGT GCTGCAATCG GCGCACGCGC GGCCGATCGC GCGAGCGAGC
GCGCCCCCAC CGCCGCACCC GAACGAACAG GCCGGTGCCG ACGCGCCCGC CGCCGACGGC
CCGCGCGCGT GGTTCATCCC GCTATCGGCG CGCACCGATG CCGCGTTGCA TGCGCGCGCC
GCTCAGCTCG CGCACTGGCT CGACACCGAG CCGGCCGACG ACGCGTGGCT GCCCGCGCTC
GCGAAGACGC TGTCGATCGG CCGCGAACCG ATGGCGCGCC GCTTCGGCAT CACGTGCGCG
TCGCTCGACG AACTGCGCGC GCAACTCGCG ATCGCGCTGG GCGGCCGCGC AACGTCGCTC
GCGCGCGATG ACGCCCGGCT GCGGCCGCAT GCGCCCGCCT GCGCGGCGTG GCTCGCGGGC
GAGACCGACC CGCTGCCCGC CGCGTGGGAT GACGCGACGC CGCGCCTGCG GTTGCCCGTC
TACCCGTTCG AAGGCGAGCG GCACTGGCCG ACCGAAGCAG CGCCGGCGGC GCGCTTCGCG
CTCGCGCCCG ACGCCGACGG CGCATACCGG ATCGCGATCG CACCCGACGC GCCGCTCGTC
GCCGACCATC GGCTCGCCGG CGAGCCGGTG CTCGCCGCCG CCGCGCAAAT CGTGATCGCG
TGGCGCGCGT TCGAGGCGGA CGCGCTCGCC GGCGATGCCG GCCAGGCGGG CGACGTCGGC
GAGTCGATGG AGTCGATGGA GTCGATGGAG TCGATGGAGT CGAACGGATC GAGCGCATCG
AAGCCGGCGG CAACGTCCGC CGATTCGGGC ACCGCCGCCG ATTCACGCGA TCTGCACGAT
TCATACCACT CGCACGACTT CCGCCACACG ATCGACACGA TCGACACGAT CGACACGATC
GACACGAACG CCACGAGCGC CGCGACGCCT ATCGCGCTGT GCGACATCGA ATGGCTCGCG
CCGATCGCGA TCGGCGCGCC GACCGACCTG TGCATCACGC TCGCGCGCGA CGCCCACGGC
GACATCGACG CGCGCCGCGG CGAAGCCGCC CATCGGCGCG CAAACGGCCG CGCCGCCCGC
TTCGCGATCG CGGCCGCCCC CGCCATCGAT ACGCCGCTCG GCCGCGGACA CGCGACGCGC
ATCGCGAGCG CGCCGTCGGA CGCGCCCGAG CTCGACATCG AGGCCATCCG CGCGCGCTGC
ACGCAAGCGG TCTCGGCCGA CGCGTGCTAC GACGCGTTCG CCGCGATCGG CATCGATTAC
GGCCCGACGT TCCGCCCGCT GCGCGCGATC GCGGTCGGCC GTGACGAAGC GCTCGCCGAA
TTCGACGCGT CGGCGCTCGC GCGCACGACG GGCGACGCGC GTATCGTCGC GCTGCTCGAC
GGCGCGTTCC AGGCGATCGC GGGCCTGACG CTCGCGCACG CCGCGAGCCT CGAAAGCGGC
CTGCTGCCCG CGTCGCTCGC ACGCATCGAG TTCACCGAGC CGCTCGCGGA CAGCGTCCGC
GCGTGGATTC GCGAAGCACC GAGCGACACG GGCCGCCGCA CATTCGATAT CGACCTCGTG
ACGGCGAGCG GCCGGTCGTG CGCGTCGCTG CGCGGCCTCG CGCTCGCGTC CGGCCGAAGC
GCAACGTCGC GCGAAGCGCC ACGCATCACG ACGCCGGGCG ACCATCTGTT CGCGCCGCAA
TGGCTGCCGT GCGCGACGAA CGCGGCCGGC GCGGCAACGC CGTCGCCGCG CGCCGGCGCG
CTCGCGATCA TGGGCGGCAC GCCGGCGCAG CGCGCCGCGC TCGCGGCGAC GCACGCGGCG
GCGCCGCGCC TGATCGACGA CATCGCCGAA CTCGACGCGA ACGTGAGCCA TCTCGTCTGG
CTGCCGTCCG CGCCCGCGGA CGCACATGCG CCGCTCGCGC AATGCGCGAG CCTCGACGGG
TTGCGCCTCG TGAAGCGTTT GCTCGCGCTC GGCGCGGGCG ATCGCGCATT CGATCTGACG
GTGCTCACCG TCCGTTCGTG GACGATGCCG GGCGACGCGC CCGCGTTTCC CGCGCACGCG
GATCTCGCGG GGCTGTGCGG GGCGCTCGCG AACGAATACC CGCACTGGCG CGTGCGGCTC
ATCGATCTGC CCGACGCCGC TGCGCTGCCC GCCGACTGGC ACGCGCGGAG CGCCGAAGGC
GGCCATCCGC TGCTGCTGCA CCGGCACGGC CAATGGTTCG CGCGCCGGCT CGTGCCGCTC
GCGGCGCTGC CCGCGCCCGC CGCGCAGCCG TATCGGCCGG GCGGCGTGTA CGTCGCGATC
GGCGGCGCGG GCGGCCTCGG CCGGGTGTGG ACCGAGCACG CGATTCGCGC CTGCGGCGCG
CAAGTCGTGT GGATCGGGCG GCGGCCGCTC GACGCGCAGA TCGATGCGCA CTGTGACGCG
CTCGCCGCGC TCGGCCCGCG CCCGTCGTAT CTGAGCGCCG ACGCGAGCGA CGCCGAGAGC
TTGCGCGCCG CGCGCGATGC GGTGCTCGAA CGCTTCGGGC GGCTCGACGG CGTCGTGCAC
ACGGCGATCG TGCTGGAGGA CGGTGGCCTC GCGCAGCTCG ACGAAGCGCG ATTCAGCGCG
GCGCTGAACG CGCAGGTCGC GACGACCGCG AACCTCGCCC GCGTGTTCGG CAGCGATCCG
CTCGATTTCA TCCTGTTCTT CTCGTCGCTG CAAAGCGCGT TCGTCGCGGC GGGCCAGAGC
AATTACGCGG CCGGCTGCAC GTTCCGCGAC GCGTTCGCCG ACTGGCTGCG CACGCAGCTC
CGATGCGCGG TCAAGGTCGT GAACTGGGGC TACTGGGGGC AGACGGGCGT GGTCGCGACC
GAGCCGTACC GCGCCCGCAT GGCCGCGCTC GGCATCGGGT CGATCGAGCC CGCCCCGGCG
ATGGCGGTCG TCGATGCGCT GCTCGCCTCG AACGTCGATC AGGTCGGCTA TCTGAAGACG
ATCGCGAGCG CCGCGGTGCC GACGCTCGCG CCCGCGCTCG CCGCGCGCAT CGCGCCGCGC
ACGCGCGCGC TTGCCGGCAC GCCGCCGCGC GTCGACGCGA CGGACGACAG CGCGGCGTGG
CGGGACGCGC TCGCGGCGCT CGAACGCGCG ATCGCGCGCC GGCTGTTCGC CGAGCTCGGC
GCGCTGCGCG TGTTCGGCGG AAGCGGCGCG CCGGGCGGCC ATGCGTTCGA CGACGGCGCG
GCCCGAAACA GCGCGGCCGG CCAACGTTCG GCCGATGACC GCGCGCCCGA CGCCGCGCCG
TTCGACATCG ACACCGCGCT GCGTACCGGC CGCGTCGCGC CCGCCTACCG GCGCTGGCTC
GCCCATGCGT TGACGCTGAT CGCGCGGCAC GGCCCCCTTG CGTGGGACGG ACGCTCGGGC
CGCCTCGCCG AAGCGCCGCC GACGCCGGAC GCGGCGCGCG CGGAATGGGC GCGCGCACGC
GCCGAGCTCG AGCGCACCGC GCTGCTCGAC GCCCATCTCG CGCTCGTCGA CGCGACGCTC
GACGCGCTGC CCGCGATCCT GCAAGGCAGC GTGCCCGCCA CGTCGATCCT GTTCCCGGAC
GGCGATCTGA GCCGCGTCGA AGCGGTCTAT CAGCGCAACG AGCAGGCGGA CCGCTGCAAC
CGCGCGCTCG CCGATGCGGT GCTGCACCTC GTCGGCGACG CATCGTCCGC GCAACCGGCC
GCGCTCGCCG AAATCGGCGC GGGCACGGGC GGCACGACCG TGCCGCTGCT CGCGGCGCTC
GACGCGCGCG GCGCGCGGCT CGGCCGCTAC GACTTCACCG ACATCTCGAA GGCGTTCCTG
CTGAACGCCG AGCAAACGTT CGGCCGGGGC CGCGACATGC TGCGCTACCG GCTGTTCGAC
GTCGAGCGGC CGATCGCCGA GCAGGCGCTC GACACCGGCG GCTACGACAT CGTGATCGCG
ACGAACGTGC TGCACGCGAC GCAGGACATC GGCGTCACGC TGCGCAATGC GAAGGCGCTG
CTGAAGGCAG GCGGCCATCT GATCATCAAC GAACTGCTCG GCACGCACGG CTTCGCGCAT
GCGACGTTCG GGCTGCTGCC CGGCTGGTGG CGGCACCGCG ACAGCGCGCG CCGCCTGCCC
GGCAGCCCGC TGCTGTCGCG CGACGGCTGG ACGCGCGCGC TGCGCGAAGC CGGCTTCGCG
GTGCTCGACG GCGGCTCGGC CGGCGCCGCG GCGGGGCAAG GCGTGATCGT CGCGCTCAGC
GACGGCGTGA TCGTGCAGCC GTCGCACGCC GACGCGCGGG CGGCCTCATG CGCGGCTTCG
CGCGCGGCCC CGGGCGACGA CGCCGGCGCG CACGCCAGCG CCGCGCGGCC GGCCGCATCG
GCTTGTTCGA CTGCCTCGCC CGCACACGCG CCCGCGGCTT CGCCGATCGC CGCCGCGCCG
ACCGGCGCGA GCCTGCGCGC GCGCTGCGTG CAGGCGCTCG CGCAACTCGT CGCGCGGACG
CTGAAAATGC CGGTCGGCAA GCTCGCGCCC GATCAGCCGC TCGGCAGCTA CGGCGTCGAT
TCGATTCTCG TGATCGGGCT CACGAAAACG CTGCGCGAGA CGTTCGGCGT CGCGCTGTCG
AACGCGACGC TGTTCGAGCA TGCGACGCTG AACGCGCTCG CCGAATTCTT CGTCGCCGAA
CATCGCGCGG CATGCGAACG CGTGCTGGGC AACGACGCGG AACCCGCGCC GAATGCGCCG
AACGGACCAA ACGCCGCGAG CGCAGCGGCG GCCACGCGCC CGGCCATGCC ACCGGCCCGC
GCCGACGCCC CATCGCCCGC CGCGGCTTCG GCCGCGCCGA AGCCGCGCGA ATCGAACGTG
TGCGCCCCGC CTGCCGCCGA CGACACCGCC GTCGCCGTGA TCGGCATGTC CGGCCGCTAT
GCGCAGGCGG ACAACCTGCG CGAGTTCTGG GCGAACCTGC GCGCGGGCCG CCATTGCATC
ACCGAAGTGC CCGCCGAGCG CTGGGACTGG CGCACGCACT TCGATGCGGA AAAAGGCGCG
CCGGGCCGCA CGTACAGCCG CTGGGGCGGC TTCCTGACGC AGATCGACCG CTTCGACGCC
GCGTTCTTCC GAATCGCGCC GAACGACGCC GAGCAGATCG ATCCGCAAGG CCGCCTGTTC
CTCGAGGAAT CGTGGGCCGC GATCGAGGAT GCCGGCTATA CGAGCGACAC GCTCAGCGCG
GACCGCCGGG TCGGCGTGTT CGTCGGCGTG ATGAACGGCG ACTATCCCAC GGGCGCGCAG
TTCTGGAGCA TCGCGAACCG CGTGTCGCAC GCGCTCGACC TGCACGGGCC GAGCCTCGCC
GTCGACACCG CGTGCTCGTC GTCGCTGACC GCGATCCATC TCGCGCTCGA CAGCCTGCGC
AGCGGCACCT GCGACTGCGC GCTCGCGGGC GGCGTCAACC TGATTCAGAG TCCGAAGCAT
CTGGTCGGGC TCGCGTCGCT CACGATGCTC TCGGCGGGCG ACGCGTGCCG CGCGTTCGGC
GCGGGCGCGG ACGGCTTCGT CGACGGCGAA GGCGTCGGCG TGCTCGTGCT CAAGCCGCTG
TCGCGCGCGC TCGCCGATGG CGACGCGATC CACGGCATCA TCCGCGGCAG CATGATCAAC
GCGGGCGGCA AGACGCACGG CCTCACGGTG CCGAACCCGC GCGCGCAGCA GGCCGTCGTC
GGCGCGGCGC TCGCGCGAAG CGGCGTGCCG GCGCGCGCGG TCGGCTACAT CGAGGCGCAC
GGCACCGGCA CCGCGCTCGG CGATCCGATC GAACTCGCGG GCCTCACGCG CGCGTTCGCC
GAAGCGACCG ACGAGCTCGG CTTCTGCGCG CTCGGCTCGG TCAAATCGAA CATCGGCCAT
TGCGAAAGCG CGGCGGGCGT CGCGGGCGTG ACGAAGGTGC TGCTGCAGAT GAAGCATCGC
GAACTCGTGC CGACGCTGCA TGCGCACGAG CCGAACCCCG ACATCGATTT CGCGCGCTCG
CCGTTCGTGC TGCAACGCAC GCTCGCGCCG TGGCCGCAGC CGGCGCTCGA CGGATGGCCC
CGGATCGCGG GCGTGTCGTC GTTCGGCGCG GGCGGCGCGA ACGCGCACGT CGTGCTCGAA
GAATTCGTCG AGACGCGCGC CGCCGCCGGC GGCGACGACG CAGGCCCCGC GATCGTCGTG
CTGTCCGCCG CGACCGACGC AGCGCTGCGT CGCCGCGCGC GGCAATTGCA CGCCGCGCTC
GCCGCCGGCG AAATCGGCGA CGAGCGCCTG CACGATCTCG CGTACACGCT GCAGATCGGC
CGCGCCGCGA TGGCCTCGCG CTTCGGCTGC GTCGCCGGCA GCGCCGCCGA ATTGCAGGCG
CAGCTCGCCG CGTTCGTCGA AGGCGACGCA TCGCGCGGCT GGCACGCGCA CCGGCTCGCC
GGCGACCACC ACGGCCTCGC CGAGCTCGAC GCCGATCCCG AGCTGCGCGC GTCGCTCGTC
GAGCAATGCG TCGCGGCCGG CAAGCTCGAC CGGCTCGCGG CACTCTGGTG CCAGGGGCTC
GGCATCGACT GGCCCACGCT GCATCGCGGC CGCGCGCGCC GGCGCATGCA TCTGCCGACG
TACCCGTTCG ACGGCCCGCG CTACTGGCTG CGCGACGACG CGGCGCACGC CGCCGAGCCC
GCGCCGGCCG ACGGCGCCGC CGAAGACGCA AGCGCCGACG CACCGAATGC AGCGAGCGCA
GCGAACGCGC CGACGCCCGA CGTCGCAACG CTCGTCCGTC GAACGGTGGC GCAAGTGCTC
GGCTATCCGG ACGTCGACAT GAACGAATCG TTCCTGTCGC TCGGCGGCGA TTCGATCCGC
GCGGCGCGCG CGCATCGGGT GCTGCAACGG GCGCTCGACA CGAGGATTCC GCTCAGCCTG
ATGCTGGAGG CAAGCACGCT CGCCGAATGC GCGCAAGCGA TCGATGCGCT GCTTTCGACG
CAACCGGAAC CGGCGAGCGC GCTCGCCTGC GAAACGAACG CGGGCGCGGC CGGCGCGCCG
ATCGCCGACG CGGCCGCGCT CGAGTCGCCG GCGCCGCCCT CCCGGGAATC GGCCTCCCCG
CCACACCCGG CCTCCCCGCC GCGCGACGCG CGCCCGCGCG TTCATCCGCT GTCATCGAAC
CAGCAGCAGT TCTTCTTCCT CGACCGGCTG AACCCGGCGA ACCCGGCGTT CAACCTGCCC
GGCGCGCTGC GCGTGCGCGG CGAATGGCAC GCGCACGCGC TCGAAGCCAC GTATCAGGCG
CTCATCGATA CGCACGACGT GCTGCGCACC CGCTTCGTCG TGCGCGGCGG CGAACCGTGC
GCGGAAGTCG CGCCGCACCG CGCGGCCGCG ATTCGCCGGC ACGATCTGAC GGCGCTGCTG
CCGAAGCATC AGGCCGCGCG CGTCGCCGAG TGCCTCACCG AGTCGAGCCG CGAGGGCTTC
GCGCTCGAAC AGGGCGAACC GAGCCGGCTG ACGGTACTCG AACTGCGCGA CGACGATCAC
GTGATCCTGC TGAATCTGCA TCACATCGTC GGCGATGCGG TGTCCGTCGT CGTGCTGCTC
GACGCGCTCG CGCGCGCCGC GCTCACGGGC CGCGCGGCCG CGCCGGACCG CGCGCGGCCG
CAATACGCGC AATGGGCCGC GCACGAACGC GATGCGCTGC CGGCGACGAT CGAGCGCGAA
CTGCCGTACT GGCTCGAGCG CCTGCGCGAC GTGCCGCCGC CGTTGCCGCT GCCGTGCGAC
CGCGCGCGGC CGCCGGTGCC GAGCTATCGT GGGCGCAGCG TGCCGCTCGC GTTTGCGCCG
GCGCTCATCA CGCTGCTCGA CGCATACTGC AAGGCGCACG GGCTGTCGCG CTTCGTCGTG
ATGCTCGCCG CGTTCAAGCT CGCGCTGCGC GTGCTGTCGG GCCGTGACGA CGTCGTCGTC
GGCAGCCCGT ACGCGAACCG CGCCGAGGAC GACACGGCCG ACATGATCGG CAGCCTCGCC
TACGCACTCG TGCTGCGCAC GCGGCTTGGC GAAGCACAGA CGTTCGCCGA TGCGGTCGCG
CTCGTGCGGC GCACCGTGCA CGGCGCGTTC GACCATCTCG GCGTGCCGTA TCCGCGCCTC
GTCGAGGCGC TGAATCCGGC GCGGCACGGC GGCGCGAACC CGCTGTATCA GATCATGTTC
AACGTGATCC CGATGCCCGC GCTACCCGAG GGCGTCGAGC CCGTCGAAGT CGATTCCGGC
TGGCTCGACT ACGATCTGTT CGTGCGGCTG CGCGCGTCGA GCCACGCCAT CGACGGCGTG
CTGCAATTCA GCGCGGATCT CTTCGATCGT TCGACGGCCG AAGCGATCGC CGCATACTAC
GTCGAGCTGC TGCACACGCT GCTCGCGCAT CCGTCGCTGC CGCTCGCGAG CCTCGCGCCG
CCCGCCGAGC TCGCGCTCGA ACGGACGATC GCCGACGCGA TGCCGCCGCT GCGCATCGAG
ATCGCGTCGA CGTTCACCGA CCGCCCGTTA GCCGGCACGC TGCGCTACTG GGGCACCGCG
ACCGGCCAGC CGATCGAGCC GAATTTCGCG CCGTACGGAC AACTGTTCCA GACGCTCTAC
GATCCGTCCA CGCCGTTCCA TGCGAATCGT CACGGAACGA ACGTCGTGCT GGTCAGGCCG
TACGACTGGC TGCGCTTCGA CGACGCGGAC GCCGCCCGCG CCGACCTCAC GGGCGACGCC
GGCGCGGCGG CCGCCGAACG CATCGCGCTG TACGCCGACG AACTCGCCGA CGCGCTGCGC
GACGCGGCGC CGTCGCTCGC GGTGCCCGTG CTCGTGCTGG TGCTGCCGGA CGATGCCGCG
TCGCTCGCGG CGCGTGACGA ACACACGGGT ACGGCAACCG AAGCGCCCGC CGAGGCGCTC
GCCGACGCAC GCGCCGGCAA GCCCTCGCCC GACACGTCGC TCGCCCCTTA CCGCATGCTC
CGCGCCGCGC TCGCGGATCT GCCGTCGATA ACGGTCGCGC ACTGGCGCGA TGTCGCCGCG
ATCTACCCGG TCGCCGACGT GTTCGATCCG CATGCGGACG CGGCCGGCCA CGTGCCGTTC
ACGAGCGAGT ACTACGCGGC GCTCGCGAGC TACATCGCGC GCACCGCGTT CCAGCACGCA
TCGGTGCCGC TCGACGACGC CTGGAACCGG CTCGCCGCGC AGATCCGCGA CGACGCCGAG
CACCTGCTCG CCGCGCCGGC CGACGGCGCG CGCGCGCGCC GCGCGCCGCA CGCCGCGCCG
ACGAACGAAA CGCAGGCGAC GCTCCTGCCG ATCTTCGCGG CTGCGCTGAA GCTCGACGAT
CCCGGCATCG ACGACAACTT CTTCGACTGC GGCGGCCACT CGATCCTCGC GATCGGCGTC
GTCCATCAGA TCAACGAAGC ATTCGGCACG TCGCTGTCGG TCGCGGACAT CTTCATGGCG
CCGACCGTGC GCCGCCTCGC CGAGCGCATG CGCGACGCGC CGGACGGCCC CGAGTATGTC
GAGCTCGCGA GCGCGGCCGC GCTGCCCGAC GACATCGCGC CGCTGCCCGG CCCAGTGGCC
GACGCGCCGC GCGCGCTGTT GCTCACGGGC GCGACGGGCT TTGTCGGCCG CCATCTGCTG
CGCGAGCTGA TCGATCGCAC CAGCGCGACG ATCTACTGCC TCGTGCGCGC GCCGGACGCC
GCGCAGGGCC TCGCGCGGAT CCGCGCGACG CTCGAGCGCT GGTCGCTGTG GCGCGACGGC
GACGCCGCGC GCGTGATCGC GGTGCCGGGC GATCTCGGCC GCCCTCGCAT CGGCCTGTCG
GATGCCGCGC GCGCGCGGCT CGTCGCCGAA GTCGACGCGA TCTATCACAA CGGCACCAGC
ATGAACCATC TCGAATCGTT CGAGATGGCG CGCGCGGCGA ACGTCGGCGG CGTGATCGAG
CTGCTGCGGA TCGCCACCGA AGGCCGGCCG AAGACGTTCA ACTACGTGTC GACGCTCGCG
GTGTTCAGCA TGCGCGAGCG CACCGGCACG CACGTATTCG ACGAATCCGC GCCGATCGAC
GGCGAGCGGC ATCCGTCCGA CCAGGGCTAC ACGACGAGCA AGTGGGTGGG CGAGCAGCTC
ACGCATCTCG CGGCCGCGCG CGGCGTGCCG TGCAACGTGT TCCGCCTCGG CCTCGTGACG
GGCGACGTGC GCCACGGTCA CTACGACGAA CTTCAGGCGT ACTACCGGCT GCTGAAGAGC
TGCATCCTGA TGGGCGCCGC GTTCGACGAT TTCCGCTACG ACCTCGTGAT CACGCCCGTC
GATTACGTCG CACGCGCGCT TGCGCATCTC GGCGCGCGGC ATTCGCAAGG CGGCCGGGTG
TTCCATCTGT CGACGATGCA GGTCACGCCG ATGCGCACCG TGTTCGAGAT GATGAACGCG
CATCTGCGCA CGCCGATGCG CATGCTCACA CACCGCGCGT GGATCGACGA GCTGCGCGTG
CGCTACCGGC GCGGCGACGT GCAATCGATC GTGCCCGTCG TGCAATGGAT GATGAACATG
AGCGATGCGG AGCTCGTGAA GCTCGCGCGC GAGCGCGAGG AAACGACCTT CATCTACGAC
TGCACGGCGA CGCACCGCGA GCTCGAGCAA GCCGGCATCG TCGTGCCCGT GTTCGACGAC
GCGCTGCTGC AGCGGTATCT GCGCGGCATG TTCAACGACG ACGCGGACCT GCGCGCGCTC
GCCGCCCGGC TCGACGGCGG CGAGTGCGCT TCTCCCCTTC ACTCCCACAC GTGA
 
Protein sequence
MAALTRTARA FQNPTQTIPM TASPPSSALV TAVEAAVLSL AGDVAGRAFD ASAAARPLHA 
LGFDSVQYVE LSGCLNEYYG LDLAPTLFFD VHVPRRIAEH LVARHPAALA RKHGIGAGDD
ADTAARARAA AAENGAPQPD MRAGAARPAG EPLLDTHASP GEPRGDAHEN PCDDTRGAAA
ADAHESAADI AIVGMAGIFP QSADLDAFWR HLAAGDDLIA EAPASRWDWR AGDGEPASRW
GGFIPRIEYF DAAFFGISPR EAEQMDPQQR LLMQTAWAAL EDAAVRPSDL MGSDAAVFVG
VSTSDYMALL PGADGHLAVG NAHAMLPNRL SHLLGAHGPS EAVDTACSSS LVALHRAVRA
LRRGESSVAI VGGVNVMLTT RLHRALTAAG MLSPDGRCKT FDAAANGYVR GEGIAALVLM
PLERARANGH PVRAVIKGSA VNHGGRAAFL TAPDINAQAA LIEAAYRDAG VDPATVSYIE
AHGTGTSLGD PIEVQALRQG LDACARDLAG TASHAPARCG LGSVKTNIGH LEAAAGLAGV
VKVVLAMDRR MLPPSLHCRE LNPYLKLDGS RYHVVTEPTP WPDEATPTPL RAGVSSFGFG
GSNAHVVLQS AHARPIARAS APPPPHPNEQ AGADAPAADG PRAWFIPLSA RTDAALHARA
AQLAHWLDTE PADDAWLPAL AKTLSIGREP MARRFGITCA SLDELRAQLA IALGGRATSL
ARDDARLRPH APACAAWLAG ETDPLPAAWD DATPRLRLPV YPFEGERHWP TEAAPAARFA
LAPDADGAYR IAIAPDAPLV ADHRLAGEPV LAAAAQIVIA WRAFEADALA GDAGQAGDVG
ESMESMESME SMESNGSSAS KPAATSADSG TAADSRDLHD SYHSHDFRHT IDTIDTIDTI
DTNATSAATP IALCDIEWLA PIAIGAPTDL CITLARDAHG DIDARRGEAA HRRANGRAAR
FAIAAAPAID TPLGRGHATR IASAPSDAPE LDIEAIRARC TQAVSADACY DAFAAIGIDY
GPTFRPLRAI AVGRDEALAE FDASALARTT GDARIVALLD GAFQAIAGLT LAHAASLESG
LLPASLARIE FTEPLADSVR AWIREAPSDT GRRTFDIDLV TASGRSCASL RGLALASGRS
ATSREAPRIT TPGDHLFAPQ WLPCATNAAG AATPSPRAGA LAIMGGTPAQ RAALAATHAA
APRLIDDIAE LDANVSHLVW LPSAPADAHA PLAQCASLDG LRLVKRLLAL GAGDRAFDLT
VLTVRSWTMP GDAPAFPAHA DLAGLCGALA NEYPHWRVRL IDLPDAAALP ADWHARSAEG
GHPLLLHRHG QWFARRLVPL AALPAPAAQP YRPGGVYVAI GGAGGLGRVW TEHAIRACGA
QVVWIGRRPL DAQIDAHCDA LAALGPRPSY LSADASDAES LRAARDAVLE RFGRLDGVVH
TAIVLEDGGL AQLDEARFSA ALNAQVATTA NLARVFGSDP LDFILFFSSL QSAFVAAGQS
NYAAGCTFRD AFADWLRTQL RCAVKVVNWG YWGQTGVVAT EPYRARMAAL GIGSIEPAPA
MAVVDALLAS NVDQVGYLKT IASAAVPTLA PALAARIAPR TRALAGTPPR VDATDDSAAW
RDALAALERA IARRLFAELG ALRVFGGSGA PGGHAFDDGA ARNSAAGQRS ADDRAPDAAP
FDIDTALRTG RVAPAYRRWL AHALTLIARH GPLAWDGRSG RLAEAPPTPD AARAEWARAR
AELERTALLD AHLALVDATL DALPAILQGS VPATSILFPD GDLSRVEAVY QRNEQADRCN
RALADAVLHL VGDASSAQPA ALAEIGAGTG GTTVPLLAAL DARGARLGRY DFTDISKAFL
LNAEQTFGRG RDMLRYRLFD VERPIAEQAL DTGGYDIVIA TNVLHATQDI GVTLRNAKAL
LKAGGHLIIN ELLGTHGFAH ATFGLLPGWW RHRDSARRLP GSPLLSRDGW TRALREAGFA
VLDGGSAGAA AGQGVIVALS DGVIVQPSHA DARAASCAAS RAAPGDDAGA HASAARPAAS
ACSTASPAHA PAASPIAAAP TGASLRARCV QALAQLVART LKMPVGKLAP DQPLGSYGVD
SILVIGLTKT LRETFGVALS NATLFEHATL NALAEFFVAE HRAACERVLG NDAEPAPNAP
NGPNAASAAA ATRPAMPPAR ADAPSPAAAS AAPKPRESNV CAPPAADDTA VAVIGMSGRY
AQADNLREFW ANLRAGRHCI TEVPAERWDW RTHFDAEKGA PGRTYSRWGG FLTQIDRFDA
AFFRIAPNDA EQIDPQGRLF LEESWAAIED AGYTSDTLSA DRRVGVFVGV MNGDYPTGAQ
FWSIANRVSH ALDLHGPSLA VDTACSSSLT AIHLALDSLR SGTCDCALAG GVNLIQSPKH
LVGLASLTML SAGDACRAFG AGADGFVDGE GVGVLVLKPL SRALADGDAI HGIIRGSMIN
AGGKTHGLTV PNPRAQQAVV GAALARSGVP ARAVGYIEAH GTGTALGDPI ELAGLTRAFA
EATDELGFCA LGSVKSNIGH CESAAGVAGV TKVLLQMKHR ELVPTLHAHE PNPDIDFARS
PFVLQRTLAP WPQPALDGWP RIAGVSSFGA GGANAHVVLE EFVETRAAAG GDDAGPAIVV
LSAATDAALR RRARQLHAAL AAGEIGDERL HDLAYTLQIG RAAMASRFGC VAGSAAELQA
QLAAFVEGDA SRGWHAHRLA GDHHGLAELD ADPELRASLV EQCVAAGKLD RLAALWCQGL
GIDWPTLHRG RARRRMHLPT YPFDGPRYWL RDDAAHAAEP APADGAAEDA SADAPNAASA
ANAPTPDVAT LVRRTVAQVL GYPDVDMNES FLSLGGDSIR AARAHRVLQR ALDTRIPLSL
MLEASTLAEC AQAIDALLST QPEPASALAC ETNAGAAGAP IADAAALESP APPSRESASP
PHPASPPRDA RPRVHPLSSN QQQFFFLDRL NPANPAFNLP GALRVRGEWH AHALEATYQA
LIDTHDVLRT RFVVRGGEPC AEVAPHRAAA IRRHDLTALL PKHQAARVAE CLTESSREGF
ALEQGEPSRL TVLELRDDDH VILLNLHHIV GDAVSVVVLL DALARAALTG RAAAPDRARP
QYAQWAAHER DALPATIERE LPYWLERLRD VPPPLPLPCD RARPPVPSYR GRSVPLAFAP
ALITLLDAYC KAHGLSRFVV MLAAFKLALR VLSGRDDVVV GSPYANRAED DTADMIGSLA
YALVLRTRLG EAQTFADAVA LVRRTVHGAF DHLGVPYPRL VEALNPARHG GANPLYQIMF
NVIPMPALPE GVEPVEVDSG WLDYDLFVRL RASSHAIDGV LQFSADLFDR STAEAIAAYY
VELLHTLLAH PSLPLASLAP PAELALERTI ADAMPPLRIE IASTFTDRPL AGTLRYWGTA
TGQPIEPNFA PYGQLFQTLY DPSTPFHANR HGTNVVLVRP YDWLRFDDAD AARADLTGDA
GAAAAERIAL YADELADALR DAAPSLAVPV LVLVLPDDAA SLAARDEHTG TATEAPAEAL
ADARAGKPSP DTSLAPYRML RAALADLPSI TVAHWRDVAA IYPVADVFDP HADAAGHVPF
TSEYYAALAS YIARTAFQHA SVPLDDAWNR LAAQIRDDAE HLLAAPADGA RARRAPHAAP
TNETQATLLP IFAAALKLDD PGIDDNFFDC GGHSILAIGV VHQINEAFGT SLSVADIFMA
PTVRRLAERM RDAPDGPEYV ELASAAALPD DIAPLPGPVA DAPRALLLTG ATGFVGRHLL
RELIDRTSAT IYCLVRAPDA AQGLARIRAT LERWSLWRDG DAARVIAVPG DLGRPRIGLS
DAARARLVAE VDAIYHNGTS MNHLESFEMA RAANVGGVIE LLRIATEGRP KTFNYVSTLA
VFSMRERTGT HVFDESAPID GERHPSDQGY TTSKWVGEQL THLAAARGVP CNVFRLGLVT
GDVRHGHYDE LQAYYRLLKS CILMGAAFDD FRYDLVITPV DYVARALAHL GARHSQGGRV
FHLSTMQVTP MRTVFEMMNA HLRTPMRMLT HRAWIDELRV RYRRGDVQSI VPVVQWMMNM
SDAELVKLAR EREETTFIYD CTATHRELEQ AGIVVPVFDD ALLQRYLRGM FNDDADLRAL
AARLDGGECA SPLHSHT