Gene BURPS668_A0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0530 
Symbol 
ID4886838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp483975 
End bp496343 
Gene Length12369 bp 
Protein Length4122 aa 
Translation table11 
GC content73% 
IMG OID640130471 
Productputative polyketide synthase PksM 
Protein accessionYP_001061536 
Protein GI126444438 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCT CCCCTCCCTC CAGCGCACTC GTCACGGCCG TCGAAGCGGC CGTCCTGTCG 
CTCGCCGGCG ATGTCGCCGG CCGCGCGTTC GACGCGTCGG CCGCGGAGCG CCCGCTGCAC
GCGCTCGGCT TCGATTCGGT GCAGTACGTC GAATTGTCCG GATGCCTGAA CGAATACTAC
GGGCTCGATC TCGCGCCGAC GCTGTTCTTC GACGTGCACG TGCCGCGCCG GATCGCCGAG
CATCTCGTCG CGCGGCATCC GGCGGCGCTC GCGCGCAAGC ACGGCATCGG GGCCGGGGAC
GACGCCGACA CGGCCGCTCG GGCCCGCGCG GCCGCGGCCG AGAACCGCGC GCCGCAGCCG
GACATGCGAG CCGGGGCGGC GGGGCCCGCG GGCGAGCCGC TTCTCGACAC GCATGCGAGC
CCCGGCGAGC CGCGCGGCGA CGCACACGAG AATCCATGTG ACGACACGCG CGGCGCGGCC
GCCGCCGACG CGCACGAATC GGCCGCCGAT ATCGCGATCG TCGGCATGGC CGGCATCTTC
CCGCAATCGG CCGACCTCGA CGCGTTCTGG CGGCATCTCG CCGCGGGCGA CGATCTGATC
GCCGAGGCGC CGGCCTCGCG CTGGGATTGG CGCGCGGGCG ACGGCGAGCC CGCATCGCGC
TGGGGCGGCT TCATCCCGCG CATCGAATAT TTCGACGCCG CGTTCTTCGG CATCTCGCCG
CGCGAAGCCG AGCAGATGGA CCCGCAGCAG CGCCTGCTGA TGCAGACCGC GTGGGCGGCG
CTCGAGGACG CGGCGGTGCG CCCGTCCGAT CTGATGGGCA GCGACGCGGC GGTGTTCGTC
GGCGTCAGCA CGTCCGACTA CATGGCGCTG CTGCCCGGCG CGGACGGCCA TCTCGCGGTC
GGCAACGCGC ACGCGATGCT GCCGAACCGG CTGTCGCACC TGCTCGGCGC GCACGGGCCG
AGCGAGGCTG TCGATACCGC GTGCTCGAGC TCGCTTGTCG CGCTGCATCG CGCGGTGCGC
GCGCTGCGGC GCGGCGAAAG CAGCGTCGCG ATCGTCGGCG GCGTCAACGT GATGCTGACG
ACGCGGCTGC ACCGCGCGCT CGCCGCCGCC GGCATGCTGA GCCCCGACGG GCGCTGCAAG
ACGTTCGACG CGGCGGCGAA CGGCTACGTG CGCGGCGAGG GCATCGCGGC GCTCGTGCTG
ATGCCGCTCG AGCGCGCGCG CGCGAACGGC CACCCGGTGT GCGCGGTGAT CAAGGGCAGC
GCGGTCAATC ACGGCGGCCG CGCGGCGTTC CTGACCGCGC CGGACATCAA CGCGCAGGCC
GCGCTGATCG AAGCCGCGTA TCGCGACGCG GGCGTCGACC CCGCCACTGT TTCGTACGTC
GAAGCGCACG GCACCGGCAC GTCGCTCGGC GATCCGATCG AAGTGCAGGC GCTGCGCCAG
GGCCTCGACG CCTGCGCGCG CGACCTCGCG GGCACCGCCT CGCACGCGCC GGCACGCTGC
GGCCTGGGCT CGGTCAAGAC CAATATCGGG CATCTCGAAG CGGCGGCGGG CCTCGCGGGC
GTCGTCAAGG TCGTGCTCGC GATGGACCGG CGCATGCTGC CGCCGAGCCT GCATTGCCGT
GAACTGAATC CGTATCTGAA GCTCGACGGC AGCCGCTATC ACGTCGTCAC GGAACCCACG
CCCTGGCCGG ACGAAGCAAC GCCGACGCCG CTGCGCGCGG GCGTCAGCTC GTTCGGGTTC
GGCGGCTCGA ACGCGCACGT CGTGCTGCAA TCGGCGCACG CGCGGCCGAT CGCGCGAGCG
AGCGCGCCCC CACCGCCGCA CACGAACGAA CAGGCCGGTG CCGACGCGCC CGCCGCCGAC
GGCCCGCGCG CGTGGTTCAT CCCGCTATCG GCGCGCACCG ATGCCGCGTT GCATGCGCGC
GCCGCTCAGC TCGCGCACTG GCTCGACACC GAGCCGGCCG ACGACGCGTG GCTGCCCGCG
CTCGCGAAGA CGCTGTCGAT CGGCCGCGAA CCGATGGCGC GCCGCTTCGG CATCACGTGC
GCGTCGCTCG ACGAACTGCG CGCGCAACTC GCGATCGCGC TGGGCGGCCG CGCAACGTCG
CTCGCGCGCG ATGACGCCCG GCTGCGGCCG CATGCGCCCG CCTGCGCGGC GTGGCTCGCG
GGCGAGACCG ACCCGCTGCC CGCCGCGTGG GATGACGCGA CGCCGCGCCT GCGGTTGCCC
GTCTACCCGT TCGAAGGCGA GCGGCACTGG CCGACCGAAG CAGCGCCGGC GGCGCGCTTC
GCGCTCGCGC CCGACGCCGA CGGCGCATAC CGGATCGCGA TCGCACCCGA CGCGCCGCTC
GTCGCCGACC ATCGGCTCGC CGGCGAGCCG GTGCTCGCCG CCGCCGCGCA AATCGTGATC
GCGTGGCGCG CGTTCGAGGC GGACGCGCTC GCCGGCGATG CCGGCCAGGC GGGCGACGTC
GGCGAGTCGA TGGAGTCGAT GGAGTCGAAC GGATCGAGCG CATCGAAGCC GGCGGCGACG
TCCGCCGATT CGGGCACCGC CGCCGATTCA CGCGATCCGC ACGATTCATA CCACTCGCAC
GACTCCCGCC ACACGATCGA CACGAACGCC ACGAGCGCCA CGACGCCTAT CGCGCTGCGC
GACATCGAAT GGCTCGCGCC GATCGCGATC GGCGCGCCGA CCGACCTGTG CATCACGCTC
GCGCGCGACG CCCACGGCGA CATCGACGCG CGCCGCGGCG AAGCCGCCCA TCGGCGCGCA
AACGGCCGCG CCGCCCGCTT CGCGATCGCG GCCGCCCCCG CCATCGATAC GCCGCTCGGC
CGCGGACACG CGACGCGCAT CGCGAGCGCG CCGTCGAACG CGCCCGAGCT CGACATCGAG
GCCATCCGCG CGCGCTGCAC GCAAGCGGTC TCGGCCGACG CGTGCTACGA CGCGTTCGCC
GCGATCGGCA TCGATTACGG CCCGACGTTC CGCCCACTGC GCGCGATCGC GGTCGGCCGT
GACGAAGCGC TCGCCGAATT CGACGCGTCG GCGCTCGCGC GCACGACGGG CGACGCGCGT
ATCGTCGCGC TGCTCGACGG CGCGTTCCAG GCGATCGCGG GCCTGACGCT CGCGCACGCC
GCGAGCCTCG AAAGCGGCCT GCTGCCCGCG TCGCTCGCAC GCATCGAGTT CACCGGGCCG
CTCGCGGACA GCGTCCGCGC GTGGATTCGC GAAGCACCGA GCGACACGGG CCGCCGCACA
TTCGATATCG ACCTCGTGAC GGCGAGCGGC CGGTCGTGCG CGTCGCTGCG CGGCCTCGCG
CTCGCGTCCG GCCGAAGCGC AACGTCGCGC GAAGCGCCAC GCATCACGAC GCCGGGCGAC
CATCTGTTCG CGCCGCAATG GCTGCCGTGC GCGACGAACG CGGCCGGCGC GGCAACGCCG
TCGCCGCGCG CCGGCGCGCT CGCGATCATG GGCGGCACGC CGGCGCAGCG CGCCGCGCTC
GCGGCGACGC ACGCGGCGGC GCCGCGCCTG ATCGACGACA TCGCCGAACT CGACGCGAAC
GTGAGCCATC TCGTCTGGCT GCCGTCCGCG CCCGCGGACG CACATGCGCC GCTCGCGCAA
TGCGCGAGCC TCGACGGGTT GCGCCTCGTG AAGCGTTTGC TCGCGCTCGG CGCGGGCGAT
CGCGCATTCG ATCTGACGGT GCTCACCGTC CGTTCGTGGA CGATGCCGGG CGACGCGCCC
GCGTTTCCCG CGCACGCGGA TCTCGCGGGG CTGTGCGGGG CGCTCGCGAA CGAATACCCG
CACTGGCGCG TGCGGCTCAT CGATCTGCCC GACGCCGCTG CGCTGCCCGC CGACTGGCAC
GCGCGGAGCG CCGAAGGCGG CCATCCGCTG CTGCTGCACC GGCACGGCCA ATGGTTCGCG
CGCCGGCTCG TGCCGCTCGC GGCGCTGCCC GCGCCCGCCG CGCAGCCGTA TCGGCCGGGC
GGCGTGTACG TCGCGATCGG CGGCGCGGGC GGCCTCGGCC GGGTGTGGAC CGAGCACGCG
ATTCGCGCCT GCGGCGCGCA AGTCGTGTGG ATCGGGCGGC GGCCGCTCGA CGCGCAGATC
GATGCGCACT GTGACGCGCT CGCCGCGCTC GGCCCGCGCC CGTCGTATCT GAGCGCCGAT
GCGAGCGACG CCGAGAGCTT GCGCGCCGCG CGCGATGCGG TGCTCGAACG CTTCGGGCGG
CTCGACGGCG TCGTGCACAC GGCGATCGTG CTGGAGGACG GTGGCCTCGC GCAGCTCGAC
GAAGCGCGAT TCAGCGCGGC GCTGAACGCG CAGGTCGCGA CGACCGCGAA CCTCGCCCGC
GTGTTCGGCA GCGATCCGCT CGATTTCATC CTGTTCTTCT CGTCGCTGCA AAGCGCGTTC
GTCGCGGCGG GCCAGAGCAA TTACGCGGCC GGCTGCACGT TCCGCGACGC GTTCGCCGAC
TGGCTGCGCA CGCAGCTCCG ATGCGCGGTC AAGGTCGTGA ACTGGGGCTA CTGGGGGCAG
ACGGGCGTGG TCGCGACCGA GCCGTACCGC GCCCGCATGG CCGCGCTCGG CATCGGGTCG
ATCGAGCCCG CCCCGGCGAT GGCGGTCGTC GACGCGCTGC TCGCCTCGAA CGTCGATCAG
GTCGGCTATC TGAAGACGAT CGCGAGCGCC GCGGTGCCGA CGCTCGCGCC CGCGCTCGCC
GCGCGCATCG CGCCGCGCAC GCGCGCGCTT GCCGGCACGC CGCCGCGCGT CGACGCGACG
GACGACAGCG CGGCGTGGCG GGACGCGCTC GCGGCGCTCG AACGCGCGAT CGCGCGCCGG
CTGTTCGCCG AGCTCGGCGC GCTGCGCGTG TTCGGCGGAA GCGGCGCGCC GGGCGGCCAT
GCGTTCGACG ACGGCGCGGC CCGAAACAGC GCGGCCGGCC AACGTTCGGC CGATGACCGC
GCGCCCGACG CCGCGCCGTT CGACATCGAC ACCGCGCTGC GTACCGGCCG CATCGCGCCC
GCCTACCGGC GCTGGCTCGC CCATGCGTTG ACGCTGATCG CGCGGCACGG CCCCCTTGCG
TGGGACGGCC GCTCGGGCCG CCTCGCCGAA GCGCCGCCGA CGCCGGACAC GGCGCGCGCC
GAATGGGCGC GCGCACGCGC CGAGCTCGAG CGCACCGCGC TGCTCGACGC CCATCTCGCG
CTCGTCGACG CGACGCTCGA CGCGCTGCCC GCGATCCTGC AAGGCAGCGT GCCCGCCACG
TCGATCCTGT TCCCGGACGG CGATCTGAGC CGCGTCGAAG CGGTCTATCA GCGCAACGAG
CAGGCGGACC GCTGCAACCG CGCGCTCGCC GATGCGGTGC TGCACCTCGT CGGCGACGCA
TCGTCCGCGC AACCGGCCGC GCTCGCCGAA ATCGGCGCGG GCACGGGCGG CACGACCGTG
CCGCTGCTCG CGGCGCTCGA CGCGCGCGGC GCGCGGCTCG GCCGCTACGA CTTCACCGAC
ATCTCGAAGG CGTTCCTGCT GAACGCCGAG CAAACGTTCG GCCGGGGCCG CGACATGCTG
CGCTACCGGC TGTTCGACGT CGAGCGGCCG ATCGCCGGGC AGGCGCTCGA CACCGGCGGC
TACGACATCG TGATCGCGAC GAACGTGCTG CACGCGACGC AGGACATCGG CGTCACGCTG
CGCAATGCGA AGGCGCTGCT GAAGGCAGGC GGCCATCTGA TCATCAACGA ACTGCTCGGC
ACGCACGGCT TCGCGCATGC GACGTTCGGG CTGCTGCCCG GCTGGTGGCG GCACCGCGAC
AGCGCGCGCC GCCTGCCCGG CAGCCCGCTG CTGTCGCGCG ACGGCTGGAC GCGCGCGCTG
CGCGAAGCCG GCTTCGCGGT GCTCGACGGC GGCTCGGCCG GCGCCGCGGC GGGGCAAGGC
GTGATCGTCG CGCTCAGCGA CGGCGTGATC GTGCAGCCGT CGCACGCCGA CGCGCGGGCA
GCCTCGTGCG CGGCTTCGCG CGCGGCCCCG GGCGACGACG CCGGCGCGCA CGCCAGCGCC
GCGCGGCCGG CCGCATCGGC TCGCTCGACT GCCTCGCCCG CACACGCGCC CGCGGCTTCG
CCGATCGCCG CCGCGCCGAC CGGCGCGAGC CTGCGCGCGC GCTGCGTGCA GGCGCTCGCG
CAACTCGTCG CGCGGACGCT GAAAATGCCG GTCGGCAAGC TCGCGCCCGA TCAGCCGCTC
GGCAGCTACG GCGTCGATTC GATTCTCGTG ATCGGGCTCA CGAAAACGCT GCGCGAGACG
TTCGGCGTCG CGCTGTCGAA CGCGACGCTG TTCGAGCATG CGACGCTGAA CGCGCTCGCC
GAATTCTTCG TCGCCGAACA TCGCGCGGCG TGCGAACGCG TGCTGGGCGG CGACGCGGAA
CCCGCGCCGA ATGCGCCGAA CGGACCAAAC GCCGCGAGCG CAGCGGCGGC CACGCGCCCG
GCCATGCCAC CGGCCCGCGC CGGCGCCCCA TCGCCCGCCG CGGCTTCGGC CGCGCCGAAG
CCGCGCGAAT CGAACGTGTG CGCCCCGCCC TCCGCCGACG ACACCGCCGT CGCCGTGATC
GGCATGTCCG GCCGCTATGC GCAGGCGGAC AACCTGCGCG AGTTCTGGGC GAACCTGCGC
GCGGGCCGCC ATTGCATCAC CGAAGTGCCC GCCGAGCGCT GGGACTGGCG CACGCACTTC
GATGCGGAAA AAGGCGCGCC GGGCCGCACG TACAGCCGCT GGGGCGGCTT CCTGACGCAG
ATCGACCGCT TCGACGCCGC GTTCTTCCGA ATCGCGCCGA ACGACGCCGA GCAGATCGAT
CCGCAAGGCC GCCTGTTCCT CGAGGAATCG TGGGCCGCGA TCGAGGATGC CGGCTATACG
AGCGACACGC TCAGCGCGGA CCGCCGGGTC GGCGTGTTCG TCGGCGTGAT GAACGGCGAC
TATCCGACGG GCGCGCAGTT CTGGAGCATC GCGAACCGCG TGTCGCACGC GCTCGACCTG
CACGGGCCGA GCCTCGCCGT CGACACCGCG TGCTCGTCGT CGCTGACCGC GATCCATCTC
GCGCTCGACA GCCTGCGCAG CGGCACCTGC GACTGCGCGC TCGCGGGCGG CGTCAACCTG
ATTCAGAGTC CGAAGCATTT GGTCGGGCTC GCGTCGCTCA CGATGCTCTC GGCGGGCGAC
GCGTGCCGCG CGTTCGGCGC GGGCGCGGAC GGCTTCGTCG ACGGCGAAGG CGTCGGCGTG
CTCGTGCTCA AGCCGCTGTC GCGCGCGCTC GCCGATGGCG ACGCGATCCA CGGCATCATC
CGCGGCAGCA TGATCAACGC GGGCGGCAAG ACGCACGGCC TCACGGTGCC GAACCCGCGC
GCGCAGCAGG CCGTCGTCGG CGCGGCGCTC GCGCGAAGCG GCGTGCCGGC GCGCGCGGTC
GGCTACATCG AGGCGCACGG CACCGGCACC GCGCTCGGCG ATCCGATCGA ACTCGCGGGC
CTCACGCGCG CGTTCGCCGA AGCGACCGAC GAGCTCGGCT TCTGCGCGCT CGGCTCGGTC
AAATCGAACA TCGGCCATTG CGAAAGCGCG GCGGGCGTCG CCGGCGTGAC GAAGGTGCTG
CTGCAGATGA AGCATCGCGA ACTCGTGCCG ACGCTGCATG CGCACGAGCC GAACCCCGAC
ATCGATTTCG CGCGCTCGCC GTTCGTGCTG CAACGCACGC TCGCGCCGTG GCCGCAGCCG
GCGCTCGACG GATGGCCCCG GATCGCGGGC GTGTCGTCGT TCGGCGCGGG CGGCGCGAAC
GCGCACGTCG TGCTCGAAGA GTTCGTCGAG ACGCGCGCCG CCGCCGGCGG CGACGACGCC
GGCCCCGCGA TCGTCGTGCT GTCCGCCGCG ACCGACGCAG CGCTGCGTCG CCGCGCGCGG
CAATTGCACG CCGCGCTCGC CGCCGGCGAA ATCGGCGACG AGCGCCTGCA CGATCTCGCG
TACACGCTGC AGATCGGCCG CGCCGCGATG GCCTCGCGCT TCGGCTGCGT CGCCGGCAGC
GCCGCCGAAT TGCAGGCGCA GCTCGCCGCG TTCGTCGAAG GCGACGCATC GCGCGGCTGG
CACGCGCACC GGCTCGCCGG CGACCGCCAC GGCCTCGCCG AGCTCGACGC CGATCCCGAG
CTGCGCGCGT CGCTCGTCGA GCAATGCGTC GCGGCCGGCA AGCTCGACCG GCTCGCGGCA
CTCTGGTGCC AGGGGCTCGG CATCGACTGG CCCGCGCTGC ATCGCGGCCG CGCGCGCCGG
CGCATGCATC TGCCGACGTA CCCGTTCGAC GGCCCGCGCT ACTGGCTGCG CGACGACGCG
GCGCACGCCG CCGAGCCCGC GCCGGCCGAC GGCGCCGCCG AAGACGCAAG CGCCGACGCA
CCGAATGCAG CGAACGCGCC GACGCCCGAC GTCGCAACGC TCGTCCGTCG AACGGTGGCG
CAAGTGCTCG GCTATCCGGA CGTCGACATG AACGAATCGT TCCTGTCGCT CGGCGGCGAT
TCGATCCGCG CGGCGCGCGC GCATCGGGTG CTGCAACGGG CGCTCGACAC GAGGATTCCG
CTCAGCCTGA TGCTGGAGGC AAGCACGCTC GCCGAATGCG CGCAAGCGAT CGATGCGCTG
CTTTCGACGC AACCGGCGCC TGCGAGCGCG CTCGCCCTCG AAACGAACGC GGGCGCGGCC
GGCGCGCCGA TCGCCGACGC GGCCGCGCTC GAGTTGTCGG CGCCGCCCTC CCGGGAATCG
GCCTCCCCGC CACACCCGGC CTCCCCGCCG CGCGACGCGC GCCCGCGCGT TCATCCGCTG
TCATCGAACC AGCAGCAGTT CTTCTTCCTC GACCGGCTGA ACCCGGCGAA CCCGGCGTTC
AACCTGCCCG GCGCGCTGCG CGTGCGCGGC GAATGGCACG CGCACGCGCT CGAAGCCACG
TATCAGGCGC TCATCGATAC GCACGACGTG CTGCGCACCC GCTTCGTCGT GCGCGGCGGC
GAACCGTGCG CGGAAGTCGC GCCGCACCGC GCGGCCGCGA TTCGCCGGCA CGATCTGACG
GCGCTGCTGC CGAAGCATCA GGCCGCGCGC GTCGCCGAGT GCCTCACCGA GTCGAGCCGC
GAGGGCTTCG CGCTCGAACA GGGCGAACCG AGCCGGCTGA CGGTACTCGA ACTGCGCGAC
GACGATCACG TGATCCTGCT GAATCTGCAT CACATCGTCG GCGATGCGGT GTCCGTCGTC
GTGCTGCTCG ACGCGCTCGC GCGCGCCGCG CTCACGGGCC GCGCGGCCGC GCCGGACCGC
GCGCGGCCGC AATACGCGCA ATGGGCCGCG CACGAACGCG ATGCGCTGCC GGCGACGATC
GAGCGCGAAC TGCCGTACTG GCTCGAGCGC CTGCGCGACG TGCCGCCGCC GTTGCCGCTG
CCGTGCGACC GCGCGCGGCC GCCGGTGCCG AGCTATCGTG GGCGCAGCGT GCCGCTCGCG
TTTGCGCCTG CGCTCATCAC GCTGCTCGAC GCATACTGCA AGGCGCACGG GCTGTCGCGC
TTCGTCGTGA TGCTCGCCGC GTTCAAGCTC GCGCTGCGCG TGCTGTCGGG CCGTGACGAC
GTCGTCGTCG GCAGCCCGTA CGCGAACCGC GCCGAGGACG ACACGGCCGA CATGATCGGC
AGCCTCGCCT ACGCACTCGT GCTGCGCACG CGGCTTGGCG AAGCACAGAC GTTCGCCGAT
GCGGTCGCGC TCGTGCGGCG CACCGTGCAC GGCGCGTTCG ACCATCTCGG CGTGCCGTAT
CCGCGCCTCG TCGAGGCGCT GAATCCGGCG CGGCACGGCG GCGCGAACCC GCTGTATCAG
ATCATGTTCA ACGTGATCCC GATGCCCGCG CTGCCCGAGG GCGTCGAGCC CGTCGAAGTC
GATTCCGGCT GGCTCGACTA CGATCTGTTC GTGCGGCTGC GCGCGTCGAG CCACGCCATC
GACGGCGTGC TGCAATTCAG CGCGGATCTC TTCGATCGTT CGACGGCCGA AGCGATCGCC
GCATACTACG TCGAGCTGCT GCACACGCTG CTCGCGCATC CGTCGCTGCC GCTCGCGAGC
CTCGCGCCGC CCGCCGAGCT CGCGCTCGAA CGGACGATCG CCGACGCGAT GCCGCCGCTG
CGCATCGAGA TCGCGTCGAC GTTCACCGAC CGCCCGTTAG CCGGCACGCT GCGCTACTGG
GGCACCGCGA CCGGCCAGCC GATCGAGCCG AATTTCGCGC CGTACGGACA ACTGTTCCAG
ACGCTCTACG ATCCGTCCAC GCCGTTCCAT GCGAATCGTC ACGGAACGAA CGTCGTGCTG
GTCAGGCCGT GCGACTGGCT GCGCTTCGAC GACGCGGACG CGGACGCCGC CCGCGCCGAC
CTCACGGGCG ACGCCGGCGC GGCGGCCGCC GAACGCATCG CGCTGTACGC CGACGAACTC
GCCGACGCGC TGCGCGACGC GGCGCCGTCG CTCGCGGTGC CCGTGCTCGT GCTGGTGCTG
CCGGACGATG CCGCGTCGCT CGCGGCGCGT GACGAACACA CGGGTACGGC AACCGAAGCG
CCCGCCGAGG CGCTCGCCGA CGCACGCGCC GGCAAGCCCT CGCCCGACAC GTCGCTCGCC
CCTTACCGCA TGCTCCGCGC CGCGCTCGCG GATCTGCCGT CGATAACGGT CGCGCACTGG
CGCGATGTCG CCGCGATCTA CCCGGTCGCC GACGTGTTCG ATCCGCATGC GGACGCGGCC
GGCCACGTGC CGTTCACGAG CGAGTACTAC GCGGCGCTCG CGAGCTACAT CGCGCGCACC
GCGTTCCAGC ACGCATCGGT GCCGCTCGAC GACGCCTGGA ACCGGCTCGC CGCGCAGATC
CGCGACGACG CCGAGCACCT GCTCGCCGCG CCGGCCGACG GCGCACGCGC GCGCCGCGCG
CCGCACGCCG CGCCGACGAA CGAAACGCAG GCGACGCTCC TGCCGATCTT CGCGGCCGCG
CTGAAGCTCG ACGATCCCGG CATCGACGAC AACTTCTTCG ACTGCGGCGG CCACTCGATC
CTCGCGATCG GCGTCGTCCA TCAGATCAAC GAAGCATTCG GCACGTCGCT GTCGGTCGCG
GACATCTTCA TGGCGCCGAC CGTGCGCCGC CTCGCCGAGC GCATGCGCGA CGCGCCGGAC
GGCCCCGAGT ACGTCGAGCT CGCGAGCGCG GCCGCGCTGC CCGACGACAT CGCGCCGCTG
CCCGGCCCAG TGGCCGACGC GCCGCGCGCG CTGTTGCTCA CGGGCGCGAC GGGCTTTGTC
GGCCGCCATC TGCTGCGCGA GCTGATCGAT CGCACCAGCG CGACGATCTA CTGCCTCGTG
CGCGCGCCGG ACGCCGCGCA GGGTCTCGCG CGGATCCGCG CGACGCTCGA GCGCTGGTCG
CTGTGGCGCG ACGGCGACGC CGCGCGCGTG ATCGCGGTGC CGGGCGATCT CGGCCGCCCT
CGCATCGGCC TGTCGGACGC CGCGCGCGCG CGGCTCGTCG CCGAAGTCGA CGCGATCTAT
CACAACGGCA CCAGCATGAA CCATCTCGAA TCGTTCGAGA TGGCGCGCGC GGCGAACGTC
GGCGGCGTGA TCGAGCTGCT GCGGATCGCC ACCGAAGGCC GGCCGAAGAC GTTCAACTAC
GTGTCGACGC TCGCGGTGTT CAGCATGCGC GAGCGCACCG GCACGCACGT ATTCGACGAA
GCCGCGCCGA TCGACGGCGA GCGGCATCCG TCCGACCAGG GCTACACGAC GAGCAAGTGG
GTGGGCGAGC AGCTCACGCA TCTCGCGGCC GCGCGCGGCG TGCCGTGCAA CGTGTTCCGC
CTCGGCCTCG TGACGGGCGA CGTGCGCCAC GGTCACTACG ACGAACTTCA GGCGTACTAC
CGGCTGCTGA AGAGCTGCAT CCTGATGGGC GCCGCGTTCG ACGATTTCCG CTACGACCTC
GTGATCACGC CCGTCGATTA CGTCGCACGT GCGCTCGCGC ATCTCGGCGC GCGGCATTCG
CAAGGCGGCC GGGTGTTCCA TCTGTCGACG ATGCAGGTCA CGCCGATGCG CACCGTGTTC
GAGATGATGA ACGCGCATCT GCGCACGCCG ATGCGCATGC TCACACACCG CGCGTGGATC
GACGAGCTGC GCGTGCGCTA CCGGCGCGGC GACGTGCAAT CGATCGTGCC CGTCGTGCAA
TGGATGATGA ACATGAGCGA TGCGGAGCTC GTGAAGCTCG CGCGCGAGCG CGAGGAAACG
ACCTTCATCT ACGACTGCAC GGCGACGCAC CGCGAGCTCG AGCAAGCCGG CATCGTCGTG
CCCGTGTTCG ACGACGCGCT GCTGCAGCGG TATCTGCGCG GCATGTTCAA CGACGACGCG
GACCTGCGCG CGCTTGCCGC CCGGCTCGAC GGCGGCGAGT GCGCTTCTCC CCTTCACTCC
CACACGTGA
 
Protein sequence
MTASPPSSAL VTAVEAAVLS LAGDVAGRAF DASAAERPLH ALGFDSVQYV ELSGCLNEYY 
GLDLAPTLFF DVHVPRRIAE HLVARHPAAL ARKHGIGAGD DADTAARARA AAAENRAPQP
DMRAGAAGPA GEPLLDTHAS PGEPRGDAHE NPCDDTRGAA AADAHESAAD IAIVGMAGIF
PQSADLDAFW RHLAAGDDLI AEAPASRWDW RAGDGEPASR WGGFIPRIEY FDAAFFGISP
REAEQMDPQQ RLLMQTAWAA LEDAAVRPSD LMGSDAAVFV GVSTSDYMAL LPGADGHLAV
GNAHAMLPNR LSHLLGAHGP SEAVDTACSS SLVALHRAVR ALRRGESSVA IVGGVNVMLT
TRLHRALAAA GMLSPDGRCK TFDAAANGYV RGEGIAALVL MPLERARANG HPVCAVIKGS
AVNHGGRAAF LTAPDINAQA ALIEAAYRDA GVDPATVSYV EAHGTGTSLG DPIEVQALRQ
GLDACARDLA GTASHAPARC GLGSVKTNIG HLEAAAGLAG VVKVVLAMDR RMLPPSLHCR
ELNPYLKLDG SRYHVVTEPT PWPDEATPTP LRAGVSSFGF GGSNAHVVLQ SAHARPIARA
SAPPPPHTNE QAGADAPAAD GPRAWFIPLS ARTDAALHAR AAQLAHWLDT EPADDAWLPA
LAKTLSIGRE PMARRFGITC ASLDELRAQL AIALGGRATS LARDDARLRP HAPACAAWLA
GETDPLPAAW DDATPRLRLP VYPFEGERHW PTEAAPAARF ALAPDADGAY RIAIAPDAPL
VADHRLAGEP VLAAAAQIVI AWRAFEADAL AGDAGQAGDV GESMESMESN GSSASKPAAT
SADSGTAADS RDPHDSYHSH DSRHTIDTNA TSATTPIALR DIEWLAPIAI GAPTDLCITL
ARDAHGDIDA RRGEAAHRRA NGRAARFAIA AAPAIDTPLG RGHATRIASA PSNAPELDIE
AIRARCTQAV SADACYDAFA AIGIDYGPTF RPLRAIAVGR DEALAEFDAS ALARTTGDAR
IVALLDGAFQ AIAGLTLAHA ASLESGLLPA SLARIEFTGP LADSVRAWIR EAPSDTGRRT
FDIDLVTASG RSCASLRGLA LASGRSATSR EAPRITTPGD HLFAPQWLPC ATNAAGAATP
SPRAGALAIM GGTPAQRAAL AATHAAAPRL IDDIAELDAN VSHLVWLPSA PADAHAPLAQ
CASLDGLRLV KRLLALGAGD RAFDLTVLTV RSWTMPGDAP AFPAHADLAG LCGALANEYP
HWRVRLIDLP DAAALPADWH ARSAEGGHPL LLHRHGQWFA RRLVPLAALP APAAQPYRPG
GVYVAIGGAG GLGRVWTEHA IRACGAQVVW IGRRPLDAQI DAHCDALAAL GPRPSYLSAD
ASDAESLRAA RDAVLERFGR LDGVVHTAIV LEDGGLAQLD EARFSAALNA QVATTANLAR
VFGSDPLDFI LFFSSLQSAF VAAGQSNYAA GCTFRDAFAD WLRTQLRCAV KVVNWGYWGQ
TGVVATEPYR ARMAALGIGS IEPAPAMAVV DALLASNVDQ VGYLKTIASA AVPTLAPALA
ARIAPRTRAL AGTPPRVDAT DDSAAWRDAL AALERAIARR LFAELGALRV FGGSGAPGGH
AFDDGAARNS AAGQRSADDR APDAAPFDID TALRTGRIAP AYRRWLAHAL TLIARHGPLA
WDGRSGRLAE APPTPDTARA EWARARAELE RTALLDAHLA LVDATLDALP AILQGSVPAT
SILFPDGDLS RVEAVYQRNE QADRCNRALA DAVLHLVGDA SSAQPAALAE IGAGTGGTTV
PLLAALDARG ARLGRYDFTD ISKAFLLNAE QTFGRGRDML RYRLFDVERP IAGQALDTGG
YDIVIATNVL HATQDIGVTL RNAKALLKAG GHLIINELLG THGFAHATFG LLPGWWRHRD
SARRLPGSPL LSRDGWTRAL REAGFAVLDG GSAGAAAGQG VIVALSDGVI VQPSHADARA
ASCAASRAAP GDDAGAHASA ARPAASARST ASPAHAPAAS PIAAAPTGAS LRARCVQALA
QLVARTLKMP VGKLAPDQPL GSYGVDSILV IGLTKTLRET FGVALSNATL FEHATLNALA
EFFVAEHRAA CERVLGGDAE PAPNAPNGPN AASAAAATRP AMPPARAGAP SPAAASAAPK
PRESNVCAPP SADDTAVAVI GMSGRYAQAD NLREFWANLR AGRHCITEVP AERWDWRTHF
DAEKGAPGRT YSRWGGFLTQ IDRFDAAFFR IAPNDAEQID PQGRLFLEES WAAIEDAGYT
SDTLSADRRV GVFVGVMNGD YPTGAQFWSI ANRVSHALDL HGPSLAVDTA CSSSLTAIHL
ALDSLRSGTC DCALAGGVNL IQSPKHLVGL ASLTMLSAGD ACRAFGAGAD GFVDGEGVGV
LVLKPLSRAL ADGDAIHGII RGSMINAGGK THGLTVPNPR AQQAVVGAAL ARSGVPARAV
GYIEAHGTGT ALGDPIELAG LTRAFAEATD ELGFCALGSV KSNIGHCESA AGVAGVTKVL
LQMKHRELVP TLHAHEPNPD IDFARSPFVL QRTLAPWPQP ALDGWPRIAG VSSFGAGGAN
AHVVLEEFVE TRAAAGGDDA GPAIVVLSAA TDAALRRRAR QLHAALAAGE IGDERLHDLA
YTLQIGRAAM ASRFGCVAGS AAELQAQLAA FVEGDASRGW HAHRLAGDRH GLAELDADPE
LRASLVEQCV AAGKLDRLAA LWCQGLGIDW PALHRGRARR RMHLPTYPFD GPRYWLRDDA
AHAAEPAPAD GAAEDASADA PNAANAPTPD VATLVRRTVA QVLGYPDVDM NESFLSLGGD
SIRAARAHRV LQRALDTRIP LSLMLEASTL AECAQAIDAL LSTQPAPASA LALETNAGAA
GAPIADAAAL ELSAPPSRES ASPPHPASPP RDARPRVHPL SSNQQQFFFL DRLNPANPAF
NLPGALRVRG EWHAHALEAT YQALIDTHDV LRTRFVVRGG EPCAEVAPHR AAAIRRHDLT
ALLPKHQAAR VAECLTESSR EGFALEQGEP SRLTVLELRD DDHVILLNLH HIVGDAVSVV
VLLDALARAA LTGRAAAPDR ARPQYAQWAA HERDALPATI ERELPYWLER LRDVPPPLPL
PCDRARPPVP SYRGRSVPLA FAPALITLLD AYCKAHGLSR FVVMLAAFKL ALRVLSGRDD
VVVGSPYANR AEDDTADMIG SLAYALVLRT RLGEAQTFAD AVALVRRTVH GAFDHLGVPY
PRLVEALNPA RHGGANPLYQ IMFNVIPMPA LPEGVEPVEV DSGWLDYDLF VRLRASSHAI
DGVLQFSADL FDRSTAEAIA AYYVELLHTL LAHPSLPLAS LAPPAELALE RTIADAMPPL
RIEIASTFTD RPLAGTLRYW GTATGQPIEP NFAPYGQLFQ TLYDPSTPFH ANRHGTNVVL
VRPCDWLRFD DADADAARAD LTGDAGAAAA ERIALYADEL ADALRDAAPS LAVPVLVLVL
PDDAASLAAR DEHTGTATEA PAEALADARA GKPSPDTSLA PYRMLRAALA DLPSITVAHW
RDVAAIYPVA DVFDPHADAA GHVPFTSEYY AALASYIART AFQHASVPLD DAWNRLAAQI
RDDAEHLLAA PADGARARRA PHAAPTNETQ ATLLPIFAAA LKLDDPGIDD NFFDCGGHSI
LAIGVVHQIN EAFGTSLSVA DIFMAPTVRR LAERMRDAPD GPEYVELASA AALPDDIAPL
PGPVADAPRA LLLTGATGFV GRHLLRELID RTSATIYCLV RAPDAAQGLA RIRATLERWS
LWRDGDAARV IAVPGDLGRP RIGLSDAARA RLVAEVDAIY HNGTSMNHLE SFEMARAANV
GGVIELLRIA TEGRPKTFNY VSTLAVFSMR ERTGTHVFDE AAPIDGERHP SDQGYTTSKW
VGEQLTHLAA ARGVPCNVFR LGLVTGDVRH GHYDELQAYY RLLKSCILMG AAFDDFRYDL
VITPVDYVAR ALAHLGARHS QGGRVFHLST MQVTPMRTVF EMMNAHLRTP MRMLTHRAWI
DELRVRYRRG DVQSIVPVVQ WMMNMSDAEL VKLAREREET TFIYDCTATH RELEQAGIVV
PVFDDALLQR YLRGMFNDDA DLRALAARLD GGECASPLHS HT