Gene BMASAVP1_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0167 
Symbol 
ID4678025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp158299 
End bp169755 
Gene Length11457 bp 
Protein Length3818 aa 
Translation table11 
GC content74% 
IMG OID639842697 
Productputative polyketide synthase PksJ 
Protein accessionYP_989781 
Protein GI121596690 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAAGA TCGACGACAT CGATCGCGCG GACGGCTTTT TCGAGGTGGG CGGCAATTCG 
CTGTCGGTGG CGCTGATCGC CTCGCGCGTC GGCGCCGAGT TCGGCCTCGC GCGGCTCGGC
GCCGGCGCGT TCTTCCGCTA TCCGACGGTC GCCGCGCTGG CCGCCCATCT GGGCGCGCGG
CTGCGCGGCG ACGCGGGCGC GGCCGAGGGC GCGGACGGCG CCGACGCGCG CGCATCCCGC
CCCGCGCAGC CGCGGGCGGC CGGGCCCGCG GCGCGACTGC CCGCAGCGCT CGACGACGCG
ATCGCGATCA TCGGCATCTC GTGCCAGTTT CCCGGCGCGC AAGACCATCG CGCGTTCTGG
CGCAATCTGC GCGACGGGAA ATCGGGCGCG CGGTTCTATT CGGAAGACGA ACTGCGCGCG
GCCGGCGTGC CGGACACGCT GATCCGCGAC CGGCACTACG TGCCGATGCA GCAGACGATC
GAAGGCAAGG ACCTGTTCGA CCGGCACTTC TTCCGGCTGA CGATGAAGGA TGCGCAACTG
ATGGACCCGC AATTCCGTCT GTTGCTGCAG CACGCGTGGA AGGCGATCGA GGACGCCGGC
TGCACGCGCG AGCGGATCGC CGACGCCGGC GTATACATGT CGGCGTCGAA CAGCTACTAC
CAGGCGATGC TGCGCGCGGC CGGCACGATC GACGCGTCCG ACGAGTATCA GGCGTGGCTG
CTCGCGCAGG GCGGCACGAT TCCGACGCGC ATCTCGTACG AGCTCGGCCT GACGGGGCCC
AGCCTCTTCA TCCATTCGAA CTGCTCGTCC GGGCTCGTGT CGCTTTCCGT CGCGGCGAAG
TCGCTGCTGC AGCGGGAAAG CCGCTGCGCG CTCGTCGGCG CGGCGACGGT GCTGCCGGAT
GCGGACATCG GCTACGTGTA CCAGCCGGGG CTCAACCTGT CGAGCGACGG CCGCTGCCGG
ACCTTCGACG AAAACGCCGA CGGGCTCACC TCCGGCGAAG GCGTCGCCGT GCTGCTCGTC
AAGCGCGCGC GCGACGCGAT CGACGACGGC GACCCGATCT ACGCGCTGCT GCGCGGCATC
GCCGTGAACA ACGACGGCGC GGACAAGGTC GGCTTCTACG CGCCGAGCGT CGGCGGCCAG
GCCGACGTGA TCCGCAAGGT GCTCGATGCG ACCGGCATCC ATCCCGAGAC GATCGGCTAC
GTCGAGGCGC ACGGCACCGG CACGAAGCTC GGCGATCCGG TGGAGGTGGC GGCGCTCACC
GACGCGTATC GCCGCCATAC CGCGCGCACC GGATTCTGCG CGATCGGCTC GGTGAAGCCG
AACATCGGCC ACCTGGATAC CGTCGCCGGG CTGTCGGGGT GCATCAAGGT CGCGCTGAGC
CTGCGGCACG GCGAGATCGC GCCGTCGATC AACTACGAGA AGCCGAACCG CGAGATCGAT
TTCGCGCACT CGCCGTTCTA CGTCGTCGAC CGATTGACGC GCTGGCCCGC GCGCGAGCCG
GGGGCGCCGC GCCGCGCGGC GCTCAGCTCG TTCGGCATCG GCGGCACCAA CGCGCATTTG
ATCCTCGAGG CGTTCGAGCG CGACGAGCCG CCCGCCGGGA TGCGCGCGCC GGCCGCCCAC
GCGGCGCGCG TGATCGCGCT GTCGGCGCGC ACCGAAGAGC GCGTGCGCGC GCAGGCGAGC
CAGTTGCTCG CGTTCCTCGA GCAGGAAGCC GGCGCGCTGC CGGACTTCGA CGGTTTCGCG
TTCACGCTGC AGGTGGGCCG CGAGGCGATG CGCGAGCGCG TCGCGTTCGT CGCCGACGGC
TACGACGCGC TCGCCGCCGC GCTCGCGCGT TTCCTGCGCG GCGAGCCGGA CGCGGCCGCG
TGTTTCACCG GCGCGCGCGG GGGCGATTCG ACGCTCGCGG CGTTGCTCGA CGATACCGGC
GATACCGGCG ATACGGCCGC GCACGGGTTG ATCGCCGCGT GGTGCGAGCA AGGCAAGGTC
GCGAAGATCG CGGCGCTCTG GGCGCACGGT GTGAACGTCG ATTGGCGCCG GCTTTACGGC
GCGCGCGCGC CGGTGCGTGT GAGTCTGCCC ACCTATCCGT TCGCGCCGGA GCGTTGCGAG
GGCGTCGCGC GCCGCCGGGC CGCCGCGCCG GCGCCGCGCC GCGCGGGCGT CGAGACGGCC
GCGGCGCGGC TGCATCCGCT CGTTCACGAC GACCGCTCGG ACGGGGCGCG CCGGCGGTTC
GCCGCGACGT ATTCCGGCGA GGAATTCTTC CTGGCCGATC ATCTGATCCG CGGCAAGCGG
ATCCTGCCCG GCGTCGCGTA TCTCGAGATG GCGCGCATGG CGGCCGTCCG GGCGCATGGC
GACGGCGCGC TGAGCCTGCA CGACGTGGTC TGGATGACGC CCATCGTCGT CGACGGGCCG
TGCGAGGTCG AGCTGAGCCT GGAGGCCGCC GAGCGTGCCG AGGCCGAGGG GGCCGCCGAG
GCGGCCGCCG GCGTGCGCAC GATGCGGTTC AACGTGACCT CGGGCGGCGG CGCCGGCGCG
CGCCGCACGA ACAGCCAGGG GACGATTCGC CTCGCGCCCG GCGCGGCCGC GCCCGCCGCC
GCGCGCGTCG ATGTCGCGGC GCTCCTCGCG CGCTGCACGC GCGAGATCGG GGCGCAGCGG
TTCTACACGT TCCTCGACAG CGGCGGCGGC CATTACGGGC CGACGTTCCG GAGCGTCGCG
GCGCTGCATC AGGGCGAGCG CGAGGTGCTC GCGCGGCTCG CGCTGCCGGA GTCCGTCGCG
CACGCCGATG CGTTCGTGCT GCATCCGAGC ATGATGGACG CCGCGTTCCA GATCGCCGAC
AGCCTGATCC TGCAACCGCG CGCGAACGGC GGCTGTCTGC CGTTCTTCGT GAAGGAGCTC
GTCGTGCGAC GCCGGCCGGG CCGCGACGCG TGGGTCCACG TTCGCCTCGC CGGCGGCGAT
GCGACGCTTG CCCGCTACGA CATCGATCTG ATCGATCCCG ACGGCACCGT CTGCGTGTCG
ATGCGCGAAT TCAGCGCGCG CGCGGAGACG GCGGGCGGCA GCGGCCGGCC GAACACGTAC
CGCGCCGCCG AATGGCGCGC CGCGGAGCGC GACGGGAACG AGCTGAGCGA GCTGAACGAG
CTGAACGAGG GGAACGAGGG GAACGAGGGG AACGAGGGGA ACGAGGGGAA CGAGGGGAAC
GAGGGGAACG AGGGGAACGA GCGGCGTCGC GCCGCGCCGC GGGTGGCGGT GCTCGACGCA
TCGCCGCGTC TTGCGCACGC GCTGCGCGGC ATCGGCGTCG ACGCGCTCTG GCTGCCGGCC
GACGCGGCGC ACGCGGCGCG CGGGCCGGCG CTGCGGGATC TCGACGCGGC GCTGCACGCC
GGCGCGGCGC GCGATCTGCT CGTGCTCGCC GACGAACGGC GCGAGCTCGA CGACGACGCG
TTGCGCCCGT GGCTCGACGG CGCGCCGCAC GCCGGGGGCG CGCGGCGGGC GCTCGTGTCG
ATCGCGGGGC TGGCCGATGC CGACGCGCGC GCCGTGGCGG ACATCGTCGA GCGCGAGCGG
CATGGCCGCG CCGCCGACGT CCGCTACGAC GCCGGCGGCG CGCGCAGCGT GCGCGGCTTC
GCCGACGCGG CCGTCGCGCG CTGGCTGCTC GACACGGACG CGCTGCGCTC GGGCGGCGTG
TACTGGATCG CCGGCGCGAA CGGCCCGCTC GGCGCGAGCC TTGCCTGCCA CCTCGCGACC
GTGGAGCGCG CGACCGTGGT GCTGACCGAC GCGCACGCGA TCGATGCGGC CCGGCTCGCC
TGCCTCGACG GGTATCGCGC CGGCGGCGCG CGCCTCGAGT TCATCGAAGG CGACGCCGCG
CGAGACGGCG CGGCGCTCGC GCAGCGGATC CGCGCGCGTC ACGGGCGCAT CGACGGCGTG
CTGCACTGTG CGCAACACGC GTCGGCGCCG ACGCTCGCGG CGCTGGCCGC GCTCGACCGC
GCGACGCGCG CCGACGCGCT CGATTGCTTC GTCGCCTGCG AGGCGCGGGA TGCCGATCCG
GATCACGATC CGGCGGCCGC GCTCGTGGCG CGATTCGTCG AGCGGCGCCA CGCGCGCGTG
CAAGCGGGAC TCGGCGGGGG GCGGACGGTG GCGATCGCGG CGCACGCGGC GCTGCCGTGG
CCGGACGACG CGCCGCTGCT GCGCGCGGGC GGTATCGCGA GCCAGCCCGC GCTGGCGATC
GTGCAGGCGC TGCATCATGC GTTGCGCTCG GACGAGGCGA TGCTCGCCGT CGGCTGGGGG
GCGTCGGCCG GCGGCGTCGA TGCGGGCGCG TCGAAGGCGG CGAACGCATC GAACGCATCG
AACGCATCAA ACGCGTCGAA CACATCGAAC GCGCCGAGCG TCGCCGCGGA TCTTGCCGCG
CCCGCCGAGC CGAACGCGCG GATTCCCGCG CGCGCGCGGG CCACGCCGGA CACGATCGCG
GCATGCCTGA AGGCCGTGAT CGCCGACGTG ATCCGGGCGG ACGTCGACGA AATCGACGCG
CGCCAGCACT TCGGCGAATA CGGACTCGAC TCGCTGTCGC TGACGTCGGT CAGCAACCGG
CTCAATGACG CATACCGCCT CGATGCTTCG CCGGCGGGCG CGCTGAATCC GACGCTGTTC
TTCGAATACC CGAGCGTCGA GCGGATGGCG GCTTATCTCG CCGAGCATCA CGCCGCGCGC
TTCGCCGACG CGTCGGCCGC ACCCGGCGCC GACGGGGCGG CCGAGTGCGC GCCGCGGCCC
GAGGCCGCGC TCAACGCCGA GGTCGAACCT GGGAACGGGG CCGCGCCCGC GCCCGAGCCC
GAGGTCGGGT TCAGGGCCGG GTTCGAGCCG GTCGCGCCGC CGATGCCGCG CATCGAGCCC
ACCGCATCGA CGCCGCCGGA CCAACCGGCG CCGCAACCCG GCGGTGCATG GCACGCCGGG
CGCGGCGCGT GCCCGGCGGC CGACGACGTC GCGATCATCG GCATCAGCGG CCGCTTTCCC
GGCGCGCGCG ACGTGGCCGA ATTCGGCCGC AATCTGTTCG ACGGCCGGGA CTGCATCGGC
GAGATTCCCG CGGACCGCTG GGACTGGCGC GCGTACCTCG GCGATCCGCA GCACGAGGCC
GGCAAGACGA ACAGCAAGTG GGGCGGCTTC ATCGACGGCA TCGCGGAATT CGATCCGCTG
TTCTTCAGCC TCTCGCCGAA GGAGGCCTAT CTGCTCGATC CCGCGCACCG GCTGCTGCTG
ATGCACGCGT GGTGGGCGAT CGAGGACGCC GGCTACAACC CCGCCGCGCT CGCCGGCAGC
CGGACCGCGC TGTTCGCGGG CATCGCGCAG AGCGGCTACG CGGATTTGCG CAGGCAAGCC
GGCGAGGGGA TCGAGGGCAA CTCGTTCCTC GGGGTCGTGC CGTCGATCGC GCTGAACCGG
ATCAGCCACC TGCTCGATCT GCACGGCCCG AGCGAGCCGG TCGAGACGGC CTGTTCGTCG
TCGCTCGTCG CGATGCACCG CGCGCTCGTC AGCCTGCGCT GCGGCGACGC CGACATGGCG
CTCGTCGGCG GCGTGCAGAC GATCCTGTCG CCGCACGCGC ATATCGGGTT CGGCAAGGCG
GGCATGCTCG CGACCGACGG CCGCTGCAAG GCGTTCTCGA GCCGCGCCGA CGGCTTCGTG
CGCGGCGAGG GCATCGGCAT GCTGTTCCTG AAGCGGCTCG GCGACGCGCG GCGCGACGGC
GACGCGATCT ACGGCGTGAT CCGCGGCAGC GCGGTCAATC ACGACGGCCG GTCGAGCTCG
CTCACCGCGC CGAACCCGGC CGCGCAGCGC GACGTGATCG TGCAGGCGCA CATGCGAGCC
GGCGTCGACC CGCGCAGCAT CGGTTACATC GAGGCGCACG GCACCGGCAC GAAGCTCGGC
GATCCGATCG AGATCAACGC GCTCACGCAG GCGCTCGACA CGCTGCTGCG CGCGCAGCGC
GAGGAAGGCG CCGCCTACGT TCCCGGCGCG TGCGCGATCG GCTCCGTGAA GAGCAACATC
GGCCATCTGG AGCTGGCCGC CGGCGTGTCC GGCGTGATCA AGGTGCTGCT GCAGATGGCG
AACGGGCGGC TCGCGAAGAG CCTGCATTGC GACGAGCTCA ATCCGTACAT CACGCTCGAC
GGCGGGCCGT TGCGCGTCGT CGGCGCGAAC GCCGCGTGGC CGCGTCCCGT CGATCGCGAC
GGCCGCGAGC AGCCGCGCCG CGCGGGCGTG AGCTCGTTCG GCATCGGCGG CGTGAACGCG
CACGTCGTGC TCGAGGAGTA TCCCGAGGCC GACGCGCGCG CGCGCGACGA CGGGCAGCCG
GCCGCCGTGC CGCTGTCCGC GCGGGATTCG CAGCGGCTCG CCGATTACGC GAGCGCATTG
CTCGCGTTCG TGCGCGAGCG GCGCGAGGCG GCCGCGCATG CGCCGCCGCC GCGGCTGTCG
GATCTCGCCT ATACGCTGCA GGTGGGCCGC GAGGCGATGC GCGAGCGTGT CGGCTTCGTC
GTCACGTCGC TCGCGCAACT CGAGGCGCGG CTTGCCGCGT TCGTCGCGGG CGAGCCGGCG
GGCGACGGCG TCTACCGCGG CAGCGTCCGC CCGGCGCGCG GTGAACGCGC GGCCGACGCG
GACGGCCTTG ACAGGCTCGT CGACATCTGG CTCGCGAGCC GCAAGCATGA GGCGCTGCTC
GGTGCGTGGG TGAAGGGCGC GGCGATCGAC TGGGCGAGAC TTCACGCGGG CGGCGCGCCG
CGCCGCGTCC ATCTGCCCGG CTATCCGTTC GCGCGCGAGC GCTACTGGAT CGCCGAGCCC
GCGCCGGCGA CGGGCGCGCC CGGCGAGCCC GCGCCGCCGC GCATGCCGAC GCAGCCGCAC
GGGCCGACGT CCGACGGCCG CGCCGAATCG CGCCATCCGT TGCGGCGCGA CGCCGCCGAC
GGCCGGTTCC TGCTCGATCT CGACGGCGAC GAGGCCTTTC TCGCCGACCA TCGGGTGGAC
GGACGCCGCG TGCTGCCGGG CGTCGCGCAC CTGGAGATCG CGTACGAGGC CGCGCGGCGC
ACGTTCGGCC CGGCCGATGC GATCCGGATC CGGAACCTCG GCTGGATCAG GCCGATCGTC
GCCGACGGCG CGCTGCGCAT CGGCGTCGAA CTGAGCATGA CCGGCGCCGC CGAAGGCGCG
TTCCGCCTCT ACACGACGGA CCCGCAACAT GGGCGGCTCA CGCACAGCGA AGGCGCGATC
GGTCGCGCCG ACGTCGCGCA GTCGGCGCGC GCGCTCGATC TCGGCGCGCT GCGCGACGCG
TTCGCGACGG CCGAGCGCGT CGATCCGGCC GTCTGGTACG ACGGCTTCTC GCGGGCCGGC
ATCGATTACG GCCCGAGCCA CCGCTGCCTC GAAACATGCG CCGTCGGCCC GGCCGGCGTG
CTCGCGCGGG TGCGCCTGCC GGCCGCCGAG GCGCGCGCGG CGCGGCCGTT CACCTTGCAT
CCGGGCCTGA TGGACGCGGT GCTGCAGGCG GCGATCGGCC TGCGCAAGCG CGCGGGCGGC
GCGCCGCGCG GCACGCCGTA TCTGCCGTTC GCGCTCGACG CGGTCGAGAT TCTCGGCGGC
TGCGGCGAGG CGGCGTGGGC ATGGCTGCGC CCGTCGCCGC GCGACGCGGC CGACGCTTCG
GCGTCGCGCG GCGACGCGGG CAAGCCGGCC GCCGAGCGCA TCGATATCGA TGTGTGCGAC
GACGCGGGCC GGATCAGCGT GACGCTTCGC GGGCTCACGT CGCGCCCGCT CGCGCGCCGG
ACGGCGCCGG CTCCCGAGGC CGGGAACCCG GCCGGTGAAG TCGGCGAGGT GGCCGACGCC
ACCGATGCCG ACGCCGCTGA AGTCCGCGAA ATCTCCGACG TCTCCGACGT TTCCGACGTC
TCCGACGTTT CCGACGTTTC CGACGTCGCG CCGCTCGCCG ACGGCGACGT CGGCCTGCTC
GCGCGAACCG CGGTGTGGAG CGCGCTGACG CCGGCGCAGT GGCTCGCGGA TCCGGCGTCG
CGCCCGCGCG CCGGCGCGCG CGTGTTCGTG CTCGGCGGCA CCGCCGCGCA GCGGCGCGAG
ATCGCGCGGA TTCATCCCGG CTGCGAACCG CTTGAGGCGA ATGCGGCCGA CGACGGCGGC
GACGGCGCGG ACCAACAGGC GCACGTCGAC GCGCTGCGGC GGCGGCTCGC CGAGGGCGCG
CCGATCGACC AGCTCGTCTG GATCGCGCCG CCGGAGCCGG CCGCCGACGC GCGCGCCGGG
CTGCGCGGCG ACGCGATCGT CGCCGCGCAG GAGCACGGGG TGCTGCAACT GTTCCGGATC
GTCAAGCTGC TGCTCGCGGC GGGCTACGGC GGCAAGCCGC TCGACTGGAC GATCGTCACG
CGCGAAACGC ACGCGACGAG CGGCATCGAC GAGCCGTCGC CGACGCACGC GGGCGTGCAT
GGGTTCGTCG GCTCGATGGC GAAGGAGTAC CGGAACTGGC GTGTCCGCCT GCTCGACATG
CCCGCGCGCG AGGCGTGGCC GATCGACGCG ATGTTCTCGA CGCGCTTCGA TCCGCGCGGC
GATGCGCTCG CCTATCGGCG CGGCCGCTGG CTCGCCCGCG AGCTGGCCGC GATCGACGCG
TTGCCCGACG GCGGCTGCCA TGTGAAGGCG GGCGGCGTCT ACGTGGTGAT CGGCGGCGCG
GGCGGGATCG GCGAAGTCTG GAGCCGCTGG ATGATGGAGC GCTATCAGGC GCGGATCGTC
TGGATCGGGC GCCGCGACGA GGACGAGCAA ATCCGCCGCA AGCGCGAGCG GCTCGCGCGC
TACGGCACGC CGCCCGTCTA CCTGCGCGCG GACGCGAGCG AGCGCGCGTC GCTCGCGGCG
GCGCGCGAGC GGATCGCCGC GCTGCGCTGG GACGGCCGCG CGCTGCCGAC GAGCGGCGTC
GTGCATTCCG CGATCGTGCT GGCGGACGCG AGCCTCGCGA CGATGGACGA GGCGCGCTTT
CTGGCCGCGT GGCGATCGAA GGCGGATGTC GGCGTGCGCG TCGCCGAGGT CTTCGGCGGC
GATCCGCTCG ATTTCATGCT GTTCTTCTCG TCGATCACGT CGTTCGGCAA GACGGCCGGA
CAGGCGAACT ACGCGGCGGG TTGCGCGTTC AAGGATGCGT TCGCCGCGCA TCTCGGCCGC
ACGCTGCCGT ATCCCGTCAA GGTGATGAAC TGGGGCTACT GGGGCAGCGT CGGCGTGGTC
AGCGACGAAA CCTATCGCCG GCGCATGGCG AGCGCGGGCT TCGGCTCGAT CGAGCCCGAC
GAGGGCATGT CGGCGCTGGA GCGGCTGCTC GCCAGCCGCG TCGGCCAGAT CGCGGTGCTC
AAGACGCTGC GGCCGAACCT CGTCGGCGAC TCGCGCGCGG ACCGGATCCG GCATTACCCC
GGCCGCGACT GGCCGGACGC GGCGCCCGCG CCGGCGACGG CCGCGCTGCA GGCGGCGCTC
GCGGCGCGCG CCGGGCGCTG GCACGCGCAG GCGTCGGCGC TCGCGCTCGG CAATCCCGAG
CTGGAGACGC TGATTGCGCG CGGCCTGCTC GCGGGCGTCC TTCCGTATCT CGACGCGCCG
GGCTCGGTCG ACGCGCGCCA TGCGCGGTGG TTCGACGAAA GCCGGGCGAT GCTGCACGGG
TTCGGCTATC TCGCGCGCGA CGGCGCGGGC GACGCGCCTT CCTGGTCGCT CACCGACGCC
GGCCGCGCGG CGGCGCCGCA CGTCTGGCAA GACTGGGAGC GGCACGCGCT CGCGTGGCAC
GACGACGAGC GGCGCGTGCC GATGCGGCTC GCGCACGTCT GCCTGCGCGC GCTGCCCGAG
CTTCTCGGCG GCAAGCGGCG CGCGACCGAC GTGATGTTCC CGGGCTCCAG CATGGCGCTC
GTCGAGGGGC TGTACAAGAG CAATCGCAAG GCCGATCTGT TCAACGACGT CGTGCACGAC
GCGGTGCTGT CGTATGCGCG CGCGCTCGGG CGCGCGCTCG ACATCGTCGA GGTGGGCGCG
GGCACGGGCG GAACGACGGA CGGCCTGCTG CGCAAGCTCG TCGAGCAAGG GATCGCGGTG
CGCGAATACC GGTATACGGA TCTGTCGCAC GCTTTTCTGC TGCATGCGCG CGAGCATTAC
GCGCCGCGCG CGCCGTTCCT GACGACCGGG ATCTTCGACG TCGACAAGCC GATCGCCGCG
CAGCGCGTGC CGGGCGGCCG CTATGACGTC GCGGTCGCGA CCAACGTGCT GCACGCGACG
CGCGACGTCC GGCGCGCGCT GCGCAACGTG AAGGCGACGC TGCGCGCGGG CGGCCTGCTG
ATCCTGAACG AACTGAGCGT CAAGTCGCTG TTCAGCCATG TGACGTTCGG GCTGCTGGAC
GGCTGGTGGA TGTACGAGGA CGCCGATTTG CGGATACCCG GCTCGCCCGG CATCGATTCG
TCGACGTGGC GGCGCGTGCT GGCGGAAGAG GGCTTCGAGT ATGTGTTCTT CCCCGCGCAA
GGGCTGCATG CACACGGCCA GCAAGTCATC GTCGCGCAGA GCGACGGCGT GGTCCGGCAG
CCGCGCGCGG CCGCCGCGCC GGGGGCCGGC GCGGCCGCGT CGCCTTCGGG CGGCACGCAA
GCGGCGGTGC CGGCGCGCCG GGCGGCCGCG GCATCCGGCG CGCCGCGCGT GGAGGCGATT
CCGCCGGCGG CCGTTGCGCC CGCGGCCTTC GATGCCGCCA CCGCGGCTCC TCCCGGCACC
GCTGCCGCTG CCGCGACCGC GGTGCCGGCG GACGGCCGAT CCGCGCTCGC CCACGCAAGT
TCGCCGGCCG CCTCGCCGCC GCAGCCGGGC GACGCGCCCG CGCCCGAACG AATGCATGCG
TATTTGCGCG ACAAGCTCTC GCAAGTGCTG AAGCTGCCGC CGGAGCGCAT CGAGCCAGAT
GCATCGTTCG CGAGCTACGG CGTCGATTCG ATCATGGCGA TGGCGTTGAT CGCGGCGCTC
GAAAAGGAGC TGGGCAGCTT GCCGAAGACG CTGTTCTTCG AGCACGAAAC GATCGAGGAA
CTGGGCGCGT ATCTGCTGGA GCGTTGCGAG CCGATGCCTT CGGGCGTGGA GCCGGCGACG
GTGGGGGCGG ACGATCGCGC CGCGTATTCC GGCGCGAGGC CGCACGCCTG GCCCGCGTCG
CCCACGGAGC CCACCGTGCC CACCGAGCCC ACCGCGTCGC CCGCCTCATC CGCCCCGCCG
GCCGCCTCGC CGCCGCAGCC GGGCGACGCG CACGCGCCCG AACGAATGCA TGCGTATCTG
CGCGACAAGC TCTCGCAAGT GCTGAAGCTG CCGCCGGAGC GCATCGAGCC AGATGCATCG
TTCGCGAGCT ACGGCGTCGA TTCGATCATG GCGATGGCGT TGATCACGGC GCTCGAAAAG
GAGCTGGGCA GCTTGCCGAA AACGCTGTTC TTCGAGCACG AATCGATCGA GGAACTGGGC
GAGTACCTGC TGGAGCGGCA AGGACAAGAG AGGGCGTGCC ATGCAAGCAA CGTTTAA
 
Protein sequence
MLKIDDIDRA DGFFEVGGNS LSVALIASRV GAEFGLARLG AGAFFRYPTV AALAAHLGAR 
LRGDAGAAEG ADGADARASR PAQPRAAGPA ARLPAALDDA IAIIGISCQF PGAQDHRAFW
RNLRDGKSGA RFYSEDELRA AGVPDTLIRD RHYVPMQQTI EGKDLFDRHF FRLTMKDAQL
MDPQFRLLLQ HAWKAIEDAG CTRERIADAG VYMSASNSYY QAMLRAAGTI DASDEYQAWL
LAQGGTIPTR ISYELGLTGP SLFIHSNCSS GLVSLSVAAK SLLQRESRCA LVGAATVLPD
ADIGYVYQPG LNLSSDGRCR TFDENADGLT SGEGVAVLLV KRARDAIDDG DPIYALLRGI
AVNNDGADKV GFYAPSVGGQ ADVIRKVLDA TGIHPETIGY VEAHGTGTKL GDPVEVAALT
DAYRRHTART GFCAIGSVKP NIGHLDTVAG LSGCIKVALS LRHGEIAPSI NYEKPNREID
FAHSPFYVVD RLTRWPAREP GAPRRAALSS FGIGGTNAHL ILEAFERDEP PAGMRAPAAH
AARVIALSAR TEERVRAQAS QLLAFLEQEA GALPDFDGFA FTLQVGREAM RERVAFVADG
YDALAAALAR FLRGEPDAAA CFTGARGGDS TLAALLDDTG DTGDTAAHGL IAAWCEQGKV
AKIAALWAHG VNVDWRRLYG ARAPVRVSLP TYPFAPERCE GVARRRAAAP APRRAGVETA
AARLHPLVHD DRSDGARRRF AATYSGEEFF LADHLIRGKR ILPGVAYLEM ARMAAVRAHG
DGALSLHDVV WMTPIVVDGP CEVELSLEAA ERAEAEGAAE AAAGVRTMRF NVTSGGGAGA
RRTNSQGTIR LAPGAAAPAA ARVDVAALLA RCTREIGAQR FYTFLDSGGG HYGPTFRSVA
ALHQGEREVL ARLALPESVA HADAFVLHPS MMDAAFQIAD SLILQPRANG GCLPFFVKEL
VVRRRPGRDA WVHVRLAGGD ATLARYDIDL IDPDGTVCVS MREFSARAET AGGSGRPNTY
RAAEWRAAER DGNELSELNE LNEGNEGNEG NEGNEGNEGN EGNEGNERRR AAPRVAVLDA
SPRLAHALRG IGVDALWLPA DAAHAARGPA LRDLDAALHA GAARDLLVLA DERRELDDDA
LRPWLDGAPH AGGARRALVS IAGLADADAR AVADIVERER HGRAADVRYD AGGARSVRGF
ADAAVARWLL DTDALRSGGV YWIAGANGPL GASLACHLAT VERATVVLTD AHAIDAARLA
CLDGYRAGGA RLEFIEGDAA RDGAALAQRI RARHGRIDGV LHCAQHASAP TLAALAALDR
ATRADALDCF VACEARDADP DHDPAAALVA RFVERRHARV QAGLGGGRTV AIAAHAALPW
PDDAPLLRAG GIASQPALAI VQALHHALRS DEAMLAVGWG ASAGGVDAGA SKAANASNAS
NASNASNTSN APSVAADLAA PAEPNARIPA RARATPDTIA ACLKAVIADV IRADVDEIDA
RQHFGEYGLD SLSLTSVSNR LNDAYRLDAS PAGALNPTLF FEYPSVERMA AYLAEHHAAR
FADASAAPGA DGAAECAPRP EAALNAEVEP GNGAAPAPEP EVGFRAGFEP VAPPMPRIEP
TASTPPDQPA PQPGGAWHAG RGACPAADDV AIIGISGRFP GARDVAEFGR NLFDGRDCIG
EIPADRWDWR AYLGDPQHEA GKTNSKWGGF IDGIAEFDPL FFSLSPKEAY LLDPAHRLLL
MHAWWAIEDA GYNPAALAGS RTALFAGIAQ SGYADLRRQA GEGIEGNSFL GVVPSIALNR
ISHLLDLHGP SEPVETACSS SLVAMHRALV SLRCGDADMA LVGGVQTILS PHAHIGFGKA
GMLATDGRCK AFSSRADGFV RGEGIGMLFL KRLGDARRDG DAIYGVIRGS AVNHDGRSSS
LTAPNPAAQR DVIVQAHMRA GVDPRSIGYI EAHGTGTKLG DPIEINALTQ ALDTLLRAQR
EEGAAYVPGA CAIGSVKSNI GHLELAAGVS GVIKVLLQMA NGRLAKSLHC DELNPYITLD
GGPLRVVGAN AAWPRPVDRD GREQPRRAGV SSFGIGGVNA HVVLEEYPEA DARARDDGQP
AAVPLSARDS QRLADYASAL LAFVRERREA AAHAPPPRLS DLAYTLQVGR EAMRERVGFV
VTSLAQLEAR LAAFVAGEPA GDGVYRGSVR PARGERAADA DGLDRLVDIW LASRKHEALL
GAWVKGAAID WARLHAGGAP RRVHLPGYPF ARERYWIAEP APATGAPGEP APPRMPTQPH
GPTSDGRAES RHPLRRDAAD GRFLLDLDGD EAFLADHRVD GRRVLPGVAH LEIAYEAARR
TFGPADAIRI RNLGWIRPIV ADGALRIGVE LSMTGAAEGA FRLYTTDPQH GRLTHSEGAI
GRADVAQSAR ALDLGALRDA FATAERVDPA VWYDGFSRAG IDYGPSHRCL ETCAVGPAGV
LARVRLPAAE ARAARPFTLH PGLMDAVLQA AIGLRKRAGG APRGTPYLPF ALDAVEILGG
CGEAAWAWLR PSPRDAADAS ASRGDAGKPA AERIDIDVCD DAGRISVTLR GLTSRPLARR
TAPAPEAGNP AGEVGEVADA TDADAAEVRE ISDVSDVSDV SDVSDVSDVA PLADGDVGLL
ARTAVWSALT PAQWLADPAS RPRAGARVFV LGGTAAQRRE IARIHPGCEP LEANAADDGG
DGADQQAHVD ALRRRLAEGA PIDQLVWIAP PEPAADARAG LRGDAIVAAQ EHGVLQLFRI
VKLLLAAGYG GKPLDWTIVT RETHATSGID EPSPTHAGVH GFVGSMAKEY RNWRVRLLDM
PAREAWPIDA MFSTRFDPRG DALAYRRGRW LARELAAIDA LPDGGCHVKA GGVYVVIGGA
GGIGEVWSRW MMERYQARIV WIGRRDEDEQ IRRKRERLAR YGTPPVYLRA DASERASLAA
ARERIAALRW DGRALPTSGV VHSAIVLADA SLATMDEARF LAAWRSKADV GVRVAEVFGG
DPLDFMLFFS SITSFGKTAG QANYAAGCAF KDAFAAHLGR TLPYPVKVMN WGYWGSVGVV
SDETYRRRMA SAGFGSIEPD EGMSALERLL ASRVGQIAVL KTLRPNLVGD SRADRIRHYP
GRDWPDAAPA PATAALQAAL AARAGRWHAQ ASALALGNPE LETLIARGLL AGVLPYLDAP
GSVDARHARW FDESRAMLHG FGYLARDGAG DAPSWSLTDA GRAAAPHVWQ DWERHALAWH
DDERRVPMRL AHVCLRALPE LLGGKRRATD VMFPGSSMAL VEGLYKSNRK ADLFNDVVHD
AVLSYARALG RALDIVEVGA GTGGTTDGLL RKLVEQGIAV REYRYTDLSH AFLLHAREHY
APRAPFLTTG IFDVDKPIAA QRVPGGRYDV AVATNVLHAT RDVRRALRNV KATLRAGGLL
ILNELSVKSL FSHVTFGLLD GWWMYEDADL RIPGSPGIDS STWRRVLAEE GFEYVFFPAQ
GLHAHGQQVI VAQSDGVVRQ PRAAAAPGAG AAASPSGGTQ AAVPARRAAA ASGAPRVEAI
PPAAVAPAAF DAATAAPPGT AAAAATAVPA DGRSALAHAS SPAASPPQPG DAPAPERMHA
YLRDKLSQVL KLPPERIEPD ASFASYGVDS IMAMALIAAL EKELGSLPKT LFFEHETIEE
LGAYLLERCE PMPSGVEPAT VGADDRAAYS GARPHAWPAS PTEPTVPTEP TASPASSAPP
AASPPQPGDA HAPERMHAYL RDKLSQVLKL PPERIEPDAS FASYGVDSIM AMALITALEK
ELGSLPKTLF FEHESIEELG EYLLERQGQE RACHASNV