Gene BURPS1710b_A1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1856 
SymbolalbI 
ID3692377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2249523 
End bp2261894 
Gene Length12372 bp 
Protein Length4123 aa 
Translation table11 
GC content73% 
IMG OID637732110 
Productpolyketide non-ribosomal peptide synthase 
Protein accessionYP_337013 
Protein GI76818113 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCT CCCCTCCCTC CAGCGCACTC GTCACGGCCG TCGAAGCGGC CGTCCTGTCG 
CTCGCCGGCG ACGTCGCCGG CCGCGCGTTC GACGCGTCGG CTGCGGCGCG CCCGCTGCAC
GCGCTCGGCT TCGATTCGGT GCAGTACGTC GAATTGTCCG GATGCCTGAA CGAATACTAC
GGGCTCGATC TCGCGCCGAC GCTGTTCTTC GACGTGCACG TGCCGCGCCG GATCGCCGAG
CATCTCGTCG CGCGGCATCC GGCGGCGCTC GCGCGCAAGC ACGGCATCGG GGCCGGGGAC
GACGCCGACA CGGCCGCTCG GGCCCGCGCG GCCGCGGCCG AGAACGGCGC GCCGCAGCCG
GACATGCGAG CCGGGGCGGC GCGGCCCGCG GGCGAGCCGC TTCTCGACAC GCATGCGAGC
CCCGGCGAGC CGCGCGGCGA CGCACACGAA AATCCATGTG ACGACACGCG CGGCGCGGCC
GCCGCCGACG CGCATGAATC GGCCGCCGAT ATCGCGATCG TCGGCATGGC CGGCATCTTC
CCGCAATCGG CCGACCTCGA CGCGTTCTGG CGGCATCTCG CCGCGGGCGA CGATCTGATC
GCCGAGGCGC CGGCCTCGCG CTGGGATTGG CGCGCGGGCG ACGGCGAGCC CGCATCGCGC
TGGGGCGGCT TCATCCCGCG CATCGAATAT TTCGACGCCG CGTTCTTCGG CATCTCGCCG
CGCGAAGCCG AGCAGATGGA CCCGCAGCAG CGCCTGCTGA TGCAGACCGC GTGGGCGGCG
CTCGAGGACG CGGCGGTGCG CCCGTCCGAT CTGATGGGCA GCGACGCGGC GGTGTTCGTC
GGCGTCAGCA CGTCCGACTA CATGGCGCTG CTGCCCGGCG CGGACGGCCA TCTCGCGGTC
GGCAACGCGC ACGCGATGCT GCCGAACCGG CTGTCGCACC TGCTCGGCGC GCACGGGCCG
AGCGAGGCTG TCGATACCGC GTGCTCGAGC TCGCTTGTCG CGCTGCATCG CGCGGTGCGC
GCGCTGCGGC GCGGCGAAAG CAGCGTCGCG ATCGTCGGCG GCGTCAACGT GATGCTGACG
ACGCGGCTGC ACCGCGCGCT CGCCGCCGCC GGCATGCTGA GCCCCGACGG GCGCTGCAAG
ACGTTCGACG CGGCGGCGAA CGGCTACGTG CGCGGCGAGG GCATCGCGGC GCTCGTGCTG
ATGCCGCTCG AGCGCGCGCG CGCGAACGGC CACCCGGTGC GCGCGGTGAT CAAGGGCAGC
GCGGTCAATC ACGGCGGCCG CGCGGCGTTC CTGACCGCGC CGGACATCAA CGCGCAGGCC
GCGCTGATCG AAGCCGCGTA TCGCGACGCG GGCGTCGACC CCGCCACTGT TTCGTACATC
GAAGCGCACG GCACCGGCAC GTCGCTCGGC GATCCGATCG AAGTGCAGGC GCTGCGCCAG
GGCCTCGACG CCTGCGCGCG CGACCTCGCG GGCACCGCCT CGCACGCGCC GGCACGCTGC
GGCCTGGGCT CGGTCAAGAC CAATATCGGG CATCTCGAAG CGGCGGCGGG CCTCGCGGGC
GTCGTCAAGG TCGTGCTCGC GATGGACCGG CGCATGCTGC CGCCGAGCCT GCATTGCCGT
GAACTGAATC CGTATCTGAA GCTCGACGGC AGCCGTTATC ACGTCGTCAC GGAACCCACG
CCCTGGCCGG ACGAAGCAAC GCCGACGCCG CTGCGCGCGG GCGTCAGCTC GTTCGGGTTC
GGCGGCTCGA ACGCGCACGT CGTGCTGCAA TCGGCGCACG CGCGGCCGAT CGCGCGAGCG
AGCGCGCCCC CACCGCCGCA CACGAACGAA CAAGCCGGTG CCGACGCGCC CGCCGCCGAC
GGCCCGCGCG CGTGGTTCAT CCCGCTATCG GCGCGCACCG ATGCCGCGTT GCATGCGCGC
GCCGCTCAGC TCGCGCACTG GCTCGACACC GAGCCGGCCG ACGACGCGTG GCTGCCCGCG
CTCGCGAAGA CGCTGTCGAT CGGCCGCGAA CCGATGGCGT GCCGCTTCGG CATCACGTGC
GCGTCGCTCG ACGAACTGCG CGCGCAACTC GCGATCGCGC TGGGCGGCCG CGCAACGTCG
CTCGCGCGCG ATGACGCCCG GCTGCGGCCG CATGCGCCCG CCTGCGCGGC GTGGCTCGCG
GGCGAGACCG ACCCGCTGCC CGCCGCGTGG GATGACGCGA CGCCGCGCCT GCGGTTGCCC
GTCTACCCGT TCGAAGGCGA GCGGCACTGG CCGACCGAAG CAGCGCCGGC GGCGCGCTTC
GCGCTCGCGC CCGACGCCGA CGGCGCATAC CGGATCGCGA TCGAACCCGA CGCGCCGCTC
GTCGCCGACC ATCGGCTCGC CGGCGAGCCG GTGCTCGCCG CCGCCGCGCA AATCGTGATC
GCGTGGCGCG CGTTCGAGGC GGACGCGCTC GCCGGCGATG CCGGCCAGGC GGGCGACGTC
GGCGAGTCGA TGGAGTCGAT GGAGTCGAAC GGATCGAGCG CATCGAAGCC GGCGGCGACG
TCCGCCGATT CGGACACCGC CGCCGATTCA CGCGATCCGC ACGATTCATA CCACTCGCAC
GACTCCCGCC ACACGATCGA CACGAACGCC ACGAACGCCG CGAACGCCAC GACGCCTATC
GCGTTGCGCG ACATCGAATG GCTCGCGCCG ATCGCGATCG GCGCGCCGAC CGACCTGTGC
ATCACGCTCG CGCGCGACGC CCACGGCGAC ATCGACGCGC GCCGCGGCGA AGCCGCCCAT
CGGCGCGCAA ACGGCCGCGC CGCCCGCTTC GCGATCGCGG CCGCCCCCGC CATCGATACG
CCGCTCGGCC GCGGACACGC GACGCGCATC GCGAGCGCGC CGTCGGACGC GCCCGAGCTC
GACATCGAGG CCATCCGCGC GCGCTGCACG CAAGCGGTCT CGGCCGACGC GTGCTACGAC
GCGTTCGCCG CGATCGGCAT CGATTACGGC CCGACGTTCC GCCCGCTGCG CGCGATCGCG
GTCGGCCGTG ACGAAGCGCT CGCCGAATTC GACGCGTCGG CGCTCGCGCG CACGACGGGC
GACGCGCGTA TCGTCGCGCT GATCGACGGC GCGTTCCAGG CGATCGCGGG CCTGACGCTC
GCGCACGCCG CGAGCCTCGA AAGCGGCCTG CTGCCCGCGT CGCTCGCACG CATCGAGTTC
ACCGGGCCGC TCGCGGACAG CGTCCGCGCG TGGATTCGCG AAGCACCGAG CGACACGGGC
CGCCGCACAT TCGATATCGA CCTCGTGACG GCGAGCGGCC GGTCGTGCGC GTCGCTGCGC
GGCCTCGCGC TCGCGTCCGG CCGAAGCGCA ACGTCGCGCG AAGCGCCACG CATCACGACG
CCGGGCGACC ATCTGTTCGC GCCGCAATGG CTGCCGTGCG CGACGAACGC GGCCGGCGCG
GCAACGCCGT CGCCGCGCGC CGGCGCGCTC GCGATCATGG GCGGCACGCC GGCGCAGCGC
GCCGCGCTCG GGGCGACGCA CGCGGCGGCG CCGCGCCTGA TCGACGACAT CGCCGAACTC
GACGCGAACG TGAGCCATCT CGTCTGGCTG CCGTCCGCGC CCGCGGACAC ACATGCGCCG
CTCGCGCAAT GCGCGAGCCT CGACGGGTTG CGCCTCGTGA AGCGTTTGCT CGCGCTCGGC
GCGGGCGATC GCGCATTCGA TCTGACGGTG CTCACCGTCC GTTCGTGGAC GATGCCGGGC
GACGCGCCCG CGTTTCCCGC GCACGCGGAT CTCGCGGGGC TGTGCGGGGC GCTCGCGAAC
GAATACCCGC ACTGGCGCGT GCGGCTCATC GATCTGCCCG ACGCCGCTGC GCTGCCCGCC
GACTGGCACG CGCGGAGCGC CGAAGGCGGC CATCCGTTGC TGCTGCACCG GCACGGCCAA
TGGTTCGCGC GCGGGCTCGT GCCGCTCGCG GCGCTGCCCG CGCCCGCCGC GCTGCCGTAT
CGGCCGGGCG GCGTGTATGT CGCGATCGGC GGCGCGGGCG GCCTCGGCCG GGTGTGGACC
GAGCACGCGA TTCGCGCCTG CGGCGCGCAA GTCGTGTGGA TCGGGCGGCG GCCGCTCGAC
GCGCAGATCG ATGCGCACTG TGACGCGCTC GCCGCGCTCG GCCCGCGCCC GTCGTATCTG
AGCGCCGACG CGAGCGACGC CGAGAGCTTG CGCGCCGCGC GCGATGCAGT GCTCGAACGC
TTCGGGCGGC TCGACGGCGT CGTGCACACG GCGATCGTGC TGGAGGACGG TGGCCTCGCG
CAGCTCGACG AAGCGCGATT CAGCGCGGCG CTGAACGCGC AGGTCGCGAC GACCGCGAAC
CTCGCCCGCG TGTTCGGCAG CGATCCGCTC GATTTCATCC TGTTCTTCTC GTCGCTGCAA
AGCGCGTTCG TCGCGGCGGG CCAGAGCAAT TACGCGGCCG GCTGCACGTT CCGCGACGCG
TTCGCCGACT GGCTGCGCAC GCAGCTCCGA TGCGCGGTCA AGGTCGTGAA CTGGGGCTAC
TGGGGGCAGA CGGGCGTGGT CGCGACCGAG CCGTACCGCG CCCGCATGGC CGCGCTCGGC
ATCGGGTCGA TCGAGCCCGC CCCGGCGATG GCGGTCGTCG ATGCGCTGCT CGCCTCGAAC
GTCGATCAGG TCGGCTATCT GAAGACGATC GCGAGCGCCG CGGTGCCGAC GCTCGCGCCC
GCGCTCGCCG CGCGCATCGC GCCGCGCACG CGCGCGCTTG CCGGCACGCC GCCGCGCGTC
GACGCGACGG ACGACAGCGC GGCGTGGCGG GACGCGCTCG CGGCGCTCGA ACGCGCGATC
GCGCGCCGGC TGTTCGCCGA GCTCGGCGCG CTGCGCGTGT TCGGCGGAAG CGGCGCGCCG
GGCGGCCATG CGTTCGACGA CGGCGCGGCC CGAAACAGCG CGGCCGGCCA ACGTTCGGCC
GATGACCGCG CGCCCGACGC CGCGCCGTTC GACATCGACA CCGCGCTGCG TACCGGCCGC
GTCGCGCCCG CCTACCGGCG CTGGCTCGCC CATGCGTTGA CGCTGATCGC GCGGCACGGC
CCCCTTGCGT GGGACGGACG CTCGGGCCGC CTCGCCGAAG CGCCGCCGAC GCCGGACGCG
GCGCGCGCGG AATGGGCGCG CGCACGCGCC GAGCTCGAGC GCACCGCGCT GCTCGACGCC
CATCTCGCGC TCGTCGACGC GACGCTCGAC GCGCTGCCCG CGATCCTGCA AGGCAGCGTG
CCCGCCACGT CGATCCTGTT CCCGGACGGC GATCTGAGCC GCGTCGAAGC GGTCTATCAG
CGCAACGAGC AGGCGGACCG CTGCAACCGC GCGCTCGCCG ATGCGGTGCT GCACCTCGTC
GGCGACGCAT CGTCCGCGCA ACCGGCCGCG CTCGCCGAAA TCGGCGCGGG CACGGGCGGC
ACGACCGTGC CGCTGCTCGC GGCGCTCGAC GCGCGCGGCG CGCGGCTCGG CCGCTACGAC
TTCACCGACA TCTCGAAGGC GTTCCTGCTG AACGCCGAGC AAACGTTCGG CCGGGGCCGC
GACATGCTGC GCTACCGGCT GTTCGACGTC GAGCGGCCGA TCGCCGAGCA GGCGCTCGAC
ACCGGCGGCT ACGACATCGT GATCGCGACG AACGTGCTGC ACGCGACGCA GGACATCGGC
GTCACGCTGC GCAATGCGAA GGCGCTGCTG AAGGCAGGCG GCCATCTGAT CATCAACGAA
CTGCTCGGCA CGCACGGCTT CGCGCATGCG ACGTTCGGGC TGCTGCCCGG CTGGTGGCGG
CACCGCGACA GCGCGCGCCG CCTGCCCGGC AGCCCGCTGC TGTCGCGCGA CGGCTGGACG
CGCGCGCTGC GCGAAGCCGG CTTCGCGGTG CTCGACGGCG GCTCGGCCGG CGCCGCGGCG
GGGCAAGGCG TGATCGTCGC GCTCAGCGAC GGCGTGATCG TGCAGCCGTC GCACGCCGAC
GCGCGGGCGG CCTCATGCGC GGCTTCGCGC GCGGCCCCGG GCGACGACGC CGGCGCGCAC
GCCAGCGCCG CGCGGCCGGC CGCATCGGCT TGTTCGACTG CCTCGCCCGC ACACGCGCCC
GCGGCTTCGC CGATCGCCGC CGCGCCGACC GGCGCGAGCC TGCGCGCGCG CTGCGTGCAG
GCGCTCGCGC AACTCGTCGC GCGGACGCTG AAAATGCCGG TCGGCAAGCT CGCGCCCGAT
CAGCCGCTCG GCAGCTACGG CGTCGATTCG ATTCTCGTGA TCGGGCTCAC GAAAACGCTG
CGCGAGACGT TCGGCGTCGC GCTGTCGAAC GCGACGCTGT TCGAGCATGC GACGCTGAAC
GCGCTCGCCG AATTCTTCGT CGCCGAACAT CGCGCGGCAT GCGAACGCGT GCTGGGCAAC
GACGCGGAAC CCGCGCCGAA TGCGCCGAAC GGACCAAACG CCGCGAGCGC AGCGGCGGCC
ATGCGCCCGG CCATGCCACC GGCCCGCGCC GGCGCCCCAT CGCCCGCCGC GGCTTCGGCC
GCGCCGAAGC CGCGCGAATC GAACGTGTGC GCCCCGCCCG CCGCCGACGA CACCGCCGTC
GCCGTGATCG GCATGTCCGG CCGCTATGCG CAGGCGGACA ACCTGCGCGA GTTCTGGGCG
AACCTGCGCG CGGGCCGCCA TTGCATCACC GAAGTGCCCG CCGAGCGCTG GGACTGGCGC
ACGCACTTCG ATGCGGAAAA AGGCGCGCCG GGCCGCACGT ACAGCCGCTG GGGCGGCTTC
CTGACGCAGA TCGACCGCTT CGACGCCGCG TTCTTCCGAA TCGCGCCGAA CGACGCCGAG
CAGATCGATC CGCAAGGCCG CCTGTTCCTC GAGGAATCGT GGGCCGCGAT CGAGGATGCC
GGCTATACGA GCGACACGCT CAGCGCGGAC CGCCGGGTCG GCGTGTTCGT CGGCGTGATG
AACGGCGACT ATCCGACGGG CGCGCAGTTC TGGAGCATCG CGAACCGCGT GTCGCACGCG
CTCGACCTGC ACGGGCCGAG CCTCGCCGTC GACACCGCGT GCTCGTCGTC GCTGACCGCG
ATCCATCTCG CGCTCGACAG CCTGCGCAGC GGCACCTGCG ACTGCGCGCT CGCGGGCGGC
GTCAACCTGA TTCAGAGTCC GAAGCATCTG GTCGGGCTCG CGTCGCTCAC GATGCTCTCG
GCGGGCGACG CGTGCCGCGC GTTCGGCGCG GGCGCGGACG GCTTCGTCGA CGGCGAAGGC
GTCGGCGTGC TCGTGCTCAA GCCGCTGTCG CGCGCGCTCG CCGATGGCGA CGCGATCCAC
GGCATCATCC GCGGCAGCAT GATCAACGCG GGCGGCAAGA CGCACGGCCT CACGGTGCCG
AACCCGCGCG CGCAGCAGGC CGTCGTCGGC GCGGCGCTCG CGCGAAGCGG CGTGCCGGCG
CGCGCGGTCG GCTACATCGA GGCGCACGGC ACCGGCACCG CGCTCGGCGA TCCGATCGAA
CTCGCGGGCC TCACGCGCGC GTTCGCCGAA GCGACCGACG AGCTCGGCTT CTGCGCGCTC
GGCTCGGTCA AATCGAACAT CGGCCATTGC GAAAGCGCGG CGGGCGTCGC GGGCGTGACG
AAGGTGCTGC TGCAGATGAA GCATCGCGAA CTCGTGCCGA CGCTGCATGC GCACGAGCCG
AACCCCGACA TCGATTTCGC GCGCTCGCCG TTCGTGCTGC AACGCACGCT CGCGCCGTGG
CCGCAGCCGG CGCTCGACGG ATGGCCCCGG ATCGCGGGCG TGTCGTCGTT CGGCGCGGGC
GGCGCGAACG CGCACGTCGT GCTCGAAGAG TTCGTCGAGA CGCGCGCCGC CGCCGGCGGC
GACGACGCAG GCCCCGCGAT CGTCGTGCTG TCCGCCGCGA CCGACGCAGC GCTGCGTCGC
CGCGCGCGGC AATTGCACGC CGCGCTCGCC GCCGGCGAAA TCGGCGACGA GCGCCTGCAC
GATCTCGCGT ACACGCTGCA GATCGGCCGC GCCGCGATGG TCTCGCGCTT CGGCTGCGTC
GCCGGCAGCG CCGCCGAATT GCAGGCGCAG CTCGCCGCGT TCGTCGAAGG CGACGCATCG
CGCGGCTGGC ACGCGCACCG GCTCGCCGGC GACCGCCACG GCCTCGCCGA GCTCGACGCC
GATCCCGAAC TGCGCGCGTC GCTCGTCGAG CAATGCGTCG CGGCCGGCAA GCTCGACCGG
CTCGCGGCAC TCTGGTGCCA GGGGCTCGGC ATCGACTGGC CCACGCTGCA TCGCGGCCGC
GCGCGCCGGC GCATGCATCT GCCGACGTAC CCGTTCGACG GCCCGCGCTA CTGGCTGCGC
GACGACGCGG CGCACGCCGC CGAGCCCGCG CCGGCCGACG GCGCCGCCGA AGACGCAAGC
GCCGACGCAC CGAATGCAGC GAACGCGCCG ACGCCCGACG TCGCAACGCT CGTCCGTCGA
ACGGTGGCGC AAGTGCTCGG CTATCCGGAC GTCGACATGA ACGAATCGTT CCTGTCGCTC
GGCGGCGATT CGATCCGCGC GGCGCGCGCG CATCGGGTGC TGCAACGGGC GCTCGACACG
AGGATTCCGC TCAGCCTGAT GCTGGAGGCA AGCACGCTCG CCGAATGCGC GCAAGCGATC
GATGCACTGC TTTCGACGCA ACCGGAACCG GCGAGCGCGC TCGCCTGCGA AACGAACGCG
GGCGCGGCCG GCGCGCCGAT CGCCGACGCG GCCGCGCTCG AGTCGTCGGC GCCGCCCTCC
CGGGAATCGG CCTCCCCGCC ACACCCGGCC TCCCCGCCGC GCGACGCGCG CCCGCGCGTT
CATCCGCTGT CATCGAACCA GCAGCAATTC TTCTTCCTCG ACCGGCTGAA CCCGGCGAAC
CCGGCGTTCA ACCTGCCCGG CGCGCTGCGC GTGCGCGGCG AATGGCACGC GCACGCGCTC
GAAGCCACGT ATCAGGCGCT CATCGATACG CACGACGTGC TGCGCACCCG CTTCGTCGTG
CGCGGCGGCG AACCGTGCGC GGAAGTCGCG CCGCACCGCG CGGCCGCGAT TCGCCGGCAC
GATCTGACGG CGCTGCTGCC GAAGCATCAG GCCGCGCGCG TCGCCGAGTG CCTCACCGAG
TCGAGCCGCG AGGGCTTCGC GCTCGAACAG GGCGAACCGA GCCGGCTGAC GGTACTCGAA
CTGCGCGACG ACGATCACGT GATCCTGCTG AATCTGCATC ACATCGTCGG CGATGCGGTG
TCCGTCGTCG TGCTGCTCGA CGCGCTCGCG CGCGCCGCGC TCACGGGCCG CGCGGCCGCG
CCGGACCGCG CGCGGCCGCA ATACGCGCAA TGGGCCGCGC ACGAACGCGA TGCGCTGCCG
GCGACGATCG AGCGCGAACT GCCGTACTGG CTCGAGCGCC TGCGCGACGT GCCGCCGCCG
TTGCCGCTGC CGTGCGACCG CGCGCGGCCG CCGGTGCCGA GCTATCGTGG GCGCAGCGTG
CCGCTCGCGT TTGCGCCGGC GCTCATCACG CTGCTCGACG CATACTGCAA GGCGCACGGG
CTGTCGCGCT TCGTCGTGAT GCTCGCCGCG TTCAAGCTCG CGCTGCGCGT GCTGTCGGGC
CGTGACGACG TCGTCGTCGG CAGCCCGTAC GCGAACCGCG CCGAGGACGA CACGGCCGAC
ATGATCGGCA GCCTCGCCTA CGCACTCGTG CTGCGCACGC GGCTTGGCGA AGCACAGACG
TTCGCCGATG CGGTCGCGCT CGTGCGGCGC ACCGTGCACG GCGCGTTCGA CCATCTCGGC
GTGCCGTATC CGCGCCTCGT CGAGGCGCTG AATCCGGCGC GGCACGGCGG CGCGAACCCG
CTGTATCAGA TCATGTTCAA CGTGATCCCG ATGCCCGCGC TACCCGAGGG CGTCGAGCCC
GTCGAAGTCG ATTCCGGCTG GCTCGACTAC GATCTGTTCG TGCGGCTGCG CGCGTCGAGC
CACGCCATCG ACGGCGTGCT GCAATTCAGC GCGGATCTCT TCGATCGTTC GACGGCCGAA
GCGATCGCCG CATACTACGT CGAGCTGCTG CACACGCTGC TCGCGCATCC GTCGCTGCCG
CTCGCGAGCC TCGCGCCGCC CGCCGAGCTC GCGCTCGAAC GGACGATCGC CGACGCGATG
CCGCCGCTGC GCATCGAGAT CGCGTCGACG TTCACCGACC GCCCGTTAGC CGGCACGCTG
CGCTACTGGG GCACCGCGAC CGGCCAGCCG ATCGAGCCGA ATTTCGCGCC GTACGGACAA
CTGTTCCAGA CGCTCTACGA TCCGTCCACG CCGTTCCATG CGAATCGTCA CGGAACGAAC
GTCGTGCTGG TCAGGCCGTA CGACTGGCTG CGCTTCGACG ACGCGGACGC CGCCCGCGCC
GACCTCACGG GCGACGCCGG CGCGGCGGCC GCCGAACGCA TCGCGCTGTA CGCCGACGAA
CTCGCCGACG CGCTGCGCGA CGCGGCGCCG TCGCTCGCGG TGCCCGTGCT CGTGCTGGTG
CTGCCGGACG ATGCCGCGTC GCTCGCGGCG CGTGACGAAC ACACGGGTAC GGCAACCGAA
GCGCCCGCCG AGGCGCTCGC CGACGCACGC GCCGGCAAGC CCTCGCCCGA CACGTCGCTC
GCCCCTTACC GCATGCTCCG CGCCGCGCTC GCGGATCTGC CGTCGATAAC GGTCGCGCAC
TGGCGCGATG TCGCCGCGAT CTACCCGGTC GCCGACGTGT TCGATCCGCA TGCGGACGCG
GCCGGCCACG TGCCGTTCAC GAGCGAGTAC TACGCGGCGC TCGCGAGCTA CATCGCGCGC
ACCGCGTTCC AGCACGCATC GGTGCCGCTC GACGACGCCT GGAACCGGCT CGCCGCGCAG
ATCCGCGACG ACGCCGAGCA CCTGCTCGCC GCGCCGGCCG ACGGCGCGCG CGCGCGCCGC
GCGCCGCACG CCGCGCCGAC GAACGAAACG CAGGCGACGC TCCTGCCGAT CTTCGCGGCT
GCGCTGAAGC TCGACGATCC CGGCATCGAC GACAACTTCT TCGACTGCGG CGGCCACTCG
ATCCTCGCGA TCGGCGTCGT CCATCAGATC AACGAAGCAT TCGGCACGTC GCTGTCGGTC
GCGGACATCT TCATGGCGCC GACCGTGCGC CGCCTCGCCG AGCGCATGCG CGACGCGCCG
GACGGCCCCG AGTATGTCGA GCTCGCGAGC GCGGCCGCGC TGCCCGACGA CATCGCGCCG
CTGCCCGGCC CAGTGGCCGA CGCGCCGCGC GCGCTGTTGC TCACGGGCGC GACGGGCTTT
GTCGGCCGCC ATCTGCTGCG CGAGCTGATC GATCGCACCA GCGCGACGAT CTACTGCCTC
GTGCGCGCGC CGGACGCCGC GCAGGGCCTC GCGCGGATCC GCGCGACGCT CGAGCGCTGG
TCGCTGTGGC GCGACGGCGA CGCCGCGCGC GTGATCGCGG TGCCGGGCGA TCTCGGCCGC
CCTCGCATCG GCCTGTCGGA TGCCGCGCGC GCGCGGCTCG TCGCCGAAGT CGACGCGATC
TATCACAACG GCACCAGCAT GAACCATCTC GAATCGTTCG AGATGGCGCG CGCGGCGAAC
GTCGGCGGCG TGATCGAGCT GCTGCGGATC GCCACCGAAG GCCGGCCGAA GACGTTCAAC
TACGTGTCGA CGCTCGCGGT GTTCAGCATG CGCGAGCGCA CCGGCACGCA CGTATTCGAC
GAATCCGCGC CGATCGACGG CGAGCGGCAT CCGTCCGACC AGGGCTACAC GACGAGCAAG
TGGGTGGGCG AGCAGCTCAC GCATCTCGCG GCCGCGCGCG GCGTGCCGTG CAACGTGTTC
CGCCTCGGCC TCGTGACGGG CGACGTGCGC CACGGTCACT ACGACGAACT TCAGGCGTAC
TACCGGCTGC TGAAGAGCTG CATCCTGATG GGCGCCGCGT TCGACGATTT CCGCTACGAC
CTCGTGATCA CGCCCGTCGA TTACGTCGCA CGTGCGCTCG CGCATCTCGG CGCGCGGCAT
TCGCAAGGCG GCCGGGTGTT CCATCTGTCG ACGATGCAGG TCACGCCGAT GCGCACCGTG
TTCGAGATGA TGAACGCGCA TCTGCGCACG CCGATGCGCA TGCTCACACA CCGCGCGTGG
ATCGACGAGC TGCGCGTGCG CTACCGGCGC GGCGACGTGC AATCGATCGT GCCCGTCGTG
CAATGGATGA TGAACATGAG CGATGCGGAG CTCGTGAAGC TCGCGCGCGA GCGCGAGGAA
ACGACCTTCA TCTACGACTG CACGGCGACG CACCGCGAGC TCGAGCAAGC CGGCATCGTC
GTGCCCGTGT TCGACGATGC GCTGCTGCAG CGGTATCTGC GCGGCATGTT CAACGACGAC
GCGGACCTGC GCGCGCTCGC CGCCCAGCTC GACGGCGGCG AGTGCGCTTC TCCCCTTCAC
TCCCACACGT GA
 
Protein sequence
MTASPPSSAL VTAVEAAVLS LAGDVAGRAF DASAAARPLH ALGFDSVQYV ELSGCLNEYY 
GLDLAPTLFF DVHVPRRIAE HLVARHPAAL ARKHGIGAGD DADTAARARA AAAENGAPQP
DMRAGAARPA GEPLLDTHAS PGEPRGDAHE NPCDDTRGAA AADAHESAAD IAIVGMAGIF
PQSADLDAFW RHLAAGDDLI AEAPASRWDW RAGDGEPASR WGGFIPRIEY FDAAFFGISP
REAEQMDPQQ RLLMQTAWAA LEDAAVRPSD LMGSDAAVFV GVSTSDYMAL LPGADGHLAV
GNAHAMLPNR LSHLLGAHGP SEAVDTACSS SLVALHRAVR ALRRGESSVA IVGGVNVMLT
TRLHRALAAA GMLSPDGRCK TFDAAANGYV RGEGIAALVL MPLERARANG HPVRAVIKGS
AVNHGGRAAF LTAPDINAQA ALIEAAYRDA GVDPATVSYI EAHGTGTSLG DPIEVQALRQ
GLDACARDLA GTASHAPARC GLGSVKTNIG HLEAAAGLAG VVKVVLAMDR RMLPPSLHCR
ELNPYLKLDG SRYHVVTEPT PWPDEATPTP LRAGVSSFGF GGSNAHVVLQ SAHARPIARA
SAPPPPHTNE QAGADAPAAD GPRAWFIPLS ARTDAALHAR AAQLAHWLDT EPADDAWLPA
LAKTLSIGRE PMACRFGITC ASLDELRAQL AIALGGRATS LARDDARLRP HAPACAAWLA
GETDPLPAAW DDATPRLRLP VYPFEGERHW PTEAAPAARF ALAPDADGAY RIAIEPDAPL
VADHRLAGEP VLAAAAQIVI AWRAFEADAL AGDAGQAGDV GESMESMESN GSSASKPAAT
SADSDTAADS RDPHDSYHSH DSRHTIDTNA TNAANATTPI ALRDIEWLAP IAIGAPTDLC
ITLARDAHGD IDARRGEAAH RRANGRAARF AIAAAPAIDT PLGRGHATRI ASAPSDAPEL
DIEAIRARCT QAVSADACYD AFAAIGIDYG PTFRPLRAIA VGRDEALAEF DASALARTTG
DARIVALIDG AFQAIAGLTL AHAASLESGL LPASLARIEF TGPLADSVRA WIREAPSDTG
RRTFDIDLVT ASGRSCASLR GLALASGRSA TSREAPRITT PGDHLFAPQW LPCATNAAGA
ATPSPRAGAL AIMGGTPAQR AALGATHAAA PRLIDDIAEL DANVSHLVWL PSAPADTHAP
LAQCASLDGL RLVKRLLALG AGDRAFDLTV LTVRSWTMPG DAPAFPAHAD LAGLCGALAN
EYPHWRVRLI DLPDAAALPA DWHARSAEGG HPLLLHRHGQ WFARGLVPLA ALPAPAALPY
RPGGVYVAIG GAGGLGRVWT EHAIRACGAQ VVWIGRRPLD AQIDAHCDAL AALGPRPSYL
SADASDAESL RAARDAVLER FGRLDGVVHT AIVLEDGGLA QLDEARFSAA LNAQVATTAN
LARVFGSDPL DFILFFSSLQ SAFVAAGQSN YAAGCTFRDA FADWLRTQLR CAVKVVNWGY
WGQTGVVATE PYRARMAALG IGSIEPAPAM AVVDALLASN VDQVGYLKTI ASAAVPTLAP
ALAARIAPRT RALAGTPPRV DATDDSAAWR DALAALERAI ARRLFAELGA LRVFGGSGAP
GGHAFDDGAA RNSAAGQRSA DDRAPDAAPF DIDTALRTGR VAPAYRRWLA HALTLIARHG
PLAWDGRSGR LAEAPPTPDA ARAEWARARA ELERTALLDA HLALVDATLD ALPAILQGSV
PATSILFPDG DLSRVEAVYQ RNEQADRCNR ALADAVLHLV GDASSAQPAA LAEIGAGTGG
TTVPLLAALD ARGARLGRYD FTDISKAFLL NAEQTFGRGR DMLRYRLFDV ERPIAEQALD
TGGYDIVIAT NVLHATQDIG VTLRNAKALL KAGGHLIINE LLGTHGFAHA TFGLLPGWWR
HRDSARRLPG SPLLSRDGWT RALREAGFAV LDGGSAGAAA GQGVIVALSD GVIVQPSHAD
ARAASCAASR AAPGDDAGAH ASAARPAASA CSTASPAHAP AASPIAAAPT GASLRARCVQ
ALAQLVARTL KMPVGKLAPD QPLGSYGVDS ILVIGLTKTL RETFGVALSN ATLFEHATLN
ALAEFFVAEH RAACERVLGN DAEPAPNAPN GPNAASAAAA MRPAMPPARA GAPSPAAASA
APKPRESNVC APPAADDTAV AVIGMSGRYA QADNLREFWA NLRAGRHCIT EVPAERWDWR
THFDAEKGAP GRTYSRWGGF LTQIDRFDAA FFRIAPNDAE QIDPQGRLFL EESWAAIEDA
GYTSDTLSAD RRVGVFVGVM NGDYPTGAQF WSIANRVSHA LDLHGPSLAV DTACSSSLTA
IHLALDSLRS GTCDCALAGG VNLIQSPKHL VGLASLTMLS AGDACRAFGA GADGFVDGEG
VGVLVLKPLS RALADGDAIH GIIRGSMINA GGKTHGLTVP NPRAQQAVVG AALARSGVPA
RAVGYIEAHG TGTALGDPIE LAGLTRAFAE ATDELGFCAL GSVKSNIGHC ESAAGVAGVT
KVLLQMKHRE LVPTLHAHEP NPDIDFARSP FVLQRTLAPW PQPALDGWPR IAGVSSFGAG
GANAHVVLEE FVETRAAAGG DDAGPAIVVL SAATDAALRR RARQLHAALA AGEIGDERLH
DLAYTLQIGR AAMVSRFGCV AGSAAELQAQ LAAFVEGDAS RGWHAHRLAG DRHGLAELDA
DPELRASLVE QCVAAGKLDR LAALWCQGLG IDWPTLHRGR ARRRMHLPTY PFDGPRYWLR
DDAAHAAEPA PADGAAEDAS ADAPNAANAP TPDVATLVRR TVAQVLGYPD VDMNESFLSL
GGDSIRAARA HRVLQRALDT RIPLSLMLEA STLAECAQAI DALLSTQPEP ASALACETNA
GAAGAPIADA AALESSAPPS RESASPPHPA SPPRDARPRV HPLSSNQQQF FFLDRLNPAN
PAFNLPGALR VRGEWHAHAL EATYQALIDT HDVLRTRFVV RGGEPCAEVA PHRAAAIRRH
DLTALLPKHQ AARVAECLTE SSREGFALEQ GEPSRLTVLE LRDDDHVILL NLHHIVGDAV
SVVVLLDALA RAALTGRAAA PDRARPQYAQ WAAHERDALP ATIERELPYW LERLRDVPPP
LPLPCDRARP PVPSYRGRSV PLAFAPALIT LLDAYCKAHG LSRFVVMLAA FKLALRVLSG
RDDVVVGSPY ANRAEDDTAD MIGSLAYALV LRTRLGEAQT FADAVALVRR TVHGAFDHLG
VPYPRLVEAL NPARHGGANP LYQIMFNVIP MPALPEGVEP VEVDSGWLDY DLFVRLRASS
HAIDGVLQFS ADLFDRSTAE AIAAYYVELL HTLLAHPSLP LASLAPPAEL ALERTIADAM
PPLRIEIAST FTDRPLAGTL RYWGTATGQP IEPNFAPYGQ LFQTLYDPST PFHANRHGTN
VVLVRPYDWL RFDDADAARA DLTGDAGAAA AERIALYADE LADALRDAAP SLAVPVLVLV
LPDDAASLAA RDEHTGTATE APAEALADAR AGKPSPDTSL APYRMLRAAL ADLPSITVAH
WRDVAAIYPV ADVFDPHADA AGHVPFTSEY YAALASYIAR TAFQHASVPL DDAWNRLAAQ
IRDDAEHLLA APADGARARR APHAAPTNET QATLLPIFAA ALKLDDPGID DNFFDCGGHS
ILAIGVVHQI NEAFGTSLSV ADIFMAPTVR RLAERMRDAP DGPEYVELAS AAALPDDIAP
LPGPVADAPR ALLLTGATGF VGRHLLRELI DRTSATIYCL VRAPDAAQGL ARIRATLERW
SLWRDGDAAR VIAVPGDLGR PRIGLSDAAR ARLVAEVDAI YHNGTSMNHL ESFEMARAAN
VGGVIELLRI ATEGRPKTFN YVSTLAVFSM RERTGTHVFD ESAPIDGERH PSDQGYTTSK
WVGEQLTHLA AARGVPCNVF RLGLVTGDVR HGHYDELQAY YRLLKSCILM GAAFDDFRYD
LVITPVDYVA RALAHLGARH SQGGRVFHLS TMQVTPMRTV FEMMNAHLRT PMRMLTHRAW
IDELRVRYRR GDVQSIVPVV QWMMNMSDAE LVKLAREREE TTFIYDCTAT HRELEQAGIV
VPVFDDALLQ RYLRGMFNDD ADLRALAAQL DGGECASPLH SHT