Gene BURPS1710b_A0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0001 
SymbolpksN 
ID3693926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp751 
End bp15375 
Gene Length14625 bp 
Protein Length4874 aa 
Translation table11 
GC content74% 
IMG OID637730255 
Productpolyketide synthase 
Protein accessionYP_335160 
Protein GI76817822 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCTGC ACGTGCACCA CCTGGTGCTC GACGGGCAGT CGCTGCTGCT GCTGATCGGC 
ACGCTGCTCG ACGCATACCG CGCGCTCGTC GACGGCGTCG AGCCCGCGCC GCGCGCGCCG
GCCGCCACGC ACGACGATTT CGTCGCCGAG GAGCGCGCGC TTCTCGACAG CGACGAGGGC
GCGCGCCGCA TCGCGTACTG GCGGCGGCAG CTCGACGCGC TGCCGCCCGC GCTCGAGCTG
CCCGCGTCGG CGCCGGCCGC GGCCGAGCGC GCGGCGGGCG ACGCGTGGCA TGCGGTGCCG
CTCGACGCGG CGAGGTCGGC GCGCGTCGCG GCGTTCGTCC AGTCGAACCA TCTCGGCGCC
GCCGCGTTCT TCCTCGGCAT GTTCAAGCTG CTGCTGCATC GCTATACCGG CGAGCCCGAC
ATCGTCGTCG GCATGCCGGC CGACGCGCGG CCGTCGCAGC GTTACCGCGA CGCGCTCGGC
TTCTTCGTCA ACATGCTGCC GCTGCGCACG CGCTTGGCCG GCGAGACCGC CGTCGTGGCG
ATGCTCGAGC GCGTGCAGCG CGAGCTCGTC GACGCCATGG CGATGCAGTA TCCGTTCGGC
GCGCTGGTTC GCGAGCTCGG CCTGCAGGGC GCGGAGGACG GCGCGCCGAT GTACCGGATC
GCGTTCATGT ATCAGGATTT TCTCGCGCGC CTGCGCTTCA GCGACGACGT CGAACCGATC
GGCGAGATTC GTCAGGCGGG CGAGTATGAG CTCGTGCTCG AGGTGATCGA AGGAGCGGCG
CCCGGCGGCC CCGCGCGTTT CGCGCTGAAC TGGAAGTACG ACGGCGCGCG GTATCGCGCG
GCCGCCGTGG AGGCGATGGC GCGCCACTAT CTGACGCTGC TCGACGGCGT GCTCGCGGCG
CCCGCCGCGC GGGTGGCCGA TTGCCCGATG CTGCCCGCGG CCGAGCGCGA ACGGCTGCTC
GCGCTTGGCC GCGGCCCGCG CGCCGACCAT GCGCGCGAGC GGCGCGTGCA CGACCTGATC
GATGCGCGCG CGCAGCAGGC CCCGCACGCG ATCGCGGTGT CCTGCGGCGG CCGCTCGCTC
GACTATGCGC GACTGAAGGC CGACAGCGAC GCGCTCGCGC AGCGCCTGCG CGCGTGCGGC
ATCGGCGCGG GCGACTTCGT CGCGGTGCGG CTCGACCGCT CGACGGCGCT CGTCGTCGGC
CTGCTCGCGG TGCTGAAGGC GGGCGCGGCA TACGTGCCGC TCGATCCCGA CTATCCGGAC
GACTGGGCGG CGCAAATGCT CGGCGATTGC CGCCCGGCCG CGATCCTGAC CCGCGCCGCG
CTCGCGGCGG GCGCGCACGC GCTCGCGCGG CGCGTCGCGG CCGACGGCCC GCCCGCCGTC
ATCGCGCTCG ACGACGCCGC CGACGCCGAT ACCCACGCCG CCGACGGCGC ACGCGCGGCC
GCGATCGCCG CCGCGCGGCA GGCCGCGGCG AGCCGCGCGC ACGCCGCGCG GGCGGCCGAT
CTCGCCTACG TGATCTACAC GTCGGGCAGC ACGGGCGCGC CGAAGGGCGT GATGGTCACG
CATCGCGCGC TGACCAACTT CCTCGCGTCG ATGGCGCGCC GCCCGGGCCT GCACGCGCGC
GACACGCTGC TCGCCGTCAC CACGTACTGT TTCGATATCG CGGCCCTCGA GCTGTTCCTG
CCGCTCGTGC AGGGCGCGCA CTGCGTGATC TGCGACAGCG CGTCGGCGCG CGACGGCGGC
CGGCTGCGCG AGCTGATCGA CGCGGCGCGC CCGACGGTGA TGCAGGCGAC GCCGTCCACG
TGGGAGATGC TGCTGCATGC CGGCTGGCGC AACGCGCGGC GCATGCGCGT GCTGTGCGGC
GGCGACACGC TGCCGGACGC CGTCAAGGCC CGGCTGCTCG AGGACGGCGG CGAAGTCTGG
AACCTGTACG GCCCGACGGA GACGACGATC TGGTCGATGG TCGCGCCGGT GACGGCGGAA
CGGCCGACCT CGATCGGCGC GCCGATCGAC AACACGCGAA TCCGGATCGT CGATGCGTAC
GGCAATCCGG TGCCGATCGG CGTGCCGGGC GAGCTGTGCA TCGCGGGCGA CGGGCTCGCC
GCGGGCTACC TGAACCGGCC CGACGAAACG GCGGCGCGGT TCGTCGATGC GCTGCCCGAC
GTGGACGGCG AGGCGCGCGA GCGCCATTAC CGCACCGGCG ACCTCGCGCG CTGGCGCGAG
GACGGCGAAG TCGAGCATCT CGGGCGCATG GATTTCCAGG TGAAGATTCG CGGCCATCGC
GTCGAAGTGC ACGACATCGA GCGGCATCTC GCGCGGCATC CGGCGATCCG GGCGGCCGCG
GTGGTCGCGC GGCGGCACGC GGGCGGCGAT CAGCTCGTCG CCTACTACGT GCGCGGCGAC
GCCGCCGGGC ACGGCGGCGC GGACGACGCG CCGGCGCTGG CCGCCGAGCT GCGCGGCCAT
CTGGCCGGCG CGCTGCCGGA CTACATGATT CCCGCGCTGT TCCTGCCGAT CGACGCGCTG
CCGATGACGC ACAACGGCAA GCTGAACCGC AAGGCGCTCG CGAGCCGCGG CATCCGGCTG
CGCGTCGCGT CGTCGGGCGA GCGCCGCGCC GCGCCGCCGC GCGCGCCGGC CGCCGCCGAT
ATCGAGGCCC GCCTGCTCGC GATCTGCCGC GAGGTGCTGA AGATCGACGA CATCGATCGC
GCGGACGGCT TTTTCGAGGT GGGCGGCAAT TCGCTGTCGG TGGCGCTGAT CGCCTCGCGC
GTCGGCGCCG AGTTCGGCCT CGCGCGGCTC GGCGCCGGCG CGTTCTTCCG CTATCCGACG
GTCGCCGCGC TGGCCGCCCA TCTGGGCGCG CGGCTGCGCG GCGACGCGGG CGCGGCCGAG
GGCGCGGACG GCGCGGACGC CGGCCCGGCC GGCGCCGACG CGCGCGCATC CCGCCCCGCG
CAGCCGCGGG CGGCCGGGCC CGCGGCGCGA CTGCCCGCGG CGCTCGACGA CGCGATCGCG
ATCATCGGCA TCTCGTGCCA GTTTCCCGGC GCGCAAGACC ATCGCGCGTT CTGGCGCAAT
CTGCGCGACG GGAAATCGGG CGCGCGGTTC TATTCGGAAG ACGAACTGCG CGCGGCCGGC
GTGCCGGACA CGCTGATCCG CGACCGGCAC TACGTGCCGA TGCAGCAGAC GATCGAAGGC
AAGGACCTGT TCGACCGGCA CTTCTTCCGG CTGACGACGA AGGATGCGCA ACTGATGGAC
CCGCAATTCC GTCTGTTGCT GCAGCACGCG TGGAAGGCGA TCGAGGACGC CGGCTGCACG
CGCGAGCGGA TCGCCGACGC CGGCGTATAC ATGTCGGCGT CGAACAGCTA CTACCAGGCG
ATGCTGCGCG CGGCCGGCAC GATCGACGCG TCCGACGAGT ATCAGGCGTG GCTGCTCGCG
CAGGGCGGCA CGATTCCGAC GCGCATCTCG TACGAACTCG GCCTGACGGG GCCCAGCCTC
TTCATCCATT CGAACTGCTC GTCCGGGCTC GTGTCGCTTT CCGTCGCGGC GAAGTCGCTG
CTGCAGCGGG AAAGCCGCTG CGCGCTCGTC GGCGCGGCGA CGGTGCTGCC GGATGCGGAC
ATCGGCTACG TGTACCAGCC GGGGCTCAAC CTGTCGAGCG ACGGCCGCTG CCGGACCTTC
GACGAAAACG CCGACGGGCT CACCTCCGGC GAAGGCGTCG CCGTGCTGCT CGTCAAGCGC
GCGCGCGACG CGATCGACGA CGGCGACCCG ATCTACGCGC TGCTGCGCGG CATCGCCGTG
AACAACGACG GCGCGGACAA GGTCGGCTTC TACGCGCCGA GCGTCGGCGG CCAGGCCGAC
GTGATCCGCA AGGTGCTCGA TGCGACCGGC ATCCATCCCG AGACGATCGG CTACGTCGAG
GCGCACGGCA CCGGCACGAA GCTCGGCGAT CCGGTGGAGG TGGCGGCGCT CACCGACGCG
TATCGCCGCC ATACCGCGCG CACCGGATTC TGCGCGATCG GCTCGGTGAA GCCGAACATC
GGCCACCTGG ATACCGTCGC CGGGCTGTCG GGGTGCATCA AGGTCGCGCT GAGCCTGCGG
CACGGCGAGA TCGCGCCGTC GATCAACTAC GAGAAGCCGA ACCGCGAGAT CGATTTCGCG
CACTCGCCGT TCTACGTCGT CGACCGATTG ACGCGCTGGC CCGCGCGCGA GCCGGGGGCG
CCGCGCCGCG CGGCGCTCAG CTCGTTCGGC ATCGGCGGCA CCAACGCGCA TTTGATCCTC
GAGGCGTTCG AGCGCGACGA GCCGCCCGCC GGAATGCGCG CGCCGGCCGC CCGCGCGGCG
CGCGTGATCG CGCTGTCGGC GCGCACCGAA GAGCGCGTGC GCGCGCAGGC GAGCCAGTTG
CTCGCGTTCC TCGAGCAGGA AGCCGGCGCG CTGCCGGACT TCGACGGTTT CGCGTTCACG
CTGCAGGTGG GCCGCGAGGC GATGCGCGAG CGCGTCGCGT TCGTCGCCGA CGGCTACGAC
GCGCTCGCCG CCGCGCTCGC GCGTTTCCTG CGCGGCGAGC CGGACGCGGC CGCGTGTTTC
ACCGGCGCGC GCGGGGGCGA TTCGACGCTC GCGGCGTTGC TCGACGATAC CGGCGATACC
GGCGATACGG CCGCGCACGG GTTGATCGCC GCGTGGTGCG AGCAAGGCAA GGTCGCGAAG
ATCGCGGCGC TCTGGGCGCA CGGTGTGAAC GTCGATTGGC GCCGGCTTTA CGGCGCGCGC
GCGCCGGTGC GTGTGAGTCT GCCCACCTAT CCGTTCGCGC CGGAGCGTTG CGAGGGCGTC
GCGCGCCGCC GCGCCGCCGC GCCGGCGCCG CGCCGCGCGG GCGTCGAGAC GGCCGCGGCG
CGGCTGCATC CGCTCGTTCA CGACGACCGC TCGGACGGGG CGCGCCGGCG GTTCGCCGCG
ACGTATTCCG GCGAGGAATT CTTCCTGGCC GATCATCTGA TCCGCGGCAA GCGGATCCTG
CCCGGCGTCG CGTATCTCGA GATGGCGCGC ATGGCGGCCG TCCGGGCGCA TGGCGACGGC
GCGCTGAGCC TGCACGACGT GGTCTGGATG ACGCCCATCG TCGTCGACGG GCCGTGCGAG
GTCGAGCTGA GCCTGGAGGC CGCCGAGCGT GTCGAGGCCG AGGGGGCCGC CGAGGCGGCC
GCCGGCGTGC GCACGATGCG GTTCAACGTG ACCTCGGGCG GCGGCGCCGG CGCGCGCCGC
ACGAACAGCC AGGGGACGAT TCGCCTCGCG CCCGGCGCGG CCGCGCCCGC CGCCGCGCGC
GTCGATGTCG CGGCGCTCCT CGCGCGCTGC ACGCGCGAGA TCGGGGCGCA GCGGTTCTAC
ACGTTCCTCG ACAGCGGCGG CGGCCATTAC GGGCCGACGT TCCGGAGCGT CGCGGCGCTG
CATCAGGGCG AGCGCGAGGT GCTCGCGCGG CTCGCGCTGC CGGAGTCCGT CGCGCACGCC
GATGCGTTCG TGCTGCATCC GAGCATGATG GACGCCGCGT TCCAGATCGC CGACAGCCTG
ATCCTGCAAC CGCGCGCGAA CGGCGGCTGT CTGCCGTTCT TCGTGAAGGA ACTCGTCGTG
CGACGCCGGC CGGGCCGCGA CGCGTGGGTC CACGTTCGCC TCGCCGGCGG CGATGCGACG
CTTGCCCGCT ACGACATCGA TCTGATCGAC CCCGACGGCA CCGTCTGCGT GTCGATGCGC
GAATTCAGCG CACGCGCGGA GACGGCGGGC GGCAGCGGCC GGCCGAACAC GTACCGCGCC
GCCGAATGGC GCGCCGCGGA GTGCGACGGC GAGCGCGACG GGAACGAGCT GAGCGAGCTG
AACGAGCTGA ACGAGGGGAA CGAGCGGCGT CGCGCCGCGC CGCGGGTGGC GGTGCTCGAC
GCATCGCCGC GTCTTGCGCA CGCGCTGCGC GGCATCGGCG TCGACGCGCT CTGGCTGCCG
GCCGACGCGG CGCACGCGGC GCGCGGGCCG GCGCTGCGGG ATCTCGACGC GGCGCTGCAC
GCCGGCGCGG CGCGCGATCT GCTCGTGCTC GCCGACGAAC GGCGCGAGCT CGACGACGAC
GCGTTGCGCG CGTGGCTCGA CGGCGCGCCG CACGCCGGGG GCGCGCGGCG GGCGCTCGTG
TCGATCGCGG GGCTGGCCGA CGCCGACGCG CGCGCCGTGG CGGACATCGT CGAGCGCGAG
CGGCATGGCC GCGCCGCCGA CGTCCGCTAC GACGCCGGCG GCGCGCGCAG CGTGCGCGGC
TTCGCCGACG CGGCCGTCGC GCGCTGGCTG CTCGACACGG ACGCGCTGCG CTCGGGCGGC
GTGTACTGGA TCGCCGGCGC GAACGGCCCG CTCGGCGCGA GCCTTGCCTG CCACCTCGCG
ACCGTGGAGC GCGCGACCGT GGTGCTGACC GACGCGCACG CGATCGATGC GGCCCGGCTC
GCCTGCCTCG ACGGGTATCG CGCCGGCGGC GCGCGCCTCG AGTTCATCGA AGGCGACGCC
GCGCGAGACG GCGCGGCGCT CGCGCAGCGG ATCCGCGCGC GTCACGGGCG CATCGACGGC
GTGCTGCACT GTGCGCAACA CGCGTCGGCG CCGACGCTCG CGGCGCTGGC CGCGCTCGAC
CGCGCGACGC GCGCCGACGC GCTCGATTGC TTCGTCGCCT GCGAGGCGCG GGATGCCGAT
CCGGATCACG ATCCGGCGGC CGCGCTCGTG GCGCGATTCG TCGAGCGGCG CCACGCGCGC
GTGCAAGCGG GACTCGGCGG GGGGCGGACG GTGGCGATCG CGGCGCACGC GGCGCTGCCG
TGGCCGGACG ACGCGCCGCT GCTGCGCGCG GGCGGTATTG CGAGCCAGCC CGCGCTGGCG
ATCGTGCAGG CGCTGCATCA TGCGTTGCGC TCGGACGAGG CGATGCTCGC CGTCGGCTGG
GGGGCGTCGG CCGGCGGCGT CGATGCGGGC GCGTCGAAGG CGGCGAACGC ATCGAACGCA
TCGAACGCAT CGAACGCATC GAACGCATCG AACGCGTCGA ACGCGTCGAA CGCGTCGAAC
GCATCGGACG CATCGAACGC ATCAAACGCG TCGAACACAT CGAACACATC GAACACATCG
AACGCGCCGA GCGTCGCCGC GGATCTTGCC GCGCCCGCCG AGCCGAACGC GCGGATTCCC
GCGCGCGCGC GGGCCACGCC GGACACGATC GCGGCATGCC TGAAGGCTGT GATCGCCGAC
GTGATCCGGG CGGACGTCGA CGAAATCGAC GCGCGCCAGC ACTTCGGCGA ATACGGGCTC
GACTCGCTGT CGCTGACGTC GGTCAGCAAC CGGCTCAATG ACGCATACCG CCTCGATGCT
TCGCCGGCGG GCGCGCTGAA TCCGACGCTG TTCTTCGAAT ACCCGAGCGT CGAGCGGATG
GCGGCTTATC TCGCCGAGCA TCACGCCGCG CGCTTCGCCG ACGCGTCGGC CGCACCCGGC
GCCGACGGGG CGGCCGAGTG CGCGCCGCGG CCCGAGGCCG CGCTCAACGC CGAGGTCGAA
CCTGGGAACG GGGCCGCGCC CGCGCCCGAG CCCGAGGTCG GGTTCAGGGC CGGGTTCGAG
CCGGTCGCGC CGCCGATGCC GCGCATCGAG CCCGCCGCAT CGACGCCGCC GGACCAACCG
GCGCCGCAAC CCGGCGGTGC ATGGCACGCC GGGCGCGGCG CGCGCCCGGC GGCCGACGAC
GACGTCGCGA TCATCGGCAT CAGCGGCCGC TTTCCCGGCG CGCGCGACGT GGCCGAATTC
GGCCGCAATC TGTTCGACGG CCGGGACTGC ATCGGCGAGA TTCCCGCGGA CCGCTGGGAC
TGGCGCGCGT ACCTCGGCGA TCCGCAGCAC GAGGCCGGCA AGACGAACAG CAAGTGGGGC
GGCTTCATCG ACGGCATCGC GGAATTCGAT CCGCTGTTCT TCAGCCTCTC GCCGAAGGAG
GCCTATCTGC TCGATCCCGC GCACCGGCTG CTGCTGATGC ACGCGTGGTG GGCGATCGAG
GACGCCGGCT ACAACCCCGC CGCGCTCGCC GGCAGCCGGA CCGCGCTGTT CGCGGGCATC
GCGCAGAGCG GCTACGCGGA TTTGCGCAGG CAAGCCGGCG AGGGGATCGA GGGCAACTCC
TTCCTCGGGG TCGTGCCGTC GATCGCGCTG AACCGGATCA GCCACCTGCT CGATCTGCAC
GGCCCGAGCG AGCCGGTCGA GACGGCCTGT TCGTCGTCGC TCGTCGCGAT GCACCGCGCG
CTCGTCAGCC TGCGCTGCGG CGACGCCGAC ATGGCGCTCG TCGGCGGCGT GCAGACGATC
CTGTCGCCGC ACGCGCATAT CGGGTTCGGC AAGGCGGGCA TGCTCGCGAC CGACGGCCGC
TGCAAGGCGT TCTCGAGCCG CGCCGACGGC TTCGTGCGCG GCGAGGGCAT CGGCATGCTG
TTCCTGAAGC GGCTCGGCGA CGCGCGGCGC GACGGCGACG CGATCTACGG CGTGATCCGC
GGCAGCGCGG TCAATCACGA CGGCCGGTCG AGCTCGCTCA CCGCGCCGAA CCCGGCCGCG
CAGCGCGACG TGATCGTGCA GGCGCACATG CGAGCCGGCG TCGACCCGCG CAGCATCGGT
TACATCGAGG CGCACGGCAC CGGCACGAAG CTCGGCGATC CGATCGAGAT CAACGCGCTC
ACGCAGGCGC TCGACACGCT GCTGCGCGCG CAGCGCGAGG AAGGCGCCGC CTACGTTCCC
GGCGCGTGCG CGATCGGCTC CGTGAAGAGC AACATCGGCC ATCTGGAGCT GGCCGCCGGC
GTGTCCGGCG TGATCAAGGT GCTGCTGCAG ATGGCGAACG GGCGGCTCGC GAAGAGCCTG
CATTGCGACG AGCTCAATCC GTACATCACG CTCGACGGCG GGCCGTTGCG CGTCGTCGGC
GCGAACGCCG CGTGGCCGCG TCCCGTCGAT CGCGACGGCC GCGAGCAGCC GCGCCGCGCG
GGCGTGAGCT CGTTCGGCAT CGGCGGCGTG AACGCGCACG TCGTGCTCGA GGAGTATCCC
GAGGCCGACG CGCGCGCGCG CGACGACGGG CAGCCGGCCG CCGTGCTGCT GTCCGCGCGG
GATTCGCAGC GGCTCGCCGA TTACGCGAGC GCATTGCTCG CGTTCGTGCG CGAGCGGCGC
GAGGCGGCCG CGCATGCGCC GCCGCCGCGG CTGTCGGATC TCGCCTATAC GCTGCAGGTG
GGCCGCGAGG CGATGCGCGA GCGTGTCGGC TTCGTCGTCA CGTCGCTCGC GCAACTCGAG
GCGCGGCTTG CCGCGTTCGT CGCGGGCGAG CCGGCGGGCG ACGGCGTCTA CCGCGGCAGC
GTCCGCCCGG CGCGCGGCGA ACGCGCGGCC GACGCGGACG GCCTCGACAG GCTCGTCGAC
ATCTGGCTCG CGAGCCGCAA GCATGAGGCG CTGCTCGGTG CGTGGGTGAA GGGCGCGGCG
ATCGACTGGG CGAGACTTCA CGCGGGCGGC GCGCCGCGCC GCGTCCATCT GCCCGGCTAT
CCGTTCGCGC GCGAGCGCTA CTGGATCGCC GAGCCCGCGC CGGCGACCGG CGCGCCCGGC
GAGCCCGCGC CGCCGCGCAT GCCGACGCAG CCGCACGGGC CGACGTCCGA CGGCCGCGCC
GAATCGCGCC ATCCGTTGCG GCGCGACGCC GCCGACGGCC GGTTCCTGCT CGATCTCGAC
GGCGACGAGG CCTTTCTCGC CGACCATCGG GTGGACGGAC GCCGCGTGCT GCCGGGCGTC
GCGCACCTGG AGATCGCGTA CGAGGCCGCG CGGCGCACGT TCGGCCCGGC CGATGCGATC
CGGATCCGGA ACCTCGGCTG GATCAGGCCG ATCGTCGCCG ACGGCGCGCT GCGCATCGGC
GTCGAACTGA GCGTGACCGG CGCCGCCGAA GGCGCGTTCC GCCTCTACAC GACGGACCCG
CAACATGGGC GGCTCACGCA CAGCGAAGGC GCGATCGGCC GCGCCGACGT CGCGCAGTCG
GCACGCGCGC TCGATCTCGG CGCGCTGCGC GACGCGTTCG CGACGGCCGA GCGCGTCGAT
CCGGCCGTCT GGTACGACGG CTTCTCGCGC GCCGGCATCG ATTACGGCCC GAGCCACCGC
TGCCTCGAAA CATGCGCCGT CGGCCCGGCC GGCGTGCTCG CGCGGGTGCG CCTGCCGGCC
GCCGAGGCGC GCGCGGCGCG GCCGTTCACC TTGCATCCGG GCCTGATGGA CGCGGTGCTG
CAGGCGGCGA TCGGCCTGCG CAAGCGCGCG GGCGGCGCGC CGCGCGGCAC GCCGTATCTG
CCGTTCGCGC TCGACACGGT CGAGATTCTC GGCGGCTGCG GCGAGGCGGC GTGGGCATGG
CTGCGCCCGT CGCCGCGCGA CGCGGCCGAC GCTTCGGCGT CGCGCGGCGA CGCGGGCAAG
CCGGCCGCCG AGCGCATCGA TATCGATGTG TGCGACGACG CGGGCCGGAT CAGCGTGACG
CTTCGCGGGC TCACGTCGCG CCCGCTCGCG CGCCGGACGG CGCCGGCTCC CGAGGCCGGG
AACCCGGCCG GTGAAGTCGG CGAGGTGGCC GACGCCACCG ATGCCGACGC CGCTGAAGTC
CGCGAAATCT CCGACGTCTC CAACGTCTCC AACGTCTCCG ACGTCTCCGA CGTCTCCGAC
GTCTCCGACG TCGCGCCGCT CGCCGACGGC GACGTCGGCC TGCTCGCGCG AACCGCGGTG
TGGAGCGCGC TGACGCCGGC GCAGTGGCTC GCGGATCCGG CGTCGCGCCC GCGCGCCGGC
GCGCGCGTGT TCGTGCTCGG CGGCACCGCC GCGCAGCGGC GCGAGATCGC GCGGATTCAT
CCCGGCTGCG AACCGCTTGA GGCGAATGCG GCCGACGACG GCGGCGACGG CGCGGACCAA
CAGGCGCACG TCGACGCGCT GCGGCGGCGG CTCGCCGAGG GCGCGCCGAT CGACCAGCTC
GTCTGGATCG CTCCGCCGGA GCCGGCCGCC GACGCGCGCG CCGGGCTGCG CGGCGACGCG
ATCGTCGCCG CGCAGGAGCA CGGGGTGCTG CAACTGTTCC GGATCGTCAA GCTGCTGCTC
GCGGCGGGCT ACGGCGGCAA GCCGCTCGAC TGGACGATCG TCACGCGCGA AACGCACGCG
ACGAGCGGCG TCGACGAGCC GTCGCCGACG CACGCGGGCG TGCATGGGTT CGTCGGCTCG
ATGGCGAAGG AGTACCGGAA CTGGCGTGTC CGCCTGCTCG ACATGCCCGC GCGCGAGGCG
TGGCCGATCG ACGCGATGTT CTCGACGCGC TTCGATCCGC GCGGCGATGC GCTCGCCTAT
CGGCGCGGCC GCTGGCTCGC CCGCGAGCTG GCCGCGATCG ACGCGTTGCC CGACGGCGGC
TGTCATGTGA AGACGGGCGG CGTCTACGTG GTGATCGGCG GCGCGGGCGG GATCGGCGAA
GTCTGGAGCC GCTGGATGAT GGAGCGCTAT CAGGCGCGGA TCGTCTGGAT CGGGCGCCGC
GACGAGGACG AGCAAATCCG CCGCAAGCGC GAGCGGCTCG CGCGCTACGG CACGCCGCCC
GTCTACCTGC GCGCGGACGC GAGCGAGCGC GCGTCGCTCG CGGCGGCGCG CGAGCGGATC
GCCGCGCTGC GCTGGGACGG CCGCGCGCTG CCGACGAGCG GCGTCGTGCA TTCCGCGATC
GTGCTGGCGG ATGCGAGCCT CGCGACGATG GACGAGGCGC GCTTTCTGGC CGCGTGGCGA
TCGAAGGCGG ATGTCGGCGT GCGCGTCGCC GAGGTCTTCG GCGGCGATCC GCTCGATTTC
ATGCTGTTCT TCTCGTCGAT CACGTCGTTC GGCAAGACGG CCGGACAGGC GAACTACGCG
GCGGGTTGCG CGTTCAAGGA CGCGTTCGCC GCGCATCTCG GCCGCACGCT GCCGTATCCC
GTCAAGGTGA TGAACTGGGG CTACTGGGGC AGCGTCGGCG TGGTCAGCGA CGAAACCTAT
CGCCGGCGCA TGGCGAGCGC GGGCTTCGGC TCGATCGAGC CCGACGAGGG CATGTCGGCG
CTGGAGCGGC TGCTCGCCAG CCGCGTCGGC CAGATCGCGG TGCTCAAGAC GCTGCGGCCG
AACCTCGTCG GCGACTCGCG CGCGGACCGG ATCCGGCATT ACCCCGGCCG CGACTGGCCG
GACGCGGCGC CCGCGCCGGC GACGGCCGCG CTGCAGGCGG CGCTCGCGGC GCGCGCCGGG
CGCTGGCACG CGCAGGCGTC GGCGCTCGCG CTCGGCAATC CCGAGCTGGA GACGCTGATC
GCGCGCGGCC TGCTCGCGGG CGTCCTTCCG TATCTCGACG CGCCGGGCTC GGTCGACGCG
CGCCATGCGC GGTGGTTCGA CGAAAGCCGG GCGATGCTGC ACGGGTTCGG CTATCTCGCG
CGCGACGGCG CGGGCGACGC GCCTTCCTGG TCGCTCACCG ACGCCGGTCG CGCGGCGGCG
CCGCACGTCT GGCAAGACTG GGAGCGGCAC GCGCTCGCGT GGCACGACGA CGAGCGGCGC
GTGCCGATGC GGCTCGCGCA CGTCTGCCTG CGCGCGCTGC CCGAGCTTCT CGGCGGCAAG
CGGCGCGCGA CCGACGTGAT GTTCCCGGGC TCCAGCATGG CGCTCGTCGA GGGGCTGTAC
AAGAGCAATC GCAAGGCCGA TCTGTTCAAC GACGTCGTGC ACGACGCGGT GCTGTCGTAT
GCGCGCGCGC TCGGGCGCGC GCTCGACATC GTCGAGGTGG GCGCGGGCAC GGGCGGAACG
ACGGACGGCC TGCTGCGCAA GCTCGTCGAG CAAGGGATCG CGGTGCGCGA ATACCGGTAT
ACGGATCTGT CGCACGCTTT TCTGCTGCAT GCGCGCGAGC ATTACGCGCC GCGCGCGCCG
TTCCTGACGA CCGGGATCTT CGACGTCGAC AAGCCGATCG CCGCGCAGCG CGTGCCGGGC
GGCCGCTATG ACGTCGCGGT CGCGACCAAC GTGCTGCACG CGACGCGCGA CGTCCGGCGC
GCGCTGCGCA ACGTGAAGGC GACGCTGCGC GCGGGCGGCC TGCTGATCCT GAACGAACTG
AGCGTCAAGT CGCTGTTCAG CCATGTGACG TTCGGGCTGC TGGACGGCTG GTGGATGTAC
GAGGACGCCG ATTTGCGGAT ACCCGGCTCG CCCGGCATCG ATTCGTCGAC GTGGCGGCGC
GTGCTGGCGG AAGAGGGCTT CGAGTATGTG TTCTTCCCCG CGCAAGGGCT GCATGCACAC
GGCCAGCAAG TCATCGTCGC GCAGAGCGAC GGCGTGGTCC GGCAGCCGCG CGCGGCCGCC
GCGCCGGGGG CCGGCGCGGC CGCGTCGCCT TCGGGCGGCA CGCAAGCGGC GGTGCCGGCG
CGCCGGGCGG CCGCGGCATC CGGCGCGCCG CGCGTGGAGG CGATTCCGCC GGCGGCCGTT
GCGCCCGCGG CCTTCGATGC CGCCACCGCG GCTCCTCCCG GCACCGCTGC CGCTGCCGCG
ACGGCGGTGC CGGCGGACGG CCGATCCGCG CTCGCCCACG CAAGTTCGCC GGTCGCCTCG
CCGCCGCAGC CGGGCGACGC GCCCGCGCTC GAACGAATGC ATGCGTATTT GCGCGACAAG
CTCTCGCAAG TGCTGAAGCT GCCGCCGGAG CGCATCGAGC CAGACGCATC GTTCGCGAGC
TACGGCGTCG ATTCGATCAT GGCGATGGCG TTGATCGCGG CGCTCGAAAA GGAGCTGGGC
AGCTTGCCGA AGACGCTGTT CTTCGAGCAC GAAACGATCG AGGAACTGGG CGCGTATCTG
CTGGAGCGTT GCGAGCCGAT GCCTTCGGGC GTGGAGCCGG CGACGGTGGG GGCGGACGAT
CGCGCCGCGT ATTCCGGCGC GAGGCCGCAC GCCTGGCCCG CGTCGCCCAC GGAGCCCGAC
GAGCCCACCG AGCCCACCGC ATCGCCCGCC TCATCCGCCT CGCCGGTCGC CTCGCCGCCG
CAGCCGGGCG ACGCGCCCGC GCTCGAACGA ATGCATGCGT ATTTGCGCGA CAAGCTCTCG
CAAGTGCTGA AGCTGCCGCC GGAGCGCATC GAGACGGACG CATCGTTCGC GAGCTACGGC
GTCGATTCGA TCATGGCGAT GGCGTTGATC ACGGCGCTCG AAAAGGAACT GGGCAGCCTG
CCGAAGACGC TGTTCTTCGA GCACGAAACG ATCGAGGAAC TGGGCGCGTA TCTGCTGGAG
CGTTGCGAGC CGATGCCTTC GGGCGTCGAG CCGGCGACGG TGGGGGCGGA CGATCGCGCC
GCGTATTCCG GCGCGAGGCC GCACGCCTGG CCCGCGTCGC CCACGGAGCC CGACGAGCCC
ACCGTGCCCA CCGAGCCCAC CGCATCGCCC GCCTCATCCG CCCCGCCGGC CGCCTCGCCG
CCGCAGCCGG GCGACGCGCA CGCGCCCGAA CGAATGCATG CGTATCTGCG CGACAAGCTC
TCGCAAGTGC TGAAGCTGCC GCCGGAGCGC ATCGAGACGG ACGCATCGTT CGCGAGCTAC
GGCGTCGATT CGATCATGGC GATGGCGTTG ATCACGGCGC TCGAAAAGGA GCTGGGCAGC
CTGCCGAAGA CGCTGTTCTT CGAGCACGAA ACGATCGAGG AACTGGGCGA GTACCTGCTG
GAGCGGCAAG GACAAGAGAG GGCGTGCCAT GCAAGCAACG TTTAA
 
Protein sequence
MSLHVHHLVL DGQSLLLLIG TLLDAYRALV DGVEPAPRAP AATHDDFVAE ERALLDSDEG 
ARRIAYWRRQ LDALPPALEL PASAPAAAER AAGDAWHAVP LDAARSARVA AFVQSNHLGA
AAFFLGMFKL LLHRYTGEPD IVVGMPADAR PSQRYRDALG FFVNMLPLRT RLAGETAVVA
MLERVQRELV DAMAMQYPFG ALVRELGLQG AEDGAPMYRI AFMYQDFLAR LRFSDDVEPI
GEIRQAGEYE LVLEVIEGAA PGGPARFALN WKYDGARYRA AAVEAMARHY LTLLDGVLAA
PAARVADCPM LPAAERERLL ALGRGPRADH ARERRVHDLI DARAQQAPHA IAVSCGGRSL
DYARLKADSD ALAQRLRACG IGAGDFVAVR LDRSTALVVG LLAVLKAGAA YVPLDPDYPD
DWAAQMLGDC RPAAILTRAA LAAGAHALAR RVAADGPPAV IALDDAADAD THAADGARAA
AIAAARQAAA SRAHAARAAD LAYVIYTSGS TGAPKGVMVT HRALTNFLAS MARRPGLHAR
DTLLAVTTYC FDIAALELFL PLVQGAHCVI CDSASARDGG RLRELIDAAR PTVMQATPST
WEMLLHAGWR NARRMRVLCG GDTLPDAVKA RLLEDGGEVW NLYGPTETTI WSMVAPVTAE
RPTSIGAPID NTRIRIVDAY GNPVPIGVPG ELCIAGDGLA AGYLNRPDET AARFVDALPD
VDGEARERHY RTGDLARWRE DGEVEHLGRM DFQVKIRGHR VEVHDIERHL ARHPAIRAAA
VVARRHAGGD QLVAYYVRGD AAGHGGADDA PALAAELRGH LAGALPDYMI PALFLPIDAL
PMTHNGKLNR KALASRGIRL RVASSGERRA APPRAPAAAD IEARLLAICR EVLKIDDIDR
ADGFFEVGGN SLSVALIASR VGAEFGLARL GAGAFFRYPT VAALAAHLGA RLRGDAGAAE
GADGADAGPA GADARASRPA QPRAAGPAAR LPAALDDAIA IIGISCQFPG AQDHRAFWRN
LRDGKSGARF YSEDELRAAG VPDTLIRDRH YVPMQQTIEG KDLFDRHFFR LTTKDAQLMD
PQFRLLLQHA WKAIEDAGCT RERIADAGVY MSASNSYYQA MLRAAGTIDA SDEYQAWLLA
QGGTIPTRIS YELGLTGPSL FIHSNCSSGL VSLSVAAKSL LQRESRCALV GAATVLPDAD
IGYVYQPGLN LSSDGRCRTF DENADGLTSG EGVAVLLVKR ARDAIDDGDP IYALLRGIAV
NNDGADKVGF YAPSVGGQAD VIRKVLDATG IHPETIGYVE AHGTGTKLGD PVEVAALTDA
YRRHTARTGF CAIGSVKPNI GHLDTVAGLS GCIKVALSLR HGEIAPSINY EKPNREIDFA
HSPFYVVDRL TRWPAREPGA PRRAALSSFG IGGTNAHLIL EAFERDEPPA GMRAPAARAA
RVIALSARTE ERVRAQASQL LAFLEQEAGA LPDFDGFAFT LQVGREAMRE RVAFVADGYD
ALAAALARFL RGEPDAAACF TGARGGDSTL AALLDDTGDT GDTAAHGLIA AWCEQGKVAK
IAALWAHGVN VDWRRLYGAR APVRVSLPTY PFAPERCEGV ARRRAAAPAP RRAGVETAAA
RLHPLVHDDR SDGARRRFAA TYSGEEFFLA DHLIRGKRIL PGVAYLEMAR MAAVRAHGDG
ALSLHDVVWM TPIVVDGPCE VELSLEAAER VEAEGAAEAA AGVRTMRFNV TSGGGAGARR
TNSQGTIRLA PGAAAPAAAR VDVAALLARC TREIGAQRFY TFLDSGGGHY GPTFRSVAAL
HQGEREVLAR LALPESVAHA DAFVLHPSMM DAAFQIADSL ILQPRANGGC LPFFVKELVV
RRRPGRDAWV HVRLAGGDAT LARYDIDLID PDGTVCVSMR EFSARAETAG GSGRPNTYRA
AEWRAAECDG ERDGNELSEL NELNEGNERR RAAPRVAVLD ASPRLAHALR GIGVDALWLP
ADAAHAARGP ALRDLDAALH AGAARDLLVL ADERRELDDD ALRAWLDGAP HAGGARRALV
SIAGLADADA RAVADIVERE RHGRAADVRY DAGGARSVRG FADAAVARWL LDTDALRSGG
VYWIAGANGP LGASLACHLA TVERATVVLT DAHAIDAARL ACLDGYRAGG ARLEFIEGDA
ARDGAALAQR IRARHGRIDG VLHCAQHASA PTLAALAALD RATRADALDC FVACEARDAD
PDHDPAAALV ARFVERRHAR VQAGLGGGRT VAIAAHAALP WPDDAPLLRA GGIASQPALA
IVQALHHALR SDEAMLAVGW GASAGGVDAG ASKAANASNA SNASNASNAS NASNASNASN
ASDASNASNA SNTSNTSNTS NAPSVAADLA APAEPNARIP ARARATPDTI AACLKAVIAD
VIRADVDEID ARQHFGEYGL DSLSLTSVSN RLNDAYRLDA SPAGALNPTL FFEYPSVERM
AAYLAEHHAA RFADASAAPG ADGAAECAPR PEAALNAEVE PGNGAAPAPE PEVGFRAGFE
PVAPPMPRIE PAASTPPDQP APQPGGAWHA GRGARPAADD DVAIIGISGR FPGARDVAEF
GRNLFDGRDC IGEIPADRWD WRAYLGDPQH EAGKTNSKWG GFIDGIAEFD PLFFSLSPKE
AYLLDPAHRL LLMHAWWAIE DAGYNPAALA GSRTALFAGI AQSGYADLRR QAGEGIEGNS
FLGVVPSIAL NRISHLLDLH GPSEPVETAC SSSLVAMHRA LVSLRCGDAD MALVGGVQTI
LSPHAHIGFG KAGMLATDGR CKAFSSRADG FVRGEGIGML FLKRLGDARR DGDAIYGVIR
GSAVNHDGRS SSLTAPNPAA QRDVIVQAHM RAGVDPRSIG YIEAHGTGTK LGDPIEINAL
TQALDTLLRA QREEGAAYVP GACAIGSVKS NIGHLELAAG VSGVIKVLLQ MANGRLAKSL
HCDELNPYIT LDGGPLRVVG ANAAWPRPVD RDGREQPRRA GVSSFGIGGV NAHVVLEEYP
EADARARDDG QPAAVLLSAR DSQRLADYAS ALLAFVRERR EAAAHAPPPR LSDLAYTLQV
GREAMRERVG FVVTSLAQLE ARLAAFVAGE PAGDGVYRGS VRPARGERAA DADGLDRLVD
IWLASRKHEA LLGAWVKGAA IDWARLHAGG APRRVHLPGY PFARERYWIA EPAPATGAPG
EPAPPRMPTQ PHGPTSDGRA ESRHPLRRDA ADGRFLLDLD GDEAFLADHR VDGRRVLPGV
AHLEIAYEAA RRTFGPADAI RIRNLGWIRP IVADGALRIG VELSVTGAAE GAFRLYTTDP
QHGRLTHSEG AIGRADVAQS ARALDLGALR DAFATAERVD PAVWYDGFSR AGIDYGPSHR
CLETCAVGPA GVLARVRLPA AEARAARPFT LHPGLMDAVL QAAIGLRKRA GGAPRGTPYL
PFALDTVEIL GGCGEAAWAW LRPSPRDAAD ASASRGDAGK PAAERIDIDV CDDAGRISVT
LRGLTSRPLA RRTAPAPEAG NPAGEVGEVA DATDADAAEV REISDVSNVS NVSDVSDVSD
VSDVAPLADG DVGLLARTAV WSALTPAQWL ADPASRPRAG ARVFVLGGTA AQRREIARIH
PGCEPLEANA ADDGGDGADQ QAHVDALRRR LAEGAPIDQL VWIAPPEPAA DARAGLRGDA
IVAAQEHGVL QLFRIVKLLL AAGYGGKPLD WTIVTRETHA TSGVDEPSPT HAGVHGFVGS
MAKEYRNWRV RLLDMPAREA WPIDAMFSTR FDPRGDALAY RRGRWLAREL AAIDALPDGG
CHVKTGGVYV VIGGAGGIGE VWSRWMMERY QARIVWIGRR DEDEQIRRKR ERLARYGTPP
VYLRADASER ASLAAARERI AALRWDGRAL PTSGVVHSAI VLADASLATM DEARFLAAWR
SKADVGVRVA EVFGGDPLDF MLFFSSITSF GKTAGQANYA AGCAFKDAFA AHLGRTLPYP
VKVMNWGYWG SVGVVSDETY RRRMASAGFG SIEPDEGMSA LERLLASRVG QIAVLKTLRP
NLVGDSRADR IRHYPGRDWP DAAPAPATAA LQAALAARAG RWHAQASALA LGNPELETLI
ARGLLAGVLP YLDAPGSVDA RHARWFDESR AMLHGFGYLA RDGAGDAPSW SLTDAGRAAA
PHVWQDWERH ALAWHDDERR VPMRLAHVCL RALPELLGGK RRATDVMFPG SSMALVEGLY
KSNRKADLFN DVVHDAVLSY ARALGRALDI VEVGAGTGGT TDGLLRKLVE QGIAVREYRY
TDLSHAFLLH AREHYAPRAP FLTTGIFDVD KPIAAQRVPG GRYDVAVATN VLHATRDVRR
ALRNVKATLR AGGLLILNEL SVKSLFSHVT FGLLDGWWMY EDADLRIPGS PGIDSSTWRR
VLAEEGFEYV FFPAQGLHAH GQQVIVAQSD GVVRQPRAAA APGAGAAASP SGGTQAAVPA
RRAAAASGAP RVEAIPPAAV APAAFDAATA APPGTAAAAA TAVPADGRSA LAHASSPVAS
PPQPGDAPAL ERMHAYLRDK LSQVLKLPPE RIEPDASFAS YGVDSIMAMA LIAALEKELG
SLPKTLFFEH ETIEELGAYL LERCEPMPSG VEPATVGADD RAAYSGARPH AWPASPTEPD
EPTEPTASPA SSASPVASPP QPGDAPALER MHAYLRDKLS QVLKLPPERI ETDASFASYG
VDSIMAMALI TALEKELGSL PKTLFFEHET IEELGAYLLE RCEPMPSGVE PATVGADDRA
AYSGARPHAW PASPTEPDEP TVPTEPTASP ASSAPPAASP PQPGDAHAPE RMHAYLRDKL
SQVLKLPPER IETDASFASY GVDSIMAMAL ITALEKELGS LPKTLFFEHE TIEELGEYLL
ERQGQERACH ASNV