Gene BURPS1710b_A2618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2618 
SymbolonnB 
ID3694476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp3150670 
End bp3164289 
Gene Length13620 bp 
Protein Length4539 aa 
Translation table11 
GC content73% 
IMG OID637732872 
ProductOnnB 
Protein accessionYP_337767 
Protein GI76819270 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGATC GCGATCGAGA CGAATCCGGA ACGAGAGGGC TCTGGAGCAT GAACGGATTG 
AAAGAACGTG TGGGATCGGG GTATGTCGCC ATTCCCATCC TTGTCGCCTT GCGCGGGGCG
CGGGCGCCCG GAGCGGCCGG CGATCGCGAT GCCGCGTGGA CGGAGCGGAT CGCCGCCGCG
GCGCGGCTCG ACGCCGACGG CGCGGCGGCG TTGCGCGAAT GGCTGAACGA TGCGGCGCCG
GCCGCAGGCG CGGACGATGC GCGCCTCGCG CGCGGCGACG CGATTCCCAC GGACATTCTC
GAACCCTACC GGTTCGAGCC CGATGCGCGC GCGTTCTTCG ACGACGCCTT TCGCGAATGT
CTGAACCGCT GGAGCCGCCG CCTGCTCGAG CAGGACGGCG CGACGTGGTT GAGCGACGTC
GTGCTCGTGC CGTTTCTCGT TGCCGCCGCG CGTGCGCGCG AGTCGGCGCC GTCGTGCGAG
GCGCGCTCCA TCCGTGCGCT CGCGGATGCG TTCGGCGCGG ATTTGCGCCG CCTGCTCGAC
GCGTCGCGCG TGCTCGACGA CGGCGATATC GGCGATATCG GCGCGATGCG CGCGTCGCTC
GAATGCGTGA CGTCGTTTCG CGACGGGCTG CTGCACGCGG ATCGCCGTCT GTCGAACGGT
GCGCCGGCCG CGTGCCGGCC GCAGGTTTGC GCCGCCGCGT ACGAGGATCT CGAGGCGGCG
ATCCTGCGGC GCTTCGATGC GAGCGCGAAC GCGCGAGCCG GCAGGCGGCC GAGATATCTG
GCGTTCGTCG GCGGCGAGGA TCGCGATCTG CCGGAGCGGC TCGTCGAGCG GATCGTGCGG
CAGGGCGAGC CGCGCAACGG CGGCGGCACC GGCGTCGAGC CGCCCCGTAT CGTGATCCCG
CGTGCGGACG CCGCACCGGC GGACGATTCC GCGCAACCGG ATCCGGCGGC CGCGTTCGGC
GTCGACGCGA TCGACGATGT GCTGCCTGTC TGCGTGCTGT CCGGCGTGGA CGACGCGGCC
GCGATCCGGC GGCTGCGCGC GTTGCTCGTT CGCGGCGCGC ACGCGGCGGT CGTGCTCGCG
TGCCACGGCT TGCCGGCGGC GAGCGCGGCT CTGCCCGACG CATGGGCCGC CGCCCGGCCG
GCGCGCATGC TGCGGCTGCT CGGCTTCGGC GCCGCGCGCG ACGCGCAGGC GTTTCTCGTC
GACATGGCGG CCGACGGCCT GTTCGCGCGC ACGCCGCCGA TCGGCTATCC GCGCGCATCG
CGCGCGCGAT GCGCGACGCT CGGCGAGTTC GAGGCGCGCG ACTATCGGGT GCGCCCCGCG
GCCCCGCACG ATCTGCCCGC GCTGCAAGCG CTCGAGCTCG CGTGCTGGCC CGCCGCGCTG
CGCATGCCGG AGGCCACGCT CGCCGCGCGC GTCGGTCGGC ATGCCGCCGG CCAGTTCGTG
CTCGAACTCG ATGCGCGCGA CGCGGCCGCC GGGCCGCGCC GGCTGGCCGG CGTGATCTAT
TCGCAGCGCA TCGCGTCGGT GCGGGCGTTG GACGGCGTCG ACGCGGATAC GGTCGACCGC
CTGCACGAAG ACGGCGGGCC GGTGATCCAG TTGCTCGCCG TCAATGTCGA TCCGGCGTGC
CAGAGCCGCC GGCTGGGCGA CCAGTTGCTC GAATTCATGT TGCAGCGCTG CGCGGCGCTC
GCGGACGTCG AATCGCTCGT CGCGGTCACG CTTTGCCGCG ATTTCCACAA GCACGCGTCG
ATGCCGATCG ACGACTATCT GCGGCTGCGC AACGCTTTCG GCTTTCTTGC CGATCCGATC
CTGCGGTTTC ACGAACTGCA CGGCGCGCGC ATCGAACGGC CGATGCCGGG CTACCGGCCG
CGCGACGTGC GCAATGCGGG GTTCGGCGTG CTCGTGTCCT ACGATCTGGC GCGCCGGGCC
CGCAACGAGG CGGGGGCGCC GGCCGCCTCG GACGCGCCGG CCGCGCCGGA CGGCGCGCCC
GCGCGCGCCG ACGGCGGGCA CGCGGGCGCC GCCGCGCCGC CGCGCCCCGA GCGGGACGCG
GCGACGGCCG CGGACCCGGA CGCGCTGGAT GCTTTCATCG AGGCCGAAAT CCGCCGCATC
GTGGGCGGCG GCGCCGAGCT CGCCTACGCC CGCGATCTGC CGCTGATGCA GCTCGGGCTC
GACTCGGTCG GCCTGCTCGA ACTCGCGGAG GCGCTCGCGC TGCGGTGCGG CGTCGCGCTG
CCGGCGACGT TCCTGTTCCA GCACAACACG CCGGCGCGCA TCGTCGCGTA TTTCGACGCG
AGCCGGCACG CGCCGCCGCG GGCGGGATGC GACACAGGCT GCGAGACGGC CGGCGCCGCC
GCTGCCGCCG CACGGCCGTC GTCGCGCGGA TGCCCGGCGA CGGAGCCCGG CGCCGCGCGC
GAGCCCGCGC AGGCGGCGCC GTTTGCGCCG GACGGCATCG CGATCGTCGG CATCGCGTGC
CGGCTGCCCG GCGGGCTCGA CACGCCGGAA GCGTTCTGGG ACGCGCTGAA GGCGGGCGCC
TGCGTGGTCG GCGAACTGCC CGGTGATCGC TGGACGTGGC CCGCGGATAT CGATCCCGGC
GCGCGCCATC GCGGAATCGA TCGCGGCGGC TTTCTCGACG ACATCCGGTC GTTCGACGCG
GGCCTGTTCC GGCTGTCCCC GAAAGAAGTC GCGACGATGG ACCCGCAGCA GCGGATTCTG
CTCGAGCTCG CCTGGGAGGC GATCGAGCGC GCCGGGCATT GCGCGGACGC CGTCGCCGGC
AGCCGCACCG GCGTGTATGT CGGCGCGAGC GGATCGGACT ACCGCCTGCT GCTCGAACGG
GCGGGCACGG GCGTCGACGC GCATGTCGCG ACCGGCGCGT CGATGGCCGT GATCGCGAAC
CGGATCTCGT ATACGTACGA TCTTCGCGGC CCGAGCATCC AGGTCGATAC CGCATGCTCG
AGCTCGCTCG TCGCGCTGCA TCAGGCGGTG CAGGCGCTGC GCGCCGGCGA GTGCGATCAG
GCGCTCGTCG GCGGCGTCAA CGTGATCTGC CATCCGGGCA ACACGATCGC GTACTACAAG
GCCGGCATGC TGTCGCCGCA GGGCCGCTGC AAGACGCTCG ACGATGCCGC CGACGGTTAC
GTGCGCTCGG AAGGCGCGGT GATGCTGATG CTCCGGCGTC TCGAGCAGGC CGTCGCGGAC
GGCGATCCCA TCCATGCGGT GATTCGCGGC AGCGCCTGCA ACCACGGCGG CCTGACGGGC
GGGTTGACCG TGCCGCATCC CGACCGGCAG GCGGACCTGC TGCGCGCCGC ATGGGCCGCG
GCGCGCGTGT CGGCCGACGA CATCGGCTAT CTCGAGATGC ACGGCACCGG CACGCGGCTC
GGCGATCCGA TCGAGGTGCG CGGCTTGGCC GATGCGTTCG GCGCGCGCGA CGACGCGGCG
GCCCGCGGCA CGTGCGGAAT CGGCTCGGTG AAGAGCAATC TCGGGCATCT GGAGGCGGCG
GCCGGGCTGG CCGGCGTGCT CAAGACGGTG CTCGCGCTGA AGCACCGCGA GGTGCCGGCG
ACGATCCATT TCTCGCGGCT GAACGCGCAG ATCAGCCTTG CGCGCACGCC GTTCGCGGTC
GTCGACACGC ATCGCGCGTG GCCGGCGCGC GGCGGCGCGC GCCGGCTCGC GGGCGTGAGC
AGCTTCGGCT CCGGCGGCGC CAACGCCCAC GTGGTGCTCG AGGAATATCC GTCCGAAGCG
CCGCCGCGCG CGGCGGCGGG TGACGCGCTG TTCGTGCTCT CCGCGCACAG CCGCGAGCAG
CTCGCCGAAT ACGCGCGGCG CGTGCTCGCC TACTGCGAAC GCCGGCTACA GTCGGGCGAT
GACGCGCCGG CCGCCGCCGC CGTCGCGCAT GCGTTGCAGC GCCGCCAGGC GATGGCGTGG
CGGCTCGCGT TCGTCGCCGC GTCGCTCGAG GAAGCGGTGC GGCGCCTGCG CGCGTTCGCG
GCGGGCGCCG CGCAGCCCGG CACGTTCGTC GGCAGCGGCG CGCCGAAGAC GTCCGTCGCG
GATTTCGTCA ATCAGAACCC CGACGTGCAG CAGGTCGTGT CCGCATGGCT GCGCGAGCGG
CAGCTCGCGA AGCTCGCGCG CTACTGGGCG GACGGCGTGC GGATCGCGAA CTGGAGCGCG
CTGTACGACG CGCGGCCCGC GTGCGTGCCG CTGCCGTGCT ATCCGTTCGC GCGCGAGCGC
CACTGGATCG CGGCGCGGCC CGCCGAGGCG AGCGAGGCGA CCGCCGCCGC CGCGCCGCCG
GCCGCGCCCG ACGACGCCTA CCGCGCCGCG CCGCGGCTCG AGGCCGAATC GCGCGACGCC
GCCGGCGCGT CGTTCCGCGT GCGGCTCGTC GGCGACGCGA GTTTCCTCAC CGACCATCGC
CTGCGTGGGC GGAAGATCCT GCCGGGCGTC GTCCACTTCG AACTGGCGCA TGCCGCATGG
GCGGCGCTCG CGCGCGCGGA CGCGCCCGCG ATCGAATTCC GCGACCTCGC GTGGACCCAT
CCCGTCGATG TCGCGACGCC CGAGCGCGTG CTCGGCGTGC GGCTGCGGCG CGTCGCGGCC
CGGCCCGGCG CCCACGCGTA CGAAGCGTAC TCGCCGCCCG ATGCGGCGGG CGGCGGCGAG
CGCGTGCATG CGCGCGGCAC CGTGCTCGAC GTGGCCGGGC CGCCCGAGCC GGCGCTCGAT
CTGGACGCCT TGCGCGCGCG GTTCGACGGC GAGCCCGGCG CGCACGAGCG CGACGCGCAC
GACTGTCATC GCGCGTTCGA GCGGATGGGC TTCGGCTACG GGCCCGCGCA TCGGGGCCTG
CGCGGCCTGC GCTGGCGCGG CGGCGCACGC GGCGCGACGG AAGTGCTCGC GCACATCGTG
CTGCCCGATT GCATCGCGGA CGCGCGCGAG CGCCACCGGC TGCCGCCCGG CTTGCTCGAC
GCGGCGGTGC AGGCTGCGAT GGCGGCCGCG ATCGGCCGCG ACGCGCTCGG CGCGTCGCCC
GCCTGCGTGC CGTTCTCGCT CGATCGCCTC GTCTGCGCCG GGTCTTGCCC GGCGCGCGCG
TGGGTCTGGG CGCGTCGGCG CGACGGCGCC CGGGCGGCGC TCGCGCCCGT CGATCTCGAT
GTCTGCGACG ACCGCGGCCG CGTCTGGGCG AGCTTTCGCG CGCTCACGTT CCGGCCGTTG
CGCGAGGCGG CCGGCCGCGA TGTCGCGGCG CCGCGGCTGT TCCGGCCGCG GTGGGCCGCG
CGGCCGCTTT CCGGCGAGCG CGCGAGCGCC GGCGCGGCCG ATGTCGCGCA CTGGCTGGTG
CTCTGCGGTT TCGACGAAGG CGCGGCGCTG CGGCGCGATC TCGCGACGCT GCGGGCGCGC
TTGCCGGACA CGTCGATCGT CGCGATCGAT TCCGATGCGG CGACGCTCGA AAGCCGGTTC
GCGGACAGCG CGGGGCAACT GTTCGAGCTG TTTCGCGAGC TCGCGCTTAG CGGCGCGACG
CCGCGCGCGG CGGTGCAGGT GCTGGTGCCG GCGGACGGGC CCGAGCGCGC GTTCGCCGCG
CTGTCGGCGC TCGTGCGCGG CGCGCGCCTC GAGCATCCGT CGTGGTCGGT GCAACTGCTG
TCGCTCGAGC GCGCGACGCA GGCCGCCGAC GCGGCGGCGC GCGCGCTGGA GAATCGCGGC
GACGATGCGG ACTTCGTGCG CTACCGGCGC GCGCGCCGCG AGTCGCTCGT CTTCGAGGCG
CTGCCGGCAG GTCGCGACGA ACCGCCGCGC CCGTGGAAGG AGGCGGGCGT CTACCTGATC
ACGGGGGGCG CGGGCGGGCT GGGGCTCGCG TTCGCGAAGG ACATCGCGGC TCACGTGCGC
CGGGCGACGA TCGTGCTCGC CGGCCGCGCG GCGGCGCCGG ACGCGGCGCT GTCGGCGTCG
CTCGGCGCGA TCGCGCGGCC CGCCGGCGTC GATATTCGCT ACCGGTCCGT CGATGTCGGC
GACGCGCCGG CCGTCGCGGC GCTCGTCGGC GATCTGCTGC GCGAGCACGG CCGCTTGTCG
GGCGTGATTC ACGCGGCCGG CGTCACGCGC GACGCGCTGC TGGTGCGCAA GCCGCGCGCC
GAGTTCGACG CGGTGCTGCG GCCGAAGGTG GCCGGCGCGG CCGCGCTGTA TGCCGCCACG
CAGGACATCG ACCTCGATTT TCTCGTGTCG TTTTCGTCGA TCGTCGGGGT GACCGGCAAC
CTCGGCCAGA CGGATTACGG CGCCGCCAAC GCGTTTCTCG ACGCGCTTGC CGGCTTGCGC
GCGCAGATGG GCGGCGAGCG CCGCGCGCGC GTGCTGTCGA TCGCGTGGCC GCTGTGGCGC
GACGGCGGCA TGGGGCGCGA GCCCGAGGTC GCGGCGCATT TCGAGCGAGC CTTCGCGCTC
GCGCCGATGG ACACGGCGGC GGGCATCGAC GCGTTCCACC GGGCGCTCGC GTCGGACGCG
GCGTACGTCG TCGTGGCGAA CGGCGGCGCC GACTGGACGC CCGAGCGCGC GATCGCGCGT
GCGCTGTCGG CGCGTGCGCG GCCCGGCGCG CCGGAGCCCG CCGAGCGCGC GCATGGGCCG
GCGCGCGAGG CGGCAGCGGC GGCGTGTGAG GCGGTTGCGG TGTCGGCTTC GTCATTGTCG
TCGCCGTCGC CGTCGTCGGC ATCGGCATCG GCATCGGTAT CGGCTGAATG CGCGCCGACC
GACGACGGCG CGGCGCGGCG GCACGCGGTC GTCGCGTATC TGACGCGGCA GATCGGCGCG
GTGCTCGGCC ATGCGCCGGA CAGCCTCGAC ATCGATGCGC CGTTCACGAG CTACGGGATG
GATTCGATTC TCGCGCTCGA CACGACGCGT GCGATCGAGA CGGATCTCGG CAGTCTGTCG
AAGACGCTCT TCTTCGAGCA CGAAAACGTC CGGCAGTTGA GCGCCTACCT GCTCGACGAG
CACGCGGACC GGCTCGCCGC CTGCGCATGG TTCGCGCAGG CCGGCGAGCC CGCGCGTGCG
GCGGGGCCGG CGCTGGCGGC GGGCGGCGAG CCGACGCGCG AGACGGCCGC GCCTTCGGCC
GCGGCCGACA CCACGGCCGA CACCACGGCC GGCATCGCGA TCGAAACCTC GACCGAAACC
CCGACCGCAG CGACAACCCA GGCCGGCGCG GCCCCCGAGG GCCGCTATCG GCGCGTCGCG
AAGGCCGCGC TGCCCGCCGA CGGGCAACTG GCCGCGGCCG TGGCCGCGAT CGGCGGCGCG
GCCGCGACGA AGGGCGTCGC GCTCTTCGAG ATCTGGCCGG AGCTGTTCGT CGATTCGGCC
GGCCACGGCT ACTGTCATCT GCTCGTCGAC GGCGGCGTGC TGTTCGCCGC GCAGCACGCG
GGCGACGCGC GCCATCGCGC GGCGTTGTTC GCCGCGCTGC TCGCGTACTG CGATCGGCAC
GGCTATGCAT TCGGCTATCT GGATCTGTCC GAGGGCCGCA AGCCGGACCT CGAAGCGCAA
TGCGGCCTGC TCGCCGCGCC GGTCGGCGTC GTGCAGATCG TCGAGGCGAT CGCGTCGTTC
TCGCTCGCGG GCGGCCGGAT GCGCCGGCTG CGCTACATGG TCGAACGGTT CCGCAAGGCG
GGCGCATGCC GGGTTGTCGA GTATCGCGCG CCGGACCCGG ACGTCGCGCG CGAGATCCGG
CGCGTCATCT GCGCGTGGAG CGACGCGAAG AAGGTCGTCA ACAACGTCGA CATCGTGCTC
GGCGAGATGG CCTCCGGCAG CCTGCACGAG CGCTATCGCG TGTTCCTGAC CTACCTCGAC
GACGTGCTCC AGAACGTGAT CATGATCGCG CGGGACGGCG ACGGCTACCT GATGGACCAG
GAATACTACG TCGCCGACAT GCCGCTCGGC GGCACCGAGT ACGCGGTGAC GGAGATCCTC
GCCGCGCTCG CCGCGGAGGG CCGCGAGCGG TTCAGCCTCG GCCTGACCTG GGGCCTCTTC
GACACCGGCG AAGGCTCGAG CGATCCGGCG GCGGACGCGT TCCTCGCGTC CACGCAGACT
CAACTGCGGC GTATCTTCGA GCGCGGCGCG GCGAACCGGC AGTACAAGAG CAAGTACGGC
ACCCGCGATC ACGCGGTCTA CCTGTACCGC CGGCCCGGCA AGCCGGAGCC CGCGATCGTC
GGCTGCCTGA GCCAGTTCTA CCGGAAAGGC CTCACGCATC ACGAAGTACG GCGCCTCGCC
GGCCTGGCCG ATGCGCCGGC GCCCGCGCCC GCGCCGGTCG CCGCGCCCGC GACAGTGCCG
CAGGCCACGC CCGCACCCGC GCCGGTGGCT ACGCCCACTG CCGACGAACG CGCGTACGAC
GTGACCCGCA TCGATGCGGC GACGATCCGT GTCGATCTCG TCAGCGATTC ATGGGCGCAC
GTCGACTATC CGTTCATGCG CGCGCGCGCC GCGACGCTCG ATCGGCACGC GCCGCCCGCG
CGCGGCGGCG ATCCCGGCCG CGCGGTCGCC GAGCTGCTCG GATTCGCGCA ATGCCTGCTG
ACGACGTCGG GGCGCGCGGC CGAGCACCTG TTTTTCCGCG CGCGGCGCTC GGCGCGCACG
CGCGTGCCGC AGAACCTGTT GTTCGAATCG ACGCTGCACA ACCTCGTGAA AAGCGGTTTC
GAGCCCGTCG AGCTTCCCGA CGCGCGCGCG CTCGATCCCG ACTCGCGCGA CCTGTTCCGC
GGCGGCATCG ATCTCGCCGC GCTCGATCGC GAGCTGCAGG CGCACGCGGA CGCGACCGCG
ATGGTGATGC TGGAGCTCTG CAACAACGCG TCGGGCGGCT ATCCCGTCGC GCTCGCGCAG
ATTCGCGCGA TCGCCGCCGC GTGCCGGCGG CACGGCGTGC CGCTCGTGAT GGACGTCACG
CGGATCGTGA AGAACGCGGA GCTGATCCGG CGCGGCGAGG CCGGCTACGC GCAGCGCGGG
CTGTGGGAGA TCGTGCGCGA GATAGCGGAC CACGCCGATG CGGTCGTCGG CAGCCTGTGC
AAGGACTTCG GCCTGGGCGC GGGCGGCTTG CTCGCCGCGC GGGACGCGCG CGTCGTCGCG
AACGCGGCGG GCATCGCGCG CCTCGAGGGC GGGCTGCCCG GGCCCGCCGA GCTTCGCCGG
ATCGCCGCGG CGTTCGACGA TCGCGCGTAT CTGGAGCGGG AGATCGGGCG CCAGCTCGAT
TTCGCGCGCG ATCTGCACAT CGAACTCGAG CGATGCGCGG TGCCGGTCGT GCAGCCGGGT
GCCGGCCACT GCGTGCTCGT GCGCGTCGAT CAGCTCGCGC CGCCGGGCGG CAGCGCGCCT
TCGCGCGGCG CGTATCTGCG GCTGCTGGCC GAGCGCTACG GCGTTCGCGG CGGCTTGCAC
CTCGTCGGCA ATCTGCGCGA TAGTCATCTG AACGCGTGCG TGCGGCTCGC GCTGCCGCTC
GGCTTCGACG ATCCGCGCGG GCCGGGCGCG CTTGCCGCGG CACTCGCCGC GGCGCGGGAC
GGGCGCGATC ACGCGCTGGA CGACCTGATG CGTGCGCCGC GCGCGCGCGC GGCGCACGGC
GGGCGATGCG CGGACGGCAT CGCGATCATC GGTCTGTCCG GCCGCTACCC GGACGCGCCG
ACGCTCGACG CGTTCTGGCG CAATCTCGTG TCCGGGCGCC GTTCGATCTC GGAGATTCCG
GCCGAGCGCT GGGACTGGCG CGATCATTAC GAGCGCGATC CGGACACGGC CGTCGCGCAC
GGCAAGTCGT ACGGCAAGTG GGGCGGCTTC CTCGACGGCT TCAGCGCGTT CGATCCGCTG
TTCTTCCAGA TCGCGCCGCG CGAGGCCGAG TTCATCGATC CGCAGGAGCG TCTGTTTCTC
GAGGCCTGCT GGCACGCGCT GGAGGACGCC GGTTGTCCGC CGTCGGCACT CACGCGCGCG
CAGCGGGCGA AGGCCGGCGT GTTCGGCGGC ATGACGAAGC AGGGCTTCAA TCTGTACGGC
GCCGGAGGCG CGCAGCCGTA TCAGAGCACG TCGCTCGCCG CGCTCGTGAA CCGCGTGTCG
CACTGCTTCG ATTTCAACGG GCCCGCCGTC GCGTTCGACA GCCACTGCGC GTCGGCGCTC
GTCGCGATCC ACGAGGCGTG CCAGTATCTG CGCCGCGAGC CGGAGGGCAT CGCGATCGCC
GGCGCGGTCA ATCTGAACCT TCACCCGTCC AATTATCAGC AGCTCTCGAA GATGCAGGTG
CTGGCGAGCG GCGCCGAGAG CGCGTCGTTC GCGAGCGGCG GGCTCGGCTA CGTGCCGGGC
GAGGGCGTCG GCGCGGTCGT CCTGAAGGAT TACCGGCGCG CGCTCGAGGA CGGCGATCCG
ATCTACGGCG TGATACGCGG CAGCGCCGTC AACCAGAACG GCCGGATGAA CCGCTTCGGG
ATGCCGAGCC AGAAGCAGCA GGAGGCGGTG GTGCGGGCGG CGCTCGCGCA GGCCGGCGTC
GATCCGCGCA GCATCACTTA CGTCGAGGCG TCCGCGCACG GTTCGGCGGT GGGCGACGCG
ATCGAGATGG CCGCGCTCAC GCGCGTGTTC GGCGCGCGCG AGCGCGCCGA CGGCCGCTAC
CGGATCGGCT CGGTCAAGCC GAACATCGGG CACGGCGAGG CCGTCTCCGG CATGTCGCAA
CTGACGAAGG TGCTGCTGTC GTTGCGGCAC GGGCAACTGC CGCCGACGCT CGTGTGCGGC
GCGCCGAATC CCGACATCGA TTTCGACGCA TTGCCGTTCG AGCTGAATAC CTCGCTCACC
GACTGGGCGC GCGCGCGTGT CGATTCGGAG CGGGTGCCGC GCCGCGCGGG CATCACGTCG
ACGGGCGCGA GCGGGCTGAA CGCGCACCTC GTGCTCGAGG AGCACGCAGC GCCCGCCGTG
CCCGCGCAGG CCGGGCCGGG CGAAGCCGAC GCGCGGGCGC ACGTGTTCGT GCTGTCGGCG
CGGGATCGCG CGCGGCTCGA CGACTACGCG CGCGACTGGA TCGCGTTCCT GAACGACGAT
CCGCAACGGG ATCTGGCGGC GATCGCCTAC ACGCTGCAGG TCGGCCGGGA GCCGATGGCC
TGCCGGCTCG CCGTCGTCGC CGCCGATTGC CGGGATCTGG CCGGCAAGCT CGCGCGCTGG
CGCGAGGCGG CGCACGCCGA TTGCGACGAC GTGTTTCACG GCGAGGCGCG CGCGGCCGCC
GGCAAGCCGC ATCGCGAGGC CGCGCGGGAC GCGCGCGAGC CGCGCGACGT CGCGCGGGCG
TGGGTGGGCG GCGCCGTCGT CGACTGGGCG GCGCGGCATG CGGGTGCGCG GCCCGCGCGG
GTGGCGGGCC TGCCGGGCTA TCCGTTCGAG CGGCGCTCGT ATTGGCCGGG CGCGGCCGCG
GCGCCGGCAA CGGCGCGAGC GGCCGCGGCA AGCGACGCGT CGGAAGCGGC CCGAGCGCTT
GAAGCACGCG AGGCGCATGA GGCGACTCGG GCGACTCGGA TGACTCGAGC GGCCCGAGAA
TCCGAGGCAC GGCAAGCGCC TGAAGCGCCT TCAATGTCTG AAGCGACCGA AGCGACCGAA
GTGGCCGAAG CGCGAGACGC ACGGCCGGGC GCGGTCGCCG ACGACGCGGC CGCGCGGCTC
GAGGCGGCGT TCCTGCCGCG TTTCATCGAG CTGGTCGCGG ACGTGTTCCG GTTGCCCGCG
GGCGAGCTCG ATGCCGACCG GCCGCTCGAC GAATACGGCA TCAACTCGTT TCTGATCAAG
GTGCTGAACG TGCGTTTCGC GGACATCGTG GGGCGCGTGT CGAGCACGCT GCCGTTCGAA
TATCGGACGG CGGGCGAGAT GGCGCGCCAT TTCCTCACCG CGCATCGCGA CGCGTGCGCC
GCATGGGTCG CCTTCGACGG CGCGGCGTCG CCCGGCGCCG CTGATGCATC GTCCGCGCCG
CCGGTGCCGG CTGCGTCGGC ATCGGCGGCA TCGGCGACAC CGGCGACACC GGCGACACAG
GCGACACAGG CGACCGGGCC GACCGGCGAG CCGTCACGCG CGTCGGCCGG CGTGCCGTCG
GGCGCGTCGT CGAGCATCAA GCGGCCGGGG GCGACGTGGG ACGAGCCGAT CGCGATCGTC
GGCGTCAGCG GGCGCTACCC GCAGGCGCGC GATCTCGACG CGTTCTGGGA CAACCTGATG
CGCGGGCGCG ACAGCATCAC CGAGATTCCG CCGGAGCGCT GGCCGCTCGA CGGCTTCTAC
GACGAGGACC GGGAGCGCGC GATCGGCGCG AGCCGCAGCT ATGCGAAATG GGGCGGCTTC
ATCGACGGTT TCGCCGAGTT CGATCCGCAG TTCTTCAATC TGTCGCCGCG CGAGGCGAGC
AACATGGACC CTCAGGAGCG CATTTTCCTG CAGGCGTGCT GGGAAGCGCT CGAAGACGCG
GCGTACACGC GCGCGCGCAT CGCGCGCGAG CACGGCGGAC GGCTCGGGGT GTTCGCCGGC
ATCACGCGCG CCGAGTTCTG CTTGTACGGC GCGGGCAATC TGAAGCAGGG CAAGGCGCCG
TTCACGTCGT TCTGCTCGCT CGTGAACCGC GTGTCGTACT TTCTCGACGC GAACGGCCCG
AGCATCCCGA TCGACACGAT GTGCTCGTCA TCGCTCGTCG CCGTGCACGA GGCCTGCGAC
AAGCTGCGTC TCGGCGAGTG CGAGGTGGCG CTCGCGGGCG GCGTGAACCT GTCGCTGCAT
CCGTACATGT ACGTGAGCCT GAGCGCGCAG CGGATGCTGT CGTCCGACGG CCGCTGCAAG
AGTTTCGGCC TCGGCGGCAA CGGCTATGTG CCGGGCGAGG GCGTCGGCGT GATCGTGCTC
AAGCCGCTGT CGCGCGCGCT CGCGGACGGC GACCGCATTC ACGCGACGAT TCGCGCGACC
AGCATCAACC ACGGCGGCAA GACCAACGGC TACACGGTGC CGAACCCGAT CGCGCAGCAG
AACGTGATTC GCAGCGCGCT CGATCGCGCC GGCGTGCACG CGCGCGCGGT GAGCTATGTC
GAGGCGCACG GCACCGGCAC CGAGCTCGGC GATCCGATCG AGATCGCCGG GTTGTCGGGC
GCGTTCCGGC GCGATACGTC CGATCGCGGC TTCTGCGCGA TCGGCTCGGT CAAGTCGAAC
ATCGGCCATC TCGAGGCCGC CTCCGGCCTC GCGGGGCTCA CCAAGGTGCT GCTGCAGATG
AAGCACGGCC TGCTGGTGCC GAGCCTGCAC GCGAGCGAGC TCAATCCGAA CATCGACTTT
CCGGCCTCGC CGTTCGTCGT CAACCGCGAG ACGAGGGCCT GGGAGCGGCC CGTGATCGAC
GGCCGCGAAC ATCCGCGCAT CGCCGGCGTG TCCTCGTTCG GCGCGGGCGG CACGAACGCG
CACGTGATCC TCGAGGAGCC GCCCCGGCAG GCGTCGCCCG CGCGCGCGCC CACGCCGGCG
GGCGCGCCGG CGCTGATCGT GCTGTCCGCG AAAAAGCCGG AGCAACTGCG CCGCTACGCG
AGCGAGCTGC TCGCGCGCCT GCGCGACGCG GACTATCGCG CGCGCGTCGA CGCGGACGGG
CTCCGGTCGC TCGCATACAC GCTGCAAGTC GGACGCGAGG CGATGGACGA ACGCCTCGCC
GTCATCGCGG ATTCGGTTCA GGCGCTGGAG GGCAAGCTCC GGCAGTTCGT CGACGGCAAG
ACCGATATCC AGGATCTCCA CGTATCACGA GTCGGGCGGA GCGCTCACCA TGTCATTTGA
 
Protein sequence
MHDRDRDESG TRGLWSMNGL KERVGSGYVA IPILVALRGA RAPGAAGDRD AAWTERIAAA 
ARLDADGAAA LREWLNDAAP AAGADDARLA RGDAIPTDIL EPYRFEPDAR AFFDDAFREC
LNRWSRRLLE QDGATWLSDV VLVPFLVAAA RARESAPSCE ARSIRALADA FGADLRRLLD
ASRVLDDGDI GDIGAMRASL ECVTSFRDGL LHADRRLSNG APAACRPQVC AAAYEDLEAA
ILRRFDASAN ARAGRRPRYL AFVGGEDRDL PERLVERIVR QGEPRNGGGT GVEPPRIVIP
RADAAPADDS AQPDPAAAFG VDAIDDVLPV CVLSGVDDAA AIRRLRALLV RGAHAAVVLA
CHGLPAASAA LPDAWAAARP ARMLRLLGFG AARDAQAFLV DMAADGLFAR TPPIGYPRAS
RARCATLGEF EARDYRVRPA APHDLPALQA LELACWPAAL RMPEATLAAR VGRHAAGQFV
LELDARDAAA GPRRLAGVIY SQRIASVRAL DGVDADTVDR LHEDGGPVIQ LLAVNVDPAC
QSRRLGDQLL EFMLQRCAAL ADVESLVAVT LCRDFHKHAS MPIDDYLRLR NAFGFLADPI
LRFHELHGAR IERPMPGYRP RDVRNAGFGV LVSYDLARRA RNEAGAPAAS DAPAAPDGAP
ARADGGHAGA AAPPRPERDA ATAADPDALD AFIEAEIRRI VGGGAELAYA RDLPLMQLGL
DSVGLLELAE ALALRCGVAL PATFLFQHNT PARIVAYFDA SRHAPPRAGC DTGCETAGAA
AAAARPSSRG CPATEPGAAR EPAQAAPFAP DGIAIVGIAC RLPGGLDTPE AFWDALKAGA
CVVGELPGDR WTWPADIDPG ARHRGIDRGG FLDDIRSFDA GLFRLSPKEV ATMDPQQRIL
LELAWEAIER AGHCADAVAG SRTGVYVGAS GSDYRLLLER AGTGVDAHVA TGASMAVIAN
RISYTYDLRG PSIQVDTACS SSLVALHQAV QALRAGECDQ ALVGGVNVIC HPGNTIAYYK
AGMLSPQGRC KTLDDAADGY VRSEGAVMLM LRRLEQAVAD GDPIHAVIRG SACNHGGLTG
GLTVPHPDRQ ADLLRAAWAA ARVSADDIGY LEMHGTGTRL GDPIEVRGLA DAFGARDDAA
ARGTCGIGSV KSNLGHLEAA AGLAGVLKTV LALKHREVPA TIHFSRLNAQ ISLARTPFAV
VDTHRAWPAR GGARRLAGVS SFGSGGANAH VVLEEYPSEA PPRAAAGDAL FVLSAHSREQ
LAEYARRVLA YCERRLQSGD DAPAAAAVAH ALQRRQAMAW RLAFVAASLE EAVRRLRAFA
AGAAQPGTFV GSGAPKTSVA DFVNQNPDVQ QVVSAWLRER QLAKLARYWA DGVRIANWSA
LYDARPACVP LPCYPFARER HWIAARPAEA SEATAAAAPP AAPDDAYRAA PRLEAESRDA
AGASFRVRLV GDASFLTDHR LRGRKILPGV VHFELAHAAW AALARADAPA IEFRDLAWTH
PVDVATPERV LGVRLRRVAA RPGAHAYEAY SPPDAAGGGE RVHARGTVLD VAGPPEPALD
LDALRARFDG EPGAHERDAH DCHRAFERMG FGYGPAHRGL RGLRWRGGAR GATEVLAHIV
LPDCIADARE RHRLPPGLLD AAVQAAMAAA IGRDALGASP ACVPFSLDRL VCAGSCPARA
WVWARRRDGA RAALAPVDLD VCDDRGRVWA SFRALTFRPL REAAGRDVAA PRLFRPRWAA
RPLSGERASA GAADVAHWLV LCGFDEGAAL RRDLATLRAR LPDTSIVAID SDAATLESRF
ADSAGQLFEL FRELALSGAT PRAAVQVLVP ADGPERAFAA LSALVRGARL EHPSWSVQLL
SLERATQAAD AAARALENRG DDADFVRYRR ARRESLVFEA LPAGRDEPPR PWKEAGVYLI
TGGAGGLGLA FAKDIAAHVR RATIVLAGRA AAPDAALSAS LGAIARPAGV DIRYRSVDVG
DAPAVAALVG DLLREHGRLS GVIHAAGVTR DALLVRKPRA EFDAVLRPKV AGAAALYAAT
QDIDLDFLVS FSSIVGVTGN LGQTDYGAAN AFLDALAGLR AQMGGERRAR VLSIAWPLWR
DGGMGREPEV AAHFERAFAL APMDTAAGID AFHRALASDA AYVVVANGGA DWTPERAIAR
ALSARARPGA PEPAERAHGP AREAAAAACE AVAVSASSLS SPSPSSASAS ASVSAECAPT
DDGAARRHAV VAYLTRQIGA VLGHAPDSLD IDAPFTSYGM DSILALDTTR AIETDLGSLS
KTLFFEHENV RQLSAYLLDE HADRLAACAW FAQAGEPARA AGPALAAGGE PTRETAAPSA
AADTTADTTA GIAIETSTET PTAATTQAGA APEGRYRRVA KAALPADGQL AAAVAAIGGA
AATKGVALFE IWPELFVDSA GHGYCHLLVD GGVLFAAQHA GDARHRAALF AALLAYCDRH
GYAFGYLDLS EGRKPDLEAQ CGLLAAPVGV VQIVEAIASF SLAGGRMRRL RYMVERFRKA
GACRVVEYRA PDPDVAREIR RVICAWSDAK KVVNNVDIVL GEMASGSLHE RYRVFLTYLD
DVLQNVIMIA RDGDGYLMDQ EYYVADMPLG GTEYAVTEIL AALAAEGRER FSLGLTWGLF
DTGEGSSDPA ADAFLASTQT QLRRIFERGA ANRQYKSKYG TRDHAVYLYR RPGKPEPAIV
GCLSQFYRKG LTHHEVRRLA GLADAPAPAP APVAAPATVP QATPAPAPVA TPTADERAYD
VTRIDAATIR VDLVSDSWAH VDYPFMRARA ATLDRHAPPA RGGDPGRAVA ELLGFAQCLL
TTSGRAAEHL FFRARRSART RVPQNLLFES TLHNLVKSGF EPVELPDARA LDPDSRDLFR
GGIDLAALDR ELQAHADATA MVMLELCNNA SGGYPVALAQ IRAIAAACRR HGVPLVMDVT
RIVKNAELIR RGEAGYAQRG LWEIVREIAD HADAVVGSLC KDFGLGAGGL LAARDARVVA
NAAGIARLEG GLPGPAELRR IAAAFDDRAY LEREIGRQLD FARDLHIELE RCAVPVVQPG
AGHCVLVRVD QLAPPGGSAP SRGAYLRLLA ERYGVRGGLH LVGNLRDSHL NACVRLALPL
GFDDPRGPGA LAAALAAARD GRDHALDDLM RAPRARAAHG GRCADGIAII GLSGRYPDAP
TLDAFWRNLV SGRRSISEIP AERWDWRDHY ERDPDTAVAH GKSYGKWGGF LDGFSAFDPL
FFQIAPREAE FIDPQERLFL EACWHALEDA GCPPSALTRA QRAKAGVFGG MTKQGFNLYG
AGGAQPYQST SLAALVNRVS HCFDFNGPAV AFDSHCASAL VAIHEACQYL RREPEGIAIA
GAVNLNLHPS NYQQLSKMQV LASGAESASF ASGGLGYVPG EGVGAVVLKD YRRALEDGDP
IYGVIRGSAV NQNGRMNRFG MPSQKQQEAV VRAALAQAGV DPRSITYVEA SAHGSAVGDA
IEMAALTRVF GARERADGRY RIGSVKPNIG HGEAVSGMSQ LTKVLLSLRH GQLPPTLVCG
APNPDIDFDA LPFELNTSLT DWARARVDSE RVPRRAGITS TGASGLNAHL VLEEHAAPAV
PAQAGPGEAD ARAHVFVLSA RDRARLDDYA RDWIAFLNDD PQRDLAAIAY TLQVGREPMA
CRLAVVAADC RDLAGKLARW REAAHADCDD VFHGEARAAA GKPHREAARD AREPRDVARA
WVGGAVVDWA ARHAGARPAR VAGLPGYPFE RRSYWPGAAA APATARAAAA SDASEAARAL
EAREAHEATR ATRMTRAARE SEARQAPEAP SMSEATEATE VAEARDARPG AVADDAAARL
EAAFLPRFIE LVADVFRLPA GELDADRPLD EYGINSFLIK VLNVRFADIV GRVSSTLPFE
YRTAGEMARH FLTAHRDACA AWVAFDGAAS PGAADASSAP PVPAASASAA SATPATPATQ
ATQATGPTGE PSRASAGVPS GASSSIKRPG ATWDEPIAIV GVSGRYPQAR DLDAFWDNLM
RGRDSITEIP PERWPLDGFY DEDRERAIGA SRSYAKWGGF IDGFAEFDPQ FFNLSPREAS
NMDPQERIFL QACWEALEDA AYTRARIARE HGGRLGVFAG ITRAEFCLYG AGNLKQGKAP
FTSFCSLVNR VSYFLDANGP SIPIDTMCSS SLVAVHEACD KLRLGECEVA LAGGVNLSLH
PYMYVSLSAQ RMLSSDGRCK SFGLGGNGYV PGEGVGVIVL KPLSRALADG DRIHATIRAT
SINHGGKTNG YTVPNPIAQQ NVIRSALDRA GVHARAVSYV EAHGTGTELG DPIEIAGLSG
AFRRDTSDRG FCAIGSVKSN IGHLEAASGL AGLTKVLLQM KHGLLVPSLH ASELNPNIDF
PASPFVVNRE TRAWERPVID GREHPRIAGV SSFGAGGTNA HVILEEPPRQ ASPARAPTPA
GAPALIVLSA KKPEQLRRYA SELLARLRDA DYRARVDADG LRSLAYTLQV GREAMDERLA
VIADSVQALE GKLRQFVDGK TDIQDLHVSR VGRSAHHVI