Gene BURPS1710b_A0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0272 
Symbol 
ID3693571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp401914 
End bp414633 
Gene Length12720 bp 
Protein Length4239 aa 
Translation table11 
GC content72% 
IMG OID637730526 
Productthiotemplate mechanism natural product synthetase 
Protein accessionYP_335431 
Protein GI76818390 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATC GTTTGCCCCA CGTCGAGCGC GGCGACGCCG ACGAACCGTC CATCGAAACC 
GCCGCCGGCG AGCCGCCGCC GCTCGCGCTG CCGGCGAGCG TCGCGCAACG CCGGCTCTGG
TTCGTCGAGA ACGGCGACGT GCGCGCGTCG ACTTACAACG TGCCGGCCGC CTTCACCTTG
ACGGGGCCGC TCGACGACGC CGTGCTCGAG CGGGCGTTGG CGTTCATGCA GCAACGGCAT
CCCGCGCTGC GCTCGCGGTT TCGCACGCGC GACGGCGAAT TGCGCATCGA GCTCGCGCCG
CAGCCGGCGC CGTTGGCGCG GCAGGACCTC GGCGCGCTCG ATGCCGACGT GCGCGCGCGG
ACGGCCGAGC GGCTCTGCGC GAACCACGCG AACCGCCGGT TCGATCTCGA GCGGGACGCG
CCGATCCGCT GCTTGCTGCT CAGGCTCGGC GAAAACGAGC ATGTGCTGGC GGTGAACGTG
CATCACATCG TGTTCGACGA CTGGTCGATC CGGATCTTCT TCCGGGAGCT CGGCGCCGTC
TATGGCGCGC TGCTCGCGGG CGCGACGCCC GATCTGCCGG CGCTCGACTA CGCGGCGGCC
GTCGCGGCGA GCGTGCCCGC GGCCGCCCGG CACGCCGCCG CGCGTCAATA CTGGGCGCGC
GCGATGTCGG GCGCGCCGAC GCTTCACAAG CTGCCCACCG ACCGGCCGCG TCCGGCCGAG
CCGCGCATGC GCGGCGCGGT GCACAAGCAC GTGTTCGCGC GCCGGCATGC CGAGGGCATC
CGCGCGCTTT GCCGCCGTGC CGGCGTGACG CCCTACATGC TGGGCGTCGC CGCGTTCGCC
GCGCTGCTGC ATCGCTATTC GGGCGAGGAC GAGATCGTGA TCGGCAGCCC GTTCGCGAAC
CGCGTGACGC AGGCGCAGCA GAGCCTGATC GGCTTCTTCA TCAACCTGAT CCCGTTGCGC
GTGCGTTTCG ATGCCGGCGT CAATTTTCTC GACCTGCTCG CGCAGGTGCG CGAGACGTCG
TTCGATGCAT TCGAGCACGC GGTGCTGCCG TTCGACCAGA TCGTCGATGC GATCCGTCCG
CCGCGTTCGT CGAGCCACGC GCCGGTGTTC CAGATCATGT TCGACTACCT GAAGAGCGGC
GGCATGCTCG AGCTCGACGG CGTCGGCGTG ACCGGCTCGC TCGTCCACAC GGGCACGGCG
AAGTACGACC TGACGGTCTC GATGGAGGAA GGCCCCGACG AGCTGGCGGC GATCGTCGAA
TACGACACGG ATCTGTTCGA CGCGGGCACG ATCGCCCGCC TGGGCGGGCA TTTCGAGCGA
TTGCTCGAGA ACGTGCTGGC TTCGCCCGCC GCGCCGATCG CCGAGGGCTC GCTGCTGCCC
GCCGACGAGC TGCGGCAGGT GCGCCGCTTC ACGCGGCCCG ACGAGCCGTA CGCGCACATT
CCGTTCTCGC CGATGCCGCA GCGGATTCGC GAAGCGGCGC GGCGCGCGCC GCACGCGGTG
GCGATCGTGC ACGGCGACGC GCGGATGACG TACGAGACGC TCGATCGCCG CTCGGATGCG
CTCGCGCGCG CGTTGCGGGC GCGCGGCGTC GGCAGGGGGA GCCGGGTGGC GTCGCTGCAA
TCGTATTCGG AGAAGATCGT CGTCGCCTAC CTGGGCATCC TGAAGGCGGG CGCCGCGTAT
TTGCCGCTCG ATCCGGCGGA CCCGAGACGG CTGGAAAAAA TCGAGGACGC CGCGCCCGCG
ATGATCGTGA CGGCGCGGCG CGATCTCGAG GACGTGCCGC AGGCGTTGCG CGCCCGCACG
CTGACGATCG ACGACCCGAT CGAATGCGGG AAAGCGCCCG ACGCGGTGAA CGACGCGGTG
AACGACGTCG CGACCGACGT CGTGACCGAC GCAACGGCGC GCGACGCCGA ACTCGACTTC
GCGACGCTTG CCGAAGCCGA TCCGGCCTAT GTGATCTATA CGTCCGGATC GACCGGCAAG
CCGAAAGGCG TCGAGGTCTC GCACGGCAGC CTCAACGTGT CGTATCACGG CTGGCATCGC
GCCTATCGGT TCGGCAAGCC CGGCCATCCG GTCACGCTGC AGCTCGCCGG CATGACGTTC
GATCTGGGCA TCGGCGACGT GAGCCGCACG CTCGCATGCG GCGGCACGCT CGTCATGCCG
CCGCGCGACG GGCTGCTCGA CGCCGGCCGG CTGCACGCGC TGATGCGCGC CGAACGCGTG
TCGTTCGGCG ATTTCCCGCC GGTGATCCTG CGCGAGCTGA TTCGCCATTG CAACGAGACG
GGCGACCGGC TCGACATGCT CGACACGCTC GTGTGCGGCG CCGACGTATG GTTCGGGCAC
GAGCTGCGCG CGGCGCGCGC GCTGTGCGTG CCGCACGCGC GCGTGCTCGG TTCGTACGGC
GTGACCGAGG CGGCGATCGA CAGTTCGTAC TTCGATCCCG ATCTGCACGC GCTCGCGCCC
GATAGCGTCG TGCCGCTCGG GCGGCCGTTG CCGAGCTGCG AATTGCTGAT CGTCGATCCG
CTGCTGCAGA TGACGCCGAT CGGCGTGCCG GGCGAATTGC TCGTCGCGGG CCCTGCGGTG
GCGACGCGCT ACCTGAACAA CGACGCGCTG ACCGCGCAAA AATTCCTCCG CGGCCGCGTG
GACGAGCACG GGCGCGTGAT CGCAGGCGAC GGCCAAACGC GCTTTTATCG CACCGGCGAT
ATCTGCCGGT TCCTCGAAGA CGGCACGATC GATTTCCTGG GGCGCCGCGA CAACCAGATC
AAGATCCGCG GCTTTCGCGT CGAGCTGGGC GAGGTCGAGG GCGTGCTCGC CGCGCATCCG
GACGTGCGCC AATGCGCGGT CGTCGTGCGC GACGAGGCGT CGGGCGACCC GTCGCTCGCG
GCGTTCGTCG TGAGCGACGC GCCGATCGCG GCGCTGCGCG GCTATCTGCG CGGGCGTCTG
CCCGCGTACA TGCTGCCCGC CGCGATCGAA CGGCTGGGCG ACATGCCGCT CACGGCGAGC
GGCAAGATCG ACCGGAACCG GCTCAAGGCC TGGCCGCTGA GCGCGCCGGA CGTCCCGCCG
CCCGACGCGG CGACCGACGT CGAGCGCCGC CTGCTCGCGC TGTGGGAAAA CTTGCTGTCG
GCGCGCGTGC CGAGCGTGCA CGAGAACTTC TTCCAGTGCG GCGGACATTC GCTCGCGGCC
GCGCGCCTCG CCTCGAGCAT CAGTCAGGCG TTCGACGTCT CGATCGGCGT GTCGAGCGTC
TTCAATCATC CGAGCGTCGC CGAGCAGGCG CGGCTCGTCG AAGCGCTCGC GTCCGCGCGC
GCGCCGCGCG ACGCCCGGCA GACGGCGCAC GCGGACGCCG CGGAGCCGGC GGGCGACGAC
GGGCTGCTGT CCTATTCGCA GCAGAGCCTG TGGCTGACGG CGAAGCGCAC GCCGGACGAT
TTCAGCTACA ACATCCCGGT GACGTGGCGC CTCGACGGCC CGCTCGACGC GCACGCGCTG
GAGCGGGCGA TCAACGATGT GGTCGCGCGC CACGATGCGT TGCGCACGGT GTTTTCGTCG
GACGTGCGCA CGGTCGTCGG CCCGCATCGC GAATCGTCGC AGGAGCCGAC GCAGCGGGTG
CTCGATACGC TGACGATCGC GTTGCGGCGG GTGGCCGTCG CACCGGACGA CGCGGCGAGC
CTGCCCGCGC GGCTGCGCGA GGCGCACAGC CGCGCGTTCG ATCTGAACGC GGGGCCGCTG
CTGCGCGCGG TGCTGTTCGA GATCGCGCCG ACGCATCACG TGCTCGACGT GACGATCCAC
CATATCGTGA TCGATGGGCC GTCGTTCGGC CTGTTCTGGC GCGATCTGCA GACCGCGTAC
CGTGCTCGCG TCGCGGGCGA GGCGCCCGGC TGGCAGCGCC CGGCGCGGCG CCATGCGGAT
TTCGTGAGCC GGCAGCGGCA GGCGCTGCGC GGCGAAGCGG CCGCGCGGCA GCTCGCATAT
TGGCGCGAGC AACTGAGTGG CTTGCCCGCG GCGCTGCCGC TGCCCGACGC GGTGCTGGCG
GCGCACGCGC CCGGCAGCGC CCGATCGCTG ACGTTCGAGA TGCCCGATGA CGTCGCCGCG
GGCCTCGCCG CGCTCGCGCG GCGCACGAAC GGCTCGCCGT TCATCGTCTA CCTCGCGCTT
TTCGCCGCCG CGCTGCGGCA GCAGACCGGC GAAGCGGACT TCGCGATCGG CACGCCGCTC
TCGCTGCGCC CGCACGAAGG GTTCGCCGAC GTGCTCGGCT TCTTCGCGAA CACGATGCCG
CTGCGCATGC GCCTGCACGG GCTCGACACC TTCGAGCGCG TGCTGCGGTA CGTGCGCGAA
CAGTGCCTCG CGCTCTACGA GAACGGCGAC GTGCCGTTCG AGTATCTCGT CCAGGCGCTG
AAGCCCGCGC GGGCCGCGCG GCGCAATCCG GTGTTCCAGA CGATCTTCTC GTGCGAATTC
GACGACGAGC GGCTGCAGTT GACGGGCGTC GACGCGCATG CGCTCGCGCT CGACGCGTAC
ACGGCGAAGC TCGATCTCGA AATGGCGATC AACGTGAGCG GCGGCCGCGT GGTGTGCCAT
CTGATGTCGC GCCCCGGATC GTTCGATGCC GATGCGTTGA GCTCGATCCG GCATCACTTC
CTGCGGACGG CGTGCAGCGC GACGCGCGCG CACGCGGAGC GCGAGGCGGC AGAGGCGCGC
GCGACGCACG AGGTGCATGA GGTGCATGAG GTGCACGAGG TGCATGAGAT GCATGACGTG
CATGAGGCGC GCGCGCAACT CGCCGATGAC GCGCCGGGCG CGCGCCGGCC GTCGGCGGAC
GACCTGTTCG GCCTCTTCGC GCGCAGCGCG GCGCGCCATG CGCAGCGCGT CGCGCTCGAC
AGCCCGACGC TGCGCGCGAG CTACGCGCAA CTGGCCGAGC GCGTGTCGGC CGCCGCGCGC
GCGCTCGCGG CGCACGGCGT GCGGCGCGGC GATCGGGTCG GGATCTTCGT CGGCCACCAT
CCGCACAACG TGACGGCGAT GCTCGCGATC GCGCGCGTCG GCGCCGCGTT CGTGCCGATG
GACCCCGAGC ACAAGCCGCA GTGGAACCGG CATATCGTGG ACGACGCGGC GCTGACGGCG
CTCGTCGGCG GCGCGTGGAC CGCGGATGCG GCGCGCGGCT TCGGGCTGCC CGTCGTCGAT
CTCGACGCGC CGCCGCCGCC CGCATCGGAG CTCGCCGACG CGCCCGCGGC GGGCGGCGCG
CACCCGGACG ATTGCGCGTA CGTGATCTAC ACGTCGGGCT CGACGGGCCG GCCGAAGGGC
GTCGCCGTCT CGCACGCGAG CGTGTGCCAC AACGTGCGCG CGATGGCCGA GATCATGCGC
ATCGGGCCGC AGTCGAGAAT GGCGCAGTAC GTGTCGCCGG TGTTCGACGT CGTGCTGGGC
GAGATCTTTC CGGCGCTCGC CGCAGGCGCG GCGATCGTGT TCGCCGAGCG CCGCCGGCCG
CTGCCGGGCC AGGCGCTGGT GGATTGGCTC GACGCGCAGC GCGTGTCGCA TGTGTGGATC
GTGCCGTCCG CGCTGGCGAT GGTGCCCGAG GCCGCGCTGC CGGCGCTCGA GGTGCTGATC
GTGGCCGGCG AAGCCTGCCC GCGCGAGCTC GCGCAGCGAT GGGCGGCCGG ACGCCGGCTG
CTGAACGGCT ACGGGCCCAC GGAGGCCGCG ATCGTCGTGT CGCTGACCGA TTACCACGCG
CAGCGCGAGC GCCTGATCCT GAGGCCGATG GGCGGCGCGC GGCTGCACGT GCTCGACGAA
GCGCTGCGCC CGGCGCCCGC CGGCGCGGCG GGCGAGCTGT TCATCGGCGG CGCGTGCGTC
GCGCAGGGCT ACCTCGGGCA GCCGGCGCGC ACCGCGCAGG CGTTCGTCGC CGATCCGTTC
GACGCCGAGC CGGGCGCGCG CATGTATCGC ACGGGCGACG TGGTGCGCCG GCTCGACGAC
GGCGCGATCC AGTTCATCGG CCGCGTCGAT CGCCAGGTGA AGATCCGGGG CTTTCGCATC
GAGCTCGACG CGGTGCGCGC CGCGCTGATG GAAGTGCCCG GCGTGCAGGC GGCGGAGGCG
CTCGCGCAGC CGGACGCGAG CGGGCAGCCG CTGCTCGTCG GGTATGTCGT CGCGCGCCGC
GCGAAGGCCG AGCTGCTCGA CGCGCTGCGC GGCAAGGTGC CGGACGCGAT GGTGCCCTCG
ACGCTCGTGT TCCTCGATGC GCTGCCGACC GGCAGCACGG GCAAGACGGA TCTGAAGGCG
CTGAAGGCGC TGAAGACGGG CGACGCGGCG CGCCCCGCCG CGGCGGCGGC CGACATGCCG
CGTGCCGCGT CGCAGGGGCG CACGCTGCAT CGCGTGCGCG AGATCTGGCG CACGCTGCTC
GAACGCGACG ACATCGGCGA CGACGAAAAC TTCTTCGACG CGGGCGGGCA TTCGCTGCGC
GCGGTCGCGC TGCACCAGCG CATCACCGAA GCGTTCGGCG ATGTGATCGC GCTCACCGAT
CTCTTCGAGC ATCCGACGAT CGGCGCGCTC GCCGCGCATC TCGACGCGTT CGCGCCGCGC
GACGGCGAAG CGGCCGACGA CGCCGCCGGC GCTGCCGCGC GCGCGCCCGC CGACGGTGTG
CTCGACACCG ACGCGATCGC CGTGATCGGC CTCGCCGGCC GCTTTCCCGA CGCGCCGGAC
CTCGACCGTT TCTGGGAGCG GCTGCTCGCC GGCTACGAGG CGGGCCGCAC GCTGAGCGAC
GCGGAACTCG ACGCGCACGG CGTGCCGGCC GAGCTGTATC GCAATCCGCA TTTCGTCCGC
CGCTTCAAGG AGCTCGAAGG CAAGGCGGAG TTCGACGCCG GCTTTTTCGG CTATTCGCCC
CGCGAGGCGC AGGTGATGGA CCCGCAGCAG CGGATCTTCC TCGAGCTCGC GTGGCAGGCG
CTCGAGCAAG CCGGCTATGG CGATCGCGGC CGCGTGCGCT CGGTCGGCGT GTTCGCGAGC
GCCGCGTTCA ACTATTACCT CGTGCAGAAC GTGATGCCGA ATGCCGAGCG GCTGCGGCTC
GAGCCGGGCC AGTGGCTGAT CGGCAACGAC AAGGATTTCA TCGCGACCCG CACCGCGTAC
AAGCTCAATC TGCTCGGGCC GGCGCTGAGC GTCGGCACCG CGTGCTCGTC GTCGCTGATG
GCGGTCCATC TGGCTTGCGC GAGCTTGCGC AACGGCGAGG CGCAGATGGC GCTCGCCGGC
GCGGTCGCGC TCGATCCCGA TCAGGTCGGC TATCTGTACG CCGAAGGCGG GATCATGTCG
CCGGACGGGC GCTGCCGGCC GTTCGACGCC GCCGCGGCCG GCACCGCAGG CGGCAGCGGC
GGCGGCGTGG TGCTGCTCAA GCGGCTCGAC GCGGCGCTGC GCGACGGCGA CACCGTGTAC
GCGGTGATCA AGGGCTCGGC GGCGAACAAC GACGGCGCGG ACAAGGTGAG CTACACCGCG
CCGAGCGTCG CCGGCCAGAC GGCCGTGATC CGCGACGCGC TGCGCGCGGC GCGCGTGTCG
GCCGACAGCA TCGGCTACGT CGAAGCGCAC GGCACGGGCA CGCCGCTCGG CGATCCGATC
GAGGTGCGCG CGCTCGCGCA GGCCTTCGCC GAGGCAGCCG CGCCGGGCGC GCTGGCGAAC
GGCCGGTGCG GGATCGGCTC GATCAAGGGC AACATCGGCC ATCTCGACGC GGCGGCCGGC
ATCGCGGGGT TCATCAAGGC GGTTCTCGCG CTGCATCGCG AAGCGATTCC GCCGAGCATC
AACTGCGAGA CGCCTAACGC GCGAATCGGC TTCGACAAGA CGCCGTTCAG CGTCGTGCGC
GAAGCCCGCG CGTGGCCGAG AACGGCGACG CCGCGCCGCG CGGGCGTCAG CTCGTTCGGC
GTCGGCGGCA CGAACGTCCA CGTGGTGCTG GAGGAGGCGC CGCGCGTGCG CGCGGGCGAA
TCGGCCGAGC CTTCGCGCTG GCAGTTGCTG CCGGTTTCGG CGCGTTCGCC GAGCGCGCTG
CGCGAGCAAT GGCGGCAACT GCGCGACGCG CTCGCGCACG CGCGGCCGCG CGTGCAGGAC
GTCGCCCATA CGCTGCAGGT CGGGCGCACC GCGTTCGAGC ATCGCGGCTT CGCCGTCGTC
GACGCGGCCG CCGACGCGCC CGCCCAGCTC GACGCCGCCG GCTCGCCGCC GGCGTTCGAG
CGTCGCGCTG CGCCGCCGGT GGTGTTCATG TTTCCCGGCC AAGGCAGCCA GTATCCCGGC
ATGGGCGCGG CGCTGTATCG GAGCGGCGGC GTATTCCAGG CCGAAGTCGA TCGCTGCGCG
CAGTTGCTTC GCGCGCATCT GGACCGGGAC GTCCGCTCGC TGATGTTCGA CGCGGGTGCG
TCGCTGCTGA GGGAAACGCG CTATACGCAG CCGGCGCTGT TTGCGATCGA ATACGCGCTC
GCGCGGCAAT GGCTCGCATG GGGCGTCACG CCGCACGCGA TGATCGGCCA TAGCGTCGGC
GAGCTCGTCG CGGCCGCCGT CGGCGAGACG CTCGCGCTGC CCGACGCGCT GGCGCTCGTC
GTCGCGCGGG CCGACGCGAT GCAGCGCCAG CCGCCGGGCG CGATGCTCGC GGTGCTGGCC
GATGCGCGCG AGCTCGCGGC GCTCAACGGC CCGGGCTGCG AGATCGCGGC GATCAACGGC
CCCGAGCAAT ACGTGCTCGC CGGCGACGCC GCCCGAATCG CCGCGCTCGA AGACGCGTGC
ATCGCGGCCG GCGTCGCGTG CCAGCGCCTC GCGACCTCGC ATGCGTTCCA TTCGTCGGCG
ATGGACGGCG CCGCGCGCGA GATCGATCGC GCGAGCGAGC GCATCGTGCG CCGGGCCGCG
CGCATTCCGT TGATCTCGAA TCGCAGCGGG CGCTGGCTGA ACGAGCAAGA CCTGCGGGAC
GCCGGTTACT GGGGCGAGCA CGTGCGCCAG CCCGTGCAAT TTCACGCGGG GGTGCGCACG
CTGCTCGATG CGCTCGACGC GCCGATATTC GTCGAAGTCG GGCCGGGGCG CGCGCTGGGC
AATCTGATCG GCGGCTGGGC GGGGCTCGGG CCGCAGCGGA TCGTCGCGAC GCTGCCGCAC
GCGCGCGAGC GCAGGGACGA CATGGCGGCG GCGTTGCGAG GCGTCGGCAC GCTGTGGGCG
CAGGGCGTCG ACGTGAACTG GGCCGGCCTG CATGCGCCGG GCGCGGCGCG GCGCATCGCG
CTGCCGACCT ATCCGTTCGA GCGTACGCGC CACTGGATCG AGCGGCCGGC GGGCGCGCGC
GCGGCGCCGG CGCGCGAAGC CGACGGCGTG CCGATGCGGC GCGGCGAGGA CGCGGCCGAC
GGCAGCCTCA CCGTGTCGTT TGCGCTGCAC GAGCGGCTCT GGTTTCTCGA TGAACACCGG
ATCTTCGACG GCGCGCCGGT GCTGCCCGGC ACCGCCTGCA TCGAACTCGT GCGGCGCGCG
TATTCGCTCG TGCGCCCGGG CGCGGCGGTG ACGATGCGCG ACGTCTATTT TCCGACGGCG
CTGATCCTGT CGACGGACGA ATCGCGCAAC GTGCGCGTCG TGTTCCTGCC GCCGGAACGC
GTGTCGGACG GCGCGGCCGC CGGGGGCGAC CTCGCGTTCG TGCTCGAATC GAACGACGCG
AACGGGCCCG CCGGATGGAC GCCGCACGCG AGCGGGCGCA TCGGGGACGA TCCGCCGGCG
TGCGCCGCGC CCGCTTCGCT CGCCGCGCTC GCCGCGCCCG CTTCGCTCGC CGCGCTCGCC
GCGCTCGACG CTCCGGCCGC GCTGCGCGAG CAATGGGGGC TCACGGCGCT GGACGACGTC
GCCGCGCTGT CCGCGCAGGC GTTCGCCGAT TACGGGGCGC GCTGGCGCGG CGTCGACGCG
CTCTGGCTCG GCGAACGGGC CGGGCTCGCG CGGCTCAGGC TGCCGGCCGC CGGCAGCGGC
GACTTGCCCG ATTTCGCGCT GCATCCGGCC ATGCTCGACG TCGCCACGGC GTTCCTGCCC
GCCTGCCTGC GCCCGCGCGA CGCGTCGGTG CCGTTCCGCT ACGAATCGAT CCGCATGCAC
CGGCCGCTGC GCGCCGATTG CTACAGCTTC GCCGTCGAGA CCGCGCCGAA CGTGTACGAC
GTCACGCTGT TCGCATGGGA CGAGGCCGCG CGGCGCGCCG ACGTGCTCGT CGCGATCGGC
GGCTTCGCGC GCCGCGAGCC GGCGCACCGG ACGCGCGACG TCGCGCAGTG GTGCCGCACG
GTGAGCTGGC GCGACGCGCC CGCGGCGCGC GCGTTGCCGC CCGAGCGCTG GCTCGTGTTC
GGCGACGAAT GGTTCGCGCT CGCGCCCGCC GGCAGCGTGC TCGTGCGCGA GGACGACGCG
TTTCGCGCGC ACGGCGACAA CGGCTATGGC GTGCGCCCGG GCGAGAAGGT CGATTGCGAC
CGGCTGATCG CGCGGCTCGC CGAGCAAGGC GGCGTGCCGG CGCACGTCGT CTACGGCTGG
GCGCAGACGG ACGTCGATCG CGCGTTCGCC GGGCTCGCCG CGCTGCTGCA GGCGCTGGGC
GAGCATCCCG CCGATTTTCG CGTGTCGCTC GTGACGAAGG GCGCGCGCTC GGCGCGCACG
CTGGACGCAT GCGCAGCCGC CGCGCCGGCG GGCTTGCTCA AGGCGGTGCG CTGGGAATAT
CCGCGCATCG TGTGCCGCCA CATCGATATC GACGATGCGA GCGACGCGAC GATCGACGCG
TTGCGCGCGG AGCTGTCGTC CGAGCCCGCC ACGCCGCCCG GCGCGCCGCC CGAGCTGCCG
AGCAGCATCG CGCTCGCCGG CGCGCGCCGC GAGGCGCCCG GCTTCGCGGC GCTGCCGGCC
GTCGCGCGCG ACGACGTCCT GCGCGACGGC GGTGCGTATC TGATCACGGG CGGCGCGAGC
GGCATCGGCC TCGAGCTGGC GGCGCACATC GCGTCGCGGC GGCGGGACGT GAGGCTGGCG
TTGCTGAGCC GCTCGCCGCA TGACGAAAAC GCCGCGCGGT TCGCCGCGCT GGACGAGGCC
GCCGCGAGCG TGCTGCGGTT GACGGCCGAC GTCGCGCACG CCGCGCAGCT CGCCGACGCG
CTGCGCACGG TGCGCGCGCG CTTCGGGCGC ATCGACGGCG TCATCCACGC GGCCGGCGTC
GAGGCGAGCG GCCTGCTCGA AACCGGCACG CCCGACGCAT GGCGGCGCGT GATGGCGGCG
AAGGTTCACG GCGCGCGACA CCTGTTCGAC CAACTGGCCG GCGATCCGCC CGACTTCATC
GTGCTCTGCT CGTCGCTCGC CGCGGTCGTC GGCGGCCTCG GGCAGGCCGA CTACGCGGCG
GCGAACGGTT ACATGGACGC GCTCGCGCAG CACTGGCGCC AACGCGGCGT CGCGGCCATC
GCGATCGATT GGGATACGTG GTCGGACACG GGCATGGCGT TCGACCACGC GGCGCGCACG
CGCCGCTCGA ATGACCGCCC GGGCGCGCTG CCCGGCCTCG CGAACCGCGA AGGGCGGGCG
CTCTTCGATC TCGCGCTCGC GCACGACGCG TCGCGCATCG TTATCAGCAA GCGGGGCTTC
GAGCAGGACC GGCGCGACGC GCCCACGCGC GCGCGGCGCG CGGCCGCCCC GGGCGACGCG
CAGGCGGCGC TCGTCGCGCT GTGGCAGGAA CTGCTGGGCG TCGAGCAGGT GGGCGTCGAC
GACGACTTCT TCGATCTGGG CGGCCATTCG CTGCTCGCGA CGCAGTTGAT TTCCCGCGTG
CGCGATCAGT ACGCGCGCAG CCCGACGCTC GGCGAATTCC TGGAGGAGCC GACGATCGCG
CGGCTACTGC GCGCGATCGA CCATACGGGC GGCGACACGG GCGGCGACAT GAGCGGCGAC
ACCGCCGGCG ACGCGCCCGA CGTCGACGAG ACGCTGCGCT ATTGCGTGGT GCCGATGGTG
AAGGCCGGCA GCGGCGCGCC GTTCTTCTGC ATTCCCGGCA TGGGCGGCAA CATCACGCAG
TTGCTGCCGC TCGCGAACGC GCTCGGCGCG GACCGGCCGG TGATCGGCCT GCAATACCTC
GGCCTCGACG GCAAGCATGC GCCGCATGCG TCGGTCGAGG CAATCGCGGC GCACTACGTG
CGCTGCATCC GCAGCGTGCA GCCCGCGGGG CCGTACTTCC TCGGCGGGCA TTCGCTCGGC
GGCAAGATCG CCTATGAAGT CGCACGCCGG CTGCATGCGC AGGGCGACGC GATCGGCCTC
GTCGCGATGT TCGATTCCGC CGCGCCGCCG TATTCGTTCG TCGCGCATCA GGACGATTTC
GCGATCGCCA GCATGATTCT CGGCGTCTTC GCGTACTACG CGGGCAAGAT GGAGATGATG
GCCGGCATCG ACGACGCGCG CTTGCGCGAC GCGCCGCGCG AGCGGTTGCT CGCGTTCATG
GGCGAGCGGC TCGCGCAGTT CGGCGTGATT CAGTCGCAAA GCGATACGAG CGCGATTCGC
GGGCTCTTCA ACGTGTATCG CGCGGCGGCC GATTTTTCCG CGCGCTATGC GCCGCCGCAC
GAGCACCTGC CGCTGCCGAT ACTGCTCGTC AAGGCGACCG AGCCGATGCC CGACGGCATC
AAGCTGCCCG AAATCCGCGA GACGCCGGCG TGGGGCTGGG AAAACTTCAC GCGGCTGCCG
GTGCGCACGT GCGAAGTCGC GGGCAACCAT TACAGTTGCC TGATGGACGG CTACGTCGAG
CGCATCGCCG ATGCGCTGCG CGATGCGCTG GCGTCGGCGC GGCAAGCGAT CGAGGCGTGA
 
Protein sequence
MNHRLPHVER GDADEPSIET AAGEPPPLAL PASVAQRRLW FVENGDVRAS TYNVPAAFTL 
TGPLDDAVLE RALAFMQQRH PALRSRFRTR DGELRIELAP QPAPLARQDL GALDADVRAR
TAERLCANHA NRRFDLERDA PIRCLLLRLG ENEHVLAVNV HHIVFDDWSI RIFFRELGAV
YGALLAGATP DLPALDYAAA VAASVPAAAR HAAARQYWAR AMSGAPTLHK LPTDRPRPAE
PRMRGAVHKH VFARRHAEGI RALCRRAGVT PYMLGVAAFA ALLHRYSGED EIVIGSPFAN
RVTQAQQSLI GFFINLIPLR VRFDAGVNFL DLLAQVRETS FDAFEHAVLP FDQIVDAIRP
PRSSSHAPVF QIMFDYLKSG GMLELDGVGV TGSLVHTGTA KYDLTVSMEE GPDELAAIVE
YDTDLFDAGT IARLGGHFER LLENVLASPA APIAEGSLLP ADELRQVRRF TRPDEPYAHI
PFSPMPQRIR EAARRAPHAV AIVHGDARMT YETLDRRSDA LARALRARGV GRGSRVASLQ
SYSEKIVVAY LGILKAGAAY LPLDPADPRR LEKIEDAAPA MIVTARRDLE DVPQALRART
LTIDDPIECG KAPDAVNDAV NDVATDVVTD ATARDAELDF ATLAEADPAY VIYTSGSTGK
PKGVEVSHGS LNVSYHGWHR AYRFGKPGHP VTLQLAGMTF DLGIGDVSRT LACGGTLVMP
PRDGLLDAGR LHALMRAERV SFGDFPPVIL RELIRHCNET GDRLDMLDTL VCGADVWFGH
ELRAARALCV PHARVLGSYG VTEAAIDSSY FDPDLHALAP DSVVPLGRPL PSCELLIVDP
LLQMTPIGVP GELLVAGPAV ATRYLNNDAL TAQKFLRGRV DEHGRVIAGD GQTRFYRTGD
ICRFLEDGTI DFLGRRDNQI KIRGFRVELG EVEGVLAAHP DVRQCAVVVR DEASGDPSLA
AFVVSDAPIA ALRGYLRGRL PAYMLPAAIE RLGDMPLTAS GKIDRNRLKA WPLSAPDVPP
PDAATDVERR LLALWENLLS ARVPSVHENF FQCGGHSLAA ARLASSISQA FDVSIGVSSV
FNHPSVAEQA RLVEALASAR APRDARQTAH ADAAEPAGDD GLLSYSQQSL WLTAKRTPDD
FSYNIPVTWR LDGPLDAHAL ERAINDVVAR HDALRTVFSS DVRTVVGPHR ESSQEPTQRV
LDTLTIALRR VAVAPDDAAS LPARLREAHS RAFDLNAGPL LRAVLFEIAP THHVLDVTIH
HIVIDGPSFG LFWRDLQTAY RARVAGEAPG WQRPARRHAD FVSRQRQALR GEAAARQLAY
WREQLSGLPA ALPLPDAVLA AHAPGSARSL TFEMPDDVAA GLAALARRTN GSPFIVYLAL
FAAALRQQTG EADFAIGTPL SLRPHEGFAD VLGFFANTMP LRMRLHGLDT FERVLRYVRE
QCLALYENGD VPFEYLVQAL KPARAARRNP VFQTIFSCEF DDERLQLTGV DAHALALDAY
TAKLDLEMAI NVSGGRVVCH LMSRPGSFDA DALSSIRHHF LRTACSATRA HAEREAAEAR
ATHEVHEVHE VHEVHEMHDV HEARAQLADD APGARRPSAD DLFGLFARSA ARHAQRVALD
SPTLRASYAQ LAERVSAAAR ALAAHGVRRG DRVGIFVGHH PHNVTAMLAI ARVGAAFVPM
DPEHKPQWNR HIVDDAALTA LVGGAWTADA ARGFGLPVVD LDAPPPPASE LADAPAAGGA
HPDDCAYVIY TSGSTGRPKG VAVSHASVCH NVRAMAEIMR IGPQSRMAQY VSPVFDVVLG
EIFPALAAGA AIVFAERRRP LPGQALVDWL DAQRVSHVWI VPSALAMVPE AALPALEVLI
VAGEACPREL AQRWAAGRRL LNGYGPTEAA IVVSLTDYHA QRERLILRPM GGARLHVLDE
ALRPAPAGAA GELFIGGACV AQGYLGQPAR TAQAFVADPF DAEPGARMYR TGDVVRRLDD
GAIQFIGRVD RQVKIRGFRI ELDAVRAALM EVPGVQAAEA LAQPDASGQP LLVGYVVARR
AKAELLDALR GKVPDAMVPS TLVFLDALPT GSTGKTDLKA LKALKTGDAA RPAAAAADMP
RAASQGRTLH RVREIWRTLL ERDDIGDDEN FFDAGGHSLR AVALHQRITE AFGDVIALTD
LFEHPTIGAL AAHLDAFAPR DGEAADDAAG AAARAPADGV LDTDAIAVIG LAGRFPDAPD
LDRFWERLLA GYEAGRTLSD AELDAHGVPA ELYRNPHFVR RFKELEGKAE FDAGFFGYSP
REAQVMDPQQ RIFLELAWQA LEQAGYGDRG RVRSVGVFAS AAFNYYLVQN VMPNAERLRL
EPGQWLIGND KDFIATRTAY KLNLLGPALS VGTACSSSLM AVHLACASLR NGEAQMALAG
AVALDPDQVG YLYAEGGIMS PDGRCRPFDA AAAGTAGGSG GGVVLLKRLD AALRDGDTVY
AVIKGSAANN DGADKVSYTA PSVAGQTAVI RDALRAARVS ADSIGYVEAH GTGTPLGDPI
EVRALAQAFA EAAAPGALAN GRCGIGSIKG NIGHLDAAAG IAGFIKAVLA LHREAIPPSI
NCETPNARIG FDKTPFSVVR EARAWPRTAT PRRAGVSSFG VGGTNVHVVL EEAPRVRAGE
SAEPSRWQLL PVSARSPSAL REQWRQLRDA LAHARPRVQD VAHTLQVGRT AFEHRGFAVV
DAAADAPAQL DAAGSPPAFE RRAAPPVVFM FPGQGSQYPG MGAALYRSGG VFQAEVDRCA
QLLRAHLDRD VRSLMFDAGA SLLRETRYTQ PALFAIEYAL ARQWLAWGVT PHAMIGHSVG
ELVAAAVGET LALPDALALV VARADAMQRQ PPGAMLAVLA DARELAALNG PGCEIAAING
PEQYVLAGDA ARIAALEDAC IAAGVACQRL ATSHAFHSSA MDGAAREIDR ASERIVRRAA
RIPLISNRSG RWLNEQDLRD AGYWGEHVRQ PVQFHAGVRT LLDALDAPIF VEVGPGRALG
NLIGGWAGLG PQRIVATLPH ARERRDDMAA ALRGVGTLWA QGVDVNWAGL HAPGAARRIA
LPTYPFERTR HWIERPAGAR AAPAREADGV PMRRGEDAAD GSLTVSFALH ERLWFLDEHR
IFDGAPVLPG TACIELVRRA YSLVRPGAAV TMRDVYFPTA LILSTDESRN VRVVFLPPER
VSDGAAAGGD LAFVLESNDA NGPAGWTPHA SGRIGDDPPA CAAPASLAAL AAPASLAALA
ALDAPAALRE QWGLTALDDV AALSAQAFAD YGARWRGVDA LWLGERAGLA RLRLPAAGSG
DLPDFALHPA MLDVATAFLP ACLRPRDASV PFRYESIRMH RPLRADCYSF AVETAPNVYD
VTLFAWDEAA RRADVLVAIG GFARREPAHR TRDVAQWCRT VSWRDAPAAR ALPPERWLVF
GDEWFALAPA GSVLVREDDA FRAHGDNGYG VRPGEKVDCD RLIARLAEQG GVPAHVVYGW
AQTDVDRAFA GLAALLQALG EHPADFRVSL VTKGARSART LDACAAAAPA GLLKAVRWEY
PRIVCRHIDI DDASDATIDA LRAELSSEPA TPPGAPPELP SSIALAGARR EAPGFAALPA
VARDDVLRDG GAYLITGGAS GIGLELAAHI ASRRRDVRLA LLSRSPHDEN AARFAALDEA
AASVLRLTAD VAHAAQLADA LRTVRARFGR IDGVIHAAGV EASGLLETGT PDAWRRVMAA
KVHGARHLFD QLAGDPPDFI VLCSSLAAVV GGLGQADYAA ANGYMDALAQ HWRQRGVAAI
AIDWDTWSDT GMAFDHAART RRSNDRPGAL PGLANREGRA LFDLALAHDA SRIVISKRGF
EQDRRDAPTR ARRAAAPGDA QAALVALWQE LLGVEQVGVD DDFFDLGGHS LLATQLISRV
RDQYARSPTL GEFLEEPTIA RLLRAIDHTG GDTGGDMSGD TAGDAPDVDE TLRYCVVPMV
KAGSGAPFFC IPGMGGNITQ LLPLANALGA DRPVIGLQYL GLDGKHAPHA SVEAIAAHYV
RCIRSVQPAG PYFLGGHSLG GKIAYEVARR LHAQGDAIGL VAMFDSAAPP YSFVAHQDDF
AIASMILGVF AYYAGKMEMM AGIDDARLRD APRERLLAFM GERLAQFGVI QSQSDTSAIR
GLFNVYRAAA DFSARYAPPH EHLPLPILLV KATEPMPDGI KLPEIRETPA WGWENFTRLP
VRTCEVAGNH YSCLMDGYVE RIADALRDAL ASARQAIEA