Gene BMAA1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1643 
Symbol 
ID3087135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1780637 
End bp1790623 
Gene Length9987 bp 
Protein Length3328 aa 
Translation table11 
GC content71% 
IMG OID637565524 
Productputative peptide synthetase 
Protein accessionYP_106216 
Protein GI53716405 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCGGCCG CCGAACGCAC ACAGGTACTG CACGGCTGGA ACGAAACAGG CCGCGCGTAT 
GCGCGGGATG CCTGCCTCCA TCAGTTGTTC GAAGCCCAGG TATCGCGAAC GCCGGAAGCG
GCGGCGGTGA TCTGCGGCGA CGAGACGCTG AGCTATACGG ACCTCGACGC GCGTGCGAAT
CGCCTCGCGC ACTACCTGCG CGGACAAGGC GTCGGGCCGG ACACGCGCGT GGGGCTGGCG
CTCGGGCGCG GCGTCGAGAT GATGACGGGA TTGTTGGCGA TCCTGAAGGC GGGCGGCGCA
TATGTGCCGC TGGACCTAGG CTATGCGTCG GAGCGCTTGC GCGCGATCCT GGACGACAGC
CGGCCGGCGA TCGTGCTGGC GGACGCGGCG GGGCGTGCGG CGCTGGATGC GCTGGCCGGT
GCGCCGCCGA TTGCGGACCT GCACGCGGAT GCGTCGCGCT GGAGCGCGTT GCCTTCGACG
CCGCCGCGTG TCGAAGGCCT GACGCCGCGC CACCTTGCCT ATGTGATCTA CACGTCGGGC
TCGACGGGAC AGCCGAAGGG GGTGATGGTC GAGCACGCAA GCGTGGTGAA TCTGTGGCGT
GCGCTGGACG AGGCGATCTA CCGCACGCAC CCGAGCGCAC GGCGCGTGAG CCTGAACGCA
TCGATCGCGT TCGATTCGCT GGTCAAGCAG TGGGTGCAGT TGCTGTCGGG GCGGACGCTG
GTGGTGGTGC CGGAGCCGGT GCGCTTCGAC GGGAGGCGTC TGCTCGATGC GATCGGGCGA
GACCGGATCG ACGTCTTCGA TTGCACGCCG TCGCAACTGG CGTTGATCGA GGGGGCGCGA
GGGCCGGAGG ACGAAGCCTA TCCGCAAGTG ACGCTGGTGG GGGGCGAGGC GATCGGCGAA
GGGATGTGGT CGGAGTTGGC GAGCGCATCG AGCCGGACGT ACTACAACGT GTATGGTCCG
ACGGAATGCA CGGTGGATGC GACGCTCGCG CGGATCACGG CGGAGCATGC GCCGCACATC
GGCGGGCCGC TGGCGAACGT GCGGGCCTAT GTGTTGAACG AGCGGTTGAG CCCGGCGCCG
GTGGGCGTGC GCGGGGAGCT CTACATCGGC GGGGCGGGTG TTGCGCGCGG GTACTTGAAC
CGGCCGGAGC TGACGCGGGA GCGGTTCATC GACGATCCGT TCGTGGCGGG TGGGCGGCTG
TATAAGACGG GGGATCTGGC GCGTTGGCGT ACGGACGGGA GCCTGGAGTA TCTGGGGCGA
AACGACTTCC AGGTGAAGAT ACGCGGTTTC CGGATCGAGT TGGGGGAGAT CGAGGCGCAG
TTGGCGAAGG TGACGGGGGT GCGCGAAGTG GTCGTGCTTG CGCGAGATTC GGCAGCGGCG
GTGCACGATA GCGCGACGGA ACACGCAACT CCGAATGCGC TTTCGCCCTC GCCCGAGACC
TCAACCGCGA CCGCGACGGC AACAGCAACG GCAACAGCGA CAGAGAAACG CCTCGTCGCG
TACTACACGG GTGACGCCGA TGTCGCGGCA TTGAGAGCGC AAGCCGCGCA GCACTTGCCG
AGCTACATGG TGCCGTCGGC GTATGTGCGG CTGGACGCGT GGCCGCTGAC ACCGAACGGC
AAGCTGGACC GGCGCGCATT GCCCGAGCCG GCAGACGACG CATACGCCCG CGCCGAATAC
GAAGCGCCGC AGGGCGCAAA AGAAGAAGCA CTGGCCGCGA TCTGGCGGGA CTTGCTGCAA
GTCGATCGAA TCAGCCGCCA CGACAACTTC TTCCAGTTGG GCGGTCATTC GCTGCAGGCG
ATCAGCCTGG GCGACATGAT GCGCGAGCGC GGCTTGCACG CCGACGTGCG CACGCTGTTC
AACGCCGAGA CGCTGGCCGC GCTCGCCGCG CAATCGGGCA CGGACAGCAT CGATGTCGAC
GTCCCGCCGA ACCTGATCCC CGTCGGCGCC GCGCGAATCA CGCCCGACAT GCTGCCGCTC
GTCGCGCTGA CCCAGCCGCA GATCGACGCG ATCGCGCAGC AGGTAGACGG CGGCGCGACG
AACGTGCAGG ACATCTATCC GCTCGCGCCG CTGCAGGAAG GCATGCTGTT CCATCATCTG
CTGCACACGC AGGGCGACCT CTATCTGGAA CCGCATCTGC TCGCGTTCCG CACGCGCGAG
CGGCTCGAGC GGTTCCTGTC CGCGCTGCAA TGCGTGATCG ACCGCCACGA CGTGCTGCGC
ACCGGCTTCT TCTGGGAAGG CGTCCCCCAG CCGGTGCAGG TGGTGTGGCG GCGCGCGCGG
CTGCCGGTCG AATACGTCGA GCTGCCGGAC GGCCACGGCG ATGTCGCGAG CCAGCTCGAA
GCCCGCTGCG ATCCGCGCCG CCATCGCATC GACATCGGCC GCGCGCCGCT CGTGCACTGC
CACGTCGCGC ACGATGCGCG CAACGACCGC TGGGTGCTCG GCGTGCTGAC GCACCATCTG
GTCAGCGACC ATACGACGCT CGCGCTCCTC GCCGAAGAAG CGCAGGCGTT CGAGCAGGGC
CGCGGCGATG CGCTGCCGCC TGCGGTGCCG TTTCGCAACT TCGTCGCGCA CGCGCGCCTC
GGCACGAGCG AGCGCGAGCA CGAAGCGTTC TTCCGCGAGA TGCTGGGCGA CGTCGACGAG
CCGACCGCGC CGTTCGGCCT GCTCGACGTG CAGGGCGACG GCAGCGCGAT CGTCGAGCAC
CGGCGCGCGC TCGCGCCCGG GTTGTCCCGC TCGGTGCGCG CGCACGCGCG GCGTCTCGGC
GTGAGCGCGG CGAGCGTGAT GCACGTCGCA TGGAGCCTCG TGCTCGCGCG GACGGCGAAC
CGGCGCGACG TGGTGTTCGG CACGGTGCTG TTCGGCCGCA TGCAAGGCGG CGCGCACGCG
CATAGAACGA TGGGCCTCTT CATGAACACG CTGCCCGTGC GGATCGCGCT CGACGAGTCG
GATGTCGAGA CGAGCCTGAT CGCGACGCAC GATCGCCTCG CGCGGCTGCT GCGCCACGAG
CACGCGCCCC TCGCGCTCGC GCAGCGCTGC AGCGCGGTAC CCGCGCAGGC GCCGCTGTTC
ACGTCGCTGC TGAACTACCG CTATTCGCCG CACGAAGAGC AGGGCGACGC AACGGACGAC
GACGTTCAGT TCATCGCCGC GCGCGAGCGC AACAACTACC CGCTGACGAT GATCGTCGAC
GACACGGGCG AGGGCTTCGC GCTCACGGCG CAGGTGGACG CCTCGATCGA CGCGGCGCGC
GTCTGCGCGT TCATGCATAC GGCGCTCGAG CAGCTCGTGC GCGCGCTCGA CGACGCGCGC
GGCGCGGTGC TCGCCGAGCT CGACGTCCTG CCCGCCGACG AGCATCGGTG CGTGGTGTCG
GCCTGCAACG ATACGGATGC CGAACTGCCG GGCGTCGACT TCGTCGATCG CCGGTTCGAG
GCGCAGGCGG CGCGCACGCC CGAGGCGATC GCCGTCGCGT GCGGCGCGCA CGCGCTCAGC
TACGCGGCGC TGAACCGGCG TGCGAATCGC CTCGCGCACT ATCTGCGCGC GCACGGCGCG
GGCCCGGAGC GCGTCGTCGC GCTCGCGCTC GAGCGCTCGG TCGACATGAT GGTCGGGCTG
CTCGGCATCC TGAAATCGGG CAGCGCCTAT CTGCCGCTCG ATCCCGCGTA TCCGGCCGAG
CGGCTCGCGT ACATCGTCGA CGACGCGCGC CCCGCGCTGC TGTTGACTGA AGCCGCGCTG
CGGGACGACT GGCGAGACGC CGGCGCACCC GTCGTGCTGC TCGACGCGGA CGGGCCGGCG
ATCGACGCGT GTCCGGATCA CAACCCGGAC GCCGCCGCCG GCCGGGATGC GCGCACACTG
TCGTCGCTCG CGTACGTGAT CTACACATCG GGTTCGACGG GGCGCCCGAA GGGCGTGATG
ATCGAGCATC GAAATCTCGC GAACTTGCTC GGCGCGATGG GCGAGCAGCC CGGCATCGGC
GCGCACGACG TGCTGCTCGC GGTGACCTCG CTGTCGTTCG ACATCGCCGC GCTGGAGCTC
TTCCTGCCGC TGCTGCACGG CGCGCGCGCG GTGATCGCGG CGCGCGACGA CGCGGCCGAT
CCGGCGCGGC TCGCGCATCT GATCGAAAGC AGCGGGGCGA GCCTGATGCA GGCGACGCCT
TCGACGTGGC GCATGCTGGC GCAGCACGGC TGGCCGCGAT CGGCGCGGCC GCTGACGCTG
CTGTGCGGCG GCGAGGCGCT GCCGCCCGCG CTCGCCGAGC GGCTGCTCGC GCATGTCCCC
GCGATTTGGA ACCTGTACGG GCCGACGGAG ACCACGGTAT GGTCGACCGT GCGGCGCGTG
ACGACGCCCG TCGTCGACAT CGGAGGGCCG ATCGCCAACA CGCAGGTCTA CGTGCTCGAC
GAGCGGCTGC GCCCCGCGCC GATCGGCGTC TCGGGCGAGC TGTACATCGG CGGCGCGGGC
GTCGCGCGCG GCTATCTGAA CCGCCCCGAG CTCACGCGCG AGCGCTTCGT CGACGATCCG
TTCCGGCGCG GCGGGCGGCT GTACCGCACC GGCGATCTGG CGCGGCGGCG CGCGGACGGC
AACCTCGAGT ACCTCGGCCG CAACGATTTC CAGGTGAAGA TTCGCGGCTT CCGGATCGAG
CTCGGCGAAA TCGAGGCGCA GCTCGCGAAG GCGCACGGCG TGCAAGGCGT GGCGCTTGCC
GCGCGCGACA CGCCCACGGC AGACAAGCGG CTCGTCGCGT ACTACGTCGG CGACGCGAGC
GCCGCGGCGC TGCGCGAGCA CGCGGCCGCG CGATTGCCGG CGTACATGGT GCCGGCGGCC
TATGTGCGGC TCGCCGCGTG GCCGCTGACG CCGAACGGCA AGCTCGACCG CGCGGCGCTG
CCCGCGCCGG ACGACGAAGC GTACGCGCGC GCCGAATACG AAGCGCCGCG GGGCGAGCAC
GAGTGCAAGC TCGCGGCGAT CTGGCGGGCC GTGCTGCAGG TCGAACGGAT CGGCCGTCAC
GACGATTTCT TCGAGCTGGG CGGCCATTCG CTGCTCGCGG TGCGCGCGAT CACGGCGATG
CGCGATGCGT TCGGCAGCGA CACGAGCCTG CGCGACCTGT TCGCGCGGCC CGTGCTGAAA
GATCTCGCCG AACACGCGAG CACGGCCGCG CGTGCGCGCG ACGCGGCGAT CCCGAAGGCC
GCGCGCGGCG AGCCCGCGCC GATGTCGTTC GCGCAGCAGC GGCTGTGGTT CCTCGCGCGG
ATGGGCGGGC TCGGCGATGC GTATCACATG CCGATCGCCG TGAGGCTGCG CGGCGCGCTC
GACGTCGACG CGCTGCAGCG CGCGCTGAGC CGAATCGTGT CGCGCCACGA TGCGCTGCGC
ACGACGTTCG CGCTCGAAGG CGAGCAACCG GTTCAGCGCG TGCACGCGGA TGATGGCGCG
GGGCTGCGCT TGCGCATCGA CGATCTGCGC GGGTGCGCCG ACGCCGGCGC GCGGCGCGCG
CGGATCCTGG CCGGGCAGGC GAGCGAGCCG TTCGATCTGG CGCGCGGGCC GCTGGTTCGC
GGCGCGCTCG TGCGCGAGGC CGACGACGTG CACACGCTAT GCGTGACGAT CCATCACATC
GTGTCGGACG GCTGGTCGAT CGACGTGTTC TGCCGCGAGC TGAGCGAGTT GTATCGCGCA
TTCGCCGGCG GGCAGCCCGA CCCGTTGCCG CCGCTGCCGG TGCAGTACGC CGATTACGCG
GCATGGCAGC AACGCGGCAT CGGCGGTGCG GCGCTGCACG CGCAGGCCGA ATACTGGCGC
GATGCGCTCG CGGGCGCGCC GACGCTGCTC GAACTGCCGA CGGACCGGCC GCGTCCGCCG
CAGCCCGACT ATGCGGGCGC GACGGTCGGG CTCGCGCTCG ACGCGCCGCT GACGGCGGGC
TTGCGCGCGC TCGCGCGGCG TCACGGCGCG ACGCTCTTCA TGACCGTGTT CGCCGCGTGG
AGCGTGCTGC TGTCGCGCCT GTCGCGGCAA ACCGACGTGG TGATCGGCAC GCCGAGCGCG
AACCGCGGCC ATGCGCAGAT CGAGGGCTTG ATCGGCTTTT TCGTCAACAC GATCGCGCTG
CGCGTGGACC TCGACGGCGC GCCGACCGTG GCCGAGCTGC TCGCGCGCGT GAAGGCGCGC
ACGCTCGCCG CGCAGCAGCA TCAGGACATT CCGTTCGAGC ATGTGGTCGA GCGGGTGCAG
CCGGCGCGCA GTCTCTCGCA TAGCCCGGTG TTTCAGGCGA TGTTCGCGTG GCAGCACGCG
TCGCGCGGCG AGATGCGGCT CGAAGGGCTG CGCGCGGAGC CGCTCGACGA CGCGGCGCGC
ACGATCGCGA AGTTCGATCT GACGCTGTCG CTGCGCGAGA GCGGCGATGC GATCGACGGC
GGTCTCGAAT ACGCGAGCGC GCTGTTCGAG CGCGCGACGA TCGAGCGCTT CGCCGGCTAC
CTGCGGCGTT TGCTGGAAGG GATGGTCGCC GATGACACGC AGCGCGTCGA TGCATTGCCG
ATGCTGTCGC GCGACGAACG GCGCGATCTG ATCGAGCGCC GGAACGCGAC CGCGCGGCCG
TATCCGGCGA ACAGCGGCGT GCATCGGCTG TTCGAGGCGC AGGCGGCGCG CACGCCCGAT
GCGACCGCGA TCGTCGACGG GGCGACGACG CTCGACTATC GCGCGCTCGA TGCGCGCGCG
AACCGCATCG CACACGCGCT CGCGCACGCC GGCGTGCGCG CGGGCGATCG CGTCGCGCTG
CATCTCGAGC CGTCGATCGG GCTCGTCGCG GCGCAGCTCG CGGTGCTCAA GCTCGGCGCC
GCCTACGTGC CCGTCGATGT CGGCAATCCG CCCGCGCGCA AGGCGTTCGT CGCGCAAGAC
AGCGGCGCGC GGCTCGTGCT CGGCGACGCG GCGCTCGACT GGCCGGCGGC GGCCGGCGTG
CCGCAGCGCG ATCTGGCGGC GCTGCTTGCC GGGCCGTGGC CGTCGGACGC GCCCGCTCGC
GCGCCGCAGT GCGGCGGCGA CACACCGGCA TACGTGATGT ACACGTCGGG CTCGAGCGGG
CAGCCGAAGG GCGTGCTCGT CACGCATCGC GGCATCGCGC GGCTGGCGGT GAACAGCGGT
TATGCGACGT TCGACGCGTC GGACCGGTTC GCGTTCGCAT CGAACCCGGC GTTCGACGCG
TCGACGTTCG AAGTGTGGAC GGCGCTTCTC AACGGCGCGA GCATCGGCAT CGTGAAGCGC
GACGATCTGC TCGATCTCGG CGCGCTCGCC GGCAAGCTGT CGTCGATCGG CGTCACCTGC
CTGTTCCTCA CGACGGCGCT GTTCAACCGG TGCGTGTCGT TCGATCCGGC GATGTTCGCG
CGGCTGCGCT GCGTGATCTC GGGCGGCGAG CGCGCCGATC CGGCGGTCTA CCGGAAGGTG
ATGGAAGCGG GCCCGCCGCG CCATCTGCTG AACGCGTACG GCCCGACCGA GACCACCACG
TTCGCCGCGG TCTGGGAAGC CGAGCCGCGC ACGCTCGCCG CGCAGGCCGC GCCGATCGGG
CGGCCGATCG GCAATACGTC GGTCTACGTG CTCGACGCGT ACGGCGCGCC GGTGCCCGTC
GGCGTGACGG GCGAGATCCA CATCGGCGGC CCGGGTGTCG CGCAAGGCTA CCTGAACCGA
CCGGCGCTTT CGGCCGAGCG CTTCGTGCGC GATCCGTTCG TCGGCGGCGA CGCGCGGATG
TACCGCACGG GCGACCTCGG CCGATGGCGG CCCGATGGCA TGCTCGACTG CATCGGCCGC
GCCGACTTCC AGGTGAAGAT TCGCGGCTTT CGGATCGAGC TCGGCGAAAT CGAAGCGTGC
CTGCTCGAAC ACGGCGCGCT CGCGCAGGCG GCGGTGCTCG CGCGCGACGA CGGCGGCGAC
GGCGGCAAGA CGCTCGTCGC GTATTACGTG CCGCGCGCGG GGCACGAGGA TGGCGCGCCC
GCGCTGCGCG CGCATCTGGC CGCACGCCTG CCCGAATACA TGGTGCCCGC CGCGTACGTG
CGGCTGCCGG CGATGCCGCT CACGCCCAAC GGCAAGCTCG AGCGCCGCGC GCTGCCCGCC
CCCGACGAGC GATCGTACGT GCGGCGCGAC TACGCGGCGC CGCAGGGCGA GATCGAGACG
ACACTCGCGC GGATCTGGGC GGAGCTGTTC GGCATCGAGC GCGTCGGCCG GCACGACGGC
TTCTTCGAAC TCGGCGGGCA TTCGCTGCTC GCGGTGCGGA TGGTCGCGCG CGTGCACGAT
GTGCTGGGCG TCGAGGTGCC GCTGCGCGCG CTGTTCGCCG ATCCGGTGCT GCACGTGTTC
GCGTCGGCGG TCGCGCGCGC GTCGACGCGC CAGGCGTCGT CGAATCTCGT CGCGTTCCGC
AGCGCGGGCA CGGCCGCGCC GCTCTTCTTC ATTCATTCGG GGCTCGGCGA GATCGGCTTC
GTCGGCGATC TGCTGCCCGG CATCGCGCCG GAGATTCCGG TGTACGGCTT CGCGGCGGTC
GGTTTCCTCG CGGGCGAGAC GCCGCACGCG ACGATCGAGG AGATGGCCGC GCAATATGTC
GACGCGATGC GGCGCGTGCA GCCGCATGGG CCGTATCGGC TCGCCGGGTG GTGCGCGGGC
GGCAACATCG CGTTCGAAAT GGCCCATCAG CTGATCGCGG CCGACGAGAC GGTCGAGTTC
CTCTGCATGA TCGATTCGCC GACATCCGCG CCGATCGACC GCTCGGTCAC CGCGTGCGTG
CTCGCGCGCA TTCCCGACGA CATTCCGGAG GCGTTGCGCA CGCGGCTTCA TGCGCTCGGC
GATGCCTTCG ACGTGCGCGG CATGCTGCAC GCGTGCCAGG CGGCGGGCAT GCTGCCGATC
GATCTGCCGA CCGGGCTGAT GGAGCGGCAC GTCGCGGTGC AATACGCGAT CAAGCATGCG
AAGCTGAACT ACGTGCCGCC GCGTCTGCCC GTCGACGTGA TTCACTTCGT CGCGCAGGAC
GAGCCGATGT GGCGCAACGG CTGGGCGATG GACGGCTGGC ACGACGTCGC GGACCGGGTG
ATCTGCCTGC CCGCGAGCGG CGACCACATG ACGATGGTGG CGGCGCCGCA CGCGGAGCAA
CTGGGGCGGC GCATCACGGA TGCGCTCGCC GTGCACGGCG GGCCGCGCGC GGATGGCGCG
GAGCGCGGCT ACGCGCCGCG CATCGCGATC CAGACGGCCC CGCGCGACGC GCGCGCGCCG
ACGCTCTTCT GCATTCCGGG CGCCGGCGCG AGCGTGACGA CGTTCTCGAC GCTCGCGCGG
CATCTGCCGG CGACGTTCTC CGTGGACGGG CTGCAGCCGC GCGGCCTGTG CGGGACGATG
GTGCCGTATC TCGACGTCGA GACGGCCGCG CGCGCGTACC TGAGAAGCAT CCGGAAAGCC
GCGCCGCGCG GGCCGTACCA CCTCGTCGGC CATTCGTTCG GCGGCTGGGT GGCCTACGAG
ATCGCGTGCC GGCTGCAGGA GCAGGGCGAG CGCGTCGCGA CGCTGATGCT GCTCGACACC
GAGCGGCCCG GCGCGACCGA CATCGTGCGC GGGCGCAAGA CGCGCGTGGA CGCGCTCGCG
AAGCTCGTCG AGCTGTACGA GATGCATCTG GGCCGCCCGC TCGGTGTGAG CCGCGACGAT
CTGGCCGCGC TCGCGCACGA TGCGCAGATC GAGCATCTGC GCGCGGCGCT CGTGCGCGCG
AAGATCCTGC CGCCGTCCGT GCATCCGAAC GTGCTGCTTG GCGTCGTGCG GGTGCTCGAG
ATGAACGTGA ACACGCCGTA TCGGCCCGCG GGTCTCTACG CGGGGACGAT GCACGTCGTG
CTGATTGCGA ACGCGAAAGC GGACGCGGAC CTCGACGCGT GGCGCGACGA GCAGGCCGAG
CAGTGGCGCG GCCTCGCGGA CGACGTGCGG ATCGTGCGCG CGGGCGGCAA TCACATGACG
ATGCTGCAGC CGCCGCACGC GGCGTCGATC GCGGCGCTGC TCGAGCGCAC GGCCGGCGCG
CCCGCGCGGC TCGCGCAGGT GCACTAG
 
Protein sequence
MPAAERTQVL HGWNETGRAY ARDACLHQLF EAQVSRTPEA AAVICGDETL SYTDLDARAN 
RLAHYLRGQG VGPDTRVGLA LGRGVEMMTG LLAILKAGGA YVPLDLGYAS ERLRAILDDS
RPAIVLADAA GRAALDALAG APPIADLHAD ASRWSALPST PPRVEGLTPR HLAYVIYTSG
STGQPKGVMV EHASVVNLWR ALDEAIYRTH PSARRVSLNA SIAFDSLVKQ WVQLLSGRTL
VVVPEPVRFD GRRLLDAIGR DRIDVFDCTP SQLALIEGAR GPEDEAYPQV TLVGGEAIGE
GMWSELASAS SRTYYNVYGP TECTVDATLA RITAEHAPHI GGPLANVRAY VLNERLSPAP
VGVRGELYIG GAGVARGYLN RPELTRERFI DDPFVAGGRL YKTGDLARWR TDGSLEYLGR
NDFQVKIRGF RIELGEIEAQ LAKVTGVREV VVLARDSAAA VHDSATEHAT PNALSPSPET
STATATATAT ATATEKRLVA YYTGDADVAA LRAQAAQHLP SYMVPSAYVR LDAWPLTPNG
KLDRRALPEP ADDAYARAEY EAPQGAKEEA LAAIWRDLLQ VDRISRHDNF FQLGGHSLQA
ISLGDMMRER GLHADVRTLF NAETLAALAA QSGTDSIDVD VPPNLIPVGA ARITPDMLPL
VALTQPQIDA IAQQVDGGAT NVQDIYPLAP LQEGMLFHHL LHTQGDLYLE PHLLAFRTRE
RLERFLSALQ CVIDRHDVLR TGFFWEGVPQ PVQVVWRRAR LPVEYVELPD GHGDVASQLE
ARCDPRRHRI DIGRAPLVHC HVAHDARNDR WVLGVLTHHL VSDHTTLALL AEEAQAFEQG
RGDALPPAVP FRNFVAHARL GTSEREHEAF FREMLGDVDE PTAPFGLLDV QGDGSAIVEH
RRALAPGLSR SVRAHARRLG VSAASVMHVA WSLVLARTAN RRDVVFGTVL FGRMQGGAHA
HRTMGLFMNT LPVRIALDES DVETSLIATH DRLARLLRHE HAPLALAQRC SAVPAQAPLF
TSLLNYRYSP HEEQGDATDD DVQFIAARER NNYPLTMIVD DTGEGFALTA QVDASIDAAR
VCAFMHTALE QLVRALDDAR GAVLAELDVL PADEHRCVVS ACNDTDAELP GVDFVDRRFE
AQAARTPEAI AVACGAHALS YAALNRRANR LAHYLRAHGA GPERVVALAL ERSVDMMVGL
LGILKSGSAY LPLDPAYPAE RLAYIVDDAR PALLLTEAAL RDDWRDAGAP VVLLDADGPA
IDACPDHNPD AAAGRDARTL SSLAYVIYTS GSTGRPKGVM IEHRNLANLL GAMGEQPGIG
AHDVLLAVTS LSFDIAALEL FLPLLHGARA VIAARDDAAD PARLAHLIES SGASLMQATP
STWRMLAQHG WPRSARPLTL LCGGEALPPA LAERLLAHVP AIWNLYGPTE TTVWSTVRRV
TTPVVDIGGP IANTQVYVLD ERLRPAPIGV SGELYIGGAG VARGYLNRPE LTRERFVDDP
FRRGGRLYRT GDLARRRADG NLEYLGRNDF QVKIRGFRIE LGEIEAQLAK AHGVQGVALA
ARDTPTADKR LVAYYVGDAS AAALREHAAA RLPAYMVPAA YVRLAAWPLT PNGKLDRAAL
PAPDDEAYAR AEYEAPRGEH ECKLAAIWRA VLQVERIGRH DDFFELGGHS LLAVRAITAM
RDAFGSDTSL RDLFARPVLK DLAEHASTAA RARDAAIPKA ARGEPAPMSF AQQRLWFLAR
MGGLGDAYHM PIAVRLRGAL DVDALQRALS RIVSRHDALR TTFALEGEQP VQRVHADDGA
GLRLRIDDLR GCADAGARRA RILAGQASEP FDLARGPLVR GALVREADDV HTLCVTIHHI
VSDGWSIDVF CRELSELYRA FAGGQPDPLP PLPVQYADYA AWQQRGIGGA ALHAQAEYWR
DALAGAPTLL ELPTDRPRPP QPDYAGATVG LALDAPLTAG LRALARRHGA TLFMTVFAAW
SVLLSRLSRQ TDVVIGTPSA NRGHAQIEGL IGFFVNTIAL RVDLDGAPTV AELLARVKAR
TLAAQQHQDI PFEHVVERVQ PARSLSHSPV FQAMFAWQHA SRGEMRLEGL RAEPLDDAAR
TIAKFDLTLS LRESGDAIDG GLEYASALFE RATIERFAGY LRRLLEGMVA DDTQRVDALP
MLSRDERRDL IERRNATARP YPANSGVHRL FEAQAARTPD ATAIVDGATT LDYRALDARA
NRIAHALAHA GVRAGDRVAL HLEPSIGLVA AQLAVLKLGA AYVPVDVGNP PARKAFVAQD
SGARLVLGDA ALDWPAAAGV PQRDLAALLA GPWPSDAPAR APQCGGDTPA YVMYTSGSSG
QPKGVLVTHR GIARLAVNSG YATFDASDRF AFASNPAFDA STFEVWTALL NGASIGIVKR
DDLLDLGALA GKLSSIGVTC LFLTTALFNR CVSFDPAMFA RLRCVISGGE RADPAVYRKV
MEAGPPRHLL NAYGPTETTT FAAVWEAEPR TLAAQAAPIG RPIGNTSVYV LDAYGAPVPV
GVTGEIHIGG PGVAQGYLNR PALSAERFVR DPFVGGDARM YRTGDLGRWR PDGMLDCIGR
ADFQVKIRGF RIELGEIEAC LLEHGALAQA AVLARDDGGD GGKTLVAYYV PRAGHEDGAP
ALRAHLAARL PEYMVPAAYV RLPAMPLTPN GKLERRALPA PDERSYVRRD YAAPQGEIET
TLARIWAELF GIERVGRHDG FFELGGHSLL AVRMVARVHD VLGVEVPLRA LFADPVLHVF
ASAVARASTR QASSNLVAFR SAGTAAPLFF IHSGLGEIGF VGDLLPGIAP EIPVYGFAAV
GFLAGETPHA TIEEMAAQYV DAMRRVQPHG PYRLAGWCAG GNIAFEMAHQ LIAADETVEF
LCMIDSPTSA PIDRSVTACV LARIPDDIPE ALRTRLHALG DAFDVRGMLH ACQAAGMLPI
DLPTGLMERH VAVQYAIKHA KLNYVPPRLP VDVIHFVAQD EPMWRNGWAM DGWHDVADRV
ICLPASGDHM TMVAAPHAEQ LGRRITDALA VHGGPRADGA ERGYAPRIAI QTAPRDARAP
TLFCIPGAGA SVTTFSTLAR HLPATFSVDG LQPRGLCGTM VPYLDVETAA RAYLRSIRKA
APRGPYHLVG HSFGGWVAYE IACRLQEQGE RVATLMLLDT ERPGATDIVR GRKTRVDALA
KLVELYEMHL GRPLGVSRDD LAALAHDAQI EHLRAALVRA KILPPSVHPN VLLGVVRVLE
MNVNTPYRPA GLYAGTMHVV LIANAKADAD LDAWRDEQAE QWRGLADDVR IVRAGGNHMT
MLQPPHAASI AALLERTAGA PARLAQVH