Gene BURPS1710b_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2086 
Symbol 
ID3689692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2260515 
End bp2270378 
Gene Length9864 bp 
Protein Length3287 aa 
Translation table11 
GC content76% 
IMG OID637728543 
Productputative siderophore related no-ribosomal peptide synthase 
Protein accessionYP_333482 
Protein GI76808735 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAGCT TTCCGACTGC GCTGCATCAT CGAATCCACG CGCTTGCGCG ACTCGACCCC 
GACGCGCCCG CGCTCGCGTC GTTCGCGCCC GACACCGTGC GTCTCACGCG CGGCGAACTC
GACGAGCGCG CCGCGCGCCT CGCCGCGCAA CTGCGCGCGG CGGGCGTGGG CGCGGAGGTG
CCGGTCGGCG TGTGCGTCGC GCGCTCGTGC GACCTGTTCG TCGCGCTGCT CGCGGTGATG
AAGGCGGGCG GCGCGTTCGT CGCGCTCGAT CCGCGCCATC CGGCCGCGCG GCTCGACTGG
GTCGCGCGCG ACGCGGGGCT CGCGCACGGC ATCGTCGACG CATCGGCCGA CGCCGCGATG
CGCGCGCGCT TCGCGCGGTG CTTCGACGTC GGCGGCGTCG CCGCGGCGGA CCCGGCCGCG
CCGCGCGAGC ACGGCGGCGA CGTGCATCCG CGCGCGGCCG CGTACATGAT CTATACGTCC
GGCTCGACGG GCACGCCGAA GGCGGTCGTC GTCGAGCACG GCCCGCTCGC CGCGCATGGC
GACGCGCTCG CCGAATCGCT GCCGATCGGG CCCGACGATC GCGTGCTGCA TTTCGCGTCG
GTGAACTTCG ACGTCGCGAT CGAGGCGTGG CTCGTGCCGC TCGCGGTCGG CGCCAGCGTC
GTCATCAGCG ATCCGCCGCC GTTCACGCCC GACGCCGCGC ACGCGCTGAT CTCGCGCGAG
CGCGTGACGA ACACGACGCT GCCGCCCGCG TACCTGCGCG AGTTCGCGGC CGTGTGCGCG
CGCGAAGGCG TGCCGCCGTC GCTGCGCGTG CTGCTGTTCG GCGGCGAGGC GATGTCGCAG
GACGCGTTCG ACGAGATCCG TCGCGTGTTC CCCGCGATTC GCCTCGTCAA CGGCTACGGG
CCGACCGAAA CCGTGATCTC GCCGATGCTC TGGCCGGTCG CGCCGGGCAC GACGCCCGCG
CTCGACGCGG GCAACGGCTA CGCGTCGCTG CCGATCGGCT GGCCGATCGG CCGGCGCGTC
GCGCGCGTCG AGCGCGCCGA CGGCACGGTC GCGCGCGGCG AAGCGGGCGA GCTGCTGCTC
GGCGGCGCGT GCCTCGCGCG CGGCTATCAC GGCCGCGCCG CGCTGACGGC CGAGCGCTTC
CTGCCGGATC CGGCGGGCGA GCCCGGCGCG CGGATCTACC GCACGGGCGA CCTCGCGCGC
GAGCGCGCGG ACGGCGCGTT CGACTATCTC GGCCGGATCG ACGACCAGGT GCAGGTGCGC
GGCGTGCGCG TCGAGCCGGC CGAGATCGCC GCGTGCCTGC TCACGCATCC GGGCGTGCGC
GACGCGGGCG TGCTCGCCGA GACGGCGGGC GGGCGCACGC AACTGATCGC GTGCGTCGCG
CTCGCGGCCC CGCCGCGCGA AACGCCGTCC GGCGGCGACG CGCGCGGCGA TGCGCCGCCG
GACGACGACG CGCTGCGCGC GCATGTGGCC GCGCATCTGC CGGCCGCGTG GCTGCCGCAC
CGGTTCGTGC GCTTCGACAA GCTGCCGTAC ACGCTGAACG GCAAGCTCGA CCGCGTCGCG
CTGCGCGACG CGGTCGCGGC GCGGCCGGCC GAGGCCGCCG CGGATGTCGA CGCGTCGCGC
ACCGACACCG AGCGCCGCGT CGCCGCGCTC TGGCAGCGGC TCCTGAACGA CCCGGCGCCG
ATCGGCCGCG TCGACCGCTT CTTCGCGCGC GGCGGCGATT CGCTCGCCGC GATGCAACTG
CAGACGGCGA TCCGGCTCGA CTGGCGCGTG AACCTGCGGC TCGACACGCT CTTCGACGAC
GCGCCGCTCG CCGAGCTCGC CGCGCGCATC GATGCGGCCG AGCGCGAAGC GCGGCAGCCG
GCCGCGATCG GGACGACGGC GCGGCGCGCG ATCGCGGCCG ATGCAATCGA CGCGCGCGCG
ACGGGCCAGA TGGGCAGGAC GGGCGCGGCG GGTGAGGCGA CCGAGGCGGC CGGCGCGCTC
GAGCGTCCCG CTTCGTTCGC GCAGCAACGC TTCTGGGTGC TCGCGCGCAC GCAGGACGCC
GGGGCCGCGT ATCACGTGTC GTTCCACTGG GACATCGACG GCGCGCTCGA CCTCGGCACG
CTGCAGCGCG CGCTCGACAT GCTGATCGCG CGCCACGAGG CGTGGCGCAC GACGCTCGTC
GAGAACGACG ACGGCGTCGT CCTCCAGCGC ATCCATGCGG CGCTGCCGGT GCGGATCGCC
GCCGTCGACC TGCGCGGCGA GGCCGGCGCG TCGCGCGCCG CGCGTCTTGC CGAGCTGACC
GAGCGCCATG CGGGCGCGCC GTTCGATCTG TCCGACGGCC CACTCGTGCG CGCGCTGCTC
GTCACGCTCG CCGACGGCGC GCAGCGTTTC CTGCTGACGA CGCACCACGC GGTGAGCGAC
GGCTGGAGCT CGCGCTGCGC GTTCGCCGAG CTGACGGCCG CCTACGCGGC ACTCGCCGAA
GGCCGCGCGC CCGAGCTGCC GGCGCTGCCG ATCCAGTATG CGGATTACGC GCAATGGCAG
CGCGACGCGC TCGATGCGCA CGAGACCGCG CGCGAGCTCG CCTACTGGCG CGGCGCGCTC
GACGGCGCGC CCGCGCCGCT CGCGCTGCCG CTCGACCGCC CGCGCGCGGC CGAGCGCGAC
TATCGCGGCG GCCGGCTCGC GCGGCGTCTG TCGCCCGCCG CGTCGGACGC GGTGCGCGCC
GCCGCGCGGC GGCTGCATGC GTCGCCGTTT ACGGTGCTGC TCGCCGCGTT CGACGCGTGG
CTCTTTCGCC TGACGGGCGA GCGCGATCTC GTCGTCGCCG CGCCGATCGC GCAGCGCGCG
CGGCCGGAAA CCGCGCCGCT CGTCGGCCTG TTCCTCAACA CGCTCGCGCT GCGCGCGCGC
GTGTCGCCCG CGCAGTCGTT CGAATCGCTC GTCGCCTCGG TGCGCGACGC CGCGTTCGGC
GCGTTCGCGC ATCAGGACGT GCCGTTCGAC AAGGTGATCG ACGCTGTGAA GCCCCCCGTG
CGGCGCGGCG ACGAGTGGCT GCGCGTGAAG TTCGCGCAGC AGTTCGATCT GGAGTGCCGC
GCGTCGCTGC CGGGCGCGGC CGTGCGGATG GCGCCGGGCC TCGACACGGC CGCGCGCTTC
GATTACGCGC TCGATTTCAC CGACGACCCG CGCGGCATCG AATTCGTCGC TGCTTACGCG
CGGGACGGGA TCGACGAGGC GACCGCGCAC GCCTGGCTCG ACAGCTTCGC CGCGCTCGTC
GAGGACGCGG TGCGCGAGCC GCGCCGCCCG ATCGCCGCGC TCGCGGTTGC GCAGGCCGGC
GCGCCGTGCG CGCTCATCGC CGGCCGGCCG CTCGCGACCG CGCCGGACGT GCTCGCGCTC
TTCGCGCGCG AGGCGGCCGA GCATCCGCAT CGCGTCGCGC TCGCGGATGC CGACACGCGG
CTCACGTTCG CCGAGCTCGA CGACGCGTCG AACCGCGTCG CGCTCGCGCT GCGGCGCGAC
GCCGCGCGCG ACGAAGCGCC CGGCGCGGAA ACGCCCGTCG CGATCTGCAT CGAGCGTTCG
GCGCGCTTCG TCGTCGCGTT CCTCGGCGTG CTGAAGTCGG GCGCGTACGC GGTGCCGCTC
GATCCGGCGT CGCCGTGCGA GCGGATCGCC GCGGCGCTCG CCGCGTGCGG CGCGCGCCGG
ATGCTCGCGG CAGGCGCGCT CGACGCGCTC GGCGAGTTCG ACGGCGTCGC GGTGCAGGAC
ATCGACGCAT GCGCGCACGA CGCGTCGCTC GCGAACGCGG CCGCGCCGCG CGTGCCGCCG
CAGCCGGAGC AGGCCGCGTA TCTGATTTTC ACGTCGGGCT CGACGGGCGC GCCCAAGGGC
GTCGTCGTGC CGCATCGCGC GCTCGCCGAC TACGTCGCCG GGATGCTCGA CGAGCTCGCG
TTCGCGCCGC ACGCGTCGAT GGCGATGGTG TCGACGGTCG CGGCCGACCT CGGCCACACG
GCGCTGTTCG GCGCGCTGTG CTCGGGCCGC ACGCTGCATC TGCTGCCCGC GCAAGCGGCG
TTCGATCCTG ACCGCTTCGC GCACGAGATG GCGACGCGCG AGGTCGGCGT GCTGAAGATC
GTGCCGAGCC ATCTGCACGC GCTGCTCGAC GCGCAGCGCG CGGCCGACGT GCTGCCCGCG
CACGCGCTCG TCACGGGCGG CGAGGCGCTG CCTTGGGCGC TCGTCGAGCG CATCGCCGCG
CTGAAGCCGG ACTGCCGCGT GATCAATCAC TATGGGCCGA CCGAGGCGAG CGTCGGCGCG
CTCGTGTGCG ACACGTCCGC ACCCGCGCAG GCGGACCTGC GCGCGGCCGC CGCGTCGTCG
CCCGGCGAAG CGGCGCGCGG CGTGCCGCTC GGCCGGCCGC TGCCGAACGC GCACGCATGC
GTGCTCGACG CGTACGGCTC GAGCGTGCCC GTCGGCGCGA TCGGCGAACT GTACCTCGGC
GGCCCGGGCC TCGCGCGCGG CTATCTGGGC CGCGCGGCGG CGAGCGCCGA GCGCTTCGTT
CCGCACCCGC ACGTGGCGGG CGCGCGCGTC TACCGGACGG GCGACCGCGT GCGGCTGCGC
GCCGACGGCC GGCTCGATTT CCTCGGGCGT CTCGACGACC AGGTGAAGAT CCGCGGCTAC
CGGGTCGAGC CCGGCGAGGT GAGCGCCGCG CTGCGCGCGC TGCCCGGCGT CGCGCAAGCA
GAAACGCTCG CGCTCGAGCA CGAAGGGCGG CTGCGCCTCG CCGCGTTCGC GACGCCCGAG
GCCGGCGCGC GGATCGCGGC CGATGCGCTG CGCGACGCGC TCGCCGCGCG CCTGCCCGAC
TACATGGTGC CCGCCGCGCT CGTCGTGCTC GACGCGCTGC CCGTGACCGC GAACGGCAAG
ATCGATCGCG CGGCGCTGCG CGCGCGCGCG GCGGCGCCCG CGCCGGCGAC GGCGGGCGAC
GAGGACGCGC CGCAAGGCCC GATCGAGGCG ACGCTCGCCG AGGTCTGGCG CGACGTGCTG
AAGGCGGCGC GCGTCGGCCG CCACGACAAC TTCTTCGAGC TCGGCGGCGA TTCGATTCTC
GTGCTGCAGG TGATCGCGCG CGCGCGCAAG CGCGGCGTCA AGTTCACGCC GAAGCAGTTG
TTCGACGGCC CGACGCTCGC CGAGCTCGCG CGCGTCGCCG TGGCGATCGA GGCCGACGCG
CCGGCGAGCG GCGCGGCGCA TGGCGCGGCC ATCGGCGCGA ATGCCGCCGC CGCCCGCCGC
GACGAAGCGG TGCTCACGCC CGCGCAGCTG CGCTTCTTCG CGCTCGATAT TCCGCGCCGC
GGCCACTGGA ATCAGTCGAT CGCGCTCGAC GTCGCCGGCG CGTTCGATTT CGACGCCTTC
GCGCGCGCGT TCGACGCGTT GCTCACGCAC CACCCGGTAT TCCGCGAACG CTTCGCGCCG
ACGGGCGACG GCGGCGGGTG GCAGCGCTCG GCCGCGCCGC GCGCGTTCGA CACGCTGCCG
CTCGCGGCCG CCGCCGCGCG CGACGAAGCG GATGCGCTCG CGCAGTTCGA CGCGCTGCAA
GCCACGCTCG ACCTGACGCA CGGCCCGCTC GCGTGCGCGT TCGCCGCGGT GCTGCCGAGC
GGCGCGACGA AGCTGTATCT GGCGATCCAC CACGCGATCG TCGACGGCGT GTCGTGGCGC
GTGCTGCTCG ACGATCTCGA CGCCGCGTAC CGCGCCGCGT GCGAGCGCCG CGCGGTGCGG
CTCGGGCCGA CGGGCGCGAG CGCGTCCGAA TGGGCGGCGC GTCTTGCGCG CGCGGCCCGC
GATCCGGCCG GGCCGTTCGC GGGCGAGCTG CCGTACTGGG CGGCGCTCGC CGCGCCGCAC
GACGATCTGC GGCCGGATCG CCCGGATGCG GCGGCGACCA ACGCGCACGC CGACGTCGTG
ATCCAGACGC TCGACGCGGC GCTCACGCGC GAAGTGCTCA CCGACGCGAA CGCCGCGTAC
CGCACGCAGG CGGTCGAGCT GCTGATCGCG GCGCTCGTCG CCGCGCTCGG CCAACACACG
GGCGCGGCCG CGTGCCGGCT CGAGCTGGAG GGGCACGGCC GCGAGGCGCT CTTCGACGAA
CTCGACGCGA GCCGCACGCT CGGCTGGCTG ACGAGCCACT ATCCGGTGGC GTTCGCGGTG
GAGGCGACGC CCGCCGCGAC GCTCGCCGGC GTGAAGGATG CGCTGCGCGC GGTGCCGAAC
AAGGGTCTCG GCTTCGGCGT GCTGCGCCAC TACGGCGACG ACGCGACGCG CGCCGCGCTC
GCGCGGGTCG CCCGCCCGCG CGTGACGTTC AACTACCTGG GCCAGTTCGA CGCGCCGCGC
GACGCCGCGC TCGTGCCGCG CTTCGGCGGC GCGGGCCGCG AGCGCGATCC GGCGGGGCCG
CTCGGCAACG CGCTCGCGAT CCACGCGTAT GTCGGCGCGA ACGGCGAGCG CGCGCTGAAG
GTGCACTGGG TGTACGGCGC GACGCAATTC GACCGCGCGA CGATCGACGC GCTCGCCGCG
CGGTTCGACG CCGCGCTGCG CGCGCTCGCC GCCGCGTGCC GCGCGCGCGT CGCCGAGCGC
GGCGCGGGCG CGACGCCCGG CGACTATCCG CTCGCGCGCG CGGGCGGCCT CACGCAGGCG
GCGCTCGACC GGCTGCCGTT CGATGCGCGC GCGATCGACG ACATCTATCC GCTGTCGCCG
ATGCAGCAGG GCATACTGTT CCATTCGCTG TTCGCGCCGG AGCGCGCGAC GTACGTGAAC
CAGCTCGTCG CGACGCTCGT CGATCCCGAC GTCGAGCGGC TGCGCGCCGC GTTCGACGCG
GCCGTGCCGC GCCACGACAT CCTGCGCACC GGCTTCGCCG CGCACGAGGC GGCGCCGATG
CAGATCGTCC ACCGCCACGC GCGCATGCCG GTCGAGATCG TCGACTGGCG CGGCGCGCAT
GCGTCGCCCG CGGCGCTCGA CACGGCGCTC GACGCCTGGC TCGCCGCCGA CCGCGCGCGC
GGCTTCGATC TCGCGGCGCC GCCGCTGATG CGCGTGACGC TGATCCGCAC GGACGACGCC
GACTGGCGGC TCGTCTGGAC CCGCCACCAT CTGCTGCTCG ACGGCTGGAG CACCGCGCGC
CTGTTCGCGG ACGTGCTGCG CGACTACATC GAGCCGCCGC GCGCGAATCC GTTCGCCGCG
CCGGCGCGCA CGCGCTACCG CGATTTCATC GCGTGGCTCG CGCGGCGCGA TGCGCAGGCG
GACCGCGCGT TCTGGCTCGG CCGCCTCGCA CGGCTCGACG AGCCGACGCA CGTCGCCGAG
CGCGCGGCCG CGCATGAGGC GGCCGGCCGC GCGAACTGGC GCGCGACGCT GCCCGCGGCG
GACACCGCGC GCATCGGCGA AGCGGCGCGC CGCATGAAAG TGACCGTCAA CACGATCGTG
CAGGGCGCGT GGGCGCTCGC GCTGCAGCGC ATCACGCACC GCCGCGCGGT CGCGTTCGGC
GCGACGGTCG CGGGACGCCC GCACGCGCTG CCCGACGTCG ACACCGTGCT CGGCCTGTTC
ATCAACACGC TGCCCGTGAT CACCGCGCCG TTGCCGCAGC TCGCCGCGCG CGACTGGCTC
GCGAGCCTGC AGCGCGACAA CGCGGCGGCG CTCGAGCACG CGCACACGCC GCTCTACGAG
ATCCAGCAGT GGGCGGGCCT GGGCGGCGCG CTGTTCGACA CGCTTGTCGT GTTCGAGAAC
TACCCGGTCG ACGAAGCGTG GCAGGGGCGC GATGCGCGCG CGCTGCAAAA GCGCGACCTG
CGCAACATCG AGGCCACCGA TTTCGCGGTG ACGCTCGTGA TCGAGGCGGG CGACACGCTC
GCGATCGACT ACGGCTACGA TCCCGCGCGC ATCGGCCCGG CGCGCGTCGA GGCGCTGCAC
CGCGCGTTCG CCGCGTGCAT CGCGGGGCTC GTCGACCATC CGGACGCGCC GCTCGGCACG
ATCTCGTGCG CGAGCGCCGA CGATCTCGCG CTGATCGCGC GCGCCAACGC GACCGAGCTC
GACTGGCCCG CCGCGCAGCG CGCGCCGCTG TTCGCGCAAT TCGAAGCGGC CGCGCGCGCG
CGGCCCGACG CGATCGCGCT CGAATGCTTT GCCTCCTCCG ACGGCGGCGA CGGCGCGCGC
GCGCAGATGC GCTACGGCGA GCTCGACGCG AAGGCGGACC GCGTCGCGGC GGCGCTCGCC
GCATCCGGCG TGCGGCCCGA TTCGGTCGTC GCGCTGTGCG TCGAGCGCTC GTTCGACATG
GTCGTCGCGC TCGTCGGCAC GATGAAGGCG CGCGCCGCGT ACCTGCCCGT CGATCCCGAC
TATCCGGCCG AGCGGATCGC GTATCTGCTC GGCGACGCGA AGCCGCCCGT GGTGATCACG
CAGGCGCATC TGCGCGCGCG CGTCGACGCG GCGCTCGCGG GCGCGGATGC CGCCGTCGTC
ACCGTCGACG AACTGCTCGC GCGCGCGGCC GGCGCGGAAC CCGAAGCCGA GCGCGTCGCG
GCGGCGGCCG ACGTCGCGCC CGGGCAGCTC GCGTACCTGA TCTACACGTC CGGCTCGACC
GGCCAGCCGA AGGGCGCGGG CAACACGCAC GGCGCGCTCG CGAACCGGAT CGCCTGGATG
CAGCGCGCGT ACCGGCTCGC GCCCGACGAC GTCGTGCTGC ACAAGACGCC GTTCGGCTTC
GACGTGTCGG TGTGGGAGTT CGTCTGGCCG CTCGCCGTCG GCGCGAAGCT CGCGATCGCG
GCGCCGGGCG ATCACCGCGA TCCGGCGCGC CTCGTCGCCG CGATCGACGC GCATCGCGTG
ACGACGCTGC ACTTCGTGCC GTCGATGCTT GCCGCGTTCG TCGCGTATCT CGACGATTTC
GGCGCGGCCG CGCGCTGCGC GAGCGTGCGC ACGATCGTCG CGAGCGGCGA GGCGCTCGCG
CCCGAGCTCG TCGCGCGCGT CGCCGCGCTG CTGCCGCACG CGCAGCTGCA CAACCTGTAC
GGCCCGACCG AGGCGGCGAT CGACGTGTCG CACTGGCGCT GCACGGCCGA CGACGCCGCG
GCCGACGCGG TGCCGATCGG CCACCCGATC GCGAACCTGC GACTGCACGT GCTCGACGCG
GCGCTGCACC CGGCGCCCGT CGGCGCGACG GGGGAACTGT ACCTGGGCGG CGCCGGGCTC
GCGCGCGGCT ACCTGGGCCG CGCGGCGCTG ACGGCCGAGC GCTTCGTGCC CGATCCGTTC
GTGCCGGGCG CGCGCCTGTA CCGCACGGGC GACCTCGCGC GGCGGCGCGC GGACGGCGCG
CTCGACTATC TCGGCCGCCT CGACACGCAG GTGAAGCTGC GCGGCCAGCG CATCGAGCTC
GGCGAGATCG AGGCGCTGCT GCGCGCGACG GACGGCGTGC GCGACGCGGT CGTGATCGTG
CGCGACGAGC GGCTCGTCGG CTACGTCGCG TGCGCGACGC CCGCCGGGTT CGACGCGGCC
GCGCAGATCG AGCGGCTGCG CGCGCGACTG CCCGCCTACA TGGTGCCCGC GCAACTCGTC
GCGCTCGATG CGCTGCCCGT CACGCCGAAC GGCAAGTGCG ATCGCCGTGC GCTGCCGGCG
CCCGTGTTCG ACGCGCGCGT CGTCGACGCG CCGCGAACCG CCACCGAGCG CGCGCTCGCG
GCGATCTGGC AGCGCGTGCT GACGCTGCCG CAGCTCGGCC GCGACGACGA TTTCTTCGCG
CTCGGCGGCC ATTCGCTGCT CGCCGCGCAG GCGAACGCGC AGGCGAACCT GCAGTGGTCG
CTCACGCTGC CGCTGCGCAC GATCTTCGAC GAGCGCACGC TCGCGCGCTG CGCGGCGGCG
ATCGACCGCG CGCGCGACGC CGGCCGCGAG CGCGACGCCG CGGGCGCGAT CGACGCGCTG
CTCGGCGAGC TCGAAGCCCA GTAA
 
Protein sequence
MTSFPTALHH RIHALARLDP DAPALASFAP DTVRLTRGEL DERAARLAAQ LRAAGVGAEV 
PVGVCVARSC DLFVALLAVM KAGGAFVALD PRHPAARLDW VARDAGLAHG IVDASADAAM
RARFARCFDV GGVAAADPAA PREHGGDVHP RAAAYMIYTS GSTGTPKAVV VEHGPLAAHG
DALAESLPIG PDDRVLHFAS VNFDVAIEAW LVPLAVGASV VISDPPPFTP DAAHALISRE
RVTNTTLPPA YLREFAAVCA REGVPPSLRV LLFGGEAMSQ DAFDEIRRVF PAIRLVNGYG
PTETVISPML WPVAPGTTPA LDAGNGYASL PIGWPIGRRV ARVERADGTV ARGEAGELLL
GGACLARGYH GRAALTAERF LPDPAGEPGA RIYRTGDLAR ERADGAFDYL GRIDDQVQVR
GVRVEPAEIA ACLLTHPGVR DAGVLAETAG GRTQLIACVA LAAPPRETPS GGDARGDAPP
DDDALRAHVA AHLPAAWLPH RFVRFDKLPY TLNGKLDRVA LRDAVAARPA EAAADVDASR
TDTERRVAAL WQRLLNDPAP IGRVDRFFAR GGDSLAAMQL QTAIRLDWRV NLRLDTLFDD
APLAELAARI DAAEREARQP AAIGTTARRA IAADAIDARA TGQMGRTGAA GEATEAAGAL
ERPASFAQQR FWVLARTQDA GAAYHVSFHW DIDGALDLGT LQRALDMLIA RHEAWRTTLV
ENDDGVVLQR IHAALPVRIA AVDLRGEAGA SRAARLAELT ERHAGAPFDL SDGPLVRALL
VTLADGAQRF LLTTHHAVSD GWSSRCAFAE LTAAYAALAE GRAPELPALP IQYADYAQWQ
RDALDAHETA RELAYWRGAL DGAPAPLALP LDRPRAAERD YRGGRLARRL SPAASDAVRA
AARRLHASPF TVLLAAFDAW LFRLTGERDL VVAAPIAQRA RPETAPLVGL FLNTLALRAR
VSPAQSFESL VASVRDAAFG AFAHQDVPFD KVIDAVKPPV RRGDEWLRVK FAQQFDLECR
ASLPGAAVRM APGLDTAARF DYALDFTDDP RGIEFVAAYA RDGIDEATAH AWLDSFAALV
EDAVREPRRP IAALAVAQAG APCALIAGRP LATAPDVLAL FAREAAEHPH RVALADADTR
LTFAELDDAS NRVALALRRD AARDEAPGAE TPVAICIERS ARFVVAFLGV LKSGAYAVPL
DPASPCERIA AALAACGARR MLAAGALDAL GEFDGVAVQD IDACAHDASL ANAAAPRVPP
QPEQAAYLIF TSGSTGAPKG VVVPHRALAD YVAGMLDELA FAPHASMAMV STVAADLGHT
ALFGALCSGR TLHLLPAQAA FDPDRFAHEM ATREVGVLKI VPSHLHALLD AQRAADVLPA
HALVTGGEAL PWALVERIAA LKPDCRVINH YGPTEASVGA LVCDTSAPAQ ADLRAAAASS
PGEAARGVPL GRPLPNAHAC VLDAYGSSVP VGAIGELYLG GPGLARGYLG RAAASAERFV
PHPHVAGARV YRTGDRVRLR ADGRLDFLGR LDDQVKIRGY RVEPGEVSAA LRALPGVAQA
ETLALEHEGR LRLAAFATPE AGARIAADAL RDALAARLPD YMVPAALVVL DALPVTANGK
IDRAALRARA AAPAPATAGD EDAPQGPIEA TLAEVWRDVL KAARVGRHDN FFELGGDSIL
VLQVIARARK RGVKFTPKQL FDGPTLAELA RVAVAIEADA PASGAAHGAA IGANAAAARR
DEAVLTPAQL RFFALDIPRR GHWNQSIALD VAGAFDFDAF ARAFDALLTH HPVFRERFAP
TGDGGGWQRS AAPRAFDTLP LAAAAARDEA DALAQFDALQ ATLDLTHGPL ACAFAAVLPS
GATKLYLAIH HAIVDGVSWR VLLDDLDAAY RAACERRAVR LGPTGASASE WAARLARAAR
DPAGPFAGEL PYWAALAAPH DDLRPDRPDA AATNAHADVV IQTLDAALTR EVLTDANAAY
RTQAVELLIA ALVAALGQHT GAAACRLELE GHGREALFDE LDASRTLGWL TSHYPVAFAV
EATPAATLAG VKDALRAVPN KGLGFGVLRH YGDDATRAAL ARVARPRVTF NYLGQFDAPR
DAALVPRFGG AGRERDPAGP LGNALAIHAY VGANGERALK VHWVYGATQF DRATIDALAA
RFDAALRALA AACRARVAER GAGATPGDYP LARAGGLTQA ALDRLPFDAR AIDDIYPLSP
MQQGILFHSL FAPERATYVN QLVATLVDPD VERLRAAFDA AVPRHDILRT GFAAHEAAPM
QIVHRHARMP VEIVDWRGAH ASPAALDTAL DAWLAADRAR GFDLAAPPLM RVTLIRTDDA
DWRLVWTRHH LLLDGWSTAR LFADVLRDYI EPPRANPFAA PARTRYRDFI AWLARRDAQA
DRAFWLGRLA RLDEPTHVAE RAAAHEAAGR ANWRATLPAA DTARIGEAAR RMKVTVNTIV
QGAWALALQR ITHRRAVAFG ATVAGRPHAL PDVDTVLGLF INTLPVITAP LPQLAARDWL
ASLQRDNAAA LEHAHTPLYE IQQWAGLGGA LFDTLVVFEN YPVDEAWQGR DARALQKRDL
RNIEATDFAV TLVIEAGDTL AIDYGYDPAR IGPARVEALH RAFAACIAGL VDHPDAPLGT
ISCASADDLA LIARANATEL DWPAAQRAPL FAQFEAAARA RPDAIALECF ASSDGGDGAR
AQMRYGELDA KADRVAAALA ASGVRPDSVV ALCVERSFDM VVALVGTMKA RAAYLPVDPD
YPAERIAYLL GDAKPPVVIT QAHLRARVDA ALAGADAAVV TVDELLARAA GAEPEAERVA
AAADVAPGQL AYLIYTSGST GQPKGAGNTH GALANRIAWM QRAYRLAPDD VVLHKTPFGF
DVSVWEFVWP LAVGAKLAIA APGDHRDPAR LVAAIDAHRV TTLHFVPSML AAFVAYLDDF
GAAARCASVR TIVASGEALA PELVARVAAL LPHAQLHNLY GPTEAAIDVS HWRCTADDAA
ADAVPIGHPI ANLRLHVLDA ALHPAPVGAT GELYLGGAGL ARGYLGRAAL TAERFVPDPF
VPGARLYRTG DLARRRADGA LDYLGRLDTQ VKLRGQRIEL GEIEALLRAT DGVRDAVVIV
RDERLVGYVA CATPAGFDAA AQIERLRARL PAYMVPAQLV ALDALPVTPN GKCDRRALPA
PVFDARVVDA PRTATERALA AIWQRVLTLP QLGRDDDFFA LGGHSLLAAQ ANAQANLQWS
LTLPLRTIFD ERTLARCAAA IDRARDAGRE RDAAGAIDAL LGELEAQ