Gene BURPS668_A1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1479 
Symbol 
ID4886522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1389036 
End bp1406783 
Gene Length17748 bp 
Protein Length5915 aa 
Translation table11 
GC content73% 
IMG OID640131418 
Productputative polyketide synthase PksL 
Protein accessionYP_001062475 
Protein GI126442426 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTTG ACCAGACCCT CATCGCCGAC CTGCTCGATC AATGGCTCGG CGGGCAATCT 
CCGGACTGGG GCCGGCTGTA TCCCGCCGGC CGGCCGCCGA TCGAGCACGC GCCGACCTAT
CCGTTCGCGC GCAACGTCTA CTGGGTGCAT GCGGCGGATC GCACGCCCGA GCCGGGCCTC
GCCGCCGCGC CGGCGCGAAG CCGCGATGCG GCGCCGCCGC TTGAGGCCGG GACGATGCTG
ACCGCGGCGA TTGCGGTGGT TGGGGCGACT GCGGCGGCGA CTGCAACGAC TGCGGTGACT
GCGAGGGCCG CGACGACTGT GACGAACGCG ACGAACGCGA TAGACGCGAT AGACGCGATA
GACGCGACGA ATGCGACGAA TGCGACGGCC GCGACGACTG CGGCGGCTGC AACGACTGCG
GTGACGGCAT CGGGCGCAAC GGGCGCGTCG GCGGCCATCG TGCTGAAATC GGCCGCCGAC
GAGGCGCGGC GGCATTTCAC GCCGAGCCGC GAGCGCGCGC CCGTCGTACT GCGCGCGCTC
GCGCCGGCCG CCGCCGATGT CGCCGATGTC ACCGACGTCG CGGCGAGCCG CGCGGCGCAG
ACTCAGGTGT TCGACGGGGC GGCGGTCGAG GCCGCGCGGC TCGCGCCCGA CGCCATCGGA
GCCGCGTTGA TCGACAGCCT CGCGCGCAGA TTGCTCGTGA GCCCGCAAAG CATCGGCGCG
CGCGACGCGT TCGACGCCCT CGGCGTCGAT TCGCTGATCG GGCAGGAATG GCTTCGCGAG
TTGAACCGGA GTTACGGGAC GTCGATCGAC GGCGCGACGC TCGCAGAGTG CGGCCATATC
GGCGCGCTTG CGTCGCGGAT CGCCGGGGCG CGTGCCGACG CGGCCGCGCC GGCATGGGTG
CCGCGCGCGG AGCCGCCGCA GCGGGCGGCG CAGTGCGCGC CCGACGCGGG CACCGTATCG
GCTTCTGTTT TGGCTTCGAT CTCGACCGCG GCTTCGATCA CGGCTTCGAC TTCGACCTTG
GCTTCGACTT CGACTTCGGC TTCGACTCCG GCGTCGGCTT CGACCTTGGC TTCGGCTTCG
ACTCCGGCCT CGGCTTCGAC CTTGGCTTCG GCTTCGACCT CGACTTCGAC TTCGGCTTCA
GCCTCGGCCG CGGCCGCGGC CGCAACGACG CACGCGGGCG ATCGGTCCGC GCCGCGCATC
ACGCGCGATG CGCTCGTGCA TGCGCTGGCG GCCAGCCTCG CGAAGGCGCT TTACATGGAC
GTCGCCGACA TCGACATCGA ACAGCCGTTC ATGGAGATGG GACTCGATTC GATCGTCGGT
GTCGAGTGGG TCCACCAGGT GAACCGCGCC TACGGAATCG GCATCAACGC GATTCAGGTC
TACGACTATC CGAACATCGT GACGTTCGCG GGGCTCGTCG AATCGCTGAC GAACGCGGGC
GGCGCGGCGG GCGGGGATGT CGCGTCCGGT GTCGGCGGCT CCGACTCAAA CGCTGATACT
GACCGCGACC GCGACCGCGA CCGCGATGTC GATGTCGATG TCGACGGCGA TGTCAACATC
GACGGCGTGC TTACGCGGCC GACGGGGGCG GCGGAGCCGG CGAGGGCGGC TGCCGCGGCC
ACATCGCCGG CGGCGGCGGA AATCGCCGCG GCAACCGCGG CCGCCGCCTC GAGCGCCATC
GCCGCCCCGC CGACGACGGC CGAGACCATG GCGGCAGCGT CGGCGGCCTG CGCGCCGCGT
GGCGAACTGC GGCGGGAAGT GATCGACAGC CTGGCCGCCG CGCTGTACAT GGACGCCTCG
GAGATCGACG TGCTTCAGCC GTTCGTCGAG ATGGGGCTCG ATTCGATCAT CGGCGTCGAG
TGGATCCATG CGCTCAATCG GCGTTACGGC ACGTCGATCG AGGCGATCCA GGTCTACGAC
TATCCGAACG TCGAGAAGCT GACCGAGCTG CTGTCGAAGT CGATCGATGC GCTCTCGCCG
CCGCATGCGC GGTGCGAGAG CGACGTGCTG CCCGGCGAAC CCATGACCGC CGCGACCGCC
GCGAACGACA CGAACGACAC GAACGACACG AACAACGCGA ACAACGCGAA CAACGCGAAC
GAGTGCGGCG TTGTGGCGGC AGAAGCCGCC TTTGCCGCGC CGGCTGCCGC GCCGCCTGCT
GCGCCGGCGG CCGCTTGCGC GGCTGCCGCA ATCGCGGCCC CAATCGCCCC AACCGCCCCG
GCCCCCCTCG CCGCCCCAGC CGCGAAGCCC GCCACGTTGC TGGACGAACT GGTCGCGAGC
CTCGCGCAGG CGCTGTTCCG CCCGGCGGAC ACCATCGACG CCGAACGCGG ATTCGAAGCG
GCGGGCCTCG ATCCGATCGT CGCCGACGAG TGGCTCGCGC GCGTCCATCG CCGATACGGC
GTGCGCGTGC GGGCCGACGA GGCGCTCGCG TGCCCGCGCA TCGCGGATTT CGCCGCGCTC
GTCGCCGCGC GGCAAGAGGC GGGCGGCGCG CAAGCCGCCG GGGCCGCCCC GGTTGCGGCG
CACGACGCGC GATGCGCCGC GCGCGCGAGC GCCGACGCCC GCGCCGGCGA TCGCGAACCG
ATCGCGATCG TCGGCATGTC GGGCCGCTAT CCGGGCGCGC GCGATCTCGA CGCGTACTGG
GAGAACCTCG CGGCCGGGCG CAGCGCGATC GGCGAGATTC CCGCGAGCCG GTGGGACGTC
GCGCGCCACT TCGACGCCCA TCCCGCGACG CCCGGCAAGG TCTACAGCAA ATGGATCGGC
CTCCTGGACG ACGTCGACTG CTTCGATCCG GCGTTTTTCC GGATCTCGCC CGCCGAGGCG
CAGGAGATGG ATCCGCAGCA CCGGTTGTTC CTGCAGGAAG GCTATCGGGC CTTCGAGAAC
GCCGGCTATT CGGCCGATAC GCTCGACGGC CGCAACTGCG GCGTCTATCT CGGCATCATG
AACCAGGAGT ACCGGCAGCT CGGCGCGGGC GGCGCGGTCA CGATGCTCGA GAAGAGCAAC
AGCTTCGCGA TCGGCGCCGC GCGGCTCGCC TATCACCTGA ACCTGAAGGG GCCGGCGATT
CCCGTCGACA CCGCGTGCTC GTCCGCGCTC GTCGCGATCC ATCTCGCGTG CCAGGCGCTG
CGCGCGGGCG AGATCGACAT GGCGCTCGCG GGCGGCGTCA CGCTGTACCT GTCGCCCGAC
GCGTACATCG AGATGTGCTC GTCGGGCATG CTGTCGCCGG ACGGCCGCTG CAAGGTCTTC
GACGATTCGG CGGACGGCTT CGTGCCGGGC GAGGGCGTCG GCGCCGTCGT GCTCAAGCGC
CTCGGCGACG CGCAGCGCGA CGGCGATCCG ATCATCGCGA CGATCATCGG CTCGGGCATC
AATCAGGACG GCAAGACGAA CGGCATCACC GCGCCGAACA TGGCGAGCCA GTTCGAGCTC
GTGTCCGGCG TCCACGGCCG GTACGGCATC GATCCGGCGA CGATCCGCTA TGTCGAGGCG
CACGGCACGG GCACGAAGCT GGGCGATCCG ATCGAGCTGA CGGCGCTCGG CGACGCGTTT
CGCGTGCGCA CCGCGCAGAC CGGCTTTTGC GCGCTCGGAT CGGTGAAGAG CAACATCGGC
CACACGTCGG CGGCCGCGGG CGTCGCCGGC CTGCACAAGG TGCTGCTCTG CATGCGGCAC
CGCACGCTCG TGCCGACGCT GCACTTCGCC GTGCCGAACC GGCATTTCGA TTTCGCCGCG
TCGCCGTTCT ACGTGAACAC CGAGCGCGCG CCCTGGGCGC CGCTCGCCGC GTCGCCGCGC
CGCGCGGCGG TCAGCTCGTT CGGTTTCAGC GGCACCAACG CGCATCTCGT CGTCGAGGAA
TACGTGCATC CCGCCGCGGC GGCGCCCGAG GCCGGCGGGC CGTTCCTGTT TCCGCTGTCC
GCGCGAACCC GCGAGCAACT CGCGGCGTAC GCCGCGCAAC TGCGCGACCA CGTGCGGCGC
GCCGCGCACG AGGACGCCGG GCTCGCGGAT CTCGCCTATA CGCTCCAGGT CGCGCGCAAG
CCGATGGCCG AGCGGGTCGG GCTGATCGCG CGCACGAAAC ACGAACTCGC CGCGTTGCTC
GATGCGTTCG TCGACGGGCG CGACGGCGGC GACGGGCTGA TTGCCGGCCG GCGCGACCGG
GCCGGCGGCA CGCCGCCGGC GCCGTCGCCC GAGGCGCTGC GCGCGCTCGT CGACGCGGGC
GAGTCGCGCA CGATTCTGCA ACGGTGGGCG CTCGGCGCGA CGATCGACTG GGCATGCCTG
TACCGCGACC TGGACGCAGC CGCGCGGCCG CGCCGGATCG CGGCGCCGTC CTACCCGTTC
GCGCGGGAGC GCTACTGGTT GCCCGATCCC GGCACGCAGC GCGCGCCGGC CGCGCGCGGC
GCGCGCGCAG CCGGGCCGCA TCCGCTGCTG CGCGAGATCG ACTCGGCGCA GTCGGCCGCG
TGCTTCGGCG CGACGCTCGC GGGCGACGAG CCGTTCTTGC GCGATCACCG GATCGACGGC
CGGCCGGTGA TGCCGTCGAG CGCCTATCTC GAGATGGTGC GCGAGGCGGC CGCGCGGGCG
CTCGGCGAGC CGGCGGACGC GATGCTCGTG ATCGAGGGCA TCGCGTGGCG CAATCCGCTG
ACGGTGAGCG GCGGCGCGCG CCGGCTGCAA CTGCGCGCGC GGGCGGAACC CGGCGCGCGC
GCGTTGCGCT TCGACGTGAG CTCGCAGGCG GCGGGCGACG CCGCGGCCGC GCCGCTCGCG
CACTGCGACG GCGTCGCGCG CTACGTGCCG CGCGCGCCGG CCGGCGCGCC GGATCTCGCC
GCGCTGCGCG CGCGGCTCGC ATCGGGGCCG CGCGCGGCTG ACGAAGCGGC GTGCGCGGCG
CTCCATGCGC GCTTCGCGCG GCTCGGCATC GAATATGGCG CCACGCATCG CGTGCTGCTG
CGCCTGCGCG TGGACGGCGA CGAAGCGCTC GCCGAGCTCG CGCCGGCGGC GGACGGGATC
GCGCAGCACG AACTGCACCC CGGCACGCTC GACGCGGCGC TGCAACCGAT GCTCGCGCTG
CTCGGCGAGC GGGTCGGCGA CGGCGTGCCC GTCGTGCCTT ACCGGATCGA GCGCGCCGAA
ATCCATGCGC CGACGCACGG CGCGAGATGG GCGTGGCTGC GGATGCGGCC CGACGCGCAC
GAATGGATCT TCGATGTCGA TCTTTGCGAC GCGCGCGGCG CGCTGTGCGT CGCGTTGCGC
GGCATCGCGG TGACCGCGTG GCGGCGGCCG GACGAAGTGG TGCGGCTCGA GCCGGTCTGG
CGCGCGGCGC CCGTCGAGGC CGACTTGCAC GACGAGCGCG CCGGGCCCGA CGCGCAGCGC
GTGGTGTTCG TCTGCGGGAC GCACGGCGCG CCGCGCGCGT GGCCGGCGGA CGGCATCGCG
CCGGTGCGCT ACGCGGCGCT CGCCGCGGGC GCGCCGCCGG GCGAGCCCGA TGCGCTCGCC
GGCTGGTTCG AAGCGCACGC GCTCGCGCTG TTCGACGAAG TGCGCCGGCT GCTCGCGCCC
GGCATGCGGC AGGCGACGCT CGTGCAGATC GTCGTGCCCG CGGCCGGGCC CGGCGCGATC
CTGCATGCGC TCGGCGCGCT GCTGCAGACC GCGCACCTGG AGAATCCGCT GCTGCACGGG
CAGGTGATCG CGATCGACGA CATCGACGCG CCCGACCTGC CGCGGCGGCT CGCGCGCGAC
GCGCGCCGCG CGGCCGACAC GCGCATCCGC TACGTGAACG GCGAGCGCCA GGTGGCGTGC
TTCGACGAGG CGGCCGCGCC GGACGGCGCG CGCGCGTTGC CGTGGCGTCA GGGCGGCGTG
TATCTCGTGA CCGGCGCGGC CGGCGGCCTC GCGCGGGCGC TGGCCGACGC GATCGCGCGC
GGAGTCGGCG GGGACGCGCC GCGGGCGACG CTCGTGCTCA CCGGGCGCTC GCCCGCCCGC
GACGACATGC GCGCGCTCGT CGCGAGCCTT TGCGCGCTCG GCGTCGCGGC CGACTACCGC
GTGCTGGATG TCGCCGATCG CGACGCGGTC GCGCGGATGG TCGAGGCGAT CGTCGGCGAG
TTCGGCGCGC TGCACGGCGT CGTGCATTGC GCCGGCGTGC TGCGCGACAA CTATCTGCTG
CGCAAGTCGG CCGACGAGTT CGCGCAGGTG CTGGCGCCGA AGGTGCGCGG CACCGTGAAT
CTGGATCTCG CGACGCGCGA CGTGCGCAGC CTCGATTTCT TCGTGACGTT CTCGTCGGGC
GCCGGCGTCG TCGGCAATCC CGGGCAGGCG GACTACGCGG TCGCCAACGC GTTCATGGAT
GCGTTCGCGG CGCATCGCGC GTCGCTCGGC GCGGCGCGGC CCGGCGTCAG CGTGTCGATC
GCGTGGCCGA TATGGCAGGC GGGCGGCATG CGGATCGACC GGCAGACCGA GGCCGAACTC
GAGCGCCGCC TTGCGATGCG GCCGATGCCG ACCGCGCTCG GGCTCGACGC GCTGCATGCG
TGCCTGCTGG GCGCGAGCCC GTGCCCGACG GTCATCCACG GCGCGCGCGC GCGAATCCTC
GCGCTCGCGC GACAGGGCTT CGCCGCGCCG CCCGCCGCGC CGCCCGGATT CGGCGCGGCC
GATCGCGGCG CGGATGCGCC GGGCGCCGAT GCCGACGCCG ACGCCGACGC CGTGAAGGCG
CGGGTGCGCG CGGCGATCGA CGGCGCGCTC TCGGCTGTGC TGAAGCTGCC CGACGCGCGG
CTTCGCGAGC CCGAATATTT CGAGAGCTAC GGGATCGATT CGATCAACGC GATCCGTCTC
ACGGTCGAGC TGGAGCGCAC GTTCGGGCCG CTGCCGAAGA CGCTGTTCTT CGAATACCGG
CACGCCGACG AGCTCGAGCG CTACCTGGTC TCGGTGCATG GCGCGGCCGT CGGCGCGCGA
ATCCGCTCGG CCGCGCGCGG CGCGCCCGCC GGCCCGGCAT GCGCGCGCGC GGGCGAGTCG
GGCGAGCCGC CGGGCGCACC GACGCGGCCG CCGGCGAGCG AAGGCGGGGC GGCAGGCGCG
CCTCGCGCGA GCGCCGCGCC GCCGCTCGCG GAGCGCGACA TCGCGGTGAT CGGCATGGCC
GGGCGCTATC CGCAGGCGGA CGATCTGCAG CAGTACTGGG ACAACCTGCG CGACGGCCGC
GACTGCATCG AGGAGATTCC GCCGCATCGC TGGGACTGGC GCAAGCACTA CGATCCGGCC
CGCGGCCACG GCGCCCACCA CAGCAAGTGG GGCGGCTTCA TCAACGATGT CGACGCATTC
GATCCGCTGT TCTTCAACAT CTCGCCGAAG GAAGCGGTGT CGATGGACCC GAAGGAGCGG
CTGTTCCTCG AACAGGTATG GACCGCGATG GAGGATGCCG GGCTGCGGCC CGAGGATCTG
CGGCGCGACG CGCAACGCGG CACGGGCGTC TACGTCGGCC TGATGTACGA GAACTATCAG
TTGCTCGCGG CCGAGGCCGC GGCCGCGGGC AGCGACGTCG GGATGGCGGG CGGCAGCTAC
GCGAGCATCG CGAACCGCGT GTCGTTCTTC CTCGACCTGC GCGGCCCGAG CCTTGCCGTC
GACACGATGT GCTCGAGCTC GATGGTCGCC GTGCATCTCG CCTGCCGCGA TCTGCTCGCG
GGCGAGATCG GCGTGGCGAT CGCGGGCGGC GTCAATCTCA GCCTGCATCC GAACAAGTAC
CGGATGCTGA GCGCCGCGCG CTTCATGTCG GGCGACGGGC GCTGCGCGAG CTTCGGCAGC
GGCGGCGAGG GTTACGTTCC CGGCGAGGGC GTCGGCGTGC TGTTGCTGAA GCGGCGCGCC
GACGCCGAGC GCGACGGCGA CCGCATCCTC GGCCTCATCA AGGCGAGCGC GATCAATCAC
GGCGGCCGCT CGAACGGCTA CACGGTGCCC AACGCGGCCG CCCAGGGCGG CGTGATCGCG
AACGCGATCC GGGCCGCCGG CATCGACGCG CGCGCGATCA ACTACGTCGA GGCGCACGGC
ACGGGAACGG CGCTCGGCGA TCCGATCGAG CTGGCCGGCC TCGCGCGCGG GTTCGCCGAC
AGCGGCGCGA ACGGGCCGTG CCGGATCGGC TCGGTGAAAT CGAACATCGG CCATTGCGAG
GGCGCGGCGG GCATCGCGGC GCTGACGAAG GTGCTGCTGC AACTGGCGCA TCGCCAGATC
GTGCCGTCGC TCCATTCGCG CGAGCTGAAT CCCGATCTGC CGCTCGACGG CTCGCGCTGG
ATCGTGAACC AGTCGCTGTG CGATTGGGAG CGCGTCGTCG TCGACGGCGT GCCGCTGCCG
CGCACGGCCG GCGTGTCCGC GTTCGGCGCG GGCGGGACCA ACGCGCATCT GATTCTGTCC
GAATACCCGG CCGACGCGTG CGCGGCGCCG GCGGGCGTCA TCGAGCCGGC GGGGCGCGAT
GCGCACGACA TGCACGACAT GCACGACATG CACGACATGC ACGACATGCA CGACATGCAC
GACATGCAGG ACATGCATGA CACGCAGGAC ATGCAGGACA TGCAGGACGC GCTCGTCGTG
CCGCTGTCCG CGCGCAACGC GCAGCGGCTG CACGCGTATG CGCAACGGCT GCGCGCGTTC
GTCGCCGCGC ACGCGCGCGG CGAGCGCGGC GCGCCGCCGC GGCTCGTCGA TCTCGCCTTT
ACGTATCAGC GCGGCCGCAT CGCGATGCCG GAGCGGCTCG CGATCGTCGC GCGCTCGCTC
GCGGAGCTCG AACGCGCGCT CACTGCGTAC GTCGCCGGGC AGCGCGCGGG CGACGGCATT
TACGCCGGCC GGGCGGATCG CGCGGCGGCG GGCGACGCAC GCGGCGCGGC GTCCGAGCGA
ACCGCCGCCG ATTTCGCCGA CCGGCTCGCG GCGCGCTGGG TCGCGGGCGA GGCTGTCGAC
TGGCACGCGC TGTTCGACGG GCGCGCGCCG CGCCGGATTG CCGCGCCGAC CTATCCGTTC
GAGCGCGGCC GGTATTGGAT CGGCGCATCG CGCGCGGCGG CGGGGGCCGC CGAGGCCGCG
ATGCGTGCCG GGGGCGGCGC ATCGCGCGCG ACGACGGCGC CGCGCGCGCT TTCCGAATCA
GGGGGAGAGG TGAGACAACC AGACCAGACG ATACGGCGCG TGCGGTTGGC GCCGACTTCG
AGCTTCACCG CGACGGCGCG CGCGCCGTCG CCTACGCAGG CTACGCAGGC TACGCAGGCT
ACGCAGGCTA CGCAGGCTAC GCAGGCTACG CAGGCTACGC AGGCTACGCA GGCTACGCAG
GCTACGCAGG CTACGCAGGC TACGCAGGCT ACGCAAGCTA CGCAGGCCAC GCAGGCCGCG
CAAGCCACGC AAGCCACGCA GGCCGCGCAG GCCACGCAAG CCACGCAAGC CACGCAAGCC
TCACAAGCCA CGCGGGCCGC CGCGGCGTTC GCGCCCGCGA CGAAAGCGTC GCTCGAGGCG
GCGTTGCGCG ACAGTCTCGC GAGCGCGCTG TTCGTCGGGG TCGACGAAAT CGACCCCGGC
CGGCCGTTTA GCGAGCTGGG GCTCGATTCG ATCGTCGGCG TCGAATGGAT TCGCGACGTC
AACCGCCGGT ACGGCGTGTC GATCCGGACG ACCGACGTCT ACGATTATCC GAGCGTCGGC
GAATTCGCCG GCTTGCTCGA GCGGCTGCTG CGCGAATCGT CCGTGGCCGG CGCGCCGCCG
GCGCCCGCGA CGGAGCCGAC GATGGAGCCG GTGACGGCGC CGCGGCCCGA AGCCGCCGGC
GCGGCTCCGT GCGAGCCGCC GCCCGACGAA GGCCGGGGCG GCGCGCCGGG CAACGTCGAG
CGGATCCGGC GCGAGCTGAT GCGCAGTCTC GCCGACGCGC TGTTCGTCGA CATCGCCGAG
ATCGACGTCG ATCGGCCGTT CGCGCAGATC GGCATGGATT CGATCGTCGG CGTCGAGTGG
ATCAAGGGAA TCAACCAGCG CTATCGCGTC GCGCTGAAGG CGACGGACGT CTACGATCAC
CCGACGATCG CCAGCATCGC GGCACTCGTC GACGCGCGCG GCGCGAGTGC CGCAAGCACC
GCAAGCACCG CAAGCACCGC AAGCACCGCA AGCACCGCAA GCACCGCAAG CACCGCAAGC
ACCGCAAGCA CCGCAAGTCC CGAAGCCGAC GTGCGGCTCG CGGCCGATGC CCGCATCGCG
TCCGCCGCAT CCGGCGCGCC GGAAAGCGAA CCCGCGCGCG CGGCGCACCC GGGCGCTGCC
GCTTCGCGCG CGCGCGCCGA CGTGGCGCCC GACGCCGGCC CGGCACCGCG CCGTCTCGAC
GCGGCGGCGC CCGACGCGCG GGCGGCGCGC GCCGAGCCCG TGCCCGAGCG GATCGCGATC
GTCGGCATGT CGGGCCGCTA TCCGGGCGCG CCCGATCTCG ACGCGTTCTG GGACAACCTC
GCGGCCGGCC GCGACGCGAT CGCCGAGATC CCGCCGAGCC GCTGGCCCGT CGGCGCGTTC
TACGATCCCG AGCCGGGCAA GCCCGGCAAG GTTTATTGCA CGCGCATCGG CCTGCTCGAC
GATGTCGACC GTTTCGATCC CGACTTCTTC CGGATCTCGC CGGCGGAAGC CGAAGAGATG
GATCCGCAGC ACCGGCTGTT CCTGCAGGAG GGATACCGCG CGATCGAGCA GATGGGCTGC
GCGCCGGCGT CGCTGTCGCG CCGCAAGTGC GGCGTCTATC TGGGCGTGAT GAACCACGAG
TACGGCGAGC TCGCGATGCG CCACCGCGGC GCCGCATCCG GGATCGGCAG CAGCTACGCG
ATCGGCGCCG CGCGTCTCGC GTATTACCTG AACCTGAAGG GGCCGGCGAT TCCGGTCGAC
ACCGCGTGCT CGTCCGCGCT CGTCGCGACG CACCTCGCAT GCCAGGCGCT GCGCAACGGC
GAGATCGATC TCGCGCTCGT CGGCGGCGTG ACCGTCTATC TGACGCCCGA ATCGTATGTC
GCGATGTGCG CGGCGGGCAT GCTGTCGCCC GAGGGGCGCT GCAAGACCTT CGACGACGCG
GCCGACGGCT TCGTGCCGGG CGAGGGCGTG GGCGCGCTCG TGCTGAAGCG GCTCGCCGAC
GCCGAGCGCG ATCGCGATCC GATCCTCGGC GTGATCGTCG GCTCGGGCCT GAACCAGGAC
GGCCGCACGA ACGGCATCAC CGCGCCGAGC GGCAGCAGCC AGACCGAGTT GCTGCGCGAC
GTCTACCGCC GGCACCGGAT CGATCCGGCC GGCATCGGCT ACGTCGAGGC GCACGGCACC
GGCACGAAGC TCGGCGATCC GATCGAGCTG ACCGCGCTGT CCGCGGCGTT CGGCGATTAC
ACCGACCGGC GCGGGTTCTG CGCGCTCGGC TCGGTGAAGA CCAACATCGG GCATACGTCG
GCGGCGGCCG GCGTCGCGAG CATCCACAAA GTGCTGCTGT GCCTCGCGCA TCGCGAGTTG
GTGCCGACGC TGAACTACGC GAACCCGAAC CGCCATTTCG ATTTCGCCGA TTCGCCGTTC
TACGTGAACA CCGACCGGCG CGCGTGGGAC GCCGCCGGCG ACGCGCCGCG CCGCGCGGCC
GTCAGCTCGT TCGGCTTCAG CGGCACCAAC GCGCACGTCG TGATCGAGGA GTACCGCCCG
GCCGCCGCCG CCGCGCCGGA CGCGTCGCCG CCGCGCGTGA TCGTGCCGCT CTCGGCGCGG
CATCCGGAGC GGTTGCGCGC CTACGCGCGC AACCTGGCCG ACTGGCTCGC GCAGGCGGCC
GCGCGCGGCG CGCCGGAGCG GCTCGCCGCG CATCTCGCGT ACACGATGCA GGTCGGCCGC
GACGCGATGG CCGAACGCGT CGCGTTCGTC GCCGACGGCC GCGACGAACT CGAGCGGCAG
TTGCGCCGCT ACGCCGACAC CGGCGAGACG AGCGACGGCG TGTACGCGGG CCGCGCCGAG
CCGCACGCTC AGGCGTCGAA CGCGCTGATG CTCGACGAGG CGTTCGGCGC GGCGATCGAC
GGATGGATGC GCACGGGCAA GCACGAGCCG CTCGCGAAGC TGTGGGCGGG CGGCTTCGAT
CTGGACTGGG CGCGGCTTTA CGACGGCGTG CCGGCCGCCG CGATGCCGCG CCGGATCGCC
GCGCCGACTT ATCCGTTCGC ATCGGGGCGC TACTGGATCG ATGTCGAACC CGACGGGCGC
GCCGCGGGCC CGGATGCGGA CGCCGCTTCG CCCGAGGCCG ATTCCGGCTC CGACTCCGAG
CACGCACACG AACATGAACA TGAACATGAA CATGAACATG AACCCGCCGC GACGCTTGCA
TACCTGCCCG TCTGGGAGGA ACTGCCGCCC GCGCAGCCGC GCGCCGCGCC CGATGCGCAG
GCGGGCGGCG TACTCGTCGT GCATCGCGGC GGCGCGTGGG GGCTCGTCGA CGCGATCGAG
CGCGAGTGCG TGGACGGCCG CCATGCCGGC GCGACCTGCG TGACGCTCGA TCTGTCCGGG
CATGCGCCGT CGCCGGAGGG CCGCGCGTGG CGCAACGCCG CGCCCGACGC GGCGCGGCTC
GCCGCGTGGC TCGGCGAATT CGGCCCGGTG CGCGCGGTAT TCTTCGCGGC GGGCTGCAGC
GAGGCCCGCC ACGATGCGCC GGGCGCGCAC GGCTGGGCGA GCGCGCCCGA CGCGCACGGC
GAGGACGAGC GTGCGCTGTT GCAGCTCGCG CAGGCGCTGA TGCGCTCGCA GGCGGCCGAC
GCGTCGATCG AGTTCGTCGT GCTGTCGCTC GATCACCATC GCACCGACGG CACGCCGTCG
AATCCGGCGG GCGGCGGCGT CGCCGGCATT GCCTACGCGA TCGCGCAGGG CGATCACCGC
TTTCGGGTGA CGAACGTCGA CGTGTCGCTC GACGAGCTGC GCGCGGCGCG GCATGCGCCG
GCGCCGCATC CGGTGCTCGC GGCCGTGCTG CGGCTCGCGC CGTCCGATCG CGGCGCGCTC
GTCAGGCTGC GCGCCGGCCG CGGCTACCGG CAGGCGTTCG TGCGGCTCGA CTGGGCCGCC
GAGGCGGGCG CCTCGGGGCT GAAGCAGGGC GGCGTCTACG TGATGCTCGG CGGCGCGGGG
CGCGTCGGCC GGGCGCTGAC GCGGCGGCTC ATCGAGCGCT ATCGCGCGAA CGTCGCGTGG
ATCGGCCGCA GCCCGGCCGA TTCGGCGAGC GTCGCGCATG CGCTGCGCGC GCTCGGCCCG
GCGGGCCCCG CGCCGTATTA CGCGCAGGCC GACGCGACCG ATGCGGCGCA GATGCGGCGC
GCGATCGAAG CCGTCCGGCA GCGCCACGGC CGCATCGACG GCGCGGTGTT CTGCGGGATG
GTGTTCGACG CGAACCACGC GATCGCGAGC GTGCCGGCGC ACCGGTTCGA CGAGATTCTG
GACGTCAAGG CGCGCGGCAG CCGCATCTTC TACGAGGCGC TCGCGCACGA GCCGCTCGAT
TTTCTCTGCT ACTGCTCGTC GGCGCAGTCG TTCTCGTTCT CGGGCGCGGC GCGGCTCGGC
GCGTACGCGG CGGCGACGAC GGCGGGCGAC GCGATCGTGC GCTCGATCGC GCCCGTCGCC
GCGTTTCCGG TCGGCACGAT CCACTGGGGG TTCTGGGAAA CGTCGGTCGA GGATTCCGCG
CTCGGCTCGC GGCATCTCGG CGCGCTGTCC GACGACGAGG GGTTCGCGTG CTTCGAGCGG
TTCGTCGGCC AGTGCATGCG CGGCAATCCG CTGCGCGAGG TCGTCTGCAT GCGGGCGTCG
CCGGAGGTCG AGCATCTGAT GCAGGTGCTG CCGGGCGAAA CCGCGACGCT CGCCGCGCCG
GGGCAGCCCG CGCAGCCGGC GCCGCTTCGC GCCGCGCCGG ACGGCGCGGC GGCGTCCGAG
GCCGCGCCGC CCGACGGCGC GGCCGACGTA TCGGCGGATA TCGACGCATG GCTCGCGCGG
CTCACGTTCG CGACGCTGCG CCCGATGCTC GACGGCCCGC GGCCCGCGCG CGCGTGCCAT
GCGCGCTGGT GGGACGAGAC GCTGCGGATC TTCGCGGCGC GCGGCTGGTT GCGCATCGTC
GACGGCGCGC CGCGCGTGAT CGCCGAGCCC GATGCGGGCG AGCACGTCTG GCGCGACTGG
GCGCGTTACC GGTTCGACAC GCCCGCGGCC CGCGGCCGGC GCGCGCAGAT CGACCTCGCC
GACGTGTGCG TGCGCGCGCT GCCGGACGTG CTCGCGGGCC GGCTGCCCGC CGCCGACGTG
CTGTTTCCGG GCGGCTCGAT GGAGCGCGTC GAGGGCGTGT ACCGCGACAA TCCGATCTCG
GATTACTTCA ACGCGGTGCA GGCCGACGCG CTGATCCGCC ATGTCCGCGC GTGGATCGAT
GCCGGCCGGC GCGAGCCGAT CCGCATTCTC GAGGTCGGCG CGGGCACGGG CGGCACGACG
GCGCTCGCGC TCGAGCGGCT GCGGCCCTAC GCGGCCGCGA TCGGCGAGTA TTGCTTCACC
GACGTGTCGC AAGCGTTCCT GCAGCACGCG CAGGCCGCGT TCGGGGCGCG GGCCGGGTAC
TTGCGCACCG CGCTGTTCGA CGTCGAGCGG CCGCTCGACG CGCAGCGGAT GCCGGCGGGC
CGCTACGACA TCGTGATCGC GACCAACGTG CTGCATGCGA CGCGGCAGGT ACGCGGCGCG
CTGCGCAACG TGAAGGCGTG CCTGCGCGCG GGCGGCGTGC TGCTGCTCAA CGAGATCAGC
GAGAAATCGC TGTTCGCGCA TCTGACGTTC GGCCTGCTCG AAGGCTGGTG GCTGCACGAG
GATTCGTCGC TGCGCGAACC GGGCAGCCCC GTGCTCGCGC CCGCCACCTG GCGCCGGCTG
CTCGAAGACG AAGGCTTCGG CGCGATCGCG TTTGCGGCGC GCGACGCGCA TGCGCTCGGG
CAGCAGGTCG TCTGCGCGAC GAGCGACGGC GTGATCCGCC AGCGCGCCGG CGAACCTTCG
GGCCATTCGA GCCGGCAAGG CCATCGGAAC CATCAGGATC ATCAGGGCCG TCAGGAGAGT
TCGACCCATG CGGGCGAGGC CGGCGCGCCG GCGCCGGCGA GCGCCGGGCG TGCCGGCGCC
GCGCCGGCCG CGAGCCCCGC CGGCACGGCG CGCGAGTCTG TCGTCGCGGC GATCCATCGC
GCGCTGCAAC AGTCGCTGAA ATTGCCGGAA GCCCGGATCG GCGATCACAC GCCGTTCCTC
GACTACGGGA TCGATTCGAT TCTCGGCGTG CGCTTCGTCG ATTCGCTCAA GCAGGCGCTC
GACGTGCCGC TCAACACGGC TGTCCTGTTC GACTATCCGA CCGTCGAGCG GCTCGCGGAT
TTCATTGTGG CCACCTACGG CGCGCGGCTC GCCGCGCGCG GCGCTTCCGC CGCGCCGGCG
AGCGTCGCGA CCGCCTCCGC GACACTTGCC GCATCGACTG CATCGACTGC ATCGACTGCA
CCGACTGCAT CAACTGCACC GACTGCATCA ACTGCACCGA CTGCATCAAC TGCACCGACT
GCATCAACTG CACCGACTGC ACCGACTGCA CCGGCGGCAC CGACTGCACC GGCCGCGCCG
AACGCACCCA CCGCGCCGGC GATGCCCGCC GAAGCCGTTT CGCGCGACGC CGCCGCGCCG
CGCGCCGAAC CGGCGGGCGC GCGGCCGGCG GACATCGCGG TCATCGGCAT GGCCGGGCAG
TTCCCGGACG CGCCCGACGT CGACGCGTTC CGCGCGCTGC TCGAGCACGC GCGGGACGGC
GCGCCCGGCG TGTCGGGCGG CATGCTCGAG AATCGCGACC GCTTCGATCA CGCGTTCTTT
CACATCACGC CCGACGAGGC CGACGCGATG CATCCGTATC AGCGGCTCGT GCTGCAGGAA
TCGTGGAAGG CGCTCGAGGA TGCCGGCTAC AACCCGGCCG CGCTCGCCGG CGCGCGGGTG
GGCGTGTTCG TCGGCGCGGA GCCGGCCGAT TACCGGTCGA CGACGTTCAG CGGCTCGTCC
GACGCGCTGA TCGCGTCGCG CGTGTCGTAT CACCTGAATC TGCGCGGCCC GGCGTACGTG
GTCAACACCG GGTGTTCGTC GGGCGCCGTC GCGATTCATC TCGCCTGCGA GAGCCTGCGC
CGCAACGAAT CGGACGTCGT GCTCGCGTGC GGCATCTTCG CGGCGATGGG GCCGCGCATG
CTGGGGGCGC TGGGGCAGGC CGGCATGCTG TCCGCCGGCG GGCGGTGCCG CAGCTTCGAC
GCGGGCGCCG ACGGCACGGC GTTCGCCGAG GGAATCGGCG TCGTCGCGCT CAAGCGGCTC
GCGGACGCGA TCGCCGACGG CGATCCGATC CACGGCATCG TGAAGGCGTC CGGCGTGAAC
CAGGACGGCA CCAGCAACGG GATCATGGCG CCCAACGGCG TCGCGCAGGA GGAACTGATC
GTCGATGTCT ACGAGCGCTT CGGGATCGAT CCGGCCGACA TCCGCTATGT CGAGGCCCAC
GGCACCGGCA CGCTGTTCGG CGACGCGGTC GAGGCCAACG CGCTCGTCAG GGCGTTTCGC
CGCTTCACCG AGCGCAGCGC GTACTGCGCG CTCGGCACCG TGAAGGCGAC CATCGGGCAT
ACGGCGGCCG CGGCCGGCGT GATCGGGCTG ATCCGCATCC TGCTGTCGAT GCGCGCGCGC
CGGCTGCCGG GCATGCCCGG CCTCGGCCGC GCGAACCCGA TGATCGATCT CGACGCGTCG
GCGTTCTCGC TCGGCCTCGT CAGCCGTGAA TGGCCGGCCG GCCGCGACGG CCGGCCGAGG
CTCGCCGCGT TGAACACGTT CGGCCACAGC GGCACCAATG TCCATATCGT CGTGCAGGAG
CCGCCGCAGG CGCGGGCGCG GCCAGCCCGC GCGGCGGACG GCCCGCGCGT CGCGGTGCCG
CTTTCCGCGA TGGACCGGGA GGCGCTGCGC CGCTACGCGG CGCGCCTTTG CGAGCGGCTC
GAAGCGGAGG GCGCGGCGCT TTGCGTCGGC GACGTCGCGC ACACGCTGCG CGTCGCGCGC
GAGCCGATGG CGCAGCGAAT CGTGCTGTTC GCGTCGACGA CGGGCGAGCT CGCCGCGTTG
CTGCGCGCGT TCGTCGACGG ACGGGATTCG CCGTGCCTGC TCGACGGCGC GGTGACGGCG
GCCGCGCGAG CGGCGGGCCT CGACGCGGCG CAGCTCGCGC AGGCGGCGCG CTGGCTCGCC
GGCGAGCGCG TCGACTGGCC GCCCGCCGGC GGGACGCCGA TGCGCGTGCA TTTGCCGGCC
TATCCGTTCG CCGGGCGGCG CTGCGGCGCG GCCGGATGGG CGCGCGCCGA GGCCGGCGCG
TCGCGCGACT GCGCGGCGGC GGGGCCGTGC GAGCCGCCCG CGGGCGTCGC GGCCGCCGCG
ATGACCGTGG CGGCCGCCGG GCCCCGCGTG GACGCGACGC CGTCCGCCGC GGCCGACCGC
GCGCGCGGCG CGGCGCGGCC GGCCGAGTGG CTCGCGGCGC GCGTCGCCGC GCGGCTCGGC
GTGCCGGCCG CGCGCGTCGA CCGGCGCCGC AGCCTGCTGG ATCTCGGGCT CACGTCGCAG
GACCTCGTGA GCCTCGCGGG CCAGTTGCGG GACGCGACGG GCGAAGCGCT GCTGCCGAGC
GTGCTGTTCG ACTATCCGAC GATCGAACGG CTCGCCGCTC ATCTGGCCGA CACCTGTCCC
GCCGCGTTCG GCGCGGCCGA GCCCGCCGAG ACCGCCGAGA CCGGCCGCGC CGCGGCGGGC
GACGCGGCGA GCGGCCCCGC GCCGGGCGTG ATCGCCCTGC TGGAACGACT CGAGGGCGGC
GGCCTGAGCC TCGAGGAGAC GATTTACTTG ATCGAGAACA CCAAATGA
 
Protein sequence
MSFDQTLIAD LLDQWLGGQS PDWGRLYPAG RPPIEHAPTY PFARNVYWVH AADRTPEPGL 
AAAPARSRDA APPLEAGTML TAAIAVVGAT AAATATTAVT ARAATTVTNA TNAIDAIDAI
DATNATNATA ATTAAAATTA VTASGATGAS AAIVLKSAAD EARRHFTPSR ERAPVVLRAL
APAAADVADV TDVAASRAAQ TQVFDGAAVE AARLAPDAIG AALIDSLARR LLVSPQSIGA
RDAFDALGVD SLIGQEWLRE LNRSYGTSID GATLAECGHI GALASRIAGA RADAAAPAWV
PRAEPPQRAA QCAPDAGTVS ASVLASISTA ASITASTSTL ASTSTSASTP ASASTLASAS
TPASASTLAS ASTSTSTSAS ASAAAAAATT HAGDRSAPRI TRDALVHALA ASLAKALYMD
VADIDIEQPF MEMGLDSIVG VEWVHQVNRA YGIGINAIQV YDYPNIVTFA GLVESLTNAG
GAAGGDVASG VGGSDSNADT DRDRDRDRDV DVDVDGDVNI DGVLTRPTGA AEPARAAAAA
TSPAAAEIAA ATAAAASSAI AAPPTTAETM AAASAACAPR GELRREVIDS LAAALYMDAS
EIDVLQPFVE MGLDSIIGVE WIHALNRRYG TSIEAIQVYD YPNVEKLTEL LSKSIDALSP
PHARCESDVL PGEPMTAATA ANDTNDTNDT NNANNANNAN ECGVVAAEAA FAAPAAAPPA
APAAACAAAA IAAPIAPTAP APLAAPAAKP ATLLDELVAS LAQALFRPAD TIDAERGFEA
AGLDPIVADE WLARVHRRYG VRVRADEALA CPRIADFAAL VAARQEAGGA QAAGAAPVAA
HDARCAARAS ADARAGDREP IAIVGMSGRY PGARDLDAYW ENLAAGRSAI GEIPASRWDV
ARHFDAHPAT PGKVYSKWIG LLDDVDCFDP AFFRISPAEA QEMDPQHRLF LQEGYRAFEN
AGYSADTLDG RNCGVYLGIM NQEYRQLGAG GAVTMLEKSN SFAIGAARLA YHLNLKGPAI
PVDTACSSAL VAIHLACQAL RAGEIDMALA GGVTLYLSPD AYIEMCSSGM LSPDGRCKVF
DDSADGFVPG EGVGAVVLKR LGDAQRDGDP IIATIIGSGI NQDGKTNGIT APNMASQFEL
VSGVHGRYGI DPATIRYVEA HGTGTKLGDP IELTALGDAF RVRTAQTGFC ALGSVKSNIG
HTSAAAGVAG LHKVLLCMRH RTLVPTLHFA VPNRHFDFAA SPFYVNTERA PWAPLAASPR
RAAVSSFGFS GTNAHLVVEE YVHPAAAAPE AGGPFLFPLS ARTREQLAAY AAQLRDHVRR
AAHEDAGLAD LAYTLQVARK PMAERVGLIA RTKHELAALL DAFVDGRDGG DGLIAGRRDR
AGGTPPAPSP EALRALVDAG ESRTILQRWA LGATIDWACL YRDLDAAARP RRIAAPSYPF
ARERYWLPDP GTQRAPAARG ARAAGPHPLL REIDSAQSAA CFGATLAGDE PFLRDHRIDG
RPVMPSSAYL EMVREAAARA LGEPADAMLV IEGIAWRNPL TVSGGARRLQ LRARAEPGAR
ALRFDVSSQA AGDAAAAPLA HCDGVARYVP RAPAGAPDLA ALRARLASGP RAADEAACAA
LHARFARLGI EYGATHRVLL RLRVDGDEAL AELAPAADGI AQHELHPGTL DAALQPMLAL
LGERVGDGVP VVPYRIERAE IHAPTHGARW AWLRMRPDAH EWIFDVDLCD ARGALCVALR
GIAVTAWRRP DEVVRLEPVW RAAPVEADLH DERAGPDAQR VVFVCGTHGA PRAWPADGIA
PVRYAALAAG APPGEPDALA GWFEAHALAL FDEVRRLLAP GMRQATLVQI VVPAAGPGAI
LHALGALLQT AHLENPLLHG QVIAIDDIDA PDLPRRLARD ARRAADTRIR YVNGERQVAC
FDEAAAPDGA RALPWRQGGV YLVTGAAGGL ARALADAIAR GVGGDAPRAT LVLTGRSPAR
DDMRALVASL CALGVAADYR VLDVADRDAV ARMVEAIVGE FGALHGVVHC AGVLRDNYLL
RKSADEFAQV LAPKVRGTVN LDLATRDVRS LDFFVTFSSG AGVVGNPGQA DYAVANAFMD
AFAAHRASLG AARPGVSVSI AWPIWQAGGM RIDRQTEAEL ERRLAMRPMP TALGLDALHA
CLLGASPCPT VIHGARARIL ALARQGFAAP PAAPPGFGAA DRGADAPGAD ADADADAVKA
RVRAAIDGAL SAVLKLPDAR LREPEYFESY GIDSINAIRL TVELERTFGP LPKTLFFEYR
HADELERYLV SVHGAAVGAR IRSAARGAPA GPACARAGES GEPPGAPTRP PASEGGAAGA
PRASAAPPLA ERDIAVIGMA GRYPQADDLQ QYWDNLRDGR DCIEEIPPHR WDWRKHYDPA
RGHGAHHSKW GGFINDVDAF DPLFFNISPK EAVSMDPKER LFLEQVWTAM EDAGLRPEDL
RRDAQRGTGV YVGLMYENYQ LLAAEAAAAG SDVGMAGGSY ASIANRVSFF LDLRGPSLAV
DTMCSSSMVA VHLACRDLLA GEIGVAIAGG VNLSLHPNKY RMLSAARFMS GDGRCASFGS
GGEGYVPGEG VGVLLLKRRA DAERDGDRIL GLIKASAINH GGRSNGYTVP NAAAQGGVIA
NAIRAAGIDA RAINYVEAHG TGTALGDPIE LAGLARGFAD SGANGPCRIG SVKSNIGHCE
GAAGIAALTK VLLQLAHRQI VPSLHSRELN PDLPLDGSRW IVNQSLCDWE RVVVDGVPLP
RTAGVSAFGA GGTNAHLILS EYPADACAAP AGVIEPAGRD AHDMHDMHDM HDMHDMHDMH
DMQDMHDTQD MQDMQDALVV PLSARNAQRL HAYAQRLRAF VAAHARGERG APPRLVDLAF
TYQRGRIAMP ERLAIVARSL AELERALTAY VAGQRAGDGI YAGRADRAAA GDARGAASER
TAADFADRLA ARWVAGEAVD WHALFDGRAP RRIAAPTYPF ERGRYWIGAS RAAAGAAEAA
MRAGGGASRA TTAPRALSES GGEVRQPDQT IRRVRLAPTS SFTATARAPS PTQATQATQA
TQATQATQAT QATQATQATQ ATQATQATQA TQATQATQAA QATQATQAAQ ATQATQATQA
SQATRAAAAF APATKASLEA ALRDSLASAL FVGVDEIDPG RPFSELGLDS IVGVEWIRDV
NRRYGVSIRT TDVYDYPSVG EFAGLLERLL RESSVAGAPP APATEPTMEP VTAPRPEAAG
AAPCEPPPDE GRGGAPGNVE RIRRELMRSL ADALFVDIAE IDVDRPFAQI GMDSIVGVEW
IKGINQRYRV ALKATDVYDH PTIASIAALV DARGASAAST ASTASTASTA STASTASTAS
TASTASPEAD VRLAADARIA SAASGAPESE PARAAHPGAA ASRARADVAP DAGPAPRRLD
AAAPDARAAR AEPVPERIAI VGMSGRYPGA PDLDAFWDNL AAGRDAIAEI PPSRWPVGAF
YDPEPGKPGK VYCTRIGLLD DVDRFDPDFF RISPAEAEEM DPQHRLFLQE GYRAIEQMGC
APASLSRRKC GVYLGVMNHE YGELAMRHRG AASGIGSSYA IGAARLAYYL NLKGPAIPVD
TACSSALVAT HLACQALRNG EIDLALVGGV TVYLTPESYV AMCAAGMLSP EGRCKTFDDA
ADGFVPGEGV GALVLKRLAD AERDRDPILG VIVGSGLNQD GRTNGITAPS GSSQTELLRD
VYRRHRIDPA GIGYVEAHGT GTKLGDPIEL TALSAAFGDY TDRRGFCALG SVKTNIGHTS
AAAGVASIHK VLLCLAHREL VPTLNYANPN RHFDFADSPF YVNTDRRAWD AAGDAPRRAA
VSSFGFSGTN AHVVIEEYRP AAAAAPDASP PRVIVPLSAR HPERLRAYAR NLADWLAQAA
ARGAPERLAA HLAYTMQVGR DAMAERVAFV ADGRDELERQ LRRYADTGET SDGVYAGRAE
PHAQASNALM LDEAFGAAID GWMRTGKHEP LAKLWAGGFD LDWARLYDGV PAAAMPRRIA
APTYPFASGR YWIDVEPDGR AAGPDADAAS PEADSGSDSE HAHEHEHEHE HEHEPAATLA
YLPVWEELPP AQPRAAPDAQ AGGVLVVHRG GAWGLVDAIE RECVDGRHAG ATCVTLDLSG
HAPSPEGRAW RNAAPDAARL AAWLGEFGPV RAVFFAAGCS EARHDAPGAH GWASAPDAHG
EDERALLQLA QALMRSQAAD ASIEFVVLSL DHHRTDGTPS NPAGGGVAGI AYAIAQGDHR
FRVTNVDVSL DELRAARHAP APHPVLAAVL RLAPSDRGAL VRLRAGRGYR QAFVRLDWAA
EAGASGLKQG GVYVMLGGAG RVGRALTRRL IERYRANVAW IGRSPADSAS VAHALRALGP
AGPAPYYAQA DATDAAQMRR AIEAVRQRHG RIDGAVFCGM VFDANHAIAS VPAHRFDEIL
DVKARGSRIF YEALAHEPLD FLCYCSSAQS FSFSGAARLG AYAAATTAGD AIVRSIAPVA
AFPVGTIHWG FWETSVEDSA LGSRHLGALS DDEGFACFER FVGQCMRGNP LREVVCMRAS
PEVEHLMQVL PGETATLAAP GQPAQPAPLR AAPDGAAASE AAPPDGAADV SADIDAWLAR
LTFATLRPML DGPRPARACH ARWWDETLRI FAARGWLRIV DGAPRVIAEP DAGEHVWRDW
ARYRFDTPAA RGRRAQIDLA DVCVRALPDV LAGRLPAADV LFPGGSMERV EGVYRDNPIS
DYFNAVQADA LIRHVRAWID AGRREPIRIL EVGAGTGGTT ALALERLRPY AAAIGEYCFT
DVSQAFLQHA QAAFGARAGY LRTALFDVER PLDAQRMPAG RYDIVIATNV LHATRQVRGA
LRNVKACLRA GGVLLLNEIS EKSLFAHLTF GLLEGWWLHE DSSLREPGSP VLAPATWRRL
LEDEGFGAIA FAARDAHALG QQVVCATSDG VIRQRAGEPS GHSSRQGHRN HQDHQGRQES
STHAGEAGAP APASAGRAGA APAASPAGTA RESVVAAIHR ALQQSLKLPE ARIGDHTPFL
DYGIDSILGV RFVDSLKQAL DVPLNTAVLF DYPTVERLAD FIVATYGARL AARGASAAPA
SVATASATLA ASTASTASTA PTASTAPTAS TAPTASTAPT ASTAPTAPTA PAAPTAPAAP
NAPTAPAMPA EAVSRDAAAP RAEPAGARPA DIAVIGMAGQ FPDAPDVDAF RALLEHARDG
APGVSGGMLE NRDRFDHAFF HITPDEADAM HPYQRLVLQE SWKALEDAGY NPAALAGARV
GVFVGAEPAD YRSTTFSGSS DALIASRVSY HLNLRGPAYV VNTGCSSGAV AIHLACESLR
RNESDVVLAC GIFAAMGPRM LGALGQAGML SAGGRCRSFD AGADGTAFAE GIGVVALKRL
ADAIADGDPI HGIVKASGVN QDGTSNGIMA PNGVAQEELI VDVYERFGID PADIRYVEAH
GTGTLFGDAV EANALVRAFR RFTERSAYCA LGTVKATIGH TAAAAGVIGL IRILLSMRAR
RLPGMPGLGR ANPMIDLDAS AFSLGLVSRE WPAGRDGRPR LAALNTFGHS GTNVHIVVQE
PPQARARPAR AADGPRVAVP LSAMDREALR RYAARLCERL EAEGAALCVG DVAHTLRVAR
EPMAQRIVLF ASTTGELAAL LRAFVDGRDS PCLLDGAVTA AARAAGLDAA QLAQAARWLA
GERVDWPPAG GTPMRVHLPA YPFAGRRCGA AGWARAEAGA SRDCAAAGPC EPPAGVAAAA
MTVAAAGPRV DATPSAAADR ARGAARPAEW LAARVAARLG VPAARVDRRR SLLDLGLTSQ
DLVSLAGQLR DATGEALLPS VLFDYPTIER LAAHLADTCP AAFGAAEPAE TAETGRAAAG
DAASGPAPGV IALLERLEGG GLSLEETIYL IENTK