Gene BURPS1106A_A2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2212 
Symbol 
ID4903610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2159944 
End bp2178189 
Gene Length18246 bp 
Protein Length6081 aa 
Translation table11 
GC content69% 
IMG OID640145317 
Productnon-ribosomal peptide synthase 
Protein accessionYP_001076245 
Protein GI126456041 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGCGA ACGATCTGCT GGCGTTGCTG AATTCGAAGG GCATTGCGCT TTCCGTCAAG 
GGAGACAATC TGGCGATCAC AGGCGACGAG AAGGTGCTGG AGGACCCGGG CCTTCTCGCG
CTGTTGCGCG CGAACAAACC GGCGTTGATC GACCTGATCA AGGACGGGCA TGGCACCGTG
GGCGGCGCGG TTCGCGTGCC GCCGAACCGG ATACCGGCCG ACGCCGAACG TATCGCGCCG
GAGCAACTGA CGCTGGTTCG ACTGGATCAG GGCGAGATCG ATGCGTTGGT CGCGCGCGTG
GACGGCGGCG CGGCGAACGT TCAGGATATC TATCCGCTCG TACCGATGCA GGAAGGCATG
CTGTTCCACC ATCTGCTGAG CCAGCAAGGC GATGCCTATC TGGAAAGCTA TCTGCTCGCA
TTTCGTACGC GCGAGAGGCT GGATCGTTTT TTGTCGGCAT TGCAACAGGT GATCGACCGG
CATGACATCC TGCGCACCGC GTTCTTCTGG GAAGGGCTGC CGCACTCCGT CCAGGTCGTG
CAGCGTCGCG CGACGTTGCC GTTGAACGTC GTCGAACTCG ATCCTCGCGA TGGCGACGTC
GGACAGCAAC TCGAGGCGCG CTACGATCCG CGCAGCCATC GCATCGATGT CGGTCGGGCG
CCGCTGATGC AAGTCCACGC CGCGCATGAT GCGGCCGGCG GAAGATGGCT CGTGCGGCTG
CTTTCTCATC ACTTGGCCGT CGATCATACG ACGCTCGAGC GTGTGATCGA GGAGGCGCGC
GCGATCGAGC AGGGCCGAGC GGAGGACTTG CCGCGGCCGG AGCCGTTCCG GAATTTCGTC
GCGCAGGCGA GGCTGGGGGT GAGCGAGGCG GACCACGAAG CGTACTTCAG GGCGAAGCTG
GGGGACATCG ATGAGCCGAC GGCGCCGTTC GGTCTGCTGA GCGTTCAGGG AGACGGGCGT
GAGATAGCGG AGGCGGCGCG GACGTTGAAG CCGGAGCTGT CGGGGGCGCT TCGCGGACAT
GCGCGCCGGC TGGGGGTGAG CGCGGCGAGC ATGATGCATG TGGCGTGGGG GCTGGTGCTG
TCGCGCACGA CGGGGCGGCA GGACGTGGTG TTCGGCACGG TGCTGTTCGG GCGGATGCAG
GGAGGCGCGC AATCGGATCG AGCGCTGGGC TTGTTCATTA ACACGTTGCC GGTGCGGATG
AAGGTAGCGC AGACGGGCGT GGAGGCGAGC GTGAAGGAGA CGCATGCGCA GCTGGCGGAA
TTGATGCGTC ACGAGCATGC GCCGCTGGTG CTGGCGCAGC GTTGCAGCGG GGTGCCGGCG
CAGACGCCGC TGTTCACGTC GCTGCTGAAT TATCGATATG GATTGCGGCA TCGCGCCGAT
GCGGCGACGC CGGGCGGTGA CGATATCGAA TTGCTGAGCG CGAGGGAACG GACGAATTAT
CCGCTGACGC TATCGGTCGA CGATCTGGGG CAGGATTTTT CGCTGACGGT TCAGGTGAGC
GGACACGTCG ATCCGCAGCG CGTATGCGCG TTCATGGAGA CGGCGCTGGA GCAATTGGCG
CAGGCGCTGG GCGAGCAACC GCAATGCGAC ATCGGCGGAC TCGATGTGTT GCCGCGGTCC
GAGCGTGAGC AGATGGTGTA CGCGTGGAAC GCAACAGAGC GTGACTATCC GATCGAGCAA
TGTATTCATC AGTTGTTCGA AGCGCAGGTG GATCGGAAGC CGGAGGCGAT TGCGCTGACG
TTCGAGGGAC AGCGACTGAG CTACGCGGAA CTCAACGCCC GAGCGAACCG GCTTGCGCAC
TATCTGCAGG CGCGCGGCGT GGGCCCGGAC CGACTGGTCG CGCTTTGCGC GGAACGCGGA
ATCGAGATGG TGGTGGGGCT GCTGGCGATC CTGAAGGCGG GCGGCGCGTA TGTGCCGCTG
GATCCGGCAT ATGCGTCGGA TCGCCTGCGC GGGATCGTGC AGGACAGCCA GCCGGCCTTG
GTGCTGGCGG ACGCGGTGGG GCGCGCGGCG TTGGGCGAGT TGGATGGTGC GCTGCCGGTG
ATCGATCCGG AAACGGATGC GCTGCGCTGG CGTGAGATGC CGGCGACCAA TCCGGAGGTG
GCATCGCAGC ATGTGCACCA CCTTGCCTAT GTGATCTACA CGTCGGGCTC GACGGGCCGG
CCGAAGGGGG TGATGGTCGA GCACGCGCAG GTTGTGCGCC TGTTCGGCGC GACGCAGGCA
TGGTTCGGCT TCGACGAGCG GGACGTGTGG ACGCTGTTCC ATTCGTACGG CTTCGACTTC
TCGGTATGGG AGTTGTGGGG CGCGCTGCTG CACGGTGGTC GGCTGGTGAT CGTGCCGACG
GAGGTAACGC GCACGCCGTC GGCGTTCTTT GCGCTGCTGT GCGCGGAAGG CGTGACGGTG
CTGAATCAGA CGCCGAGCGC GTTCCAGGCG CTGATGTCGG CGCAGGAGGA GCGGGAGGAA
GCGGCGTTAA AGGCGGATGT AGACGCGGAA GCGGAGGCGG CCGGGAATAT CGAGCGCGCA
AACGTAATCG CCCACCGGCT GCGCTATGTC ATCTTCGGTG GAGAGGCGCT GGAGCCGCGG
ACGCTCGCGT CGTGGTATGC CCGTCACGGC GAGCGTACGC AGTTGGTGAA CATGTATGGA
ATCACCGAGA CGACGGTGCA CGTGACGTAT TGTGCGCTGC GAGCGGAAGA CGCCATGCGT
TTGGGTGCGA GTCCGATCGG CGTGCGGATT CCGGATTTGC AGCTGTACGT GCTGGATGAT
CGTCGCGAGC CGGTGCCGAT GGGCGTGACG GGAGAGCTGT ATGTGGGCGG GGCGGGTGTC
GCACGCGGGT ACTTGAACCG GCCGGAGCTG ACGCGGGAGC GGTTCATCGA CGATCCGTTC
GTGGCGGGCG GGCGGCTGTA TAAGACGGGG GATCTGGCGC GTTGGCGTAC GGACGGGAGC
CTGGAGTATC TGGGTCGAAA CGACTTCCAG GTGAAGATAC GCGGTTTCCG GATCGAGTTG
GGGGAGATCG AGGCGCAGCT GGCGAAGGTG ACGGGGGTGC GCGAAGTGGT CGTGCTTGCG
CGAGATTCGG CATCGGAGGT GCACGATAGC GCGACGGAAC ACGCAACTCC GAATGCGCTT
TCGCCCTCGC CCGAGACCTC AACCGCGACA GCAGCGGCAA CAGCAACAGC AACAGCGACC
GAGAAACGCC TCGTCGCGTA CTACACGGGC GATGCCGATG TCGTGGCATT GAGAGCGCAA
GCCGCGCAGC ACTTGCCGAG CTACATGGTG CCGTCGGCGT ACGTGCGGCT GGACGCGTGG
CCGCTGACGC CGAACGGCAA GCTGGACCGG CGCGCGCTAC CCGCGCCGGC GGACGACGCA
TACGCCCGCG CCGAATACGA AGCGCCGCAG GGCGCAAGAG AAGAAGCACT GGCCGCGATC
TGGCGGGACT TGCTGCATGT GGAGCGCGTC AGCCGCCACG ACAACTTCTT CGAACTCGGC
GGCCACTCGC TGCTGGCGGT GCAACTGGTA TCACGCCTGC GGCAGGCGCT GTCGGTGGAG
GTGGCGCTGG GCACGGTGTT CGACGCGCCG GTGCTGTCGG CTTTGGCATC CCGATTGGAC
GATAACACCG CGGAGGTCCT GCCGCCGATA CCGCTGGCGC CACGCGACGG AAGAATCGCG
CTGTCGCTGG CGCAGCAACG GCTATGGTTC CTGACGCAAC TGGAAGGCGT CAGCGAGGCG
TACCACATGA GCGGCGCGGT GCGGCTTGAT GGGCCGTTGA ATCGAGAGGT GCTGCAACGT
GCGCTGAACC GTATCGTGAT GCGTCACGAA GCACTGCGGA CGTGCTTCGC TCGGGAGGAG
GGTGAACCGA TCCAGGTGAT CCAGCCGCAT GCCGATCTGA CGATGAGCTA TCACGACCTG
CGCGAAGCGG AGTCGATCCG ACATGAAGCC GGGAACCGCG AACAGCGGGC AAAGGACCTG
AGCCAAGCCT ACGCATCGGC GCCATTCGAC CTGAGCCGAG ACCTGCCGGT GCGAGTGCTG
CTGCTGCAAT TGGCGGATGA AGCCCACGTC GTGCAGGTGG TGATGCATCA CATCGCATCG
GACGGCTGGT CGGTCGGGGT GTTCCTGCAA GAGCTGAGCG CGCTGTACGG GTCGTTCATC
GCGGAGCAGG ATGATCCGCT GGCGCCGCTG CCGTTGCAAT ACGCGGACTA CGCCGCGTGG
CAACGCAGGT GGCTGGCGAG CGGCCAGTTG GAGAAGCAAG GGGCGTTCTG GCAAACGAAC
CTGTCCGGCG CACCGACGCT GCTGGAGCTG CCGACGGACC GTCCGCGTCC GCCGAAGCAA
TCGCACGCGG GCGCGAGCGT CGAGGTGAAG CTGGGCGCGG CGTTGAGCGA ACGGGTGAAG
CGCCTGAGCC AACGCCACGG GGTGACGCCG TACATGACGC TGCTGTCGAG CTGGGCGGCG
GTGCTGAGCC GGTTGAGCGG GCAGGAGGAG GTGGTGATCG GCAGCCCGGT CGCGGGGCGG
AACCGAACGG AGGTCGAAGC GCTGATCGGC TTCTTCGTGA ACACGCTGGC GTTGCGGCTG
GATCTGTCGT CGGAGCCGAC GGTGGGCGAA TTGCTGAAGC GGACGAAGGC GCAGGTGCTG
TCGGCGCAGG CGCATCAGGA CTTGCCGTTC GATCAGGTGG TGGAGCGGGT GAAGCCGCCG
CGCAGCACCG CGCATCCGCC GCTGTTCCAG GTGATGTTCG TCTGGCAGAA CATGCCGGCG
GGGGAACTGA CGATACCGGG GCTGACGATC CGCGCCGTGG AGACGCCGCT GCAGACGGCG
CAGTTCGAAC TGACGCTGTC GTTGCAGGAG GCGGGCGACG ACATCGTCGG CCACCTGAAT
TACGCGAGCG CGCTGTTCGA CGAATCGACG GTGAGGCGTT ACGTGACCTA TTGGTGCCGT
TTGCTGGAAG GCATGACAGC GGGCGCCGCG GACCAGACGA TCGTGGGCTT GCCGTTGCTC
GACGAAGCCG AACGCAAGCA GGTGGTGTCC GAGTGGAACG CAACGGAGCG TGACTATCCG
ATCGAGCAAT GCATTCATCA GTTGTTCGAA GCGCAGGTGG ATCGGAAGCC GGAGGCGATT
GCGCTGACGT TCGACGGGCG GCGACTGAGC TACGCGGAAC TCAACGCCCG AGCGAACCGG
CTTGCGCACT ATCTGCAGGG GCGCGGCGTG GGCCCGGACC GACTGGTTGC GCTTTGCGCG
GAACGCGGAA TCGAGATGGT GGTGGGGCTG CTGGCGATCC TGAAGGCGGG CGGTGCGTAT
GTGCCGCTGG ATCCGGCATA TGCGTCGGAT CGCCTGCGCG GGATCGTGGA GGACAGCCAG
CCGGCCTTGG TGCTGGCGGA CGCGGTGGGG CGCGCGGCGT TGGGCGAGTT GGATGGTGCG
CTGCCGGTGA TCGATCTGGA AACGGATGCG CTGCGCTGGC GTGAGATGCC GGCGACCAAT
CCGGAGGTGG CATCGCAGCA TGTGCACCAC CTTGCCTATG TGATCTACAC GTCGGGCTCG
ACGGGTCGGC CGAAGGGGGT GATGGTCGAG CACGCGCAGG TTGTGCGCCT GTTCGGCGCG
ACGCAGGCAT GGTTCGGCTT CGACGAGCGG GACGTGTGGA CGCTGTTCCA TTCGTACGGC
TTCGACTTCT CGGTATGGGA GATGTGGGGC GCGCTGCTGC ACGGTGGTCG GCTGGTGATC
GTGCCGACGG AGGTAACGCG CACGCCGTCG GCGTTCTTTG CGCTGCTGTG CGCGGAAGGC
GTGACGGTGC TGAATCAGAC GCCGAGCGCG TTCCAGGCGC TGATGTCGGC GCAGGAGGAG
CGGGAGGAGG CGGCCGGGAA TATCGAGCGC GCAAACGTAA TCGCCCACCG GCTGCGCTAT
GTCATCTTCG GTGGAGAGGC GCTGGAGCCG CGGACGCTCG CGTCGTGGTA TGCCCGTCAC
GGCGAGCGTA CGCAGTTGGT GAACATGTAT GGAATCACCG AGACGACGGT GCACGTGACG
TATTGTGCGC TGCGAGCGGA AGACGCCATG CGTTTGGGTG CGAGTCCGAT CGGCGTGCGG
ATTCCGGATT TGCAGCTATA CGTGCTGGAC GCTCGTCGCG AGCCGGTGCC GATGGGCGTG
ACGGGAGAGC TGTATGTGGG CGGGGCGGGT GTCGCACGCG GGTACTTGAA CCGGCCGGAG
CTGACGCGGG AGCGGTTCAT CGACGATCCG TTCGTGGCGG GTGGGCGGCT GTATAAGACG
GGGGATCTGG CGCGTTGGCG TACGGACGGG AGCCTGGAGT ATCTGGGTCG AAACGACTTC
CAGGTGAAGA TACGCGGATT CCGGATCGAG TTAGGGGAGA TCGAGGCGCA GTTGGCGAAG
GTGACGGAGG TGCGCGAGGT GGTCGTGCTT GCGCGAGATT CGGCAGCGGA CACGGATCAA
AACGCGGACC TGAATGCAAG CGCGACGGCG AACTCAAGCG AGAAACGCCT CGTCGCGTAC
TACACGGGCG ATGCCGATGT CGCGGCATTG AGAGCGCAAG CCGCGCAGCA CTTGCCGAGC
TACATGGTGC CGTCGGCGTA CGTGCGGCTG GACGCGTGGC CGCTGACGCC GAACGGCAAG
CTGGACCGGC GCGCATTGCC CGCGCCGGCA GACGACGCAT ACGCCCGCGC CGAATACGAA
GCGCCGCAGG GCGCAAAAGA AGAAGCACTG GCCGCGATCT GGCGGGAGCT GCTGCATGTG
GAGCGCGTCA GCCGCCACGA CAACTTCTTC GAACTCGGCG GCCACTCGCT GCTGGCGGTG
CAACTGGTAT CGCGCCTGCG GCAGGCGCTG TCGGTGGAGG TGGCGTTGGG CACGGTGTTC
GACGCGCCGG TGCTGTCGGC ACTGGCCGAG CGGCTCGAAG CGGAGAACAC GGCGGTCTTG
TCGCCGATAC CGCTGGCGCC ACGCGACGGA AGAATCGCGC TGTCGCTGGC GCAGCAACGG
CTATGGTTCC TGACGCAACT GGAAGGCGTC AGCGAGGCGT ACCACATGAG CGGTGCGGTG
CGGCTTGATG GGCCGTTGAA TCGAGAGGTG CTGCAACGTG CGCTGAACCG TATCGTGATG
CGTCACGAAG CACTGCGGAC GTGCTTCGCT CGGGAGGAGG GTGAACCGAT TCAGGTGATC
CAGCCGCATG CCGATCTGAC GGTGAGCTAC CACGACCTGC GCGAAGCGGA GTCGATCCGA
CATGAAGCCG GGAACCGCGA ACAGCGGGCA AAGGACCTGA GCCAAGCCCA CGCATCGGCG
CCATTCGACC TGAGCCGAGA CCTGCCGGTG CGAGTGCTGC TGCTGCAATT GGCGGATGAA
GCCCACGTCG TGCAGGTGGT GATGCATCAC ATCGCATCGG ACGGCTGGTC GGTCGGGGTG
TTCCTGCAAG AGCTGAGCGC GCTGTACGGG TCGTTCATCG CGGAGCAGGG CGATCCGCTG
GCGCCGCTGC CGTTGCAATA CGCGGACTAC GCCGCATGGC AACGCAGGTG GCTGGCGAGC
GGCCAGTTGG AGAAGCAAGG CGCGTTCTGG CAAACGAACC TGTCCGGCGC GCCGACGCTG
CTGGAGCTGC CGACGGACCG TCCGCGTCCG CCGAAGCAAT CGCACGCGGG CGCGAGCATC
GAGGTGAAGC TGGGCGCGGC GTTGAGCGAA CGGGTGAAGC GCCTGAGCCA ACGTCACGGG
GTGACGCCGT ACATGACGCT GCTGTCGAGC TGGGCGGCGG TGCTGAGCCG CTTGAGCGGA
CAGGAGGAGG TGGTGATCGG CAGCCCGGTC GCGGGGCGGA ACCGAACGGA GGTTGAACCG
CTGATCGGCT TCTTCGTGAA CACGCTGGCG TTGCGGCTGG ATCTGTCGTC GGAGCCGACG
GTGGGCGAGT TGCTGAAGCG GACGAAGGCG CAGGTGCTAT CGGCGCAGGC GCATCAGGAC
TTGCCGTTCG ATCAGGTGGT GGAACGGGTG AAGCCGCCGC GCAGTACCGC GCATCCGCCG
CTGTTCCAGG TGATGTTCGT CTGGCAGAAC GCGCACGAAG GCAGCTTGCA GATTCCTGGG
TTGCGTCTGA GCACATGGGG CGATCCGCTG ACGATGGCGC CTTTCGAGCT GACGCTTGCC
GTCCGCGAAC ATCAGGACGA TATCGCGTGT ACGTTGACCT ATGCGACGTC GTTGTTCGAT
CGAGCGACGG TCGAGCGCTA TCTCGGCCAC TGGCTCCGGC AGCTCGACGC AATGGCGACC
GATGCGGATC CCGTCGTCAC AGGCCTGCCG CTGCTCGGCG AAGCCGAGCG CGCGCAGGTA
CTGCACGGCT GGAACGAAAC AGGCCGCGCG TATGCGCGGG ATGCCTGCCT CCATCAGTTG
TTCGAAGCCC AGGTATCGCG AACGCCGGAA GCGGCGGCGG TGATCTGCGG CGACGAGACG
CTGAGCTATA CGGACCTCGA CGCGCGTGCG AATCGCCTCG CGCACTACCT GCGCGGACAA
GGCGTCGGGC CGGACACGCG CGTGGGGCTG GCGCTCGGGC GCGGCGTCGA GATGATGACG
GGATTGTTGG CGGTCCTGAA GGCGGGCGGT GCATATGTGC CGCTGGACCC GGGCTATGCG
TCGGAGCGCT TGCGCGCGAT CCTGGACGAC AGCCGGCCGG CGATCGTGCT GGCGGACGCG
GCGGGGCGCA CGGCGCTGGA TGCGCTGGCC GGTGCGCCGC CGATTGCGGA CCTGCAAGCG
GATGCGTCGC GCTGGAGCGC GTTGCCTTCG ACGCCGCCGC GTGTCGAAGG CCTGACGCCG
CGCCACCTTG CCTATGTGAT CTACACGTCG GGCTCGACGG GACAGCCGAA GGGGGTGATG
GTCGAGCACG CAAGCGTGGT GAATCTGTGG CGTGCGCTGG ACGAGGCGAT CTACCGCGCG
CATCCGAGCG CACGGCGCGT GAGCCTGAAC GCATCGATCG CGTTCGATTC GCTGGTCAAG
CAGTGGGTGC AGTTGCTGTC GGGGCGGACG CTGGTGGTGG TGCCGGAGCC GGTGCGTTTC
GACGGGAGGC GTCTGCTCGA TGCGATCGGG CGAGACCGGA TCGACGTCTT CGATTGCACG
CCGTCGCAAC TGGCGTTGAT CGAGGGGGCG CGAGGGCCGG AGGACGAAGC CTATCCGCAA
GTGACGCTGG TGGGGGGCGA GGCGATCGGC GAAGGGATGT GGTCGGAGTT GGCGAGCGTA
TCGAGCCGGA CGTACTACAA CGTGTATGGT CCGACGGAAT GCACGGTGGA TGCGACGCTC
GCGCGGATCA CGGCGGAGCA TGCGCCGCAC ATCGGCGGGC CGCTGGCGAA CGTGCGGGCC
TATGTGTTGA ACGAGCGGTT GAGCCCGGCG CCGGTGGGCG TGCGCGGGGA GCTCTACATC
GGCGGGGCGG GTGTTGCGCG CGGGTACTTG AACCGGCCGG AGCTGACGCG GGAGCGGTTC
ATCGACGATC CGTTCGTGGC GGGCGGGCGG CTGTATAGGA CGGGGGATCT GGCGCGTTGG
CGCACGGACG GGAGGCTGGA ATATCTGGGT CGAAACGACT TCCAGGTGAA GATACGCGGA
TTCCGGATCG AGTTGGGGGA GATCGAGGCG CAGCTGGCGA AGGTGACGGG AGTGCGCGAG
GTGGTCGTGC TTGCGCGAGA TTCGGCATCG GCGGTGCGCG ATAGCGCGAC GGAACACGCA
ACTCCGAATG CGCTTTCGCC CTCGCCCGAG ACCTCAACCG CGACCGCGAC CGCGACGGCA
ACAGCAACGG CAACAGCGAC AGAGAAACGC CTCGTCGCGT ACTACACGGG TGACGCCGAT
GTCGCGGCAT TGAGAGCGCA AGCCGCGCAG CACTTGCCGA GCTACATGGT GCCGTCGGCG
TATGTGCGGC TGGACGCGTG GCCGCTGACG CCGAACGGCA AGCTGGACCG GCGCGCATTG
CCCGCGCCGG CGGACGACGC ATACGCCCGC GCCGAATACG AAGCGCCGCA GGGCGCAAAA
GAAGAAGCAC TGGCCGAGAT CTGGCGGGAC TTGCTGCAAG TCGATCGAAT CAGCCGCCAC
GACAACTTCT TCCAGTTGGG CGGTCATTCG CTGCTGGCGA TCAGCCTGGG CGACATGATG
CGCGAGCGCG GCTTGCACGC CGACGTGCGC ACGCTGTTCA ACGCGGAGAC GCTGGCCGCG
CTCGCCGCGC AATCGGGCAC GGACAGCATC GACGTCGACG TCCCGCCGAA CCTGATCCCC
GTCGGCGCCG CGCGAATCAC GCCCGACATG CTGCCGCTCG TCGCGCTGAC CCAGGCGCAG
ATCGACGCGA TCGCGCAGCA GGTAGACGGC GGCGCGACGA ACGTGCAGGA CATCTATCCG
CTCGCGCCGC TGCAGGAAGG CATGCTGTTC CATCATCTGC TGCACACGCA GGGCGACCTC
TATCTGGAAC CGCATCTGCT CGCGTTCCGC ACGCGCGAGC GGCTCGAGCG GTTCCTGTCC
GCGCTGCAAT GCGTGATCGA CCGTCACGAC GTGCTGCGCA CCGGCTTCTT CTGGGAAGGC
GTCCCGCAGC CGGTGCAGGT GGTGTGGCGG CGCGCGCGGC TGCCGGTCGA ATACGTCGAG
CTGCCGGACA GCCACGGCGA TGTCGCGAGC CAGCTCGAAG CCCGCTGCGA TCCGCGCCGC
CATCGCATCG ACATCGGCCG CGCGCCGCTC GTGCACTGTC ACGTCGCGCA CGATGCGCGC
AACGACCGCT GGGTGCTCGG CGTGCTGACG CACCATCTGG TCAGCGACCA TACGACACTC
GCGCTCCTCG CCGAAGAAGC GCAGGCGTTC GAGCAGGGCC GCGGCGATGC GCTGCCGCCT
GCGGTGCCGT TTCGCAACTT CGTCGCGCAC GCGCGCCTCG GCACGAGCGA GCGCGAGCAC
GAAGCGTTCT TCCGCGAGAT GCTGGGCGAC GTCGACGAGC CGACCGCGCC GTTCGGCCTG
CTCGACGTGC AGGGCGACGG CAGCGCGATC GTCGAGCACC GGCGCGCGCT CGCGCCCGGG
TTGTCCCGCT CGGTGCGCGC GCACGCGCGG CGTCTCGGCG TGAGCGCGGC GAGCGTGATG
CACGTCGCAT GGAGCCTCGT GCTCGCGCGG ACGGCGAACC GGCGCGACGT GGTGTTCGGC
ACGGTGCTGT TCGGCCGCAT GCAAGGCGGC GCGCACGCGC ATAGAACGAT GGGCCTCTTC
ATGAACACGC TGCCCGTGCG GATCGCGCTC GACGAGTCGG ATGTCGAGAC GAGCCTGATC
GCGACGCACG ATCGCCTCGC GCGGCTGCTG CGCCACGAGC ACGCGCCCCT CGCGCTCGCG
CAGCGCTGCA GCGCGGTACC CGCGCAGGCG CCGCTGTTCA CGTCGCTGCT GAACTACCGC
TATTCGCCGC ACGAAGAGCA GGGCGACGCA ACGGACGACG ACGTTCAGTT CATCGCCGCG
CGCGAGCGCA ACAACTACCC GCTGACGATG ATCGTCGACG ACACGGGCGA GGGCTTCGCG
CTCACGGCGC AGGTGGACGC CTCGATCGAC GCGGCGCGCG TCTGCGCGTT CATGCATACG
GCGCTCGAGC AGCTCGTGCG CGCGCTCGAC GACGCACGCG GCGCGGTGCT CGCCGAGCTC
GACGTGCTGC CCGCCGACGA GCATCGGTGC GTGGTGTCGG CCTGCAACGA TACGGATGCC
GAACTGCCGG GCGTCGACTT CGTCGATCGC CGGTTCGAGG CGCAGGCGGC GCGCACGCCC
GAGGCGATCG CCGTCGCGTG CGGCGCGCAC GCGCTCAGCT ACGCGGCGCT GAACCGGCGT
GCGAATCGCC TCGCGCACTA TCTGCGCGCG CACGGCGCGG GCCCGGAGCG CGTCGTCGCG
CTCGCGCTCG AGCGCTCGGT CGACATGATG GTCGGGCTGC TCGGCATCCT GAAATCGGGC
AGCGCCTATC TGCCGCTCGA TCCCGCGTAT CCGGCCGAGC GGCTCGCGTA CATCGTCGAC
GACGCGCGCC CCGCGCTGCT GTTGACTGAA GCCGCGCTGC GGGACGACTG GCGAGACGCC
GGCGCACCCG TCGTGCTGCT CGACGCGGAC GGGCCGGCGA TCGACGCGTG TCCGGATCAC
AACCCGGACG CCGCCGCCGG CCGGGATGCG CGCACACTGT CGTCGCTCGC GTACGTGATC
TACACATCGG GTTCGACGGG GCGCCCGAAG GGCGTGATGA TCGAGCATCG AAATCTCGCG
AACTTGCTCG GCGCGATGGG CGAGCAGCCC GGCATCGGCG CGCACGACGT GCTGCTCGCG
GTGACCTCGC TGTCGTTCGA CATCGCCGCG CTGGAGCTCT TCCTGCCGCT GCTGCACGGC
GCGCGCGCGG TGATCGCGGC GCGCGACGAC GCGGCCGATC CGGCGCGGCT CGCGCATCTG
ATCGAAAGCA GCGGGGCGAG CCTGATGCAG GCGACGCCTT CGACGTGGCG CATGCTGGCG
CAGCACGGCT GGCCGCGATC GGCGCGGCCG CTGACGCTGC TGTGCGGCGG CGAGGCGCTG
CCGCCCGCGC TCGCCGAGCG GCTGCTCGCG CATGTCCCCG CGATCTGGAA CCTGTACGGG
CCGACGGAGA CCACGGTATG GTCGACCGTG CGGCGCGTGA CGACGCCCGT CGTCGACATC
GGAGGGCCGA TCGCCAACAC GCAGGTCTAC GTGCTCGACG AGCGGCTGCG CCCCGCGCCG
ATCGGCGTCG CGGGCGAGCT GTACATCGGC GGCGCGGGCG TCGCGCGCGG CTATCTGAAC
CGCCCCGAGC TCACGCGCGA GCGCTTCGTC GACGATCCGT TCCGGCGCGG CGGGCGGCTG
TACCGCACCG GCGATCTGGC GCGGCGGCGC GCGGACGGCA ACCTCGAGTA CCTCGGCCGC
AACGATTTCC AGGTGAAGAT TCGCGGCTTC CGGATCGAGC TCGGCGAAAT CGAGGCGCAG
TTCGCGAAGG CGCACGGCGT GCAAGGCGTG GCGCTTGCCG CGCGCGACAC GCCCACGGCA
GACAAGCGGC TCGTCGCGTA CTACGTCGGC GACGCGAGCG CCGCGGCGCT GCGCGAGCAC
GCGGCCGCGC GATTGCCGGC GTACATGGTG CCGGCGGCCT ATGTGCGGCT CGCCGCGTGG
CCGCTGACGC CGAACGGCAA GCTCGACCGC GCGGCGCTGC CCGCGCCGGA CGACGAAGCG
TACGCGCGCG CCGAATACGA AGCGCCGCGG GGCGAGCACG AGTGCAAGCT CGCGGCGATC
TGGCGGGCCG TGCTGCAGGT CGAACGGATC GGCCGTCACG ACGATTTCTT CGAGCTGGGC
GGCCATTCGC TGCTCGCGGT GCGCGCGGTC ACGGCGATGC GCGATGCGTT CGGCAGCGAC
ACGAGCCTGC GCGACCTGTT CGCGCGGCCC GTGCTGAAAG ATCTCGCCGA ACACGCGAGC
ACGGCCGCGC GTGCGCGCGA CGCGGCGATC CCGAAGGCCG CGCGCGGCGA GCCCGCGCCG
ATGTCGTTCG CGCAGCAGCG GCTGTGGTTC CTCGCGCGGA TGGGCGGGCT CGGCGATGCG
TATCACATGC CGATCGCCGT GAGGCTGCGC GGCGCGCTCG ACGTCGACGC GCTGCAGCGC
GCGCTGAGCC GAATCGTGTC GCGCCACGAT GCGCTGCGCA CGACGTTCGC GCTCGAAGGC
GAGCAACCGG TTCAGCGCGT GCACGCGGAT GATGGCGCGG GGCTGCGCTT GCGCATCGAC
GATCTGCGCG GGTGCGCCGA CGCCGGCGCG CGGCGCGCGC GGATCCTGGC CGGGCAGGCG
AGCGAGCCGT TCGATCTGGC GCGCGGGCCG CTGGTTCGCG GCGCGCTCGT GCGCGAGGCC
GACGACGTGC ACACGCTATG CGTGACGATC CATCACATCG TGTCGGACGG CTGGTCGATC
GACGTGTTCT GCCGCGAGCT GAGCGAGTTG TATCGCGCAT TCGCCGGCGG GCAGCCCGAC
CCGTTGCCGC CGCTGCCGGT GCAGTACGCC GATTACGCGG CGTGGCAGCA ACGCGGCATC
GGCGGCGCGG CGCTGCACGC GCAGGCCGAA TACTGGCGCG ATGCGCTCGC GGGCGCGCCG
ACGCTGCTCG AACTGCCGAC GGACCGGCCG CGTCCGCCGC AGCCCGACTA TGCGGGCGCG
ACGGTCGGGC TCGCGCTCGA CGCGCCGCTG ACGGCGGGCT TGCGCGCGCT CGCGCGGCGT
CACGGCGCGA CGCTCTTCAT GACCGTGTTC GCCGCGTGGA GCGTGCTGCT GTCGCGCCTG
TCGCGGCAAA CCGACGTGGT GATCGGCACG CCGAGCGCGA ACCGCGGCCA TGCGCAGATC
GAGGGCTTGA TCGGCTTTTT CGTCAACACG ATCGCGCTGC GCGTGGACCT CGACGGCGCG
CCGACCGTGG CCGAGCTGCT CGCGCGCGTG AAGGCGCGCA CGCTCGCCGC GCAGCAGCAT
CAGGACATTC CGTTCGAGCA TGTGGTCGAG CGGGTGCAGC CGGCGCGCAG TCTCTCGCAT
AGCCCGGTGT TTCAGGCGAT GTTCGCGTGG CAGCACGCGT CGCGCGGCGA GATGCGGCTC
GAAGGGCTGC GCGCGGAGCC GCTCGACGAC GCGGCGCGCA CGATCGCGAA GTTCGATCTG
ACGCTGTCGC TGCGCGAGAG CGGCGATGCG ATCGACGGCG GTCTCGAATA CGCGAGCGCG
CTGTTCGAGC GCGCGACGAT CGAGCGCTTC GCCGGCTACC TGCGGCGTTT GCTGGAAGGG
ATGGTCGCCG ATGACACGCA GCGCGTCGAT GCATTGCCGA TGCTGTCGCG CGACGAACGG
CGCGATCTGA TCGAGCGCCG GAACGCGACC GCGCGGCCGT ATCCGGCGAA CAGCGGCGTG
CATCGGCTGT TCGAGGCGCA GGCGGCGCGC ACGCCCGATG CGACCGCGAT CGTCGACGGG
GCGACGACGC TCGACTATCG CGCGCTCGAT GCGCGCGCGA ACCGCATCGC ACACGCGCTC
GCGCACGCCG GCGTGCGCGC GGGCGATCGC GTCGCGCTGC ATCTCGAGCC GTCGATCGGG
CTCGTCGCGG CGCAGCTCGC GGTGCTCAAG CTCGGCGCCG CCTACGTGCC CGTCGATGTC
GGCAATCCGC CCGCGCGCAA GGCGTTCGTC GCGCAAGACA GCGGCGCGCG GCTCGTGCTC
GGCGACGCGG CGCTCGACTG GCCGGCGGCG GCCGGCGTGC CGCAGCGCGA TCTGGCGGCG
CTGCTCGCCG GGCCGTGGCC GTCGGACGCG CCCGCTCGCG CGCCGCAGTG CGGCGGCGAC
ACACCGGCAT ACGTGATGTA CACGTCGGGC TCGAGCGGGC AGCCGAAGGG CGTGCTCGTC
ACGCATCGCG GCATCGCGCG GCTGGCGGTG AACAGCGGTT ATGCGACGTT CGACGCGTCG
GACCGGTTCG CGTTCGCATC GAACCCGGCG TTCGACGCGT CGACGTTCGA AGTGTGGACG
GCGCTTCTCA ACGGCGCGAG CATCGGCATC GTGAAGCGCG ACGATCTGCT CGATCTCGGC
GCGCTCGCCG GCAAGCTGTC GTCGATCGGC GTCACCTGCC TGTTCCTCAC GACGGCGCTG
TTCAACCGGT GCGTGTCGTT CGATCCGGCG ATGTTCGCGC GGCTGCGCTG CGTGATCTCG
GGCGGCGAGC GCGCCGATCC GGCGGTCTAC CGGAAGGTGA TGGAAGCGGG CCCGCCGCGC
CATCTGCTGA ACGCGTACGG CCCGACCGAG ACCACCACGT TCGCCGCGGT CTGGGAAGCC
GAGCCGCGCA CGCTCGCCGC GCAGGCCGCG CCGATCGGGC GGCCGATCGG CAATACGTCG
GTCTACGTGC TCGACGCGTA CGGCGCGCCG GTGCCCGTCG GCGTGACGGG CGAGATCCAC
ATCGGCGGCC CGGGTGTCGC GCAAGGCTAC CTGAACCGAC CGGCGCTTTC GGCCGAGCGC
TTCGTGCGCG ATCCGTTCGT CGGCGGCGAC GCGCGGATGT ACCGCACGGG CGACCTCGGC
CGATGGCGGC CCGACGGCAT GCTCGACTGC ATCGGCCGCG CCGACTTCCA GGTGAAGATT
CGCGGCTTTC GGATCGAGCT CGGCGAAATC GAAGCGTGCC TGCTCGAACA CGGCGCGCTC
GCGCAGGCGG CGGTGCTCGC GCGCGACGAC GGCGGCGACG GCGGCAAGAC GCTCGTCGCG
TATTACGTGC CGCGCGCGGG GCACGAGGAT GGCGCGCCCG CGCTGCGCGC GCATCTGGCC
GCACGCCTGC CCGAATACAT GGTGCCCGCC GCGTACGTGC GGCTGCCGGC GATGCCGCTC
ACGCCCAACG GCAAGCTCGA GCGCCGCGCG CTGCCCGCCC CCGACGAGCG ATCGTACGTG
CGGCGCGACT ACGCGGCGCC GCAGGGCGAG ATCGAGACGA CACTCGCGCG GATCTGGGCG
GAGCTGTTCG GCATCGAGCG CGTCGGCCGG CACGACGGCT TCTTCGAACT CGGCGGGCAT
TCGCTGCTCG CGGTGCGGAT GGTCGCGCGC GTGCACGATG TGCTGGGCGT CGAGGTGCCG
CTGCGCGCGC TGTTCGCCGA TCCGGTGCTG CACGTGTTCG CGTCGGCGGT CGCGCGCGCG
TCGACGCGCC AGGCGTCGTC GAATCTCGTC GCGTTCCGCA GCGCGGGCAC GGCCGCGCCG
CTCTTCTTCA TTCATTCGGG GCTCGGCGAG ATCGGCTTCG TCGGCGATCT GCTGCCCGGC
ATCGCGCCGG AGATTCCGGT GTACGGCTTC GCGGCGGTCG GTTTCCTCGC GGGCGAGACG
CCGCACGCGA CGATCGAGGA GATGGCCGCG CAATATGTCG ACGCGATGCG GCGCGTGCAG
CCGCATGGGC CGTATCGGCT CGCCGGGTGG TGCGCGGGCG GCAACATCGC GTTCGAAATG
GCCCATCAGC TGATCGCGGC CGACGAGACG GTCGAGTTCC TCTGCATGAT CGATTCGCCG
ACGTCCGCGC CGATCGACCG CTCGGTCACC GCGTGCGTGC TCGCGCGCAT TCCCGACGAC
ATTCCGGAGG CGTTGCGCAC GCGGCTTCAT GCGCTCGGCG ATGCCTTCGA CGTGCGCGGC
ATGCTGCACG CGTGCCAGGC GGCGGGCATG CTGCCGATCG ATCTGCCGAC CGGGCTGATG
GAGCGGCACG TCGCGGTGCA ATACGCGATC AAGCATGCGA AGCTGAACTA CGTGCCGCCG
CGTCTGCCCG TCGACGTGAT TCACTTCGTC GCGCAGGACG AGCCGATGTG GCGCAACGGC
TGGGCGATGG ACGGCTGGCA CGACGTCGCG GACCGGGTGA TCTGCCTGCC CGCGAGCGGC
GACCACATGA CGATGGTGGC GGCGCCGCAC GCGGAGCAAC TGGGGCGGCG CATCACGGAG
GCGCTCGCCG TGCACGGCGG GCCGCGCGCG GATGGCGCGG AGCGCGGCTA CGCGCCGCGC
ATCGCGATCC AGACGGCCCC GCGCGACGCG CGCGCGCCGA CGCTCTTCTG CATTCCGGGC
GCCGGCGCGA GCGTGACGAC GTTCTCGACG CTCGCGCGGC ATCTGCCGGC GACGTTCTCC
GTGGACGGGC TGCAGCCGCG CGGCCTGTGC GGGACGATGG TGCCGTATCT CGACGTCGAG
ACGGCCGCGC GCGCGTACCT GAGAAGCATC CGGAAAGCCG CGCCGCGCGG GCCGTACCAC
CTCGTCGGCC ATTCGTTCGG CGGCTGGGTG GCCTACGAGA TCGCGTGCCG GCTGCAGGAG
CAGGGCGAGC GCGTCGCGAC GCTGATGCTG CTCGACACCG AGCGGCCCGG CGCGACCGAC
ATCGTGCGCG GGCGCAAGAC GCGCGTGGAC GCGCTCGCGA AGCTCGTCGA GCTGTACGAG
ATGCATCTGG GCCGCCCGCT CGGTGTGAGC CGCGACGATC TGGCCGCGCT CGCGCACGAC
GCGCAGATCG AGCATCTGCG CGCGGCGCTC GTGCGCGCGA AGATCCTGCC GCCGTCCGTG
CATCCGAACG TGCTGCTTGG CGTCGTGCGG GTGCTCGAGA TGAACGTGAA CACGCCGTAT
CGGCCCGCGG GTCTCTACGC GGGGACGATG CACGTCGTGC TGATCGCGAA CGCGAAAGCG
GACGCGGACC TCGACGCGTG GCGCGACGAG CAGGCCGAGC AGTGGCGCGG CCTCGCGGAC
GACGTGCGGA TCGTGCGCGC GGGCGGCAAT CACATGACGA TGCTGCAGCC GCCGCACGCG
GCGTCGATCG CGGCGCTGCT CGAGCGCACG GCCGGCGCGC CCGCGCGGCT CGCGCAGGTG
CACTAG
 
Protein sequence
MTANDLLALL NSKGIALSVK GDNLAITGDE KVLEDPGLLA LLRANKPALI DLIKDGHGTV 
GGAVRVPPNR IPADAERIAP EQLTLVRLDQ GEIDALVARV DGGAANVQDI YPLVPMQEGM
LFHHLLSQQG DAYLESYLLA FRTRERLDRF LSALQQVIDR HDILRTAFFW EGLPHSVQVV
QRRATLPLNV VELDPRDGDV GQQLEARYDP RSHRIDVGRA PLMQVHAAHD AAGGRWLVRL
LSHHLAVDHT TLERVIEEAR AIEQGRAEDL PRPEPFRNFV AQARLGVSEA DHEAYFRAKL
GDIDEPTAPF GLLSVQGDGR EIAEAARTLK PELSGALRGH ARRLGVSAAS MMHVAWGLVL
SRTTGRQDVV FGTVLFGRMQ GGAQSDRALG LFINTLPVRM KVAQTGVEAS VKETHAQLAE
LMRHEHAPLV LAQRCSGVPA QTPLFTSLLN YRYGLRHRAD AATPGGDDIE LLSARERTNY
PLTLSVDDLG QDFSLTVQVS GHVDPQRVCA FMETALEQLA QALGEQPQCD IGGLDVLPRS
EREQMVYAWN ATERDYPIEQ CIHQLFEAQV DRKPEAIALT FEGQRLSYAE LNARANRLAH
YLQARGVGPD RLVALCAERG IEMVVGLLAI LKAGGAYVPL DPAYASDRLR GIVQDSQPAL
VLADAVGRAA LGELDGALPV IDPETDALRW REMPATNPEV ASQHVHHLAY VIYTSGSTGR
PKGVMVEHAQ VVRLFGATQA WFGFDERDVW TLFHSYGFDF SVWELWGALL HGGRLVIVPT
EVTRTPSAFF ALLCAEGVTV LNQTPSAFQA LMSAQEEREE AALKADVDAE AEAAGNIERA
NVIAHRLRYV IFGGEALEPR TLASWYARHG ERTQLVNMYG ITETTVHVTY CALRAEDAMR
LGASPIGVRI PDLQLYVLDD RREPVPMGVT GELYVGGAGV ARGYLNRPEL TRERFIDDPF
VAGGRLYKTG DLARWRTDGS LEYLGRNDFQ VKIRGFRIEL GEIEAQLAKV TGVREVVVLA
RDSASEVHDS ATEHATPNAL SPSPETSTAT AAATATATAT EKRLVAYYTG DADVVALRAQ
AAQHLPSYMV PSAYVRLDAW PLTPNGKLDR RALPAPADDA YARAEYEAPQ GAREEALAAI
WRDLLHVERV SRHDNFFELG GHSLLAVQLV SRLRQALSVE VALGTVFDAP VLSALASRLD
DNTAEVLPPI PLAPRDGRIA LSLAQQRLWF LTQLEGVSEA YHMSGAVRLD GPLNREVLQR
ALNRIVMRHE ALRTCFAREE GEPIQVIQPH ADLTMSYHDL REAESIRHEA GNREQRAKDL
SQAYASAPFD LSRDLPVRVL LLQLADEAHV VQVVMHHIAS DGWSVGVFLQ ELSALYGSFI
AEQDDPLAPL PLQYADYAAW QRRWLASGQL EKQGAFWQTN LSGAPTLLEL PTDRPRPPKQ
SHAGASVEVK LGAALSERVK RLSQRHGVTP YMTLLSSWAA VLSRLSGQEE VVIGSPVAGR
NRTEVEALIG FFVNTLALRL DLSSEPTVGE LLKRTKAQVL SAQAHQDLPF DQVVERVKPP
RSTAHPPLFQ VMFVWQNMPA GELTIPGLTI RAVETPLQTA QFELTLSLQE AGDDIVGHLN
YASALFDEST VRRYVTYWCR LLEGMTAGAA DQTIVGLPLL DEAERKQVVS EWNATERDYP
IEQCIHQLFE AQVDRKPEAI ALTFDGRRLS YAELNARANR LAHYLQGRGV GPDRLVALCA
ERGIEMVVGL LAILKAGGAY VPLDPAYASD RLRGIVEDSQ PALVLADAVG RAALGELDGA
LPVIDLETDA LRWREMPATN PEVASQHVHH LAYVIYTSGS TGRPKGVMVE HAQVVRLFGA
TQAWFGFDER DVWTLFHSYG FDFSVWEMWG ALLHGGRLVI VPTEVTRTPS AFFALLCAEG
VTVLNQTPSA FQALMSAQEE REEAAGNIER ANVIAHRLRY VIFGGEALEP RTLASWYARH
GERTQLVNMY GITETTVHVT YCALRAEDAM RLGASPIGVR IPDLQLYVLD ARREPVPMGV
TGELYVGGAG VARGYLNRPE LTRERFIDDP FVAGGRLYKT GDLARWRTDG SLEYLGRNDF
QVKIRGFRIE LGEIEAQLAK VTEVREVVVL ARDSAADTDQ NADLNASATA NSSEKRLVAY
YTGDADVAAL RAQAAQHLPS YMVPSAYVRL DAWPLTPNGK LDRRALPAPA DDAYARAEYE
APQGAKEEAL AAIWRELLHV ERVSRHDNFF ELGGHSLLAV QLVSRLRQAL SVEVALGTVF
DAPVLSALAE RLEAENTAVL SPIPLAPRDG RIALSLAQQR LWFLTQLEGV SEAYHMSGAV
RLDGPLNREV LQRALNRIVM RHEALRTCFA REEGEPIQVI QPHADLTVSY HDLREAESIR
HEAGNREQRA KDLSQAHASA PFDLSRDLPV RVLLLQLADE AHVVQVVMHH IASDGWSVGV
FLQELSALYG SFIAEQGDPL APLPLQYADY AAWQRRWLAS GQLEKQGAFW QTNLSGAPTL
LELPTDRPRP PKQSHAGASI EVKLGAALSE RVKRLSQRHG VTPYMTLLSS WAAVLSRLSG
QEEVVIGSPV AGRNRTEVEP LIGFFVNTLA LRLDLSSEPT VGELLKRTKA QVLSAQAHQD
LPFDQVVERV KPPRSTAHPP LFQVMFVWQN AHEGSLQIPG LRLSTWGDPL TMAPFELTLA
VREHQDDIAC TLTYATSLFD RATVERYLGH WLRQLDAMAT DADPVVTGLP LLGEAERAQV
LHGWNETGRA YARDACLHQL FEAQVSRTPE AAAVICGDET LSYTDLDARA NRLAHYLRGQ
GVGPDTRVGL ALGRGVEMMT GLLAVLKAGG AYVPLDPGYA SERLRAILDD SRPAIVLADA
AGRTALDALA GAPPIADLQA DASRWSALPS TPPRVEGLTP RHLAYVIYTS GSTGQPKGVM
VEHASVVNLW RALDEAIYRA HPSARRVSLN ASIAFDSLVK QWVQLLSGRT LVVVPEPVRF
DGRRLLDAIG RDRIDVFDCT PSQLALIEGA RGPEDEAYPQ VTLVGGEAIG EGMWSELASV
SSRTYYNVYG PTECTVDATL ARITAEHAPH IGGPLANVRA YVLNERLSPA PVGVRGELYI
GGAGVARGYL NRPELTRERF IDDPFVAGGR LYRTGDLARW RTDGRLEYLG RNDFQVKIRG
FRIELGEIEA QLAKVTGVRE VVVLARDSAS AVRDSATEHA TPNALSPSPE TSTATATATA
TATATATEKR LVAYYTGDAD VAALRAQAAQ HLPSYMVPSA YVRLDAWPLT PNGKLDRRAL
PAPADDAYAR AEYEAPQGAK EEALAEIWRD LLQVDRISRH DNFFQLGGHS LLAISLGDMM
RERGLHADVR TLFNAETLAA LAAQSGTDSI DVDVPPNLIP VGAARITPDM LPLVALTQAQ
IDAIAQQVDG GATNVQDIYP LAPLQEGMLF HHLLHTQGDL YLEPHLLAFR TRERLERFLS
ALQCVIDRHD VLRTGFFWEG VPQPVQVVWR RARLPVEYVE LPDSHGDVAS QLEARCDPRR
HRIDIGRAPL VHCHVAHDAR NDRWVLGVLT HHLVSDHTTL ALLAEEAQAF EQGRGDALPP
AVPFRNFVAH ARLGTSEREH EAFFREMLGD VDEPTAPFGL LDVQGDGSAI VEHRRALAPG
LSRSVRAHAR RLGVSAASVM HVAWSLVLAR TANRRDVVFG TVLFGRMQGG AHAHRTMGLF
MNTLPVRIAL DESDVETSLI ATHDRLARLL RHEHAPLALA QRCSAVPAQA PLFTSLLNYR
YSPHEEQGDA TDDDVQFIAA RERNNYPLTM IVDDTGEGFA LTAQVDASID AARVCAFMHT
ALEQLVRALD DARGAVLAEL DVLPADEHRC VVSACNDTDA ELPGVDFVDR RFEAQAARTP
EAIAVACGAH ALSYAALNRR ANRLAHYLRA HGAGPERVVA LALERSVDMM VGLLGILKSG
SAYLPLDPAY PAERLAYIVD DARPALLLTE AALRDDWRDA GAPVVLLDAD GPAIDACPDH
NPDAAAGRDA RTLSSLAYVI YTSGSTGRPK GVMIEHRNLA NLLGAMGEQP GIGAHDVLLA
VTSLSFDIAA LELFLPLLHG ARAVIAARDD AADPARLAHL IESSGASLMQ ATPSTWRMLA
QHGWPRSARP LTLLCGGEAL PPALAERLLA HVPAIWNLYG PTETTVWSTV RRVTTPVVDI
GGPIANTQVY VLDERLRPAP IGVAGELYIG GAGVARGYLN RPELTRERFV DDPFRRGGRL
YRTGDLARRR ADGNLEYLGR NDFQVKIRGF RIELGEIEAQ FAKAHGVQGV ALAARDTPTA
DKRLVAYYVG DASAAALREH AAARLPAYMV PAAYVRLAAW PLTPNGKLDR AALPAPDDEA
YARAEYEAPR GEHECKLAAI WRAVLQVERI GRHDDFFELG GHSLLAVRAV TAMRDAFGSD
TSLRDLFARP VLKDLAEHAS TAARARDAAI PKAARGEPAP MSFAQQRLWF LARMGGLGDA
YHMPIAVRLR GALDVDALQR ALSRIVSRHD ALRTTFALEG EQPVQRVHAD DGAGLRLRID
DLRGCADAGA RRARILAGQA SEPFDLARGP LVRGALVREA DDVHTLCVTI HHIVSDGWSI
DVFCRELSEL YRAFAGGQPD PLPPLPVQYA DYAAWQQRGI GGAALHAQAE YWRDALAGAP
TLLELPTDRP RPPQPDYAGA TVGLALDAPL TAGLRALARR HGATLFMTVF AAWSVLLSRL
SRQTDVVIGT PSANRGHAQI EGLIGFFVNT IALRVDLDGA PTVAELLARV KARTLAAQQH
QDIPFEHVVE RVQPARSLSH SPVFQAMFAW QHASRGEMRL EGLRAEPLDD AARTIAKFDL
TLSLRESGDA IDGGLEYASA LFERATIERF AGYLRRLLEG MVADDTQRVD ALPMLSRDER
RDLIERRNAT ARPYPANSGV HRLFEAQAAR TPDATAIVDG ATTLDYRALD ARANRIAHAL
AHAGVRAGDR VALHLEPSIG LVAAQLAVLK LGAAYVPVDV GNPPARKAFV AQDSGARLVL
GDAALDWPAA AGVPQRDLAA LLAGPWPSDA PARAPQCGGD TPAYVMYTSG SSGQPKGVLV
THRGIARLAV NSGYATFDAS DRFAFASNPA FDASTFEVWT ALLNGASIGI VKRDDLLDLG
ALAGKLSSIG VTCLFLTTAL FNRCVSFDPA MFARLRCVIS GGERADPAVY RKVMEAGPPR
HLLNAYGPTE TTTFAAVWEA EPRTLAAQAA PIGRPIGNTS VYVLDAYGAP VPVGVTGEIH
IGGPGVAQGY LNRPALSAER FVRDPFVGGD ARMYRTGDLG RWRPDGMLDC IGRADFQVKI
RGFRIELGEI EACLLEHGAL AQAAVLARDD GGDGGKTLVA YYVPRAGHED GAPALRAHLA
ARLPEYMVPA AYVRLPAMPL TPNGKLERRA LPAPDERSYV RRDYAAPQGE IETTLARIWA
ELFGIERVGR HDGFFELGGH SLLAVRMVAR VHDVLGVEVP LRALFADPVL HVFASAVARA
STRQASSNLV AFRSAGTAAP LFFIHSGLGE IGFVGDLLPG IAPEIPVYGF AAVGFLAGET
PHATIEEMAA QYVDAMRRVQ PHGPYRLAGW CAGGNIAFEM AHQLIAADET VEFLCMIDSP
TSAPIDRSVT ACVLARIPDD IPEALRTRLH ALGDAFDVRG MLHACQAAGM LPIDLPTGLM
ERHVAVQYAI KHAKLNYVPP RLPVDVIHFV AQDEPMWRNG WAMDGWHDVA DRVICLPASG
DHMTMVAAPH AEQLGRRITE ALAVHGGPRA DGAERGYAPR IAIQTAPRDA RAPTLFCIPG
AGASVTTFST LARHLPATFS VDGLQPRGLC GTMVPYLDVE TAARAYLRSI RKAAPRGPYH
LVGHSFGGWV AYEIACRLQE QGERVATLML LDTERPGATD IVRGRKTRVD ALAKLVELYE
MHLGRPLGVS RDDLAALAHD AQIEHLRAAL VRAKILPPSV HPNVLLGVVR VLEMNVNTPY
RPAGLYAGTM HVVLIANAKA DADLDAWRDE QAEQWRGLAD DVRIVRAGGN HMTMLQPPHA
ASIAALLERT AGAPARLAQV H