Gene Lcho_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3430 
Symbol 
ID6163124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3814793 
End bp3831634 
Gene Length16842 bp 
Protein Length5613 aa 
Translation table11 
GC content70% 
IMG OID641666205 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001792453 
Protein GI171060104 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0177269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG GCCCCGGCTC CATGAATCGC GTCTATCGCC TGGTCTGGAA CGACATCGTC 
GGCGCCTTTG TCGCCGTGGC CGAGTTCGCC CGTGGGCGCG GCAAGCGCAG CTCGTCGGTG
GTCGGCGCGG TGCTCGCCGT CACGCTGCTG GGCACGGGAT CGGGGGTGCA GGCGGCGGGG
CCGCCGGCGG TCAACGCGCT GCCGAGCGGC GGCACGGTGG TGGCGGGCCA GGCCGGCATC
AGCCAGAACG GCAACGCGCT CAACATCCAG CAGACCAGCG TGCGCGCGGC CATCAACTGG
CTGACCTTCG ACATCGGCGC GCAGGCCCAG GTCAACATCC GGCAGCCCGA CGCCCAGTCG
GTGACGCTCA ACCGCGTGCT CGGCGCCGAT CCGTCGGCCA TCCACGGCCG CCTGAATGCC
AACGGTCAGG TGATCCTGGT CAACCCGAAC GGCATCGTCT TCGGCAAGGG CTCGCAGGTC
GACGTGGGCG GCCTGGTGGC CAGTTCGCTG GATCTGGCCG ACGCCGACTT CATGTCGGGC
AAGCTCAATT TCGTGCGCGG CGACGCGCTC GGCAAGGTGC TCAACCAGGG CACGCTCAAG
GCCGACGGCG GCTACGTGGC CCTGCTGGCG CCCGAGGTGA TCAACGAAGG CGTCATCAGC
GCCCAGCTCG GCACGGTGGC GCTGGCCGCC GGTGACGCGG TCACGCTCGA CCTCGGCGGC
AACCAGCTGC TGGGTGTCAA GGTCGATCCG GCCAGCGTCA AGGCGCTGGT CGCCAACCGC
CAGCTGGTGC AGGCCGAAGG CGGCCGGGTG ATCCTGAGTG CCGGCGCCGC CAACCGCCTG
CTCGAACAGG CCGTGGCCGG CAGCGCGGGC GCCACCGAAC TGGTCGAACA CGACGGCGGC
GTGCGGCTGG TTTCGGCCGG CGGCACGGTG AAGGCGGGCG CCGGGCGCAT CGACGTCGAA
GGCAGCACGG TCGACCTTTC CGGCACGCTC GACGCCAGCG GCGCGCAGGC CGGCAGCGTG
CGGGTCGGCG CCGATTTCGT CAGCCAGTCG GGCCGCATCG ACGCCAGCGC CAGCCAGGGT
GCCGGTGGCC GGATCGCGAT CAGCGCCGAC ACCACGATCC AGACCGCCTC CGCCACGCTC
AATGCCGACG GCGCGGGTGG CGGTGGCAGC GTGCGCATCG CCGCCGATGA CGGCGCCGCG
GGCGTCGTCT ACAGCTCGGC GCAGATCAGC GCCCGCGGCA CCGGGGCCGC GGCGGTCGGC
GGCGAGATCG CCGTCAGCGC CGACAGCGTG CAGCTGCGCG CGGCCAACCT TGACGCCAGC
GGCCAGGCCG GTGGCGGCCG GGTGCGGGTG GGCGGCGGCT TCCAGGGCCA CGACACCGAC
CTGGCCAACG CCTTGAACGT CGGCATCAAC GCCAGCACCG TGCTGCGCGC CGATGCCGGC
GTGCAGGGTG ACGGCGGCCA GGTGGTGGTC TGGTCGGACG ACCGCACCAC CTTCGGCGGC
CACGTCTTCG CACGCGGCGC GGGCGCCGGT GGTGACGGCG GCCAGGCCGA GGTGTCGGGC
AAGGGCGACC TGATCTTCGC CGGCACGGCC GACCTGTCGG CGCCGCAGGG CCAGGCCGGC
CGCCTGCTGC TCGATCCGCG CAACATCATC GTCGACAACG CCGCCAGCTC GATCGCCAGC
CTGGCGATGG ACGACCCCAC GCCCGACGCG ACCAGCGGCT TCGGCACCGT GACCCAGGTG
CTCGGCAACG GCAACGTCGT CATCACCGCA CCGAACGCCG ACACCGCCGC CACGGCCGAC
ACCGGCGCGG TCTACCTGTT CGACAGTTCG AACGGCGCCC TGCTGTCGAA CCTGCGTGGC
AGCGGCGCGG GCGATCACGT CGGCAGCGCC GGCATCCAGG TACTGGGCAA CGACAACTAC
CTGGTGTTGA GCCCGCAGTA CGGCACGGTC AGCGGCGTCA ACGTCTTCGG TACGTCGACC
AGCGATTCGT CCACCATCGG CACACCCAGC GCCTACGCCA TCACGGCCAG CAGCACGGCG
TCGGCCGGTG CGATCACCTG GCAAAGTGCC ACCGGGGCGG CCGGCAGCAG CTTGGTGAGT
GCGGCCAACA GCCTGGTCGG CAGCACCGCC AACACCGACA GCGTGACGAG CTACAGCTAC
AGCGGCAGCA CCGTCACGGC CGGAGCCAGC GTGTCGGTCA CCGCCAATGA CCAGCTCGGC
AAGGTGTCCA CCTACGAGCC GTGGGGCGAT GTGGCCGTTG CCGGCACGAC CACCGTGCAC
GAACTGGCCG ACGGCAATGT CGCCATCGCC ACGCCGAACT GGTTTAACGG CCGCGGTGCA
GTGACCTGGA TGAACGGCGC CACCGGCGCG CTGTTCACCG GTGCAGCGGG CGCCGAGGTG
TCGGCCAGCG TGGCCACGAC AGGGGCTACG CCGATCCGCA CCCTGGCGCC GGTGTCGACC
GATGCCAGCG GCGACAAGGT CTACGTGATC GGCTTCGACG CCTCGCTCCC CTTCACCTAC
ACGGCCGGCG CCACCAACCG GCGCAGCACA CCGCCCGGTG CCGGCGGCGA CGCCATCGGC
AGAAACCTGA CGCTGTTCGA CGACGGCAGC TACGTGATCG CGAGCCCGCA GTGGAGCAAC
GGCGCTGCGG CCTATGCCGG CGCCGTGACC TGGTCCGGTG CGGGAGGCAC GGCCGGGGTG
GTGTCGAGCG GCAACAGCCT GGTGGGCAGC AATGCCTACG ACTTCGTCGG CAGCGGTGGC
GTGACGCCGG TGGGTGCCAT CGACAGCAAC TACCTGGTGG CCAGTCCGTA CTGGACCGAC
AGCGGCAATG CCGCGGTCGG GCGCTATCTC GAGAACGCTC CCAACGGGGC CGTCACCTGG
GTGGACGGCA GCACCGGCCG GGCCTTCGGC GAGGCCGGCA CGGGCGCTGC CGTGGGCGGC
GGCAACAGCC TGATCGGTGG TGCGGGGGAT GGGTTGGGCG CACGCAGCAA CAGCATCGGC
ACGCCGACCG CCACCTACGA CCTCGTGCCC AGCCCGTACA GCTACAGCAC CACCTCGTCG
ACACGCACCT TCGCGACCAC CGACGGCATC ACCGAACTGG CGAACGGCAA CTACCTGGTG
GTCGACACCT CGTGGAGCGG CAGCCGCGGC TCGGTCACGT TCGGCGACGG CGCAGCGGGT
GTCGCCGGCA GCGTGTCGGC CGCCAACAGC CTGGTGGGCA GCACGGCGTC TGATGGCGTG
GGCGCGTCGG TGCTCGAATT GAGCGGCAGC CACTACGCGG TCGTCTCCAG CACCTGGGAC
AACACCGCTG CCGGCGCAAC CGATGCCGGC GCCGTCACCT GGGGTTCGGG CACCAGCGGC
GTCAGCGGGG CGGTCTCGGC CGGCAACAGT CTTGTCGGCA GCGATGACTA TGAACAGGTC
GGCTCGGGCG GCGCCATCGG CGTGGGCGCC ACAGGTGCGG ATGGCCTGCG CACCGACTAC
ATCGTGCTCA GCCCGCTGTG GGGCAACCGC AGCAGCGCGC AGTATTCGAC CACCGCCTAT
GGCGCGGTGA CCTGGGTCGA CGGCAGCAAC GGCGACGTGC ACGGCAGCAG CGCCAACGGC
GCCGCGGTTT CGAGCACCAA CAGCCTGGTC GGCAGCAATA CCGGCGACTA CGTCGGCAGC
TACGTCGGCT CGTACTCGCG CGTGCTGGGG AACCCAGTGG ACGGTGCCTC GCCGATCACG
ACGCCGTGGT ACGCCAACCC GACCGCCACC GTCGACGTGC TCGCCGACGG CAACTACGTC
GTGCGCAGCC CGAGCTGGGA TGCCGGCAAG GGCGCCGCGA CCTGGGCCGA CGGCAGCGCC
GGCATCGCCG GCTCGGTGAG CGCGGGCAAC AGCTTGGTGG GCAGCACCTC CGACCAGTAC
GCCACGGTCA CCGGCAGCCA CACCGTCAAC GCTGCTGTCT ATACCGACAC GACCTATCAA
CTCACCACCA CCGGCGACCA CGTCGGCCTG CTCGGCACGG CCCTGCAGGA TGGCAACTTC
CTGGCCATCA GCCCCAGCTG GGGCAACGGC CGCGGCGCGA TCAGCTGGCT CGGCGCGAGT
GCGCCCACCG GCGCCGTGAG TGCGGGCAAC AGCCTGGTCG GGAGCACCCA GGACGTTTTC
ACCGACGCCA CCCACACAGC CCTGGTGTCG GCCGGTGACC GCCTCGGCAC GTTGGCCGAC
AGCACCTGGG TTGTGCCGGT CGTGAGCAGC CGCAACTCGG ACAGCTACTC CCCCACCGGC
ACACGCAACC TCGGGCCCTA TCAGCTCGGC ACCAGCTGGA ACGTCACCTT CGCCTACTGG
AACGGCAGCA CCTGGACTTA TCCGGTGGCC GGCAAGAACT ACGGCACGAC TTCGGTGGCG
AGCACGCAGA TCGAGTCCGA CGGTTGGGGC TACCTGACGA GCCATTTCGC CTACAACGCC
AAACCCATCG TCGCGACCTT GTCGAACGGC AACGCGCTGA TCGGCAGCCC GAGCTGGAAC
AACGGCAGCG CCACGCGCGC CGGCGCCGTC ACCTGGATCA ACGGCGCCAC CGGCACGCTG
GCCGATGCCA GCCAGGGTGG CACGCTCGGT GCCACCAACA GCCTGGTCGG TGACCACACC
GACGATGTGC TGGGCTTTCG CCTTCCGCTC GACGGCGTGG CGGAGCTCGC CAACGGCAAC
TTCCTGCTGA TCAATCCGCA GTGGAACGAG GAGCGCGGTG CCGTCACCTG GGGCAGCGGC
ACGGCCGGCG TGACCGGCAC GGTGTCGGCC GCCAACAGCC TGGTCGGTGA GATCGCCTCG
GGCAGTCTCG GCTTCAGTCT GCCGAGCGAC TATGTCTATC GACTGGTCGG CTCCAACTTC
TCGTACGACT GGACCGACCG CCGTTCCGAT CAGGCCGGTG ACCGCGTCGG CTTGGGCGGT
GTCACCGTGC TGGCCGACGG CAATGCCGTG ATCGGCTCGC CGTTCTGGAA CCAGTCCGCC
GCCTGGGACA GTTATTCAGC CCCCAGCTCG CTCGGCGCCG CCACCTGGAT CGACGGCAGC
AACGGGCTGC TCAAGGATGG CAGCGCCGGC GGCAGCATCA GCGCGAGCAA CTCGCTGGTC
GGATCCCAGC ATGGCGACGC GGTGTCGTAT GCGGCCTTCG TCGACCCCTA CTCACTTCAA
CACTATGTGA CCTCCGGCAT CACGGCGCTG GCCGCCGGCA ACTACGTGGT GGCCAGCCCC
TGGTGGTCGA ACGGCGCCAC CTCGCAAGTG GGCGCCGTCA CCTACGGCAC GGCAGGCGGC
ATCGTCGGCG AGGTCGGCGT CGGCAACAGC CTGGTCGGGG CCAGTGCCGG CGACCACGTC
GGCCGAGGCC TGTCGAGTTA CGACCCCTGG TACTACACCG CCATCATCTA TTCCGGCGTC
CAGGTGCTCA GTGACGGCAG CTACACGAAC TACCTCGTGC GCACGTCCGA CTGGACCAAT
GCCAACGACA TCGCCGGCGG TGGTGCCGGC GCTGGCGCGG TGACCTGGGT GGACGGCAGC
ACCGGCCACG CATTTGGTGA AGGCGGTACC GGTGCGGTGG TCTCGGCCGC CAACAGCGTG
GTCGGCAACG TCGCCGGCGA TGGCGTCGGC GCCCAGCTGA TCTCGCTGAC CCGTAACGTG
AGTGGGCAGA GCGTGGCCAC CGGCGACCAG TTGCTGCTGA GCAACCTGGC CTATTGCGGC
GTGCCCGGCG TGGGTGCGAT CACCTTGCTG TCCGGTGCGC AGGGCGCCGC AGGGCCGGTG
AGCTGGCGCA ACAGCATGAT CGGATTCGCC TCCGCAGCCG ACGGCATGAA CACGTCGAGC
TTTGACGGCT ACAGCTACAG CACCGACGTG CACGCCACGC TCTTGCCCAC CGCCGTGACC
GGCGCCGAGC AGGTCGCCTG GCGGCCGCTG ATCTGGACCA CGCCCAACAC CACCAGTGGC
AACAACGGCA GCCGCGCGCT GGCGCTGACG CTGGTCAGCG ACAGCAACAC CGCGCCCGAC
AGCGCCGATC AGGTCAACAC CGGCGGCGGG GCCAACTGGG CCGGCAGCTG GTTTGCCGGC
GACAACGCGG GCTACACCGG CGTGGGCGGC AACACCGGCA CCGGCTTGCT GGGCTTCTCG
GCCAACACCG GTGAAGACGT GGTCATCACG CCGGCGGCCC TCACGGGCCT GCTCGATGCC
GGCACCGACG TCACGCTGCA GGCCAGCAAC GACATCACGG TGCTGCGCGA CATCACCACC
TCGGCCGGCG GCAACGGCGG CGACCTGACG CTGGAGGCGG GTCGCTCGAT CCACCTTTAT
GCCGACATTC GCACCGACAA CGGCAACTTC ACCGCCATCG CCAACCAGAG CCTGGCCGCC
GGTGTGGTCG ACGCCGACTG CGCCACCTGC GTGGCCGAGA TCGTCCAGCA GGCCGGCAGC
CACATCGATG CGGGCACCGG CAGCGTCAGC CTGACCCTGC TGGCCAGCAC CGACAAGACG
TCCGACGACG CCGGCGACAT CCGCCTGTCC AACCTCGACG GCGCGAGCAT CTCGGTGGTC
AACCAGGGCA TCGACGCGGC CGGCAACGGC CGCGGCATCC GCTTCAACGA AGGCGCCACG
ATCGGCAGCA GCGCCACCGA GTCGCTCACG CTGCAAACCC GCGGCTACAC CGGGCTGGCC
GGCGGGCTGG CGCTGCAGAG CGACACCGTG CTGCAGGGCT CGAACACGGC CACGCTCTCG
GTCGGTGCAT CGGACGCGAG CATCGCGGTC GACCTGGGCA GCGCCAGCGG CAGCGGCCTG
GCGATGACCG GCGCCGAGAT GGGCAGCGTG ATCCAGCAGT CCAGCGGCTT TGCCGAGATC
GTCTTCGGCC GCGCCGACCA GAGCGGCGCC ACCACCGTCG GCACACTCGA CTTCACGCAG
GCTGCGATGC TGCGCAGCGG CAGCAGCAGC CTCGACGCCG ACCTGACCTT GCAAGGCGGC
AGCGGCGGCA CGACGCTGTC GGGCGCCGTC ACCTCGGCCG AGGCCGCCGG CCGCACGCTG
ACCGTGGCCG GCACGGACGG TGCCATCACG CTCGCCAACG GCGCATCGAT CACCGCAGCG
ACCGGTGTGC TGCAACTCGC CGCCCACGGC AGCGCGGCGC TGACGCAGAA CGCCGGCAAC
ACGCTCAACG CCACGCAACT GCTGCTCGAC GGCACCGGCT CCGTGGCGCT GAACGCCGGC
AGCAACACCA TCGGCACGCT GGCCGGCACG GTCGGCACGG CCAACGTGCG CACCAGCAGC
GGCAACCTCA CCGTGGGATC GGTGGGCACC GTCGACGGGC TCACCACCAC CAGCGGCATG
ACGCTGCAGG CCGCCGGCAG CGCCACCGAC GTCGTGCTCG ACCAGGCCGT CACCAACGCC
AGCGGCACGC TCGTCATGGC CGCCGCAGGC GACTTCGTCA ACAACGTCGC CGCCGATACC
GGCATCGACG CGGGCACCGG CCGCTACCTC GTGTACTCGG CCAGCCCGGT CGACACCACC
GAGGGCATGA CGGGCTACTC GAAGCATTAC AACCAGAGCT ACAGCGCCGG CTCCGACCCC
GCCTATGCCA GCGCCGGCGA CTGGTTCCTG TACAGCGTGG CGCCGACCCT GACCGCCTCG
CTCGGCGCGG GTTCGACCAT CACCTACGGC CAGCCGGGCA GCGCCCCGGG CGTCAACATC
AGCGGCCTGA TCGACGGCGA CACGCTGGCG AGCGCCACCA CCGGCAGCCT GGCCTCTTCC
ACGAGCAGCT ACACGGCGTC CGGCGCGGGC TTCATCCCGG TCGGTTCGCA CACCCTGACG
CTCAGCGGCC AGGGCACGCT GACCAGCGAT CTGGGCTACC AGATCGCCGT CACCACCGGC
AGCGCCACCC TCACGGTGCA GGCCAAGGCG ATCAACGTGG CCGGCCTCGC CGCCGACAGC
AAGGTCTACG ACGGCAGCAC CACCACCACG GTCAGCGGCA CCGCCAGCCT GACCGGCGGA
GGCGCCACCA ACCTCGACGG CAAATACCTC ACCGGCGACA CGCTCGCCAT CGGCGGCACG
GCCTCGGGCA GCTTCGTCGA TCGCCATGTC GGCACCGCCA AGAGCGTCGC GCTCGGCGGC
CTGAGCCTGA GCGGCGCCGA CGCGGCCAAC TACACCATCA GCACCGACTC GGTCACGGCC
GACATCACGC CCAAGACCGT CACCCTGACG GGCCTGAGCG TGGCCGCGAG CAAGGTCTAC
GACGGCACTG CCCAAGCCAC CCCGATCGGC AGCGCCGCCC TGCTGGCGGC CGAGGCGGTC
GGCAGCGGCA GCACGGGCGA TGGCCGTCGC TACACCGGCG ACAGCGTCAG CCTCGTCGGC
ACCGCGAGCG CCACCTACAA CGGCGTCAAC GTGGCCGATG CCAGCGCCGT CACCTTCGGC
GGCCTGAGCC TGAGCGGCAC CGAAGCGGGC AACTACGTGC TCGCGGCCGG CAGCCAGGCG
GCCACCATCA CGGCCAAAGC CCTGACGGTG ACCGGCGCGA CGGCCGTCGA CAAGGTCTAC
GACGGCAACA CCCTGGCCAC GGTCGGCAGC GCCGGCAGCC TGGTCGGCGT GATCGGCAGC
GACGCGGTCA GCCTTGACGC CAGCCAGGTG ACCGCTGAGT TCGCCGACAA GAACGTCGGC
ACCGGCAAGA CGGTGACGCT CGGCAACCTG AGCCTGGATG GCGCGGGTGC GGGCAACTAC
AGCATCACCG GCCAGGCCAG CACGACGGCC GACATCACGG CCAAGGCTCT GACGGTCAGC
GGTGTGACGG CCAGCAGCAA GGTCTACGAC GGCGGCGTCA ACGCGACCGT CAACACGGCA
GTCGCAAGCC TCACCGGGCT GGTGGAAGGC GACGACCTGA CGCTCAGCGC GACGGGCACG
TTCGCGGACA AGAACGTCGG CACGGGCAAG ACCGTGACGC TGAGCAGCAG CTACGACGGT
GCCGACCTCG GCAACTACAG CATCACCGGC CAGGCCAGCA CGACGGCCGA CATCACGGCC
AAGGCTCTGA CGGTCAGCGG TGTGACGGCC AGCAGCAAGG TCTACGACGG CGGCGTCAAC
GCGACCGTCA ACACGGCAGT CGCAAGCCTC ACCGGGCTGG TGGAAGGCGA CGACCTGACG
CTCAGCGCGA CGGGCACGTT CGCGGACAAG AACGTCGGCA CGGGTAAGGC CGTGACCTTG
AGCAGCAGCT ACGGCGGCAC CGACCTCGGC AACTACAGCA TCACCGACCA GGCGAGCACG
ACGGCCGACA TCACGGCCAA GTCGCTGACG GTCAGCGGTG TGACGGCCAG CGGCAAGGTC
TACGACGGCG GCGTCAACGC CATCGTCAAC ACGGCAGTCG CAAGCCTCAC CGGGCTGGTG
GAAGGCGACG AACTGACGCT CAGCGCGACG GGTGCATTCG GCGACAAGAA CGTCGGCACC
GGCAAGACAG TGACGCTGAG CAGCAGCTAC GACGGCGCGG ATGTGGGCAA CTACAGCATC
ACGGGCCAGA CCACGACGAG CGCGGACATC ACCGCCAAGA CGCTGACGGT CAGCGGTGTG
ACGGCCAGCA GCAAGGTCTA CGACGGCGGT GTCAACGCGA CCGTCAACAC GGCCGGCGCA
AGCCTTGCCG GGCTGGTGGA AGGCGACGAC CTGACGCTCA ACGCGACGGG CACGTTCGCG
GACAAGAACG TCGGCACCGG CAAGACAGTG ACGCTGAGCA GCAGCTACGG CGGCACCGAC
CTCGGCAACT ACAGCATCAC CGACCAGGCG AATGCGACGG CCGACATCAC GGCCAAGGCG
CTGACGGTCA GCGGTGTGAC GGCCAGCAGC AAGGTCTACG ACGGCGGCGT CAACGCGACC
GTCAACACGG CCGGCGCAAG CCTTGCCGGG CTGGTGGAAG GCGACGACCT GACGCTCAAC
GCGACGGGCA CGTTCGCGGA CAAGAACGTC GGCACCGGCA AGACAGTGAC GCTGAGCAGC
AGCTACGGCG GCACCGACCT CGGCAACTAC AGCATCACCG ACCAGGCGAA TGCGACGGCC
GACATCACGG CCAAGGCGCT GACGGTCAGC GGTGTGACGG CCAGCAGCAA GGTCTACGAC
GGCGGCGTCA ACGCGACCGT CAACACGGCC GGCGCAAGCC TCGCCGGGCT GGTGGAAGGC
GACGACCTGA CGCTCAGCGC GACGGGCACG TTCGCGGACA AGAACGTCGG CACGGGTAAG
GCCGTGACCT TGAGCAGCAG CTACGGCGGC ACCGACCTCG GCAACTACAG CATCACCGAC
CAGGCGAGCA CGACGGCCGA CATCACGGCC AAGGCTCTGA CGGTCAGCGG TGTGACGGCC
AGCAGCAAGG TCTACGACGG CGGTGTCAAC GCGACCGTCA ACACGGCCGG CGCAAGCCTT
GCCGGGCTGG TGGAAGGCGA CGACCTGACG CTCAACGCGA CGGGCACGTT CGCGGACAAG
AACGTCGGCA CCGGCAAGAC AGTGACGCTG AGCAGCAGCT ACGGCGGCAC CGACCTCGGC
AACTACAGCA TCACCGACCA GGCGAATGCG ACGGCCGACA TCACGGCCAA GGCGCTGACG
GTCAGCGGTG TGACGGCCAG CAGCAAGGTC TACGACGGCG GCGTCAACGC GACCGTCAAC
ACGGCCGGCG CAAGCCTCGC CGGGCTGGTG GAAGGCGACG ACCTGACGCT CAGCGCGACG
GGCACGTTCG CCGACAAGAA CGTCGGCACG GGTAAGGCCG TGACCTTGAG CAGCAGCTAC
GGCGGCACCG ACCTCGGCAA CTACAGCATC ACCGACCAGG CGAGCACGAC GGCCGACATC
ACGGCCAAGG CTCTGACGGT CAGCGGTGTG ACGGCCAGCA GCAAGGTCTA CGACGGCGGT
GTCAACGCGA CCGTCAACAC GGCCGGCGCA AGCCTCGCCG GGCTGGTGGA AGGCGACGAC
CTGACGCTCA GCGCGACGGG CACGTTCGCC GACAAGAACG TCGGCACGGG TAAGGCCGTG
ACCTTGAGCA GCAGCTACGG CGGCACCGAC CTCGGCAACT ACAGCATCAC CGACCAGGCG
AGCACGACGG CCGACATCAC GGCCAAGGCT CTGACGGTCA GCGGTGTGAC GGCCAGCAGC
AAGGTCTACG ACGGCGGCGT CAACGCGACC GTCAACACGG CAGTCGCAAG CCTCACCGGG
CTGGTGGAAG GCGACGACCT GACGCTCAGC GCGACGGGCA CGTTCGCGGA CAAGAACGTC
GGCACGGGCA AGGCCGTGAC CTTGAGCAGC AGCTACGGCG GCACCGACCT CGGCAACTAC
AGCATCACCG ACCAGGCGAG CACGACGGCC GACATCACGG CCAAGGCCTT GACGGTCAGC
GGTGTGACGG CCAGCAGCAA GGTCTACGAC GGCGGTGTCA ATGCGACCGT CAACACGGCA
GTCGCAAGCC TCGCCGGGCT GGTGGAAGGC GACGACCTGA CGCTCAACGC GACGGGCACG
TTCGCGGACA AGAACGTCGG CACGGGTAAG GCCGTGACCT TGAGCAGCAG CTACGACGGC
GCCGACCTCG GCAACTACAG CATCACCGAC CAGGCGAGTG CGACGGCCGA CATCACGGCC
AAGGCGCTGA CGGTCAGCGG CGTGACGGCC AGCAGCAAGG TCTACGACGG CACCACCGGT
GCGACGGTCG ACACGAGCGC CGCGAGCCTC GCCGGGCTGA TCACCGGTGA CGACCTGACG
CTCCGCGCCA CGGGTGTCTT CGCCGACAAG AACGTCGGCA CGGGCAAGAC GGTGACACTG
AGCAGCAGCT ACGGCGGCGC GGACGCAGGC AACTACAGCA TCACCGACCA GACCACGACC
AGCGCCGACA TCACGGCCAA GGCCTTGACG GTCAGCGGCG TGACGGCCAG CGGCAAGGTG
TACGACGGCA CCACCGGTGC GACGGTCGAC ACGAGCGCCG CGAGCCTCGC CGGGCTGATT
ACCGGTGACG ACCTGACGCT CCGCGCCACG GGTGTCTTCG CCGACAAGAA CGTCGGCACG
GGCAAGACGG TGACACTGAG CAGCAGCTAC GACGGCGCCG ACCTCGGCAA CTACAGCATC
ACCGACCAGG CGAGTGCGAC GGCCGACATC ACGGCCAAGG CCTTGACGGT CAGCGGCGTG
ACGGCCAGCA GCAAGGTCTA CGACGGCGGT GTCAACGCGA CCGTCAACAC GGCCGGCGCA
AGCCTCGCCG GGCTGGTGGA AGGCGACGAC CTGACGCTCA GCGCCAGCGG CACGTTCGCG
GACAAGCACG TCGGCACCGG CAAGACGGTG GCCTTGAGCA GCAGCTACGG CGGCACCGAC
CTCGGCAACT ACAGCATCAC CGACCAGGCG AGCACGACGG CCGACATCAC GGCCAAGGCG
CTGACGGTCA GCGGTGTGAC GGCCAGCAGC AAGGTCTACG ACGGCGGCGT CAACGCGACC
GTCAACACGG CCGGCGCAAG CCTCGCCGGG CTGGTGGAGG GCGACGACCT GACGCTCAAC
GCCACGGGTG CATTCGGCGA CAAGAACGTC GGCACCGGCA AGACAGTGAC GCTGAGCAGC
AGCTACGACG GCGCGGATGT AGGCAACTAC AGCATCACGG GCCAGACCAC GACGAGCGCG
GACATCACCG CCAAGACGCT GGCGGTCAGC GGTGTGACGG CCAGCGGCAA GGTCTACGAC
GGCGGCGTCA ACGCGACCGT CAACACGGCA GTCGCAAGCC TCACCGGGCT GGTGGAAGGC
GACGACCTGA CGCTCAACGC GACGGGCACG TTCGCGGACA AGCACGTCGG CAGGGGCAAG
ACCGTGACGC TGAGCAGCAG CTACGGCGGC ACCGACCTCG GCAACTACAG CATCACCGAC
CAGGCGAGCA CGACGGCCGA CATCACGGCC AAGGCTCTGA CGGTCAGCGG TGTGACGGCC
AGCAGCAAGG TCTACGACGG CGGTGTCAAT GCGACCGTCA ACACGGCAGT CGCAAGCCTC
GCCGGGCTGG TGGAAGGCGA CGACCTGACG CTCAACGCGA CGGGCACGTT CGCGGACAAG
AACGTCGGCA CGGGTAAGGC CGTGACCTTG AGCAGCAGCT ACGACGGCGC CGACCTCGGC
AACTACAGCA TCACCGACCA GGCGAGTGCG ACGGCCGACA TCACGGCCAA GGCGCTGACG
GTCAGCGGCG TGACGGCCAG CAGCAAGGTC TACGACAGCA CCACCGGTGC GACGGTCGAC
ACGAGCGCCG CGAGCCTCGC CGGGCTGATC ACCGGTGACG ACCTGACGCT CCGCGTCACG
GGTGTCTTCG CCGACAAGAA CGTCGGCACG GGCAAGACCG TGACGCTGAG CAGCAGCTAC
GACGGTGCCG ACCTCGGCAA CTACAGCATC ACCGACCAGG CCGGCACGAC GGCCGACATC
ACGGCCAAGA CTCTGACGGT CAGCGGCGTG ACGGCATCGG GCAAGGTCTA CGACGGCAAC
ACCGGTGCGA TGGTCGACAC GAGCGCCGCG AGCCTCGCCG GGCTGATCAC CGGTGACGAC
CTGACGCTCC GCGCCACGGG TGTCTTCGCC GACAAGAACG TCGGCACGGG CAAGACGGTG
ACACTGAGCA GCAGCTACGA CGGCGCCGAC CTCGGCAACT ACAGCATCAC CGACCAGGCG
AGTGCGACGG CCGACATCAC CGCCAAGGCC TTGACGGTGA CCGGCGCTAC CGCCAGCGGC
AAGGTCTACG ACGGCACGGC CGTGGCGAGT GTCGGCAGCG CCGGCAGCCT GGTCGGCGTG
ATCGGCGACG ATGTCGTCGG CCTCGATGCC AGCCGGGTCG CGGCAGCGTT CGTCGACAAG
AACGTCGGCA CGGGCAAGGT GGTGACGATC GACAGCCTGA GCCTGGACGG CGCCGATGCG
GGCAACTACA GCATCACGGG CCAGACCACG ACCCGCGCCG ACATCACGGC CAAGGCCTTG
ACGGTGACCG GCGCGACGGC CGGTGGCAAG GTCTACGACG GCACGGCCGT GGCGAGTGTC
GGCAGCGCCG GCAGCCTGGT CGGCGTGATC GGCGGCGATG CCGTCGGCCT CGACACGAGC
CTGGCCACGG CAGCGTTTGC CGACAAGAAC GTCGGCACGG GCAAGGCGGT GACGATCGAC
AGCCTGAGCC TGGACGGCGC CGATGCGGGC AACTACAGCA TCACGGGCCA GACCACGACC
CGCGCCGATA TCACCGCCAA GGCCTTGACG GTGACCGGCG CGACGGCCGG CGGCAAGGTC
TACGACGGCA ACACCCTGGC CACCGTCGGC AGCGCCGGCA GCCTGGTCGG CGTGATCGGC
GCCGATGCCG TCGGCCTGGA CACGAGCCTG GCCACGGCAG CGTTTGCCGA CAAGAACGTC
GGCACGGGCA AGGTGGTGAC GATCGACAGC CTGAGCCTGG ACGGCGCCGA TGCGGGCAAC
TACAGCATCA CCGGCCGGAC CACGACCCAT GCCGACATCA CGGCCAAGGC CCTGACGGTC
AGTGGCGTGA CGGCCGCCGG CAAGACCTAC GACGGCAACA CCGCCGCGGT GGTCGACACG
AGTGCCGCGA GCCTCGCCGG CCTCGTGTCC GGCGATGACC TGACGCTCAG CGCCACGGGT
GTCTTTGCCG ACAAGAATGT CGGCACGGGC AAGACCGTGA CGCTGACCAG CAGCTACGGC
GGCGTCGACC TTGGCAACTA CACCATCACG GGCCAGACGA GCACGACGGC CGACATCACC
GCCAAGGCCT TGACGGTCAC GGGCGTGACG GCGACCGGCA AGGTCTACGA CGGCAACACC
GCCGCGAGGG TCGACACGAG CGCCGCGAGC CTGAACGGGC TGGTGACGGG CGACGACCTG
ACCCTGCGCG CCACGGGCAC GTTTGCCGAC AAGAACGTCG GCGCGGGCAA GACCGTGACG
CTGGCCAGCA GCTACGCCGG CGCCGACCTC GGCAACTACA GCATCACCGG CCAGGCCGGC
ACCACCGCCG ACATCACGGC GCGCACCCTC GAGTTGCGTG CCGACGACAA GGCCAAGACC
TACGGCGATG CCGACCCGGT GCTGACGTTC GGCATCGGTG GCCTGGGGCT GGCCGAAGGC
GACAGCGCCG CCGATGTCTT TGCCGGCGCG CTCTCCACCG CGGTGGGGGC GGCAGCGACG
GCCGGGCTGC ATGCCATCGA CATCGGCACG CTCGCCGTCG GTGACAACTA TCGCGTGGGC
CAGTTCACGG CCGGTCAGAT GGTGGTCGGC AAGGCGGCGC TGACCGTGCA GGCCGACAAC
CAGCGCAAGA CCTACGGCGA CGCCGATCCG ACGCTGACCC ACAGCCTCGA CACGGCGCAG
CTGAAGTACA CCGACACGGT CGCCGTGGCC GACGGCGTGA CGCTGGGCAC CGCCACGGGC
GCAGCGGCCA CGGCGGGCAC GCACCGCATC GTCGGCGAGG GCGTGGCGGA CAACTATGTC
ATCACCGTGC TCGACGGCGA ACTGGCGGTG GACCGGGCGG CACTGACCGT CACGGCCGAC
AACCGGCACA AGACCTACGG CGACGCGGAT CCGCTGCTGA GCCACTCCGT CGACGCCTCG
CAGCTCAAGT ACAGCGACAC CGCCGCCGTG GTGACGGGTG TCGGCCTGAG CACCGCGACG
GGTGCGCAGG CGACCGCGGG CAGCCACGTG ATCACCCTCT CGGGTGCGAC GGCGGCCAAC
TACGCCATCG TGTCGGTCGA CGGCACGCTG AGCGTCGACA AGGCCGCCTT GACCGTGACG
GCCGACGACA AGAGCAAGAC CGCGGGCGAG AGCGACCCGG CGCTGAGCTA CAGCGTCGAC
GCCAGCCAGC TCAAGTACGC CGACGATGCC GGCGTGGTGC GTGACGTGAC GCTGACCGCG
CCCACCGGCA ACGGCCTGCC TCCGGGCGAC TACGCCATCG TGGCCGGCCA GGGCACGGCC
GACAACTACG AGCTGGGTTT TGTCGACGGC CGTCTGACCG TCAAGCCTTC GCCGAGCGTG
AAGGCCGAGA ACCTGGCCAC GCAGACGCGG GTGGTGACTG CGCCCGTGGT CGCACCCACA
CCGGTCTCGA CCGGTGCCGC GGTGGTCCAG CCGGGCAGTC TGACCGTGAT TGGCGGCGGG
CTGACCGTCC CCGTGTCGTC GTCGGGCTCG GCAGGCTCGG CCGGCTCGCC GGAGGGCGCA
GGTCTTTCGG GCGGATTCGG TTCCTCGGGG GGCCTTGGTG GTGGCTTCGG TGGGGGTACT
TCAGGCGGCA ATGCCGGATT CGGCTCCGCC GGTGCGAACG GTGCGGGGGG CGGTTCGGCT
GCCGATGGCG CATCGGGTGG GGCTGGCGGC ACTGGTGGTG CAGGCCGCAC CAATGCCACC
ACGGGCACAA CGGGCACGGG CAGCACCGCC GGGACCGCTG GTGCAACCGG TGCCGCAGAT
GCGGGTGGCC TGGTCGACAC CGGTGGCGCA AACGCTGCCG GCGGCGCAGG TGGCTCGGCG
GGGATCACCG GCGACAGCTC GGCTACGGGT CAGTCCACTG GCGGCACGGG TGGTGCCGGC
GGTGCATCCG CCAGTGCCAG TGCCGGCGCG GGTGGCGGTG ACGGACGATT GGTCGCGGTG
CGGCCGATCA GCCAGATCAG CGCCGACCCG GTCCGCGGCG TGACTTTCGA GGCCGGCGCT
GTGTTCGAGA AACCGGCGGG CAGCAGCGTG GCGTTTACGG CCACGCTGGC GGACGGCACC
GCGTTGCCGA GCTGGCTCAA GATCGATCCC CAGACCGGCA CGCTGACGGG GCTTCCGCCG
GCCGGCAGCC GCGATGTGTT CGAGGTGGTG GTGACGGCAC AGAGCGATGC CGGCCAGAAG
GCCGCCACGC GGGTGACGCT GGAATCCGGC GCCGTCCGCT GA
 
Protein sequence
MKNGPGSMNR VYRLVWNDIV GAFVAVAEFA RGRGKRSSSV VGAVLAVTLL GTGSGVQAAG 
PPAVNALPSG GTVVAGQAGI SQNGNALNIQ QTSVRAAINW LTFDIGAQAQ VNIRQPDAQS
VTLNRVLGAD PSAIHGRLNA NGQVILVNPN GIVFGKGSQV DVGGLVASSL DLADADFMSG
KLNFVRGDAL GKVLNQGTLK ADGGYVALLA PEVINEGVIS AQLGTVALAA GDAVTLDLGG
NQLLGVKVDP ASVKALVANR QLVQAEGGRV ILSAGAANRL LEQAVAGSAG ATELVEHDGG
VRLVSAGGTV KAGAGRIDVE GSTVDLSGTL DASGAQAGSV RVGADFVSQS GRIDASASQG
AGGRIAISAD TTIQTASATL NADGAGGGGS VRIAADDGAA GVVYSSAQIS ARGTGAAAVG
GEIAVSADSV QLRAANLDAS GQAGGGRVRV GGGFQGHDTD LANALNVGIN ASTVLRADAG
VQGDGGQVVV WSDDRTTFGG HVFARGAGAG GDGGQAEVSG KGDLIFAGTA DLSAPQGQAG
RLLLDPRNII VDNAASSIAS LAMDDPTPDA TSGFGTVTQV LGNGNVVITA PNADTAATAD
TGAVYLFDSS NGALLSNLRG SGAGDHVGSA GIQVLGNDNY LVLSPQYGTV SGVNVFGTST
SDSSTIGTPS AYAITASSTA SAGAITWQSA TGAAGSSLVS AANSLVGSTA NTDSVTSYSY
SGSTVTAGAS VSVTANDQLG KVSTYEPWGD VAVAGTTTVH ELADGNVAIA TPNWFNGRGA
VTWMNGATGA LFTGAAGAEV SASVATTGAT PIRTLAPVST DASGDKVYVI GFDASLPFTY
TAGATNRRST PPGAGGDAIG RNLTLFDDGS YVIASPQWSN GAAAYAGAVT WSGAGGTAGV
VSSGNSLVGS NAYDFVGSGG VTPVGAIDSN YLVASPYWTD SGNAAVGRYL ENAPNGAVTW
VDGSTGRAFG EAGTGAAVGG GNSLIGGAGD GLGARSNSIG TPTATYDLVP SPYSYSTTSS
TRTFATTDGI TELANGNYLV VDTSWSGSRG SVTFGDGAAG VAGSVSAANS LVGSTASDGV
GASVLELSGS HYAVVSSTWD NTAAGATDAG AVTWGSGTSG VSGAVSAGNS LVGSDDYEQV
GSGGAIGVGA TGADGLRTDY IVLSPLWGNR SSAQYSTTAY GAVTWVDGSN GDVHGSSANG
AAVSSTNSLV GSNTGDYVGS YVGSYSRVLG NPVDGASPIT TPWYANPTAT VDVLADGNYV
VRSPSWDAGK GAATWADGSA GIAGSVSAGN SLVGSTSDQY ATVTGSHTVN AAVYTDTTYQ
LTTTGDHVGL LGTALQDGNF LAISPSWGNG RGAISWLGAS APTGAVSAGN SLVGSTQDVF
TDATHTALVS AGDRLGTLAD STWVVPVVSS RNSDSYSPTG TRNLGPYQLG TSWNVTFAYW
NGSTWTYPVA GKNYGTTSVA STQIESDGWG YLTSHFAYNA KPIVATLSNG NALIGSPSWN
NGSATRAGAV TWINGATGTL ADASQGGTLG ATNSLVGDHT DDVLGFRLPL DGVAELANGN
FLLINPQWNE ERGAVTWGSG TAGVTGTVSA ANSLVGEIAS GSLGFSLPSD YVYRLVGSNF
SYDWTDRRSD QAGDRVGLGG VTVLADGNAV IGSPFWNQSA AWDSYSAPSS LGAATWIDGS
NGLLKDGSAG GSISASNSLV GSQHGDAVSY AAFVDPYSLQ HYVTSGITAL AAGNYVVASP
WWSNGATSQV GAVTYGTAGG IVGEVGVGNS LVGASAGDHV GRGLSSYDPW YYTAIIYSGV
QVLSDGSYTN YLVRTSDWTN ANDIAGGGAG AGAVTWVDGS TGHAFGEGGT GAVVSAANSV
VGNVAGDGVG AQLISLTRNV SGQSVATGDQ LLLSNLAYCG VPGVGAITLL SGAQGAAGPV
SWRNSMIGFA SAADGMNTSS FDGYSYSTDV HATLLPTAVT GAEQVAWRPL IWTTPNTTSG
NNGSRALALT LVSDSNTAPD SADQVNTGGG ANWAGSWFAG DNAGYTGVGG NTGTGLLGFS
ANTGEDVVIT PAALTGLLDA GTDVTLQASN DITVLRDITT SAGGNGGDLT LEAGRSIHLY
ADIRTDNGNF TAIANQSLAA GVVDADCATC VAEIVQQAGS HIDAGTGSVS LTLLASTDKT
SDDAGDIRLS NLDGASISVV NQGIDAAGNG RGIRFNEGAT IGSSATESLT LQTRGYTGLA
GGLALQSDTV LQGSNTATLS VGASDASIAV DLGSASGSGL AMTGAEMGSV IQQSSGFAEI
VFGRADQSGA TTVGTLDFTQ AAMLRSGSSS LDADLTLQGG SGGTTLSGAV TSAEAAGRTL
TVAGTDGAIT LANGASITAA TGVLQLAAHG SAALTQNAGN TLNATQLLLD GTGSVALNAG
SNTIGTLAGT VGTANVRTSS GNLTVGSVGT VDGLTTTSGM TLQAAGSATD VVLDQAVTNA
SGTLVMAAAG DFVNNVAADT GIDAGTGRYL VYSASPVDTT EGMTGYSKHY NQSYSAGSDP
AYASAGDWFL YSVAPTLTAS LGAGSTITYG QPGSAPGVNI SGLIDGDTLA SATTGSLASS
TSSYTASGAG FIPVGSHTLT LSGQGTLTSD LGYQIAVTTG SATLTVQAKA INVAGLAADS
KVYDGSTTTT VSGTASLTGG GATNLDGKYL TGDTLAIGGT ASGSFVDRHV GTAKSVALGG
LSLSGADAAN YTISTDSVTA DITPKTVTLT GLSVAASKVY DGTAQATPIG SAALLAAEAV
GSGSTGDGRR YTGDSVSLVG TASATYNGVN VADASAVTFG GLSLSGTEAG NYVLAAGSQA
ATITAKALTV TGATAVDKVY DGNTLATVGS AGSLVGVIGS DAVSLDASQV TAEFADKNVG
TGKTVTLGNL SLDGAGAGNY SITGQASTTA DITAKALTVS GVTASSKVYD GGVNATVNTA
VASLTGLVEG DDLTLSATGT FADKNVGTGK TVTLSSSYDG ADLGNYSITG QASTTADITA
KALTVSGVTA SSKVYDGGVN ATVNTAVASL TGLVEGDDLT LSATGTFADK NVGTGKAVTL
SSSYGGTDLG NYSITDQAST TADITAKSLT VSGVTASGKV YDGGVNAIVN TAVASLTGLV
EGDELTLSAT GAFGDKNVGT GKTVTLSSSY DGADVGNYSI TGQTTTSADI TAKTLTVSGV
TASSKVYDGG VNATVNTAGA SLAGLVEGDD LTLNATGTFA DKNVGTGKTV TLSSSYGGTD
LGNYSITDQA NATADITAKA LTVSGVTASS KVYDGGVNAT VNTAGASLAG LVEGDDLTLN
ATGTFADKNV GTGKTVTLSS SYGGTDLGNY SITDQANATA DITAKALTVS GVTASSKVYD
GGVNATVNTA GASLAGLVEG DDLTLSATGT FADKNVGTGK AVTLSSSYGG TDLGNYSITD
QASTTADITA KALTVSGVTA SSKVYDGGVN ATVNTAGASL AGLVEGDDLT LNATGTFADK
NVGTGKTVTL SSSYGGTDLG NYSITDQANA TADITAKALT VSGVTASSKV YDGGVNATVN
TAGASLAGLV EGDDLTLSAT GTFADKNVGT GKAVTLSSSY GGTDLGNYSI TDQASTTADI
TAKALTVSGV TASSKVYDGG VNATVNTAGA SLAGLVEGDD LTLSATGTFA DKNVGTGKAV
TLSSSYGGTD LGNYSITDQA STTADITAKA LTVSGVTASS KVYDGGVNAT VNTAVASLTG
LVEGDDLTLS ATGTFADKNV GTGKAVTLSS SYGGTDLGNY SITDQASTTA DITAKALTVS
GVTASSKVYD GGVNATVNTA VASLAGLVEG DDLTLNATGT FADKNVGTGK AVTLSSSYDG
ADLGNYSITD QASATADITA KALTVSGVTA SSKVYDGTTG ATVDTSAASL AGLITGDDLT
LRATGVFADK NVGTGKTVTL SSSYGGADAG NYSITDQTTT SADITAKALT VSGVTASGKV
YDGTTGATVD TSAASLAGLI TGDDLTLRAT GVFADKNVGT GKTVTLSSSY DGADLGNYSI
TDQASATADI TAKALTVSGV TASSKVYDGG VNATVNTAGA SLAGLVEGDD LTLSASGTFA
DKHVGTGKTV ALSSSYGGTD LGNYSITDQA STTADITAKA LTVSGVTASS KVYDGGVNAT
VNTAGASLAG LVEGDDLTLN ATGAFGDKNV GTGKTVTLSS SYDGADVGNY SITGQTTTSA
DITAKTLAVS GVTASGKVYD GGVNATVNTA VASLTGLVEG DDLTLNATGT FADKHVGRGK
TVTLSSSYGG TDLGNYSITD QASTTADITA KALTVSGVTA SSKVYDGGVN ATVNTAVASL
AGLVEGDDLT LNATGTFADK NVGTGKAVTL SSSYDGADLG NYSITDQASA TADITAKALT
VSGVTASSKV YDSTTGATVD TSAASLAGLI TGDDLTLRVT GVFADKNVGT GKTVTLSSSY
DGADLGNYSI TDQAGTTADI TAKTLTVSGV TASGKVYDGN TGAMVDTSAA SLAGLITGDD
LTLRATGVFA DKNVGTGKTV TLSSSYDGAD LGNYSITDQA SATADITAKA LTVTGATASG
KVYDGTAVAS VGSAGSLVGV IGDDVVGLDA SRVAAAFVDK NVGTGKVVTI DSLSLDGADA
GNYSITGQTT TRADITAKAL TVTGATAGGK VYDGTAVASV GSAGSLVGVI GGDAVGLDTS
LATAAFADKN VGTGKAVTID SLSLDGADAG NYSITGQTTT RADITAKALT VTGATAGGKV
YDGNTLATVG SAGSLVGVIG ADAVGLDTSL ATAAFADKNV GTGKVVTIDS LSLDGADAGN
YSITGRTTTH ADITAKALTV SGVTAAGKTY DGNTAAVVDT SAASLAGLVS GDDLTLSATG
VFADKNVGTG KTVTLTSSYG GVDLGNYTIT GQTSTTADIT AKALTVTGVT ATGKVYDGNT
AARVDTSAAS LNGLVTGDDL TLRATGTFAD KNVGAGKTVT LASSYAGADL GNYSITGQAG
TTADITARTL ELRADDKAKT YGDADPVLTF GIGGLGLAEG DSAADVFAGA LSTAVGAAAT
AGLHAIDIGT LAVGDNYRVG QFTAGQMVVG KAALTVQADN QRKTYGDADP TLTHSLDTAQ
LKYTDTVAVA DGVTLGTATG AAATAGTHRI VGEGVADNYV ITVLDGELAV DRAALTVTAD
NRHKTYGDAD PLLSHSVDAS QLKYSDTAAV VTGVGLSTAT GAQATAGSHV ITLSGATAAN
YAIVSVDGTL SVDKAALTVT ADDKSKTAGE SDPALSYSVD ASQLKYADDA GVVRDVTLTA
PTGNGLPPGD YAIVAGQGTA DNYELGFVDG RLTVKPSPSV KAENLATQTR VVTAPVVAPT
PVSTGAAVVQ PGSLTVIGGG LTVPVSSSGS AGSAGSPEGA GLSGGFGSSG GLGGGFGGGT
SGGNAGFGSA GANGAGGGSA ADGASGGAGG TGGAGRTNAT TGTTGTGSTA GTAGATGAAD
AGGLVDTGGA NAAGGAGGSA GITGDSSATG QSTGGTGGAG GASASASAGA GGGDGRLVAV
RPISQISADP VRGVTFEAGA VFEKPAGSSV AFTATLADGT ALPSWLKIDP QTGTLTGLPP
AGSRDVFEVV VTAQSDAGQK AATRVTLESG AVR