Gene GSU0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0279 
Symbol 
ID2687003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp289984 
End bp307683 
Gene Length17700 bp 
Protein Length5899 aa 
Translation table11 
GC content63% 
IMG OID637124945 
Productcadherin domain/calx-beta domain-containing protein 
Protein accessionNP_951340 
Protein GI39995389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CGTCCGACGA TCAAGTGCGT GCAGCCAAGG GAAAATCCAT ACCGCTGTCT 
TCTTCGGGAA AGGATTTTAT CGAAGGAAGT CTTGATTTTC GGCCTGTCTC TCCCAAAAAG
CAGACTGCCG TGCGCAAGCC AGGAGAAGAA CAGGAGGAAA AGGCGGCGGT GGCCGCGGAG
CAGACCGAGT CCGACCACAT GACTGCCGAT GAGGCCGCGG ATGTTTCCTA TGACCATGTG
GCGGCCATTA CCGGCGAACA GTCCTTCGCT GATACGCTGT CTGTCGCTGA CCAGACAAAA
ACCGGAAAAG AGGAAAAATG CGGCGACAAC AATGACGATG ATGACTGTGA CGACAAAGGT
GGCTGGCTCT GGTGGGCCGG TGGGGCAGTG GGTGTTGCCG GCGGGAGCAT CGGTTTCGCC
GTTGCGGCGC TCAACGGTGA CGACGACGAA GGGACCCACG TTGACACTGC GGCCGTTGCC
TTTGCGGGAA GGGTGACCGA CGGCCCAGTT CACGGGGCGA CAATCTACAA CGATATCAAC
AAGAACGGCG TCTATGACGA TGGCATAGAC AAAGCCATGA CGCACAACAG CGAGGCCATA
ACCAGCGATG CCGACGGCAA CTTCACCATA ACAGTCGGCG ACCTGATCGA GAATGGCATA
ACTGACATCA ATAAGTTAAA GCTGGTGGCC TACGGGGGGA TCGACACGGT AACGGGCGAG
GCGGTCACGG TTGATTTCAC GGCACCCGAA GGCTATCGCT ACCTGAACCC GGTCACCAGC
CTCATTGCCG CCTATATGGA GGCATACAAC GAAGCCAATC CGAACGCAAA AATTACTGCC
GCAGAGGCCG AAGAGGCCGT GATACAGGCC CTTGGTCTCC CCCAGATCGA CTACGCGACG
ACCGACCTGG CGCTGCCCGA AACGGCCGTA GAGGCTCAAA AAGTGGCCGC GATTCTCGCT
GTCGCCGCCA TGCTCATTGA GGAGTCGGGC ACGGACAGCG ATGGTTTTGC CTTCCTGGCC
GCCCACCTGG CCCCGTCTGA AACCCCTCTG CCCGGCACCA TGACCTATCT GACGGATGAG
GTCACCACTG CACTGCAGTC GGCCAACGAC ACGACGGCAG CCAACCAGTT CTCATCAACC
GTGGTGGCCG TAAACGACGC GACCAGCCTC GACGACATCA ATACCGCCCT GAACAATACC
ATCTTCGCCG ACCTGATTGT GGCCGGCAAT GTCAAGGTGG GGCAGACGCT GGACGGCGGA
CTGGGGATGG GCAGCACCAC GGGGTTGGAG GTGACCTACC AGTGGCTGGA AAGCATCGAC
GGAACCACCT GGACCCCCAT AGCCGGCGAA ACCGGAAGTG ACTACACCAT CAGGCCAACC
GACATACTTC ACCATTTGCG TTTACAGGCA ACGTACATCG GTACGGACGG ACAACCGCGA
ACCATTTTTT ACGACGTGGG AGTCGTGCCC GACAGTTCGC CGGTGTTCGC ATCCAGCACC
AGCGGAGCCG TCGCCGAGAA TGAAGCCGTT GGCACGGTGG TCTACCGGGC GGAGGCAACC
AGCGACCTGG AGAACAACCC CCTGAGCTAC AGCCTGGGAG GAACCGACGC CGACCTGTTC
ACCATTGACG TCGCCACCGG CGAGGTCACG CTCAAGAACC CCGCTGACTA TGAATCGAAA
AGCAGTTACT CGATCGACAT CACCGCCACT GACACCTATG GCCTGACCAG CACCACGTCG
GTAACCATTG GCATCGACAA CCTGGACGAG GTGGCACCCT CCATCACCTC GGGACCCACG
GCTGCGACGA TTGCGGAAAA CAGCGGCCCC GGGCAGGTGG TCTATACCGC CGCGGCAGAC
GACTCGGCAG ACATCAGCGG CGGCGTCACC TTCAGTCTCA AGGCGGACGG AGACGCTGCC
CTGTTCACCA TCGACGCCGC CACCGGCGAG GTGACTCTCA CCGGCAACCC GGACTACGAG
GCCAAGCCCG CCTACAGCTT CACCGTCGTG GCAACCGATG CCGCGGGGCA CTCCACCGAG
CAGACCGTCA CCCTGGCCAT TGACAACCTG GACGAGGTGG CACCCTCCAT CACCTCGGGA
CCCACGGCTG CGACGATTGC GGAAAACAGC GGCCCCGGGC AGGTGGTCTA TACCGCCGCG
GCAGACGACT CGGCAGACAT CAGCGGCGGC GTCACCTTCA GTCTCAAGGC GGACGAAGAC
GCTGCCCTGT TCACCATCGA CGCCGCCACC GGCAAGGTGA CTCTCACCGG CAACCCGGAC
TACGAGGCCA AGCCCGCCTA CAGCTTCACC GTCGTGGCAA CCGATGCCGC AGGGCACTCC
ACCGAGCAGA CCGTCACCCT GGCCATTGAC AACCTGGACG AGGTGGCACC CTCCATCACC
TCGGGCCCCA CGGCTGCGGC GATCGAGGAA AACAGCGGCC CCGGGCAGGT GGTCTATACC
GCCGCGGCAG ACGACTCGGC AGACATCAGC GGCGGCGTCA CCTTCAGCCT CAAGGCGGAC
GGAGACGCTG CCCTGTTCAC CATCGACGCC GCCACCGGCG AAGTGACTCT CACCGGCAAC
CCGGACTACG AGGCCAAGCC CGCCTACAGC TTCACCGTCG TGGCAACCGA TGCCGCGGGG
CACTCCACCG AGCAGACCGT CACCCTGGCC ATTGACAACC TGGACGAGGT GGCACCCTCC
ATCACCTCGG GACCCACGGC CGATATTGCC GAAAACACCG GTGCCGGCCA AGTCATTTAC
ACTGCCGTGG CCGACGATGC GGCAGACATC AGCGGCGGCG TCACCTGGAG CCTCAAGGCG
GGTAGCGATG CCGCTCTGAC GATCGATGCC GTCACCGGTG CGGTCACACT AGCCGACAAC
CCTGACCATG AAGCAAAATC CGGCTACAGC TTTACGGTTG TGGCCACTGA TGCCGCGGGG
CACTCCACCG AGCAGGCGGT AGAGCTCTCT GTTCTGGACA ATAATGCCAG CGCCGCCATT
GCCGTTGACC TGACAACCAT CGCAGAAGGT TCCGAGGGGA CAAGCACCAT CCTCACCTAC
ACCGTGACGC GCACCAGTGC CCTCAATGCC AGTAGCGTCG ACTGGGCCAT CAGTGGCGTC
GATGCCGCCG ACCTTGCCGC CGGCCAGGCC GCAGTCGGAA CCGTCACCTT CGCCATAGGC
GAAACCAGCA AGACATTTAC GGTAGAAGTT GTCGGCGACA GGACCATCGA GAGCAACGAG
GACTTGGTCG TCACCCTTGG CAACCCTGGC AACGATATCG ATCTCGGCAC GGCCGACTCT
TCGGCCACAA CCATTGCCGA CGACGACGGC GAGGTGTCGA TAGCGGCGAC GGCCGTGTCG
GTTCCCGAAG GGGATACCGG CGACAGCCGG GTGGTCACCT TCACCGTCAC CCGTACCAAC
ACCCTCAGCG CCAGCAGCGT CGACTGGGAT GTGGCCGGCG GCACCGTGAA TGCGGCTGAT
TTCGGCGGGA CGCTTCCTTC AGGCACGGTT ACGTTCGCCG AGGGTGAAGC GACCAAGACC
ATCAGCATCA CGGTGACCGG AGACCGGATT ATCGAGCCCG ACGAAACCCT GACGGTACGC
CTGTCCAATC CGGGCCTGAA CCTCGTTCTG GGGGTGGACG AAGCGAGCTC CACCATTGTC
AACGACGACG TGGGCTTCAG CATCTTCGGC GACGTCATGG ACGTGGTCGA GGGGGGAATC
GGCGAACAAA GGGCGATCAC CTTCCACGTT GTCCGCTCCG ACAGCCTTAC CCTGCCCATG
ACCATCGATT ACCGACTGAT CCCCCGCGGC AGCACGGTGC CGGACGGCTT CGACTTCACC
GGCTCTCCGG ACAGCCTGGG CGACAACGCC GGCCGTCCGA GCGGCACCAT CAGCTTCGGC
CCCGATGAAA CCAGCAAGAC GGTCACGATC TACGTGGCCG GGGATGCCGT GCCGGAACTG
AACGAAACCT TCAGCATCGT GCTGGCCAAT GCGCCCCCCA GCACCATCAT CATCAACGGG
GAAATCGAGG GCGTCATCCG CTCCGACGAA ACCCAGTACA GCATTCATGC GGTCACCGCC
GCTACGGTGG AAGGAAACGG GACCGGGGGA ATCCAGCAGT TCCTCATCAC CCGCACGGGA
GACACGTCTC AGCCCGGTTC CGTCGGCTAC ACGCTCTCGG AATATGGCGA AAACCCCACC
GAGGCCAACG ATTTTGCCGC AGGAACCCCT CTGACCGGCA CTATCAGCTT CGCAGCCGGC
GAAACATCGA AGATTCTTTC CGTAAACCTT GAAGGAGACA GCGTACTCGA GGGGTACGAG
AGCTTTCAGG TCGCCCTGCA CACCCTCGAC AGCAACTCAA TCATCGGGAC AAACACGGCC
GTAGCATCCA TCATCCCGGA TGATGCGGCA ATCAACATCG CGGCAACTGA CAGCATCGTC
AAAGAAGGAA CCGGCGCCGT CTCCCGCAGC CATACCTTCA CGCTGACCCG CTCATCTCAT
GTTGATAGCG AAGTCACCGT CGACTGGCAC CTTGCCGGAA CGGGGGCCAA TCCGGTCGAT
GCCGCTGACT TCGGCGGCAC TCTGCCGTCG GGCAGCGTTA CCTTCGCTCC GGGCGAAACA
GTGAAGACCC TGACCATCAC CCCCAGCACT GATGCAGCCT ATGAACCGCA CGAGAGCTAT
GAGATCGTGC TCTCTACCAG TCAGCTGGGA GTCGTGCTGG AAACGGACCA CGCCTCTGGC
ATGATCCTCA ATGACGATTC GGGACTCACG CTGGTTGCCA CCAATCTGGA CAAGGCAGAA
GGGAATCCGG GTACGCCGTC GCAACTGACG TTTACCGTGC AGCGTACGGG CGACACTACC
GGCGAGTCCA CCGTGCACTG GGAACTGGTC TCCGCGGACG GCAGCGGCGT CAGCGCCGCG
GATTTCGCCT CGGGGATACT CCCTTCCGGC GATCTGACTT TCAGCCGCGG CGTTACCAGC
CGCGTCGTAA CCATTCCGCT CACCACCGAT AACATCATCG AACCGGACAA GGGATTCACC
ATCCGCTTGA GCAGTCCTTC GGAGGGAACC GAGCTGCTGG TCAGCGAAGT CGGCGGTTAC
ATCCGCAACG ACGATGCCGC CTTTACCCTT GAGTCCGTTT CCCCGGTCGC CGAGGGACAC
AACGGCACCA CTACGGTTAC CTTCACGGTT GTCCGGACCG GCGACATCAG CGGTGCGGAC
ACGGTGGAGT ACGTCGTAGC CCCTGCCGAC GGCGGAGCAG TTGTAGATGG GGCCGACTTC
GTCGGCGGAC AACTGCCGGA CGGCCTCATC ACCTTTAACG CCGGCGAGGC CAGCAAGACC
GTCACCCTTG CCGTTGCAGG AGACAACGCT CTCGAAAGTG ACGAAGCGTT CACTATCACC
CTCGTTAACC CCGGCGTGGG GAGCACCATC GCCAGCGGCA GCACCGATGT CGTAATTCTC
AATGATGATG ACGCCCTGAG CATCGTGGCA ACCGACGCCG ACCAGGCAGA GGCGGCCGCC
GGCGGCACGC GCGATTTTAC CTTCACCGTC AACCGAACCG GCTTCCTCGA CCGCGCCACC
ACGGTAAACT GGAGCGTCGC TGGCGTCGGT GCCAACCAGG TCGATGCTGC GGACTTCGGC
GGCGCACTCC CCTCCGGAAC CCTCGAATTC GCCGCCAATG AAAGCAGCAA GACCATCACG
ATCACCGTCA ACGGCGACTA CTTCCAGGAA GCGGACGAAG GATTCCGCGT CACCCTCTCC
TCGCCGTCGG ACGGAACCAC TCTTACCACC GCCAGCGCGG ATGGCGTGAT TCGCAACGAC
GATACCGGAC TGGCCATAAC GGCAACAACC ACGACACTGG CGGAAGGAGA CAGCGGCACA
GTAACCCACG TCTTCACCGT CACCCGCACC GGCGTCACCA CCGGAACCAC CACCGTCGAC
TGGGCCCTTG CCGGCTCGGG CGGGCATCCG GTTGACGCTG CGGATTTCGG CGGCACGCTC
CCGTCCGGAA CGCTGGTCTT CGCCCCGGGC GAAACCACAA AAACAATCGA AGTACAGGCC
TCCGGCGACA CCGACATCGA GCCCGGCGAG GGCTTTACCA TCACCCTCTC GGGCGCCGAC
GGCAATGCCG ACATCATGAC CGCATCGGCA AACGGCACGG TGGTGGCAGA CGATATCAGC
ATCGCTATCA GCGCCGGTAC CGCGTCGGTT ATGGAAGGCG CAACGGGCAG CAGCCGCGTG
CTGCAGTTCA CCGTTACCCG TACGGGTGAC CTGGCTTCCC CGGTCAGCAT CGACTGGTCC
GCCAGCGGCA TGGACGCCGC CGACTTCGCC AACGGCACGG CCCTCTCCGG CACGATCAAC
TTTGGCGCCG GCGAAACCGT CAAAACCATC AATCTCACCC AGATCGGCGA CAATGTCAGC
GAAAGCGACG AAACTCTCAC CATCACGCTG AGCAACCCGG CGGGCAACCC GGCCCACGAC
CGGACCTACA TCACCTCAGC CACAGCCACG ACTGACGTGG TCAACGACGA CGCCTCCCTG
ACAATCACCG CCGACGCTGC CTCGCAAAAC GAGCACAACA CCGGCGACGG CGAAGCTACT
TCCTTCACCT TCACCGTGAC CCGCACCGGC GACACCTCCA CGGAAACCAC CATCGACTGG
GTGCTGCAGC TTCCCGGCGG AGCCGGTTCC GCCGCGGGCA ACGACTTTGT CGCGGGGCAG
GACCTGCTCG GCACAAACAG CGGCCTGCCG TCGGGAACCA TATCCTTTGC CGCCGACGAG
ACCTCCAAAA CCATTACCGT CTTAGTGGCC ACGGACAACC AGGTGGAGCA GGATGAGACC
TTCAGCATCC AACTGCAGGG AGCCGGCGCC AATACCGAAG TGTCCGGCAA CAGCGCCAGT
GCCGTCATCA GCAACGATGA TACCGGCTTT TCCATCATTG CCCTTGCTGC CGACCATACC
GAAGCCAATG GCGGCACCGT CACTTACACC TTCCGGGTTA CCCGGGCCGG CGACATCAGC
TCCGCGGCCA CCGTCGACTG GGATGTTGCC GGTAGCGGCG CATCCCCAGC CAATGCCGAC
GACTTCGGCG GCAGCCTGCC GGGCGGCACG CTTTCCTTTG CCGAGAATGA AGCCAGCAAG
GAGATCAGCT TCACGGTCAG CGGCGACACC GTAGTGGAGC AGGATGAGGA GTTCACTGTC
ACCATCAGCA ATGCCCAGCT TACCGACGCA ACGCCGCAGC TGATCCAGGA CGCCACCGTC
GGCGGCATCA TCCGCAACGA CGACCAGAGC TTCAGCGTCA GCGCTGCCAA CGCCTCCGTG
ACCGAAGGCT CCGCCGGCAC CACTCAGATC GCCTACACCA TTACCCGCAC CGGCGACCTG
AGCGACAGCG TCACCATCGA CTACGCCGTC ACCGGCGCGG GCGGCGCCGC CACAAGCGAC
GTTCAGGGCG GTGTTCTACC CACCGGCACC CTTACCTTTG CCGCCGGTGA AACCAGCAAA
TCCGTCACCT TCGACGTCAT CGCCGACACC CTGGCCGAAG GGAACGAGAC CTTTACCCTC
ACCCTGACCA ACCCGTCGGC CGGGATCATC GGCACCGCCA GTGACAGTAC GGTGGTTGTC
AACGACGACA CCAACTTCGC GCTGAGCGCA CCGGCGCCCT TTGCGGAAGG TGAATCCGGC
AGCGCCACGG CCACCTTCAC CGTTACCCGT AGCGGCGACA GCACCGGCGC CGGCAGCGTA
CAATGGTCCG TGGCCCCCGC CACCGGCCTG ACTACCGCCG ACTTCACCGG GAACCAGGAC
CTGTTGGGCA CCAACAGCGG CCTGCCCAGC GGCACCATCA CCTTTGCCGC GGGAGAGACC
AGCAAGAACA TCACCATTCA AGTGGCCGGG GACCTGACCC TGGAAAACGA TGAAACCCTG
CGAGTGATAC TGGCCGATCC CACCGGCGGC ACCATCGAAG GGACGGACGG TGACAAGAGC
ACCACCATCC TGACCGACGA CGACAGCTTC AGCATCAGTA CCCTGACCGC CAGCCGCGCC
GAGGGGAACA GCGACAGCAC CATCACCTAC ACAGTGACCC GCACGGGCTC TCTGGTGGGT
GCCCGTGATC TCACCTGGAC CATCACCGGC GCCGATGGCT TCGCGACCGG CAACGACCTG
GCCGGCGGGC AGGCGGCAAC CGGCACGGTC AGTTTTGCCG ACGGCCAGGA GAGCGCCACC
ATCGTGGTCA ATGTCAAGGG CGATTCCGCC GTGGAGTCGG ACGAAACCAT GACCGTGACC
CTCTCGGGGG CTCCGGCCAA CAGCGTCATC GGCACGGCCA GCGCATCCAC GGTGCTGACC
AACGACGATG CAAGCGTCTC CATCGTCACG CTGATAGCCG ACAAGAACGA AGGCAACGTG
ACCATCGTCA CCCCCACTGG TGAAGTACCG GGCAGCACTG CCAGCACATT TACCGTCAGT
GCGGCCGGGA CGGTCAGCGG CACCGTGGAG TCCGCCGGTG ATCGCGACTG GTATAAAGTC
AACCTCGTCG CCGGCCACCA GTATCAAATC GATCTCATCG GAAACGGCAG CTATACGGCC
GGTGACGTGT TCCTCTCGCT GCGTAACAGT ACGGGTATCC AACTTGCCTC TAACGACGAT
TTCATCGGCG TTAATTCGCG CATAACCTAC ACTGCCCCCT CAAACGGCAT TTACTTTATC
GATGCTGGGC ACCTGGGTTC CGGCACTGGC ACCTATGGTG TAACCATCGC CGACCTGACT
GTTCCCGGTA CCGATATGAG CGCCCCGGCT TACGGGGTCG GCGCCCAACC GTACACGTTC
ACCATCACCC GCACCGGTGA CACGACCCAC GGCAGTACAG TGGAGTGGCG AGTAGCCCAA
GGGGTGGGTG TCGACGCCGT GGACTTCGGC AGTGTCGGCT CGCAGGACCT GCTCGGAGAC
AACAGTGGCC TGCCCAGTGG CACAGTCACC TTTGCAGCCG GTGAAATCAG CAAGACGCTG
ACCGTCAACA TCGCCACCGA TTCAGCGAAG GAGACTGACG AGATCCTTCG CGTCGTACTC
TCCAATCCGT CAGCCGGCAC CGAAGTCATC ACCGCCAGTG CCGATGGTAT CGTGCGCAAC
GACGACGCCG AACTCAACAT CACCGCCGGC ACGTTCAACC TTCTCGAAGG AGACGGCCTC
CACGGCACCG GCAAGGCCAT GACCTACACG GTGACCCGCA CCGGCAACAT CAACCAGACC
AGCACGGTGG ACTGGAGCGT GGTGCACGGC ACCACCAGCA GCGCTGACTT CACTAATGGC
GTTGGCTCAA ACCTAACCCC CAGCGGGACC CTGACCTTCG CCTCCGGTGT CGCCACCCAG
ACCATAGTGG TCTACGTCTA CGGTGATACC GGCGTGGGCA GCGTCGAAGG TGACGAAACC
TTCTCCATCC AGCTCTCCAA CCCCAACTCA GGCAGCGCGT TGGGCAACAT CACCAGCTAT
ACCAGCACCA TTCTCGAGGA CGATACGCGG CTGGTGCTGA GTGCCGCCGA CTACAGCCAG
GCTGAAAAAA CCGCGGGCAA CAACACCACC TACACCTACA ACATCGCGCG TGAAGGCTAC
ACCGGCGGCA CCACGAACTA CAGCTGGGCG GTTGGCTATA CCGACCCCTA CACCGGCAAC
CCCGCCTACA TGTATGACAA TACGCAGTCG CGGTACGAGA CGGTGACGGC GAATGCCTCG
GACTTCACCG GATCGTTGAG CGGCAGTGGC AGTTTCTCGG CCGGCCAGAC CAACGCCAGC
TTCACGGTCA CGGTGACGGG CGACGATACA CCGGAAGACG ACGAATGGTT CGCGGTCAAC
CTGACCGCCA GCAGCGGCTA TGACGAAGTT ACCGTGATCT ATGACGATCC GACCAAAGGC
ACGGGGACCC AACTCGCCAG GACCTATTAC TCCCCGTACC GGACCTACTA TGACGGCCAG
CAGGTCAGCA GCGCCACCAA CGGTGTCGCC AGCAACACCA ACTACCTGTT CTCCTCCATC
GAACGGGACG AAGCGGTCTA CTATCTGAGC GACCGCGAAG TGGCCAGCAC CAGCGTACAG
ACGCTGAACC CGGGCGACGG GCTGCGCACG CGCGTCGAAG GCGATACGCC GGCCGACGGC
GGTGCGGGCG CCACCACCGT CACCATTGAG GGCGTGGAGT ACGGCTATGT CGAGCATATC
TTCGCCGTGC AGCGCCAGGT TGCCACTGCA GGAACGGCAA GCGTGGGCTG GCGCATCGGC
ACGTACTACA ACGCAGCGGT CAGTGCGGAC GATTTCCTCA CCATCACCCG TGACGGTAAC
GGCGACATCA CCGCCATCAC CACCGCCGGC GCCCTCCCAT CCGGCACCGT CACCTTTGCC
GACGGCCAAG AGTGGGCCTA CATCAAGTTC TACACCAAGG TGGACGATAT TGGAGAATAC
GACGAGTATT TCAGCATCTT CCTCGAGAAC CCCAGCGCCG GCTCCTCGAT CTATACCTAC
GACACCGTGA GCTATCCGCA GTACAACTAC GGCATCATCA CCAACGACGA CACCCGCTTC
GACGCATCGG TCAACGATGT GGTGGAAGGG GGAACTCTCA CCTATACCGT CACCCGCAGT
GGCGACAGCC GGGGCACCGA CACCGTGGAC TGGTCCCTGG CACTGCCCGG CTCCGAAGCC
ACCAATGAAA GCAACAACTC TACCGGCACG TGGTACAAGC TGGATCCTTC GGACATCGAC
AGCGTCACCC CCTCAAACGG TACCGCCACC TACAATGCAG GGACCCTCAC CTGGTCCGGG
ACCCTGACCT TTGAAGACGG CGAGACAACC AAGACCATCA CCGTGGTCAC CACCGATGAC
AGCTGGACCG AAACCTGGCG GGAGGAGTTG CCCATCGTCC TTTCCAACGC CACCAACGTC
AATGCCGGCG AAGGCAACCA CGACCAGGAG ACCGCGTCCA CCGGCTACAC CGATACCGCC
AGGGTCTACG ACAACGAATC GGACCCGCTG ATTGGCGTTT CGGTCGGCAG CAGCACCACC
TGGGAGGGGA CCGGCGCCAA CGATTCGGCC ACCGGCAACA GCGTCACCTT CACCATCACC
CGCACCGACC AGGGCGGCCG GGACGGCAGC CTGAACTACC CCACTACCGT GGCCTGGCGA
CTTGACGGCA GCGGCATCAA CTGGGGTAGC GCCAACAACA GCGCAGAGAT TTTGACCTAT
GGCGGTGATG CCGCCAGCGT AAATGAATAC ACTTCCAACA CCACCTACGG CGTGGTCACC
TTCGCCGCCG GAGAAACCAG CAAAAACGTT GTGGTTACCT TCACCGGCGA CCGCTACGTG
GAGTCTGACA AGACACTCAC CTTCACCGTG CTGGACCCGG ACGACGCCGA ACACGGGCCG
CTCTATACCG ACTTCTACGG TCCGGCGGAC ATCAACAACG CCCAAGCCAG CGTCACCACA
ACCCTGAAGA ACGACGACAT CCGCCTCTGG GTCGGTGGGT GGGACACCTA CAGCGGCGAC
GCCAACGGCT ACTACACCAA TGTGCAGACC AGCGCCTACG AGGGGAACCC GCTGACCTTC
GCGGTCAACC GCTACGGCCG GCTCGACTGC GACATCGTCG TCAACTACAC CCTGATCAAC
GGCACCACCA CAAACGGCGA CTTCACCACC ACCAGCGGCA GCTTTACCCT TGCCGCACAG
GGCAGCGCCT ACGGCGAGTA CACCTACAGC ATCTCCCTGG CCGACCTGCT CACCGACGAC
ACCACGGTTG AGGCAAACGA AACCTTTACC CTGCGCCTTT CCGCCCCCGG CGACAGCGCC
GGCTCCAGCG TGCGTTTCCA GAGTTACTAC GCCGATTACA CCAGCAGCTA CAACTCGCCG
GCCACCACGC TGGACGTACG CGGCACCGTC TACGACGACG ACACCACCTA CACCCTGACG
CCGGCCAGCA CCAGCTTGGT GGAAACCGAC CAGGGAGCCA GCCAGACCTT CTCATTCGAT
GTCACCCGCG GCGGCACCGG CTATACCGGC GCAGCCCAGC TGCGCTGGCG CGTCGAGGCG
GTGGGGGGCA CCCCGGCAGA CAGTGCCGAC TTCACCAGCA CCGATCTCCT CGGTACCAAC
AATGGCCTGC CCAGCGGCAC GGTCAGCTTT GCCAACGGCG AACTCACCAA GACGTTCAGC
GTGCTAATCA GGGGAGACCT AGTCGCAGAG AACAATGAAA CCTTCCGGGT GGTGCTCTAC
GAAGACGTGC TGACCAGTTC TTCCCCCACC ATCACCAACA GCCAGAGCGT GGCCAGCAGC
ACCCTGACCA TTGTCACCGA CGATACCGGC ATCAGCATTG CCGACAGCAC CCTGACGGAA
AGCGATGCCA ACCAGACCAT GACCTTCACC ATCACCCGTT CCGGTGACAC CAGCGGCACG
TCATCCATGA ACTGGACCCT CTACCACGGC ACCACCACCG CTGGCGACTT CAGCGGCGCC
ACCACCGGCA CGGTCAGCTT CGCCGCCGGC GAGACCAGCA AGACCATCTC GGTGACAGTG
GCGGGCGATG CCACGCCGGA GGCCGACGAG ACCTTCACCA TCCTGCTGTC AAACCTGGTG
GGGGTCGATG AGGCCATCGA CATCAGCGCA ACCGGCACCA TCAAGAACGA CGACTCCTCA
TTCGCCATAG CCGGTGACGC TGCCAGCTCG CCGGAAAGCG GCAGCCAGAC CTTCACCATC
ACTCGCACCA ACGACACGGC ACAGAGCCAG ACCATCACCT GGTCGGTAAG CGCCGGCTCA
GCTGGTGCAG CCGACTTCGG CGGTTCGCTC CCCTCCGGCT CCGTCACCTT TGCGCCGGGC
GAGATGAGCA AGACCATCAC CATCTCCCCC AGCAGCGATG CCACGCCGGA AACCGACGAG
TCATATACCG TCAGCATCGC CTTGGGGGCC GGCACCACCG GCGACACCAT CACCCAGGCC
ACGGCCACCG GCACCATCGA GAACGATGAC GCCGCGATCT ACATCGCTGC CGATCAGACG
AACCAGCAGG AAGGACACAG CGGCACAACA CCGTTTACCT TCACCGTAAC CCGTACCGGC
AACACGACGG GCGCAGCCTC GGTCGACTGG GCTCTCTCGT CAGCCGGCGC GTCGGCCGCC
GACTTCACCA CGGCGGACGG CCTCGGGTCC AATGGCGGCC TTCCCTCCGG CACCATCACC
TTTGCCGACG GCGAGTCCGC AAAGACCATC ACCATCGAGA TTGTGGGCGA CGAGGTAGTC
GAAGCCGACG AATCGTTCAC CATCACCCTG TCTAACGCAG CTGGCGGCGC GATCATCACC
GGCTCGGCCG GCTCCACGAT CGCCAACGAT GATTCGACCA TCGCCATCGC GGCCGACAGC
GCGGTAAAGA ATGAAGGCAA CAGCGGCACG ACCGCCTACA CCTTCACCAT CACCCGTACC
GGCTACCTGG GCGAGGCGGA AACGGTCGAA TACTCCGTCG CCGGTTCCGG GGCCCATCCC
GCCGACGGCA CTGATTTCAA CGGCACGACG GGCACCCTGA CCCTCGCCGC CGGCGAGGCC
ACCACTACCT TGACCATCAA CGTGAGCGGC GATCTCTCCG GCGAACCTGA CGAGGATTTC
ACCGTTACCC TCAGCAACCC GAGCAGCGGG GTGACCATCA CCACCGACAC GGCAACGGGG
AGCATACTGG CCGACGACAT CGTCTTCGAC GTAGCGGCAC CGGCCTCCCA GACCGAAGGG
AATCCGGGGG ATACCACCTA TTTCGATTTC GTGGTGACAC GCAGCGGCAA CCTGAGCGGC
TCCCAGACCC TCACGTGGAG CGTGGCCGGT ATCGGCGCCG ATGGCACATC CGGCTCGGAC
TTCGACAGCA CCACCGGAAC GGTTACCTTC GACCCGGGCG AGACCAGCAA GACAATCAGC
GTTCCGGTCA AGGGGGACTA CCTGGGCGAG GCCGACGAAA ACTTCCGCCT CACCCTCACC
GGTCCGGACG GTGTGGTGTT CACCCACAAC TCAGCCGACG CCACCATCAT CGACGACGAG
GCCTCACTGC GCATTAGTGC CACCGACGCC GGCAGGGCGG AAGGAGCCAA CGGCGTCACC
AGTTACACCT TCACCGTGAC CCGCACCGGC AACACCGCAC TTGAGGCCAC CGTTGACTGG
TCGCTGGCTG CGGGCGCCAC GGACCCCGAC GACTTCGCGG GGGGAACGCT TCCCTCGGGG
AGCCTGAGCT TCGCCGCCGG CGAACTGAGC AAGACCATTA CCGTGGATGT GGCCGGCGAC
ACGGCGATTG AAGGCGATGA AAGCTTCACC GTAAGTCTCT CCAATGCTTC CACCGGCGCC
GATATCGTGA TCGGCAGCGC CACCGGCACC ATTGTCAGCG ACGACGTAGA GTGGACGGTC
TCCCCCCTGA GCGTTCCCGC TGTTGAAGGT GATGGCGCGA GCAGCTACGT CTTCCGGGTT
ACCCGTACCG GCAGCCTCTC GGCCACCACG CTGGACTGGA GCACTGCCGG CAGCGGAACG
AATCCCGCCG ATGCCGACGA CTTCCTGGGC AGCTTCTTCC CGTCGGGGAC CCTGGTCTTT
GCCCAGGGAC AGACAAGCCA GGACATCGTC GTGCAGATAG CGGGCGACAA CCTGCTGGAA
GCCGACAAAG AGTTCTCCGT GACGCTGGCC GCTCCCGTCA ACGGCCTGAC TCACAGCTAT
GCCGAGCAGA CCGCAAGCGC CACTATCGTC AATGACGATG ACGTGATCTC CATTGCCCCC
CTCTCGGCCG ACCATGCCGA GGGGACGGAC AGCTCCAGCC CCTTCACATT CACCGTCACC
CGGACGGGCA GCCTGACCGG CACATCCACC GTGGGCTGGC GCATCGTGCA CGGCGACACC
TCAGCCGACG ACTTTGTCGC CACCACCGGC ACGGTCAGCT TTGCGGACGG CCAGGATACC
GCCACCCTCA CCGTCCTGGT CAGCGGCGAC CGCAACCTGG AAGGAGATGA AGGTTTCAGT
GTCGAACTCT ATAACCCGGG CGCAGGAAGC ACGGTCGACG ATACTGCCAC AACGGCCTCG
GGCATCATCC ACGACGATGA CGTGGACCTC TCTCTGGCCG CTGCAGACGC CAACGTGGCG
GAAGGCGACA GCAGCACCGC CGGGCATGCC ACCTTTACGG TCACCAGGAG CGGCGATCTG
AGCGTGGAAA CCAGTGTCAA CTGGAATGTC GTTGCCGGCA CGGCAACGGC CGCAGACTTT
GCCGGCGGGG AATTGCCGGG TGGAACCGTG GTGTTCGGGG CCGGCGAATC AAGCAAGACC
ATCACGATCG ATCTGGCCGG CGACGGCGCC TGGGAAGGAA ACGAAACCTA CACGGTGCAG
TTGTCGGGCG CCAGCGATCA TGCCGATATC GTGGCCAACA ACGTCTCCGG CCAGATCATC
GACGATGACG ACACCCTGAC TCTTTCGGCT GTCTCCGCCG ATCATGCCGA AGGGAACTCC
GGGGCCACCA TCTACACCTT CCGCATCGAC CGCGCCGGGA CCGCCACCGG TGCCACCAGC
GTAGAATGGA TTGCTGCGGG AAGCGGTGCG CACCCGACCG ACCAGGACGA CCTGCTGGCC
ACGACCGGCA CGGTCACGTT CGCTGACGGC GAAACAAGCA AGACCTTTAC CGTGGAAGTC
GCCGGCGACA CCACCGGCGA ATACGATGAA ACCTTCAGCG TATCTCTTGC CAACCCGGCC
TATGGCTCCA CCACGGTGGG AGCGCCGGTC ACGGCAACGG TGCGCAACGA CGACGCCGTT
CTGTTCGTCA GGGCCGACCA CGTGTCGGTG GCCGAGGGAG CGGACGGCGT CGAAACCACC
TTCACCTTCA CCGTCACCCG CAGCGGGGAC ACTTCCGGCG CTGCCAGCGC CCTCTGGGAG
GTGACGGGCA GCGGCCTCCG GCCGGCCAAC GCGGCTGACT TCGGCGGGAT ATTCCCCTCG
GGTGCGGTCG CCTTCCAGCC GGGTGAAAGC ACGCAACAGA TCAGCCTGAC CGTCCTCGGC
GACGCAGTTG GCGAATATGA CGAAACATTC TCCCTGGTGC TTTCGGACCC CGAAGGCGCC
ACCATCCTGG AAGGGACCGC AGAAACCATC ATTGCCAACG ACGATACCGG CATTTCGATC
ACGGCTCTCG ATGCCGACAA GGCGGAAGGC AACAACGGTC TCACCGACTT CACCTTCCGG
ATCGAACGGG TCGGCCTGGC CAACGGAGCC GCATCGGTCT CCTGGGCCGT AGCCGGCACC
GGGAGCTACC CTGCGGGTGC CGACGATTTC GCAGGGGGCA TCCTCCCCTC GGGGACGGTG
TACTTTGCCG ACGGCGAAAG CGTGAAAGAC ATCACCATTC AGGTGGCGGG GGACGAAACC
TATGGTCAGG ATCAGACCTT CCGCGTGCTC CTCTCCAATC CCGCGGGAGC CAACCTCATT
AACGCCGATG CTACGGGCGT CATTCGCAAC GATGATTCCC AGGTCGCCAT CACGGCGCTC
GACACAGCCA AGCTGGAAGG AAATGCCGGC ACGACGACGT TCAGCTTCCA GGTAACCCGC
ACCGGTGCTC TGGACACCTC GGCAACCATC GATTGGGACG TAATCGGCAG CGGGGGACAT
CAGACCGTTG CCGGAGACTT CGCAGGCAAC AGCTTCCCCG GCGGCGCCCT GACCTTCGCT
GTGGGCGAGT CGTCGAAAAC CATCACGGTT GAAGTGGCTG GTGACACTCT GACCGAAGTG
GACGAAGAGT TCAGTGTCCG TCTGCGCAAT CCTGGCTCCG GCGTCAGTAT CGCGCCCAAT
GCCGGCGAAG CCTCAGCCAC TATCCTGAGC GACGATGACG GGGTGGTGCT GATCGGCCTC
GACGTGGACC GCCACGAAGG GGCGTCCGGA ACCCAGACCG TTTATACCTA TCAGGTGCTC
CGCTCCGGCA ACATTGACGC GCCGATAACG CTCAACTACG CCGTCAGCGG AGATGTCGAC
AGCGCCGACT TCATGTCTCC CCTTACCGGG TCCTTCGAAA TGGGGGCCGG AGAGAACAGC
CGTCTTCTGA CCTTGACGGT GAACGGCGAC GACATCGTCG AGCCGGACGA ATTCTTCCAG
GTAACGCTGT CCGGCAGTGG AATCAACATC GACAGCACCC CGGTTACCGG CGCTATCCGC
GGTGACGACG TGGCCGGTGA CGGGAATGAC GTCATCCATG CCGCGGCAAC CGCCGACACG
ATCAACTCCG GCGCCGGCGA CGATGTCATC CATCTGACCA TGGACAATCT GCTCCATCTC
CAAGTGACGG ATGGGGCGCA TGTGGACGGT GGCCTCGGCT TCGACACCAT CCTGTTCGAC
GCCGCGGGCC AAGAATTTGA TCTGGTCGCG CTCGTGGCAA ACGATGCCAT GAGTGGCATC
GAGAAGATCG ACCTGGGGGG CGAGGGCAAT ACCCTTCGCC TCACCACGGC GGAACTTCTT
CACCAGGACC AGAACCTGTT CAGTATCCTG GCGAACGGCT CGGAGCCGTT CCACCAGCTG
ATGGTGGACG GTGACGCCGA CGATGAGGTC ATCATCGCCG ACATTACCAA CTGGAGCCAC
GGAGCCGCCG ACACCTACAC CGACGGCAGC GTGACCTATG ATGTGTACAC CAACGGCACC
GACCACACCC AGCTGCTGAT CAACCAGGCC ATTACCAATG TGCATGGAGT CGCAGGGTAA
 
Protein sequence
MKKTSDDQVR AAKGKSIPLS SSGKDFIEGS LDFRPVSPKK QTAVRKPGEE QEEKAAVAAE 
QTESDHMTAD EAADVSYDHV AAITGEQSFA DTLSVADQTK TGKEEKCGDN NDDDDCDDKG
GWLWWAGGAV GVAGGSIGFA VAALNGDDDE GTHVDTAAVA FAGRVTDGPV HGATIYNDIN
KNGVYDDGID KAMTHNSEAI TSDADGNFTI TVGDLIENGI TDINKLKLVA YGGIDTVTGE
AVTVDFTAPE GYRYLNPVTS LIAAYMEAYN EANPNAKITA AEAEEAVIQA LGLPQIDYAT
TDLALPETAV EAQKVAAILA VAAMLIEESG TDSDGFAFLA AHLAPSETPL PGTMTYLTDE
VTTALQSAND TTAANQFSST VVAVNDATSL DDINTALNNT IFADLIVAGN VKVGQTLDGG
LGMGSTTGLE VTYQWLESID GTTWTPIAGE TGSDYTIRPT DILHHLRLQA TYIGTDGQPR
TIFYDVGVVP DSSPVFASST SGAVAENEAV GTVVYRAEAT SDLENNPLSY SLGGTDADLF
TIDVATGEVT LKNPADYESK SSYSIDITAT DTYGLTSTTS VTIGIDNLDE VAPSITSGPT
AATIAENSGP GQVVYTAAAD DSADISGGVT FSLKADGDAA LFTIDAATGE VTLTGNPDYE
AKPAYSFTVV ATDAAGHSTE QTVTLAIDNL DEVAPSITSG PTAATIAENS GPGQVVYTAA
ADDSADISGG VTFSLKADED AALFTIDAAT GKVTLTGNPD YEAKPAYSFT VVATDAAGHS
TEQTVTLAID NLDEVAPSIT SGPTAAAIEE NSGPGQVVYT AAADDSADIS GGVTFSLKAD
GDAALFTIDA ATGEVTLTGN PDYEAKPAYS FTVVATDAAG HSTEQTVTLA IDNLDEVAPS
ITSGPTADIA ENTGAGQVIY TAVADDAADI SGGVTWSLKA GSDAALTIDA VTGAVTLADN
PDHEAKSGYS FTVVATDAAG HSTEQAVELS VLDNNASAAI AVDLTTIAEG SEGTSTILTY
TVTRTSALNA SSVDWAISGV DAADLAAGQA AVGTVTFAIG ETSKTFTVEV VGDRTIESNE
DLVVTLGNPG NDIDLGTADS SATTIADDDG EVSIAATAVS VPEGDTGDSR VVTFTVTRTN
TLSASSVDWD VAGGTVNAAD FGGTLPSGTV TFAEGEATKT ISITVTGDRI IEPDETLTVR
LSNPGLNLVL GVDEASSTIV NDDVGFSIFG DVMDVVEGGI GEQRAITFHV VRSDSLTLPM
TIDYRLIPRG STVPDGFDFT GSPDSLGDNA GRPSGTISFG PDETSKTVTI YVAGDAVPEL
NETFSIVLAN APPSTIIING EIEGVIRSDE TQYSIHAVTA ATVEGNGTGG IQQFLITRTG
DTSQPGSVGY TLSEYGENPT EANDFAAGTP LTGTISFAAG ETSKILSVNL EGDSVLEGYE
SFQVALHTLD SNSIIGTNTA VASIIPDDAA INIAATDSIV KEGTGAVSRS HTFTLTRSSH
VDSEVTVDWH LAGTGANPVD AADFGGTLPS GSVTFAPGET VKTLTITPST DAAYEPHESY
EIVLSTSQLG VVLETDHASG MILNDDSGLT LVATNLDKAE GNPGTPSQLT FTVQRTGDTT
GESTVHWELV SADGSGVSAA DFASGILPSG DLTFSRGVTS RVVTIPLTTD NIIEPDKGFT
IRLSSPSEGT ELLVSEVGGY IRNDDAAFTL ESVSPVAEGH NGTTTVTFTV VRTGDISGAD
TVEYVVAPAD GGAVVDGADF VGGQLPDGLI TFNAGEASKT VTLAVAGDNA LESDEAFTIT
LVNPGVGSTI ASGSTDVVIL NDDDALSIVA TDADQAEAAA GGTRDFTFTV NRTGFLDRAT
TVNWSVAGVG ANQVDAADFG GALPSGTLEF AANESSKTIT ITVNGDYFQE ADEGFRVTLS
SPSDGTTLTT ASADGVIRND DTGLAITATT TTLAEGDSGT VTHVFTVTRT GVTTGTTTVD
WALAGSGGHP VDAADFGGTL PSGTLVFAPG ETTKTIEVQA SGDTDIEPGE GFTITLSGAD
GNADIMTASA NGTVVADDIS IAISAGTASV MEGATGSSRV LQFTVTRTGD LASPVSIDWS
ASGMDAADFA NGTALSGTIN FGAGETVKTI NLTQIGDNVS ESDETLTITL SNPAGNPAHD
RTYITSATAT TDVVNDDASL TITADAASQN EHNTGDGEAT SFTFTVTRTG DTSTETTIDW
VLQLPGGAGS AAGNDFVAGQ DLLGTNSGLP SGTISFAADE TSKTITVLVA TDNQVEQDET
FSIQLQGAGA NTEVSGNSAS AVISNDDTGF SIIALAADHT EANGGTVTYT FRVTRAGDIS
SAATVDWDVA GSGASPANAD DFGGSLPGGT LSFAENEASK EISFTVSGDT VVEQDEEFTV
TISNAQLTDA TPQLIQDATV GGIIRNDDQS FSVSAANASV TEGSAGTTQI AYTITRTGDL
SDSVTIDYAV TGAGGAATSD VQGGVLPTGT LTFAAGETSK SVTFDVIADT LAEGNETFTL
TLTNPSAGII GTASDSTVVV NDDTNFALSA PAPFAEGESG SATATFTVTR SGDSTGAGSV
QWSVAPATGL TTADFTGNQD LLGTNSGLPS GTITFAAGET SKNITIQVAG DLTLENDETL
RVILADPTGG TIEGTDGDKS TTILTDDDSF SISTLTASRA EGNSDSTITY TVTRTGSLVG
ARDLTWTITG ADGFATGNDL AGGQAATGTV SFADGQESAT IVVNVKGDSA VESDETMTVT
LSGAPANSVI GTASASTVLT NDDASVSIVT LIADKNEGNV TIVTPTGEVP GSTASTFTVS
AAGTVSGTVE SAGDRDWYKV NLVAGHQYQI DLIGNGSYTA GDVFLSLRNS TGIQLASNDD
FIGVNSRITY TAPSNGIYFI DAGHLGSGTG TYGVTIADLT VPGTDMSAPA YGVGAQPYTF
TITRTGDTTH GSTVEWRVAQ GVGVDAVDFG SVGSQDLLGD NSGLPSGTVT FAAGEISKTL
TVNIATDSAK ETDEILRVVL SNPSAGTEVI TASADGIVRN DDAELNITAG TFNLLEGDGL
HGTGKAMTYT VTRTGNINQT STVDWSVVHG TTSSADFTNG VGSNLTPSGT LTFASGVATQ
TIVVYVYGDT GVGSVEGDET FSIQLSNPNS GSALGNITSY TSTILEDDTR LVLSAADYSQ
AEKTAGNNTT YTYNIAREGY TGGTTNYSWA VGYTDPYTGN PAYMYDNTQS RYETVTANAS
DFTGSLSGSG SFSAGQTNAS FTVTVTGDDT PEDDEWFAVN LTASSGYDEV TVIYDDPTKG
TGTQLARTYY SPYRTYYDGQ QVSSATNGVA SNTNYLFSSI ERDEAVYYLS DREVASTSVQ
TLNPGDGLRT RVEGDTPADG GAGATTVTIE GVEYGYVEHI FAVQRQVATA GTASVGWRIG
TYYNAAVSAD DFLTITRDGN GDITAITTAG ALPSGTVTFA DGQEWAYIKF YTKVDDIGEY
DEYFSIFLEN PSAGSSIYTY DTVSYPQYNY GIITNDDTRF DASVNDVVEG GTLTYTVTRS
GDSRGTDTVD WSLALPGSEA TNESNNSTGT WYKLDPSDID SVTPSNGTAT YNAGTLTWSG
TLTFEDGETT KTITVVTTDD SWTETWREEL PIVLSNATNV NAGEGNHDQE TASTGYTDTA
RVYDNESDPL IGVSVGSSTT WEGTGANDSA TGNSVTFTIT RTDQGGRDGS LNYPTTVAWR
LDGSGINWGS ANNSAEILTY GGDAASVNEY TSNTTYGVVT FAAGETSKNV VVTFTGDRYV
ESDKTLTFTV LDPDDAEHGP LYTDFYGPAD INNAQASVTT TLKNDDIRLW VGGWDTYSGD
ANGYYTNVQT SAYEGNPLTF AVNRYGRLDC DIVVNYTLIN GTTTNGDFTT TSGSFTLAAQ
GSAYGEYTYS ISLADLLTDD TTVEANETFT LRLSAPGDSA GSSVRFQSYY ADYTSSYNSP
ATTLDVRGTV YDDDTTYTLT PASTSLVETD QGASQTFSFD VTRGGTGYTG AAQLRWRVEA
VGGTPADSAD FTSTDLLGTN NGLPSGTVSF ANGELTKTFS VLIRGDLVAE NNETFRVVLY
EDVLTSSSPT ITNSQSVASS TLTIVTDDTG ISIADSTLTE SDANQTMTFT ITRSGDTSGT
SSMNWTLYHG TTTAGDFSGA TTGTVSFAAG ETSKTISVTV AGDATPEADE TFTILLSNLV
GVDEAIDISA TGTIKNDDSS FAIAGDAASS PESGSQTFTI TRTNDTAQSQ TITWSVSAGS
AGAADFGGSL PSGSVTFAPG EMSKTITISP SSDATPETDE SYTVSIALGA GTTGDTITQA
TATGTIENDD AAIYIAADQT NQQEGHSGTT PFTFTVTRTG NTTGAASVDW ALSSAGASAA
DFTTADGLGS NGGLPSGTIT FADGESAKTI TIEIVGDEVV EADESFTITL SNAAGGAIIT
GSAGSTIAND DSTIAIAADS AVKNEGNSGT TAYTFTITRT GYLGEAETVE YSVAGSGAHP
ADGTDFNGTT GTLTLAAGEA TTTLTINVSG DLSGEPDEDF TVTLSNPSSG VTITTDTATG
SILADDIVFD VAAPASQTEG NPGDTTYFDF VVTRSGNLSG SQTLTWSVAG IGADGTSGSD
FDSTTGTVTF DPGETSKTIS VPVKGDYLGE ADENFRLTLT GPDGVVFTHN SADATIIDDE
ASLRISATDA GRAEGANGVT SYTFTVTRTG NTALEATVDW SLAAGATDPD DFAGGTLPSG
SLSFAAGELS KTITVDVAGD TAIEGDESFT VSLSNASTGA DIVIGSATGT IVSDDVEWTV
SPLSVPAVEG DGASSYVFRV TRTGSLSATT LDWSTAGSGT NPADADDFLG SFFPSGTLVF
AQGQTSQDIV VQIAGDNLLE ADKEFSVTLA APVNGLTHSY AEQTASATIV NDDDVISIAP
LSADHAEGTD SSSPFTFTVT RTGSLTGTST VGWRIVHGDT SADDFVATTG TVSFADGQDT
ATLTVLVSGD RNLEGDEGFS VELYNPGAGS TVDDTATTAS GIIHDDDVDL SLAAADANVA
EGDSSTAGHA TFTVTRSGDL SVETSVNWNV VAGTATAADF AGGELPGGTV VFGAGESSKT
ITIDLAGDGA WEGNETYTVQ LSGASDHADI VANNVSGQII DDDDTLTLSA VSADHAEGNS
GATIYTFRID RAGTATGATS VEWIAAGSGA HPTDQDDLLA TTGTVTFADG ETSKTFTVEV
AGDTTGEYDE TFSVSLANPA YGSTTVGAPV TATVRNDDAV LFVRADHVSV AEGADGVETT
FTFTVTRSGD TSGAASALWE VTGSGLRPAN AADFGGIFPS GAVAFQPGES TQQISLTVLG
DAVGEYDETF SLVLSDPEGA TILEGTAETI IANDDTGISI TALDADKAEG NNGLTDFTFR
IERVGLANGA ASVSWAVAGT GSYPAGADDF AGGILPSGTV YFADGESVKD ITIQVAGDET
YGQDQTFRVL LSNPAGANLI NADATGVIRN DDSQVAITAL DTAKLEGNAG TTTFSFQVTR
TGALDTSATI DWDVIGSGGH QTVAGDFAGN SFPGGALTFA VGESSKTITV EVAGDTLTEV
DEEFSVRLRN PGSGVSIAPN AGEASATILS DDDGVVLIGL DVDRHEGASG TQTVYTYQVL
RSGNIDAPIT LNYAVSGDVD SADFMSPLTG SFEMGAGENS RLLTLTVNGD DIVEPDEFFQ
VTLSGSGINI DSTPVTGAIR GDDVAGDGND VIHAAATADT INSGAGDDVI HLTMDNLLHL
QVTDGAHVDG GLGFDTILFD AAGQEFDLVA LVANDAMSGI EKIDLGGEGN TLRLTTAELL
HQDQNLFSIL ANGSEPFHQL MVDGDADDEV IIADITNWSH GAADTYTDGS VTYDVYTNGT
DHTQLLINQA ITNVHGVAG