Gene XfasM23_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasM23_2221 
Symbol 
ID6203337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M23 
KingdomBacteria 
Replicon accessionNC_010577 
Strand
Start bp2517735 
End bp2528162 
Gene Length10428 bp 
Protein Length3475 aa 
Translation table11 
GC content65% 
IMG OID641703736 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001830888 
Protein GI182682728 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGG ACCTCTACCG CCTCATCTAC AACCGTGCCC TGCGTCTGTG GCAAGTGGCC 
TCAGAACTCG CCACCGCATC CGGCGGTACC CCAGGTCCTT CCCCGACAGC GCAACGGCCA
GCCCGTGCCT GCCTCCATCC CATCCCCTTT GCCCTCTGGC TCACCCTTGG CTGGGTGACC
ATCACCGGCA TTGCCACCGC CCAAGTGGTG GCTGACCCCC ACGCCCCAGG CCAACAACGC
CCGACCGTCC TTGCCGCCCC CAACGGCACC CCCCTGATCA ACATCCAGAC CCCCAGCCCA
GCGGGCGTCT CCCGCAACAC CTACCAACAA TTTGACATCA CCCCACAAGG CGCCATCCTC
AACAACGCCC GTACCCCGAC CCAGACCCAC CTGGCGGGCA CCGTCCAAGG CAACCCCTGG
CTGGCCGCCG GCACCGCCAA AATCATCCTC AACGAAGTCA ACAGCTCCAC TCCCAGCCAA
CTGCATGGCT CTATGGAAGT GGCTGGCGCC CGCGCCCAAC TCATCATTGC CAACCCCTCC
GGCATCACCT GCAACGGCTG CGGCGTCATC AACGCCCACC AACTTACCCT CACCACCGGC
ACCCCCATCT TTAACGCCCG TGGCGCCCTA GACCACTACC GCGTCCAAGG CGGCGCCATC
CAGATTGACG GCCTAGGCCT AGACAGCCGC AGCGCCGACT ACACCGCCCT CATTGCCCGC
ACCGTCCAAC TCAACGCTGG CCTCTGGGCC CACACCCTGC AGACCACCAC CGGCCCCGCC
ACCGTCGCCC TGGACGGACA CCCCACCGCC TCCCTGCCTG TCACCCCAGG CGACCGCCCC
ACCGTGGCCC TGGACGTCTC CGCCCTGGGC GGCATGTATG CCGGCAAAAT CACCCTGATT
GGCACCGAAC ACGGCCTGGG CGTCCGTAAC GCCGGCCAAC TTAGCGCCAC CAGCGCCCCC
CTCACCGTCA CCGTGGATGG TCTTCTGGAG AACACCGGCC GCCTTCAATC CGCCACCGAC
ACCCAACTCA ACGCCACCGC AGAAGTGAAC AACAGCGGCC TCATCAGCGC CGCCCAGACC
CTGACCCTGC ACACCCCCAC CACCATTGAC AACCGCAGCG GCACCCTCAA CGCCGCCCGC
CTGGACATCA CCGGCGCCCG CCTGGATAAC CGCGGCGGCC ACATCCAACA AACCGGCCTC
CAACCCCTCA CCCTCCAGAC CCAACACCTG GACAACCAGG ACCAGGGCCG CCTTGGCGTC
CTAGACACCC CGGCGCCCGC CTCCCCGGCC ACCCCGACAG TCACGGCCCC GATTAGCAAC
GCCCCTCCCA CCGTCACCGC CCCCCCGGCC ACCGACCCCA CCACCTCCCC CGTTGCCCCC
ACCGTCCCCC ACCTGGCCCA TGGCACCCTT ACCCTCACAC AGACCATTGA CAACCGCGGC
GGCCACATCA CCGCCGGCGG TGCCATCGAC GCCATCCTCA CCGACCTAGA CAACCGCGAC
GGCACCGCCG CCCTCAACCG CCTCACCCTC CAAGGCCAAC GCCTGGATAA TCAACACGGC
ATCCTCACCC TCGCCACCGA TGCCACCATT CACACCCACA CCCTCAACAA CGCCGCCGGC
CAACTTCACG CCAACGGCAC CCTGGACCTC ACCGCCGACA CCTTCAGCAA CCAAAACGGC
CAACTCCTCC ACACCGGCTC CCAGAACGCC ACCCTCACCA TCACTGACCT CCTGGACAAC
CAACACGGCA TCATTGCTAG CGCCGCGAAC CTCCTGACCC TAAAGACCGA CCACCTCAAC
AACGCTGCTG GCCAACTCCA CGCCAACGGC GCCCTGGACC TCACCGCCCA ACGTTTCAGC
AACCAAAACG GCCAACTCCT CCACACCGGC TCCCAGAACG CCACCCTCAC CATTGCCAAC
CTCCTGGACA ACCAACACGG CCTAGTAGCC AGCGCGGCTA ACGCCCTGAC CCTGCACACC
GGCCACCTCA ACAACGACGC CGGTCAATTC CAGACTAACG GCGCTCTTGA CCTGACCGCC
CAACGTTTCA GCAACCAACA CGGCCAATTC CTGCACAACA GCCCGCAAAG CGCCCACCTG
CGGATTGATG GCCAGCTGGA CAACCAACAA GGCGTACTCG CCAGCAACGC CGCCGAACTG
ACACTAGAGA CTGGCCAATT CAACAACGAC AGCGGCACCC TCCAACAGAG CGGGCAGGGC
ACCCTGCACA TTGACGCCGC CACCCTGACC GGCCACGGCG GCACCCTGAC CAGCCAAGGC
GCCCTCACCC TCACCGGCAC CCACACCGAC CTCAGCCACG CCACCACCAC CGCCCAACAC
ATCACCATCC ACACCGACGA CCTCACTACC GCCGGCGGCC ACCTCACCGC CTACGGAGAA
CACACCCTCC AACTGAATGC CCGTACCCGC ATCGACAACA CCGCCGGCAC CATTGCCACC
AACGGCAGCC TAGACCTGCA CACCGCCGCC CTGGATAACA CCGGCGGCAC CCTCCACAGC
ACCGCCACCG GCCCCAACCG CCTAGACATC ACCCACACCC TCACCAACAC CGCCGGCCAC
CTCCTCCTTA ACGGCCCCAC CACCCTCACC ACCGGCACCT GGACCAACAC GGGCGGCCAA
CTCCAGATCA CCGGCCCGGC CACCCTCCAC GCCACCACCC TAGACAACCG CGGCGGCATC
CTCCACACCG CCACCGGCCC CCTGGACCTG CGCGTCACCG GCACCATTAA CAACCAAGAC
AACGGCATCC TCTCCAGCAC CGCCGCCCTC ACCCTCACCG CCGCCAGCCT CCACAACCAA
CACGGCACCC TGGATGCCGC CGGCCCCGCC CACCTCACCC TTACCGGCCT ACTGGATAAC
ACCGCCGGCC TCCTCCAAAC CGCCCACACC CTCTGGCTCA CCAGCGCCGG CCTCACCAAC
CGCAGCGGCA CCCTCACCGC CGCCGCCCTC ACCCTAGACA CCCAAGCCCA CACCCTGGAC
AACACCAGCG GCCGCCTCGG CACCACCACC GGCAACCTCA CCCTTCACAC CGGCCTACTG
GACAACACCG CCGGCCTCCT CCAAACCGCC GCCACCCTCA CCATCGACAC CGGCGCCGCC
CCCCTGACCA ACCGCGACGG CGGCACCCTC CTGGCTGCTG ACACCCTGGA CCTACACACC
ACCACCCTGG ACAACCGCGG CGGCACCATC GACTCCCAGA CCGCCACCCA CCTGCACACC
ACCACCATTG ACAACACCAC CGCCGGACAC ATCAGCAGCA ACGGCACCCT CCAGATTGAC
GGCACCACCC TCACCAACAC CGGCGGCCGC CTCCACAGCG GCGGCGACAC CCGCCTCCAC
CTCCAAGACA CCCTGAACAA CCACGACGGC CGCATCACCG CCGCCGGCAC CCTGGACATC
ACCACCACCA CCCTGGACAA CCACAGCACC CCCCTTACCG CGCCCCCGGC CACCCAGACC
CGCGCCCCCA CCGGCGCCCC AGACAACGGC CTCTACGCCA CCCACATCCA AATCGCCAGC
ACCACCCTGG ACAACACCGC CGGCACCCTC AGCGCCGCTC AAAACCTCAC CCTCACCCTG
AGCGACACCC TCACCAACAC CGCCGGCCAC CTCAGCGCCG GCGCCACCCT GGACCTCACT
GCAGACCACC TGAGCAACCA CACCGGCACC CTCCTCAGCG GCGCCAGCCA AACCCTGCAC
CTCCACCGCC TCACCGGCGA CGGCCGCCTC CATGCCGGCA ACGCCCTCAC CCTGACCCTC
CAAGACAGCC TCGACACCGC CGGCACCCTC AGCGCCACCG GCCTGCTCAC CCTCACCACC
GCTGGCGACC TCACCAACCG CGGCCTCATC CAAGCCGCCG ACCTCACCGC CCAGGCCCGT
GACATCACCA CCACCGCCAC CGGCCAACTC CTGACCACCG GCCACACCCA CCTCACCGCC
ACCGGCACCC TGAACAACAG CGGCCACCTC CAAGCTGCCG ACCTCACCGC CCAGGCCCAT
GACATCACCA CCACCGCCAC CGGCCAACTC CTGACCACCG GCCACACCCA CCTCACCGCC
ACCGGCACCC TGAACAACAG CGGCCACCTC CAAGCCGCCG ACCTCACCGC CCAAGCCAAC
ACCATCACCA ACACCGGCAC CTTCCTGGCT ACGAGCCACG CCACCCTCAC CGCCACCGAC
ACCCTGACCA ACAGCGGCTT GCTCCAAGCC GCCGACCTCA CCGCCCAAGC CAACACCATT
ACCAACACCG CCACCGGACG GCTCCTGACC ACCGCCCACA CCCAGCTCAC CGCCACCGAC
ACCCTCACCA ACAGCGGCCT TGTCCACGCC GGTGACCTGA CCGTCCACGC CCGTGACATC
ACCAACACCG CCACCGGCCA ACTTATCGCC AGCAACCTCG CCCAACTCAC CGCCACCGCC
ACCCTCACCA ACCGCGGCCT CATCGACGCC TTCACCACCC ACCTCAGCGC CCCCACCATT
GACAACCTCG GCACCGGCCG CCTCTACGGC GACCACATTG CCCTCCAAGC CCACACCCTC
ACCAACCGCG ATGAAACCAG CGACGGCCAC ACCCACACCG CCACCATCGC CGCCCGTGAG
CGCCTAGACA TTGGCGCGGA CACCCTGCGT AACACCGCTA ACGCCATGAT TCTCAGTGAT
GGAGATGCCG CCATCGGCGC CACCCTGGAC AACACCCTCC ACGCCACCGG CATCGCCACC
CTCATCGACA ACCGCAGCGC CACCATCGAC ATCACCGGCA CCCTGAACAT CACCACCACC
ACCCTCAACA ACATCCGCGA AAACGTCCAC ATCGCCCACG CCCCCGACGT CGTGACCGAA
GCCCGCATGT ACCAACCCCA CTGGCGCAAA AACAAACCCA ACGGCGGCTC AGGCGACTTC
CGACTGAGCA GCAACTACGA CGCCCACGAC ATCTACTACC TCAACCCCGC CGACATCCTC
GAAGACACCC CCTACATCAC CCCCGATGGC CAAAAGATCC ACCGCGCCAT CGTCCGCCTC
ACCCCCCAGA CCAGCGCCTA CTTCTATGCC CGTGGTGGCC TCTATGCCAG CCAAGCTGAA
CGCCGCCGCC TGGACCTCAC CGCCCGCACC GGCGACAGCC TCGTCCTGTA CTACACCGAC
CGTCAGGACA AACAACCCAA CCCCGACCAC GTCGCCGCCG CCGCAACCAA TGACAGCGCC
TTCATCGGCC TGGACACCCC CCAGCAAAAC GAACGCTTAA AAATCGTCCC CATCACCTAC
GCCCCCGGCG ATGACCGCCT CACCTACGAC CCCACCTACG GCACCTGCAC TGACGACTGC
GTCCGCCTCG TCACCTGGCA CGACTACACC GACCCAGACC ACACCCTCAT TGACATGCGC
CGCGGCCCCA ACGATGTCGA CGACAACGAG CGCGAACGTC ATGCCACCAG AACCACCCAA
CAAGAAATCC TCAACCCCGA TGCCGGCGCC CCAGCCCTCA TCCAGTCTGG CGGCACCATG
CGTATTGACG TCGGCTACCT CTACAACCAC TACGCCGACC TGCTGGCCGG CGGCGACCAA
ACCATCGTTG GCCTGCCCCC CCATCCGACC AAAGAAACAG CAGATGACGA ACACAAGTAC
AACAGAGCCC TGCTGATCGA CAACCGCGCC CTCCAACTCT CCCGTACCGA CAGATTCCAA
AACATCAGCA CCACCTACCG TGGCAAAGAC TCCGCACCAT GGAGCAATGA ATCCCGGACC
ACCCCCACCA CACAGATTGG CGGCCGCATC ACCAGCGGCG GCCACCAACA CATTGCCGCC
CAAACATTCA ACAACGTCAC CGACTCCACC CACGCCCCTG AGCCCATCCA ACATGTCACC
TACAACCCCA GCACCCAAAC CCTGACCATT GCTGACGGCC ACATCACCGT CACCGACACC
CCCCCCAGCC TCCACACCGT CTCCCTTGCC GACAACGGCT TCAGCCACGG CCAAGAACTC
ACCTACATCC CTGAAAAGAG CATCACCACC CCCAACGCCC CCATCCGCGA CCCCGCCGCC
CCCCCCGCCG TCACCGTCAC CCCCACCGGC CCCCTCACCC TGCCCAACAA CAGCCTCTTC
ACCATTCACC CCGACACCGC CACCCTCATC ACCACCGACC CCCGCTTTAC CCTCGGCCGC
CCCTACACCA GCGCCGACAG CCAACTCCAC GCCCTGGGCG ACCACGACAC CCTCCACAAA
CGCCTCGGCG ACGGCTACTA CGAACAACGC CTCATCCGCG AACAAATCGC CCAACTCACC
GGCCGCCGCC GCCTGGACGG CTACACCGAC GACGACCACC AATACCGCGC CCTCCTGGAC
GCCGGCCTCA CCGTCGCCAA ACAGCACCAA CTGCGCCCCG GCATTGCCCT CAGTGCCGAC
CAAATGGCCC AACTCACCAG CGACATCGTC TGGCTCGTCC AACAAGACGT CCACCTGCCC
GACGGCACCA CCACCGTCGC CCTCGTCCCC CGCCTCTACC TGCGCCCCCG CACCGGCGAC
CTCACCCCAG ACGGCGCCCT CCTGGCGGCC GCCAGCACCA CCATCAACGC CCACACCCTC
ACCAACACCG GCACCATTGA CGCCCGCGAC CTCATCAACA TCAACACCCA CATCATGGAC
CAACAAGGCG GCCGCCTTAC CGCCGACGCC ATCAACATCC ACACCACCGG CGACTTCACC
AACCTGGGCG GACAATTCAC CGCCGGCGAC TTCCTCAAAG TCCATGCCCA AGGCAACTTC
CTTGCTAGCA GCACCCTCCG CGACGCCACC ACCCAAGGCA CCCGCCACCA CAGCGTGACA
GAACTGGACC AACAGGCCGG CTTCACCGTC ACCGGCCCCG GCGCCTACCT TGGCTTGAGC
ACCGACCAAG CCATGACCCA ACAAGCCGTT GCCATCAGCA ACACCGGCCT TGACGGCTAC
ACCTCCCTCA AAGCCACCGG CCGCCTACAC CTAGGCACCC TCAACACCCA CCGCAGCGAC
ACCACCCAGT GGGACCCCCG CAACAGCCGC CACACCCGCA TCGATACCGA ACACGGCACC
AGCATCCGCA GCGCCGGCGA CATCCAACTC AACAGCGGCC AAGACATCAA CCTGCGTGCC
GTCACCCTCC ACAGCACCCA AGGCACTGTC AGCGCCCTGG CCACCGGCAA CGTCACCATT
ACGCACGGGG ACACCCTCCA ATACACCAGC CAAGATAACC ACAGCAAACG CAGCGGCCTC
CTCAACAGCC GCACCACCAC CACCCACGCC GACCAACAAC AGACCCAGGC CATGAGCAGC
ACCCTCAGCG GCACTAAAGT CCTCGTCAAA GGCAACAACA TCACCGTCAC CGGCAGCCAC
CTCCTTTCCG ATGCCGGCAC CTACATGCAG GCCAAAGGCG ACCTCACCCT CCAAGCCGCC
ACCAACACCA CCCAATCCAC CTACTCAGAA CACACCAAAC AACGCGGCCT CATCCGCAAC
GGCGGCGCCT CCCTCACCCT GGGCAACCAA AGCCAGCGCA CCGACAGCAC CACCACAGCC
ACCACCACCA CCGGCTCCCT CATCGGCGCC ACCAACGGCA ATGTGACCCT GCTGGCGGGC
GGCCACTACC AACAGATCGG CAGCGACGTC CTGTCCCCTA ACGGCGACAT CGACATCCAC
GCCAAAAAAG TCGACATCAT CCAAGCCCAC CACACCAGCC ACACCACCCA ACACACCGCC
ACCCGCCAAA GCGGCCTCAC CGTCGGCCTC AGCACCCCCC TGATTGCCGG TGCCCAGACC
GCCCAGCAAA TGCAACACGC CGCCGCTCGC AGCGGCGACC CCCGCCTCCA CGCCCTGGCC
GGCCTCACCA CCGCCCTGGG CGCCAAAAAC ACCATTGATG CCGTGCGCCA AGACCCCCGC
GCCCTGGGCG GCCTCAACGC CTCCCTCACC CTTGGCCGCA GCACACACGA CAGCACCACC
ACGACCACCA CCACCACCGC CGCAGGCTCC AACGTCAACG CCGGCGGTAA CGTCCGCATC
AGCGCCACCG GTGACGGCGA AGCCTCCACC CTCACCATCC AAGGCAGCCA CGTCCGCGGC
GACAACATGA CTTACCTCAA AGCCGATGGC GACATCGCCC TACTGGCCGC CGCCAACACC
ACCACCAGCG ACCGTCAAAG CCGCGGCCGC AGCGCAGGTG TCGGCGTGGC CGTGAACCTA
GGCTCCAGCG GCACCAGTGC CGGCCTCACC GCCCACGCCA GCACCTCCAC AGGCAGCGGC
CAGTCTACCG ACCTCACCTG GACCAACAGC CACGTCGGAG GCGGCAACCT ACTGACCATT
GAAGCCGGCG GCGACCTCCT CATGAAAGGC GCCATTGGCA CCGCCAAACA CGTCATTGCC
GACATTGCTG GCAACCTCAC CATCCAAAGC CTCCAAGACA CCCACCACTA CCGCAGCAAA
GACCGCAGCC TTGGCGGCAG CCTCACCGTC GGCGCAGGCG TCAGCGGCAG CGCCAACCTC
AACAACCAAA CCATCCGCAG CGACTACGCC AGCGTCACCG AACAAAGCGG CCTGTTTACT
GGCGATGGCG GCTATGACAT CACCGTTGGC GGCCAGACCC ACCTTATCGG CGGCGCCATC
ACCTCCAACA GCACCGCCAT CCACAACGGC CTGAACACCC TGGACACCGG CACCCTGATC
CTGCAAGACA TTGAAAACCG CGCCACCTAC ACCGCCACCC AAGTCAACCT GGGCGGCGGC
TACAGCCGCA ACGGCGGCAC CGTCGGCACC GACCAACAAG GCCACGCCGC CACCGCCACC
CAGGTTCCTG GCACCACCTT ACCCACCCAC AACGGCCTCA GCGCTGCCCC TCCTGGCGCC
ATGACCGCCA GCGACAGCAG CCACAGCACC ACCTACAGCG GCATCAGCCA AGGCGCCCTC
ACCATCCGCG ACCCCGCCGC CCAACACGCC CTCACCGGCC ACACCGCCGC CCAGACCATC
GCCGGCCTCA ACCGCGACAT CCTCACCGAC ACCGCCACCA GCAACGCCCT CACCCCCATC
TTTGACGAAC AACGCATCAA CGCCGCCTTT GACATCGTCA CCGCCCTACA ACGGGAAACC
GGCACCTTCA TCAACAACCG TGCTGCAGAA GCCACCCAAG CCCAACAGGC CCTGCAGGCC
GAACACGCCA AACCAGCAGA CCAACGCGAC CCGGCCCACA TCGCCGCCCT ACAACAACGC
ATCCAAAACA CCACCACCTG GGAACTGGGC GGCACCGGCC ACACCATCGT GACCGCCCTG
ACCCTGGCCG CCGGCCAACA AGTGACCGGC CCGGCCACCC AAATGCTGCA AAACGCCGCA
GTCAACTACA TCCAAAGCCT GGGCGCCCGC GAAATCAAAG ACCTTGCCGA CACCCTGGGC
AGCGACACCG CCCGCAGCGC CCTGCAAGGT CTCCTGGGTT GTGCCGGTGC CGCCGCCCAA
GGCCAGGCTT GTGGTGCTGG CGCGGTAGGT GGTGCCGCCG CCGTTGTCAT CAACAGCCTT
CTAGACCGCG CCAACGGTGC TGAAGCGGCC AGCCTCAGCG CTGAAGAAAA ACAACACCGT
ACCGACCTAG TCACCAGCCT GGTGGCCGGC ATCACCACCG CCGCCGGAGG CGATGCCGCC
GTCAGCAGCG CCGCCGCCCG CCTGGAAACC GAAAACAACG CCGCCTTTAT CCCAGTGATT
CTTGGCGCCG TCTGGCTAGC CGATAAAGGC ATCACCGCCT ATCAAGCTTG GCAAGACATC
AAAGCGATTC GTTCCGGAGA AAAAACCCTA GAACAAGTGG CCCTAGAAAG AGGGCAGGAC
TACGTCACCT CCATAGTGAT TGGCAACCTC GCTAAATACG GCCTTAAAGC GGCAATGATC
GGCGGCCGCT GGATTTCTGG CACCGCAAAA GAGATTGCCA ATGCAGAAAA AGAAGCGCTT
AGGCAAATAA GAAATAACCC CAAAGGCCCT GATTTAACCC AAAAGCCGCC TGGTCAAATC
ATGGCGCTGC AGCGGCAAAA GCGCCTGGAT GATGTGAAAA GCGTGATTGG CAGGCGTAGT
CAAAAGGATA CATTGGTCGT GGGAGGGATT GAGGTCAAAG CTGTACCCTA TGACCGCAAC
GTTCCAGGGG GAAGTAATAA AAGTGGTACA ACCAAGGTAT TTGATTCACA TGCATTAACC
GATGCGCAGA TTAAAGACTA TGCACAGCAA TTAACTGGAG GTGTGCCTTT AAAGCAGACG
AGTAGGCCGG GGGTTTATAC GGCTAAATTG AGTGATGGGA GCACTGTGAC ATTGAGGTCG
GTTTCTAAAT CAAATCAAGA GACTCAAGCG AGGTGGACTA TTGATATAAA GGATAATCCC
GCTTTAAGTG AGATTACAAA TAAAACAGTT GAGCTTAAAT TTAGGTGA
 
Protein sequence
MNKDLYRLIY NRALRLWQVA SELATASGGT PGPSPTAQRP ARACLHPIPF ALWLTLGWVT 
ITGIATAQVV ADPHAPGQQR PTVLAAPNGT PLINIQTPSP AGVSRNTYQQ FDITPQGAIL
NNARTPTQTH LAGTVQGNPW LAAGTAKIIL NEVNSSTPSQ LHGSMEVAGA RAQLIIANPS
GITCNGCGVI NAHQLTLTTG TPIFNARGAL DHYRVQGGAI QIDGLGLDSR SADYTALIAR
TVQLNAGLWA HTLQTTTGPA TVALDGHPTA SLPVTPGDRP TVALDVSALG GMYAGKITLI
GTEHGLGVRN AGQLSATSAP LTVTVDGLLE NTGRLQSATD TQLNATAEVN NSGLISAAQT
LTLHTPTTID NRSGTLNAAR LDITGARLDN RGGHIQQTGL QPLTLQTQHL DNQDQGRLGV
LDTPAPASPA TPTVTAPISN APPTVTAPPA TDPTTSPVAP TVPHLAHGTL TLTQTIDNRG
GHITAGGAID AILTDLDNRD GTAALNRLTL QGQRLDNQHG ILTLATDATI HTHTLNNAAG
QLHANGTLDL TADTFSNQNG QLLHTGSQNA TLTITDLLDN QHGIIASAAN LLTLKTDHLN
NAAGQLHANG ALDLTAQRFS NQNGQLLHTG SQNATLTIAN LLDNQHGLVA SAANALTLHT
GHLNNDAGQF QTNGALDLTA QRFSNQHGQF LHNSPQSAHL RIDGQLDNQQ GVLASNAAEL
TLETGQFNND SGTLQQSGQG TLHIDAATLT GHGGTLTSQG ALTLTGTHTD LSHATTTAQH
ITIHTDDLTT AGGHLTAYGE HTLQLNARTR IDNTAGTIAT NGSLDLHTAA LDNTGGTLHS
TATGPNRLDI THTLTNTAGH LLLNGPTTLT TGTWTNTGGQ LQITGPATLH ATTLDNRGGI
LHTATGPLDL RVTGTINNQD NGILSSTAAL TLTAASLHNQ HGTLDAAGPA HLTLTGLLDN
TAGLLQTAHT LWLTSAGLTN RSGTLTAAAL TLDTQAHTLD NTSGRLGTTT GNLTLHTGLL
DNTAGLLQTA ATLTIDTGAA PLTNRDGGTL LAADTLDLHT TTLDNRGGTI DSQTATHLHT
TTIDNTTAGH ISSNGTLQID GTTLTNTGGR LHSGGDTRLH LQDTLNNHDG RITAAGTLDI
TTTTLDNHST PLTAPPATQT RAPTGAPDNG LYATHIQIAS TTLDNTAGTL SAAQNLTLTL
SDTLTNTAGH LSAGATLDLT ADHLSNHTGT LLSGASQTLH LHRLTGDGRL HAGNALTLTL
QDSLDTAGTL SATGLLTLTT AGDLTNRGLI QAADLTAQAR DITTTATGQL LTTGHTHLTA
TGTLNNSGHL QAADLTAQAH DITTTATGQL LTTGHTHLTA TGTLNNSGHL QAADLTAQAN
TITNTGTFLA TSHATLTATD TLTNSGLLQA ADLTAQANTI TNTATGRLLT TAHTQLTATD
TLTNSGLVHA GDLTVHARDI TNTATGQLIA SNLAQLTATA TLTNRGLIDA FTTHLSAPTI
DNLGTGRLYG DHIALQAHTL TNRDETSDGH THTATIAARE RLDIGADTLR NTANAMILSD
GDAAIGATLD NTLHATGIAT LIDNRSATID ITGTLNITTT TLNNIRENVH IAHAPDVVTE
ARMYQPHWRK NKPNGGSGDF RLSSNYDAHD IYYLNPADIL EDTPYITPDG QKIHRAIVRL
TPQTSAYFYA RGGLYASQAE RRRLDLTART GDSLVLYYTD RQDKQPNPDH VAAAATNDSA
FIGLDTPQQN ERLKIVPITY APGDDRLTYD PTYGTCTDDC VRLVTWHDYT DPDHTLIDMR
RGPNDVDDNE RERHATRTTQ QEILNPDAGA PALIQSGGTM RIDVGYLYNH YADLLAGGDQ
TIVGLPPHPT KETADDEHKY NRALLIDNRA LQLSRTDRFQ NISTTYRGKD SAPWSNESRT
TPTTQIGGRI TSGGHQHIAA QTFNNVTDST HAPEPIQHVT YNPSTQTLTI ADGHITVTDT
PPSLHTVSLA DNGFSHGQEL TYIPEKSITT PNAPIRDPAA PPAVTVTPTG PLTLPNNSLF
TIHPDTATLI TTDPRFTLGR PYTSADSQLH ALGDHDTLHK RLGDGYYEQR LIREQIAQLT
GRRRLDGYTD DDHQYRALLD AGLTVAKQHQ LRPGIALSAD QMAQLTSDIV WLVQQDVHLP
DGTTTVALVP RLYLRPRTGD LTPDGALLAA ASTTINAHTL TNTGTIDARD LININTHIMD
QQGGRLTADA INIHTTGDFT NLGGQFTAGD FLKVHAQGNF LASSTLRDAT TQGTRHHSVT
ELDQQAGFTV TGPGAYLGLS TDQAMTQQAV AISNTGLDGY TSLKATGRLH LGTLNTHRSD
TTQWDPRNSR HTRIDTEHGT SIRSAGDIQL NSGQDINLRA VTLHSTQGTV SALATGNVTI
THGDTLQYTS QDNHSKRSGL LNSRTTTTHA DQQQTQAMSS TLSGTKVLVK GNNITVTGSH
LLSDAGTYMQ AKGDLTLQAA TNTTQSTYSE HTKQRGLIRN GGASLTLGNQ SQRTDSTTTA
TTTTGSLIGA TNGNVTLLAG GHYQQIGSDV LSPNGDIDIH AKKVDIIQAH HTSHTTQHTA
TRQSGLTVGL STPLIAGAQT AQQMQHAAAR SGDPRLHALA GLTTALGAKN TIDAVRQDPR
ALGGLNASLT LGRSTHDSTT TTTTTTAAGS NVNAGGNVRI SATGDGEAST LTIQGSHVRG
DNMTYLKADG DIALLAAANT TTSDRQSRGR SAGVGVAVNL GSSGTSAGLT AHASTSTGSG
QSTDLTWTNS HVGGGNLLTI EAGGDLLMKG AIGTAKHVIA DIAGNLTIQS LQDTHHYRSK
DRSLGGSLTV GAGVSGSANL NNQTIRSDYA SVTEQSGLFT GDGGYDITVG GQTHLIGGAI
TSNSTAIHNG LNTLDTGTLI LQDIENRATY TATQVNLGGG YSRNGGTVGT DQQGHAATAT
QVPGTTLPTH NGLSAAPPGA MTASDSSHST TYSGISQGAL TIRDPAAQHA LTGHTAAQTI
AGLNRDILTD TATSNALTPI FDEQRINAAF DIVTALQRET GTFINNRAAE ATQAQQALQA
EHAKPADQRD PAHIAALQQR IQNTTTWELG GTGHTIVTAL TLAAGQQVTG PATQMLQNAA
VNYIQSLGAR EIKDLADTLG SDTARSALQG LLGCAGAAAQ GQACGAGAVG GAAAVVINSL
LDRANGAEAA SLSAEEKQHR TDLVTSLVAG ITTAAGGDAA VSSAAARLET ENNAAFIPVI
LGAVWLADKG ITAYQAWQDI KAIRSGEKTL EQVALERGQD YVTSIVIGNL AKYGLKAAMI
GGRWISGTAK EIANAEKEAL RQIRNNPKGP DLTQKPPGQI MALQRQKRLD DVKSVIGRRS
QKDTLVVGGI EVKAVPYDRN VPGGSNKSGT TKVFDSHALT DAQIKDYAQQ LTGGVPLKQT
SRPGVYTAKL SDGSTVTLRS VSKSNQETQA RWTIDIKDNP ALSEITNKTV ELKFR