Gene YpAngola_A0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0145 
Symbol 
ID5798609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp144158 
End bp153472 
Gene Length9315 bp 
Protein Length3104 aa 
Translation table11 
GC content37% 
IMG OID641338168 
Productputative phage minor structural protein 
Protein accessionYP_001604775 
Protein GI162418635 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.520123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000741368 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTGGGT TTTATATAGA TAAGCTGTCA CTCTCTCAAC GGCTTTCTAT TGTTTCTGAA 
ACATATGATC GGGTTAATAA AAACAATAAA AAAGAAAAAT TAAAATACTC TTATGATGAT
ATTGAAATGA TCAAGAAAAG ATTTGTTAAA TATATCGACG CGCAACTTTA CAGTTTAATT
AGAGAAGGGT TATCTGTACC GGCCACTTTA ACGCAAGAAG AAAAAATAAA AATTACTGAC
CTGGCAATTG ATGCGGCTCT TTTCAATGAC TATGAAAGAT TCAATGAGTT AATTATATAT
ATTAGTTCAT TAGGTATATC TGTGACACCT CCACTACCAC AAGAAGAGGG AGGGAGCAGA
TTGTATATCT ACTTTAGTGG TGACATTCAT GCTTATATGG ATGTGTGGCG AGGTGATTTA
TTATTAGGGA GTGGAACTGA ACTTTCTGAT ATACAAAGTA TCACTGGTCT GCGATTTATG
ATTGATATGG CAGAATCACT GAAACTTAAT ATTATTAATC CAGCGGATAA GGCGATGGTG
GATTTAATCA ATCACCTTCG GTATGAAATG ATTTCATATG CCAGTTCATT TTGTGCAACC
TACAGTGCTG AGCGTGGTGG GACGGTTTAC TTATCATCAC CAGATGGCCT ACGGATAAAT
AACTATTTTT GGAACAGTGA GCTGCCTGTA CTGCGTGCAT TGCAGAAGAA AGGATTGATA
GGCGATATCC GTATTCTTCA TAAACCTTTA GAGTTTTATA AAGATACACC ACTGGATGAG
CTGGGGGATT TGCTGACAGC AAAAGATATT TCAATGACTG TGGAGTATCA GTTTCTTCCT
GTTTGGTTGC AAGAAAAATT ATTGGTCGAC ATCTATCAAC AATGGCTAGA TGAGGAGTTT
CAGCCCAGTT TATTGGCTGT TAGAAAAGAG ATTATTAATA CAATCGATAT TGATAGGCAT
GCACCTGAAG TAGAGTTGCT ACGCTATTTT CTTAGAAAAA TACATGAGCA GTTAGATGAA
ATCACAGAGT TTAAAGTACT AAAAGAAGCG GAGCGTATAG ATTCTATCAA GAAAAAATTA
GCCGTGGGCA GTGAAATAGA AAGCTGGTTA GATAATGTAC CCGCAATAGA TGTTAATGAA
AGAAAAATTA TTCTTGAGAG TCTATTACAA AAAGAAAGTC TGCTTTTTTC TAATGTTCGT
GATATTAAAA AATCCATTAT TCCTCTGGAT TTTAATAGTG ATGTGATTAA TATTAATACA
TCACTATTAA AAAACACTTT CATTCCTTTT AATTTGTTAC GGGAAAGTTG GGATGTCATA
ATCAGTGATC GTAGTTTAGT GGAAGGAACA CTTACCATAC ATTTTTCTGT TGGTAGAAAA
ATAATGATTA AGGTGGATGC TAACCGTAAT CAGTTAAAAC AGATGGCTAC TCTTGAACGT
TTTTTATTGG CCAATTTCAC CCCAAGAAAT ACACCTCAAG ATTTAAAAAT AATTGATAAT
GTTATTATGA GTGGTGATGT AGTATTAGCT GAGAGGAGAG ATGATAAAGG TTGGCATAAT
GATCAGCAGA TGATAGAGAA AGTTAAATTA AGTGAGTTTG ACTATTTTCT AAAAAGTAAT
GATTTGGGGG TAAAGATAAA TGACAATGGA TTTGTCCTCT ATTTAATCTC AGATCCTGAG
GATAATCGAG ATGTAATTAT AAAACCTAAT GATGATTATA ATTTAAAATC GATTAAAGAC
TTTATCGAAA ATAACTATTT ATTTTTTGAT GATGTGCCAG AGTACCTTAT TGTAAAGAAA
AATGTAGAAA ATAAAGAATG TATATTTGCC CATGATGAAG GGGAGACGTA CCAAGTTGCT
TATAGAGATG GAGAAGCATG GGTTCTACTT TCTAAAATGA ATACTTCAGA TCAAATTAAA
AATCTAAACG AAATAACCAT GTCCGTAAAT CTAAATAATG CAGAATCAAG GTCTGTAGCA
TTAATATTAT CTTTATTAAA CAAACACGCT AGATTAGTAT CGATACTTCC TGATACTCAT
CCTAAAGTCA TGGAGAACTT TTTAGATATA GACTCTCTAT TAAAAAACAG CACGCATCCA
TTTGAGCACC CTCTCTATAG GAAGTTATTA AATAGCATAT CAAATGATAT AAATAAATAT
ATGGAGTCAT TAAATGAGAT AAAGGATTCA TACCATTTGT TACCATTCGA TGTTCGGCCT
GGGCAATATA TGAATACTTG GAGTAAAATT GATAGAGACA CTGTTATTGA ATATTCCTTT
AAACAAAATG ATAAAAATCA CCACCCACAA TTTATTGTAT TATTACAAGA TGACTCTCTT
TCTAAGAGAG TCGGAGAGAT TATTGCTTCG TATAATAATA ATAAATCAAT AGTATTACAG
TTCGATGCCA GGAGTTCTGA AGCACGAATA GCCTACGGTA CTCAAAATAA TATAACTGAA
ATGGGGAAAT TTGAATTATC ATTTGTAACT CATGGTACTA CAGATGGAGT ATATAGTTTC
AGTGTAGCTA ATGTTATTGA AATTTATAAA CTTACAATTA ATTCATTCGC ATTGCCTCCT
CCGGTAAAAA TTAGATTAGT AATATGCAGT ATTGCTGATA ATGGACAGGG AGCACAGGGA
TTTAATGGTA CTAATCCAGC CCTGGGAATT GTCAATATGA TGCACCAGGA AGGATTTGAT
ATTCCTATAT TGGCCTATAC AACAAAAGTG GGGGTTTCAG TAGAACACCC CGGTGAGTTA
GTGGTATTCA ATTCAGAAAA ACCGGGAGAA GTACTGAAAA ATATAGATGA CTATCAGGTG
TTATATCATT ATAAAAATAA TATATTACTT ACCGATGGTG TCCCGGTAGT TGAACTGTTA
CTCAAGGATG TAAGGAATAA AATAAAGTCT GTTGATCAAC TTATTGAATC TTATTCACAA
TATCTAGTGC CTTTTTTCTC TGATGATAAT GGTGTTGTTG ATCGTGAATT ATTAGAGTTA
ACTATTAATG ATTTTGATAC ATATTCAAAA TTTGAAAATT TCCTTGATAT TATTAGGCAG
CGTCCTGAAT TACGTAATAG TGATAATTGG CAGTTAGTTG TCGCTAATAA TGCGACTGGC
TTCCTTATAA CAACACTAGA TGAACCCGTT GTAAAATATC CAGACATTGT TAAAGTTAAT
GAGTGGGACC TGCCTGCTAT TGCCAATATA GATAAAACGG CTACGGCAAG TCAATACGAT
ATGCAGATTG TTTTTCAATG TGAAAATAAC CCAACGGTTA ATCGTGCAGC GACGAGACTT
GCGGGTAAAC ATGCTAAGAA TTCAATTATT ATTCAGTTAG ATGTTGATAA TAATCACAGG
GCTTTTATCA TTGACGATAA TACTCACGCC GAGTGGCGTG AAATTAGTCA TAATGAACTC
GTCACTAAAC TAAAAATCCA GCCTGAAAAC GGTAAAATAC GGTGGCAAGT TGTTGGGCAT
GGCCGTTCTG AAGGTGGCAA TGATAAGCAT CCGACACTGG CTGGGCAACG ACCCGAGCAA
CTAACAGCCC GCTTGAATCA ATTTTCTGAC TATTTACAGA CTGAGCATCA GATAAATATT
TCACCTCAGC AAGTCAGTTT GGTGGGGTGT GCTATGAGTA GCAGTGATAG GTACACCAGT
TTTGCACATA AATTTATGTC TCATTTGAAT GAAAATGGCA TTAGAACTAA TGTTTCAGCG
TCAACAAAGG CTATTGAGGT TGATCCTCAG GGGCATAAGC ATGATGTGGA TACTCCCGAT
ATTGATAGCT ACAACAATAA ATACTTATCG TCAATAAAAG GAACTGAAAA ATTATATTGG
AACCGTTGGG GGGAAATAAC AACAGAACGC AAAAAAGACA TAAATGGCCG TTTGAATAAT
ATAGATAGTC TACTGGATAA GTTGATAACA AGACAATTGT CAGTCAATCA ACTTAATAAA
AAACAACAGC GTAAATTAGC GGAAATATTT CCACAGTTAA CAGATAAGAA ACTGAATAAA
GGTGAATTGT TATTAACCTT ACATGACTCT TGGCGTATGC AGGCGTTGAA ATATGATTTG
CTGTTTTTAC AGAAAATATC AGACAGGCCA GATTTTGATA CTGAACTTTG GCGAGTAACT
GACCGTTGGC GAATAACAGA GACTGATGGT AATACGTTAC AAGATGTAAG GATTAAATCA
GGTTCACAGC ATAAAACTGA TTTAGCGACT TATCCTCACT CGATAACATC TGATCCTGAT
CTAAAAACAT CTGCTCCTGA ACTAAAAACA TCTAATCCTA AAGCAAGAAC AGCGATTTTT
GGGCGATTTG GTTATGGTAT GCAGGGATAT GGCTTTATTT CTGCTTTACG TTTATCTGCT
GATTACCAAC GTTGGATGAG TAACGGTGAC TTAACAGAAA AACAAGAGGA AGAAATACAA
TTGCAATTGG CCATGGCCTG GGGAGGTATC GGGGCTAATC TGGCGACCGA TGGCTTGCAA
TATGCATTTG GTAAGTGGGG AATTGGCTAC TTACAAAAAT TAGTGAGTAA AGGAGGGAGG
TTGTCACCGG CCCTTTTGAG CCAATTAACA TTATTAAAAC GCAACCCTGC ATTGCTATTG
GCCCCAGGTT TTCTTAAAGA CCTAAGAAAA TTAGCACTCA ACCAATTCGC ACATGGTGCT
GCACGTTTTA GTATGCCACT GCTCTCAGCA CTCACTTCTG GTATTGATAT CTATCAGGCA
TACCATGCAT TTTCACAATT GGCGACAGAA ACAGATCCTC ATGTTAGACG AGATCTTATT
GCCAGCGGGG TCTTTTCGAC CATTAATGCA ACCATTGGTT TGGGCGTCGC GTTTGCCATG
GCAATGGGGG GGACTGCTGC TACAGCTGCT GGGCCGGCGG GTATTGCCCT GGCTTTTACC
ATGATTATTG TTGGCGATAT TTACTCAGCC GTTAGTCAGA TTGAAAGGAT CCGCGATATT
GTACCTGATA TGACGGGAAG CCAACGGTTT GAGAATGGCT TGCGGTTATT TTTAAAATTT
GGTTTAACAC CGGGGCTGGA TAATCAGATT AGATATAACC AGACTATGGA AAGTGTTTAC
CAGCGACAGA GAGATTATTA TGAGGCGCTA TTAGCGAGTA AACAGGGCGT AGATACGCTA
TTTTATAGCC GGGGCGAAGC AGTTCTTAAA GCTATCCCTT TTATAAAACG AGATGAACGA
TCGCAGACAG AGAGAGATCT GGAGAAAATC AGTATTTTTT CAGGTGATCC CTTTACCAAT
GCCAAAATTT ATACCACTTA TGCAGAAATG GGAAAGCATG AGTATTATGA ACTGGATAAA
ATAAATGATG TTGACGATTA TGTTATCGCT GATTTTTTCG AGGATAATAA TCGTAGTGTA
GTTAAGTTAC AAAATAAAAA TCTGCATCAG GCGTTCTCTG AGCTTGATAT TGATTCTACC
TATAGCCCAT TTATCCTTTC TGCTGATGTT GATCGTAATG GGCTAAACGA TTTTATCGTG
ATCAACGAAA AATATAATAC AACAATAGCA TCAAGAAAAA ATTCAGTGGG TATGACAGTG
ATTGATGACT ATGTTAGTCG CTGGCATTAT GAGCTATACA CTTGGTTGGC TCAGCCGGAT
GGCAGCTACC TAAAAATAGA TACAAGGCTG GAGTGGGAGA AATTATTTCA TGCAATAGAA
GTCGATAAAT TTAATGAAGT TGTGTTCCCT GTGCTAGGGG ATTTTAACGG TGATAACGTG
TTTGAATTGG TGATTTTTCA TGATGATAAA ATGACAACTT ATCATTATGA TAGTCTGGAT
TTTAATCAGA GTGGAAAAGA TAATCACAAT GTAATTAATA TCGGAGATTT TATTGAGCCA
GTGAGGCTGG CATTTGAGGG TGAAAAAAGT AAAAATTATC CTTATTCACT TGTTGGCGAT
ATCAATAATG ATGGCTTTGA TGATATTTTA CTGCTAAATA AATCAGGGGA TATGTTACAC
CTGATGGGAA ATAGTAGTGG TGTTTTTAGG CAGCATAAGA CCAAGTTATC TTCTGAACTA
ACATCATTAC TCTCCTCCTC TAATCTTCAT CGTTCACAGT TACAATTAAC TGATCTCAAT
AAAGACGGGG GGCTTGATTT AGTTATCATT CTTAATGATG GAATTTATTA TCAGGCGTTG
GGGGATAAAA TAGATGGGGA GTATCATTTT GATACACCTT CCATGGTTAA TAAAATAACG
ATTAAGAACG AGGGAGGAGA CAGTGTCCGT TATCAGCAAA ATAGACTGTC TCAAATAGAT
AAACATAAAA TAATAGCGAT ATCTCCGAGT GATCAAGGTG AGAATAGATT AATCTCTCTC
TCTGATTCAG GGGAATTGCT TGCTCACCCT CTTCGCGAGA TAAAAGAGAA TGATGTCGCC
GCTTTATTTG ATTTAGGTGG GGGCGATGAT GTCGCAAAAG GTTATCATAA AAAGAAAAAT
ATATTTACTA TTGGTAGTGG GTTTAAGCAA TATCAAGGCG GTGAAAATGC GGATACGTTT
ATATTGACCA GTGCTGTGGC CTCTAAGAGT CATATCCTTA GCGGTGGCGA AGGAAATGAT
ACAGTCGCTC TGGGAGAAGT ACTGGGAAAT GAAATTGACA GTATTATTGA TATCAGCAAT
GGCTATTATA GTCAGGTGAA TGGTGGTGTC GAAAAACAAG TTGCTTTACT TTATGATTTT
GAAAATATTC TGGGTCATGA AAATGTTAAT GATACTATCA TCGGAAATGA TGTAGATAAT
TATCTTAATG GCATGGGCGG CGATGATAAA ATATGGGGTA ATGGAGGCAA TGATTTATTA
GCACTACAGT CTGGTTTAGC CCAGGGAGGC ACGGGGTTAG ATAGTTATCA TATTCTAAAA
AGTACTCATG AAAAATCGTT ACAGATTAGA ATTGAGGAAG TATCAGAAAA TAACAATACA
GATATGCAAA TCAGTAATAT CTTTCTTGAG CATAAGCTCA ACCAAATAAA ATCTATTGAG
TTGGATAATA TTGATGTGTT AATTAATATA AACAATGATA ATGGATTTAT GACCCAGATC
AGGTTGGTTG GTGTCTATAA TATAAATAAC AATCAAAAGC AGCAAGTACT TAACTTCACC
ATCCAGACTG TTGATGGGTT TACAATGGTG CCATTGTGGC CAAGCTACCT TAATGAAATT
ACTGAATTCT CCCCGAATAT GGTCGCATAT TACTCCTCGT TGGTTGATCG TAATTATAAG
GAATTAGTGG GTAAGGGAGA TCCTGATGAT ATCGTCGTAC GGTTCTCATT AGACAATGGT
TATCAACAGC AGCAGGTGAC TCATCTTCAA AGGGTAGAAG GAGAGAAAGA TATTGTTTTA
CGGCAAGCTA TATTACCTGA TTTTATTAGG CTCTCACCTC AAGAACATTC AATGCTAATG
GGGTTTTTAC CTCGCTATGA ATTATTGGGT GATAATAAAG ATAATCTATT GCAAGTGTTA
AGCGGGGAAG GGTTACTCGA GGGGCGCGGT GGACAAGATA CTTACTTAAT TCAAGAGAAA
GAGGGGAGTC CAACAGATAT TATTATCAAT AATTTTGATG ACTCATTAGC TTCAGATAAT
TTAGTTCTGT CATCCTGGTT ATTATGTGAT GTTATTGTTG AGCGTTCGGA TGATGACTTG
TTATTACGCT ACCGCGATCA ACCAGAAAAA CACCAGAGTA TACGGTTAGT TAACTATATG
AATGATGAAC GTTATCGGCA TCTAAAAATA ACGGATAAAA GTGGGCAGTC ACAATATCGA
GATCCTGTTA CTGGAACATT TATTGATTAT CAGATAAACC TCGATAAAAA TGGTCATCCT
TTTATAGCAG CCCAACAGGC CCCTGTCGTC AGCAGCGGTA ATGATGAGGT TGTCATTACT
TCGGCGACAT TCTTACCGGG TAATTATATT GATACAGGCG ATGGCAATGA TGCGATTATT
TATATTCGTG GGCACGAAGG TACCATGCTT AAAGGGGGGG GCGGTGACGA TACTTATTAT
TATAGCGCAG GGAGTGGGGC GATAAATATT GCCGATACCA GTGGGCTGGA TCATCTTTAT
CTGGATAAGC ACATTCTACT GCATACATTG TCAGCAGAGC GGCGTGAAAA TAATCTGGTG
CTGAATATCG CGGATAATAC ATCAGGTCGT ATTATTTTTG TTGACTGGTA TCTTGCTGAT
GAAAATAAAG TTGAGTTTAT TTGGGTAGAA GACTCTCAAA TTACTTTTGA TGAACTATTC
AGTTTGCGTC CGTATTCCGA TGAATACTAT CAATTATGCC AACACCTTAA ATCCATGGGA
TTATCCCTTA CCGTGAGGCA ATTAGCCGAT CTTGATTCTC AAGATGGCTA TAACACCCTC
AATCAGCTAA GAACCATTAA AGCATGGGTG ACAAAGAATC CAATTTATGA TGTTGCTGAT
TTGGATTATC TGGTGGCAAT GTCGTCAATT GCCTGGCGTG GTAACGCCCG TAACACTGAC
CCACTACCGC TAATAGAGCA GAAAATCGAT GCATTCTTCC AACCTTTGAT AGCCGAACGA
ATTAGCCTTA CAGAAGAGCA TGTTACCTGG ATCCAGCGCG AGGAGTTCGA TACTGTCGAT
ATCGCCAAAT GGGTAAAAAA TTATCATCTA CGTAGCCAGA ATGAAATTAA TTATCTGCTG
GAACAACTGG GTTTACTCAA GGAATCGCCA TTAAGTGATA AAGCCTTGGA TTTTACATTC
AAAAATAGAA TCGATCTGGC ACAGGCCGAT ATTGAATTGT GTCAGCAAGA ATGTGGGATC
AACCGCCAGA GTCTTATTAA TTTGGCGATG AAATATCATG TCACAGGCCG TGGGCACTTT
GAGTTACTGA TATCAAATAT TCAGGTGCTT AAGGAATATG GTGTGGTAGT GAGTGAGTCT
GAACAGCCTC TCGTCTTGAG AAAGCCCATA GACTTAAGGC AGTACTTCAA TCAGAAAAAT
TTAACAAAAG ATCATGTTGG CCGTTTAGCG GAACACGATA TGAGTTTTGA TGAATTGACC
CTGCTCTTGG ATAAAAACAT TCCCATTGAG CAGGCTTTTA CCCAAAGATT ACAAACTCAG
CTTGGGCCTC TGAAATTATT TAATGACGAG AGAGTATTTA ATAAAGGGGA CGTATTTGAT
CAAGATATTA GCCAATTAGC GGAGGCTATG GGGGGATTGG AATCAACTGA AAGCTATTCG
CTACCGCTAG AGCGGCAAAC AGTGATGGCA ATAACTACTC ATCAGTTTGT GAGTGATTCT
ATTGCCGCTT ATTGA
 
Protein sequence
MAGFYIDKLS LSQRLSIVSE TYDRVNKNNK KEKLKYSYDD IEMIKKRFVK YIDAQLYSLI 
REGLSVPATL TQEEKIKITD LAIDAALFND YERFNELIIY ISSLGISVTP PLPQEEGGSR
LYIYFSGDIH AYMDVWRGDL LLGSGTELSD IQSITGLRFM IDMAESLKLN IINPADKAMV
DLINHLRYEM ISYASSFCAT YSAERGGTVY LSSPDGLRIN NYFWNSELPV LRALQKKGLI
GDIRILHKPL EFYKDTPLDE LGDLLTAKDI SMTVEYQFLP VWLQEKLLVD IYQQWLDEEF
QPSLLAVRKE IINTIDIDRH APEVELLRYF LRKIHEQLDE ITEFKVLKEA ERIDSIKKKL
AVGSEIESWL DNVPAIDVNE RKIILESLLQ KESLLFSNVR DIKKSIIPLD FNSDVININT
SLLKNTFIPF NLLRESWDVI ISDRSLVEGT LTIHFSVGRK IMIKVDANRN QLKQMATLER
FLLANFTPRN TPQDLKIIDN VIMSGDVVLA ERRDDKGWHN DQQMIEKVKL SEFDYFLKSN
DLGVKINDNG FVLYLISDPE DNRDVIIKPN DDYNLKSIKD FIENNYLFFD DVPEYLIVKK
NVENKECIFA HDEGETYQVA YRDGEAWVLL SKMNTSDQIK NLNEITMSVN LNNAESRSVA
LILSLLNKHA RLVSILPDTH PKVMENFLDI DSLLKNSTHP FEHPLYRKLL NSISNDINKY
MESLNEIKDS YHLLPFDVRP GQYMNTWSKI DRDTVIEYSF KQNDKNHHPQ FIVLLQDDSL
SKRVGEIIAS YNNNKSIVLQ FDARSSEARI AYGTQNNITE MGKFELSFVT HGTTDGVYSF
SVANVIEIYK LTINSFALPP PVKIRLVICS IADNGQGAQG FNGTNPALGI VNMMHQEGFD
IPILAYTTKV GVSVEHPGEL VVFNSEKPGE VLKNIDDYQV LYHYKNNILL TDGVPVVELL
LKDVRNKIKS VDQLIESYSQ YLVPFFSDDN GVVDRELLEL TINDFDTYSK FENFLDIIRQ
RPELRNSDNW QLVVANNATG FLITTLDEPV VKYPDIVKVN EWDLPAIANI DKTATASQYD
MQIVFQCENN PTVNRAATRL AGKHAKNSII IQLDVDNNHR AFIIDDNTHA EWREISHNEL
VTKLKIQPEN GKIRWQVVGH GRSEGGNDKH PTLAGQRPEQ LTARLNQFSD YLQTEHQINI
SPQQVSLVGC AMSSSDRYTS FAHKFMSHLN ENGIRTNVSA STKAIEVDPQ GHKHDVDTPD
IDSYNNKYLS SIKGTEKLYW NRWGEITTER KKDINGRLNN IDSLLDKLIT RQLSVNQLNK
KQQRKLAEIF PQLTDKKLNK GELLLTLHDS WRMQALKYDL LFLQKISDRP DFDTELWRVT
DRWRITETDG NTLQDVRIKS GSQHKTDLAT YPHSITSDPD LKTSAPELKT SNPKARTAIF
GRFGYGMQGY GFISALRLSA DYQRWMSNGD LTEKQEEEIQ LQLAMAWGGI GANLATDGLQ
YAFGKWGIGY LQKLVSKGGR LSPALLSQLT LLKRNPALLL APGFLKDLRK LALNQFAHGA
ARFSMPLLSA LTSGIDIYQA YHAFSQLATE TDPHVRRDLI ASGVFSTINA TIGLGVAFAM
AMGGTAATAA GPAGIALAFT MIIVGDIYSA VSQIERIRDI VPDMTGSQRF ENGLRLFLKF
GLTPGLDNQI RYNQTMESVY QRQRDYYEAL LASKQGVDTL FYSRGEAVLK AIPFIKRDER
SQTERDLEKI SIFSGDPFTN AKIYTTYAEM GKHEYYELDK INDVDDYVIA DFFEDNNRSV
VKLQNKNLHQ AFSELDIDST YSPFILSADV DRNGLNDFIV INEKYNTTIA SRKNSVGMTV
IDDYVSRWHY ELYTWLAQPD GSYLKIDTRL EWEKLFHAIE VDKFNEVVFP VLGDFNGDNV
FELVIFHDDK MTTYHYDSLD FNQSGKDNHN VINIGDFIEP VRLAFEGEKS KNYPYSLVGD
INNDGFDDIL LLNKSGDMLH LMGNSSGVFR QHKTKLSSEL TSLLSSSNLH RSQLQLTDLN
KDGGLDLVII LNDGIYYQAL GDKIDGEYHF DTPSMVNKIT IKNEGGDSVR YQQNRLSQID
KHKIIAISPS DQGENRLISL SDSGELLAHP LREIKENDVA ALFDLGGGDD VAKGYHKKKN
IFTIGSGFKQ YQGGENADTF ILTSAVASKS HILSGGEGND TVALGEVLGN EIDSIIDISN
GYYSQVNGGV EKQVALLYDF ENILGHENVN DTIIGNDVDN YLNGMGGDDK IWGNGGNDLL
ALQSGLAQGG TGLDSYHILK STHEKSLQIR IEEVSENNNT DMQISNIFLE HKLNQIKSIE
LDNIDVLINI NNDNGFMTQI RLVGVYNINN NQKQQVLNFT IQTVDGFTMV PLWPSYLNEI
TEFSPNMVAY YSSLVDRNYK ELVGKGDPDD IVVRFSLDNG YQQQQVTHLQ RVEGEKDIVL
RQAILPDFIR LSPQEHSMLM GFLPRYELLG DNKDNLLQVL SGEGLLEGRG GQDTYLIQEK
EGSPTDIIIN NFDDSLASDN LVLSSWLLCD VIVERSDDDL LLRYRDQPEK HQSIRLVNYM
NDERYRHLKI TDKSGQSQYR DPVTGTFIDY QINLDKNGHP FIAAQQAPVV SSGNDEVVIT
SATFLPGNYI DTGDGNDAII YIRGHEGTML KGGGGDDTYY YSAGSGAINI ADTSGLDHLY
LDKHILLHTL SAERRENNLV LNIADNTSGR IIFVDWYLAD ENKVEFIWVE DSQITFDELF
SLRPYSDEYY QLCQHLKSMG LSLTVRQLAD LDSQDGYNTL NQLRTIKAWV TKNPIYDVAD
LDYLVAMSSI AWRGNARNTD PLPLIEQKID AFFQPLIAER ISLTEEHVTW IQREEFDTVD
IAKWVKNYHL RSQNEINYLL EQLGLLKESP LSDKALDFTF KNRIDLAQAD IELCQQECGI
NRQSLINLAM KYHVTGRGHF ELLISNIQVL KEYGVVVSES EQPLVLRKPI DLRQYFNQKN
LTKDHVGRLA EHDMSFDELT LLLDKNIPIE QAFTQRLQTQ LGPLKLFNDE RVFNKGDVFD
QDISQLAEAM GGLESTESYS LPLERQTVMA ITTHQFVSDS IAAY