Gene YpsIP31758_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0824 
Symbol 
ID5384438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp995132 
End bp1004446 
Gene Length9315 bp 
Protein Length3104 aa 
Translation table11 
GC content37% 
IMG OID640863788 
Productputative phage minor structural protein 
Protein accessionYP_001399808 
Protein GI153949079 
COG category 
COG ID 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGGT TTTATATAGA TAAGCTGTCA CTCTCACAAC GGCTTTCTAT TGTTTCTGAA 
ACATATGATC GGGTTAATAA AAACAATAAA AAAGAAAAAT TAAAATACTC TTATGATGAT
ATTGAAATGA TCAAGAAAAG ATTTGTTAAA TATATCGACG CGCAACTTTA CAGTTTAATT
AGAGAAGGGT TATCTGTACC GGCCACTTTA ACGCAAGAAG AAAAAATAAA AATCGCTGAT
CTGGCAATTG ATGCGGCTTT TTTCAATGAC TATGAAAGAT TTAATGAGTT AATTATATAC
ATTAGCTCAT TAGGTATATC TGTGACACCA CCACTGCCAC AAGAAGAGGG AGGGAATAGA
TTGTATATCT ATTTTAGTGG TGACATTCAC ACTTATATGG ATGTGTGGCG AGGTGATTTA
TTCGTAGGAA GTGGAACTGA ACTTTCCGAT ATGCAAAGTA TCACTGGTCT GCGATTTATG
ATTGACATGG CAGAATCACT GAAACTTAAT ATTATTAATC CAGCGGATAA GGCGATGGTG
GATTTAATCA ATCACCTTCG GTATGAAATG ATTTCATATG CCAGTTCATT TTATGCCACC
TATAGCGCTG AACGCGGTGG AACGGTTTAC TTATCATCAC CAGATGGCCT ACGGATAAAT
AACTATTTTT GGAACAGTGA GTTGCCTGTA CTGCGTGCAT TGCAAAAGCA AGGATTGATA
GGCGATATCC GTATTCTTCA TAAACCTTTA GAGTTTTATA AAGATACACC ACTGGATGAA
TTGGGGGATT TGCTGACAGC AAAAGATCTT TCAATGACTG CGGAGTATCA ATTTCTTCCC
GTTTGGTTGC AAGAGAAATT ATTGGTCGAC ATTTATCAAC AATGGTTAGA TGAGGAATTT
CAGCCCAGTT TATTTACTGT TAGAAGAGAG ATTATTAATA CAATTGATAT TGATAGAAAT
GCACCTGAAG TAGAGTTGCT ACGCTATTTT CTTAGCAAAA TACATGGGCA GTTAGATGAA
ATTACAGAGT ATAAAGCATT AAAAGAAGCG GAGCGTATAA ATTTTATCAA GAAAAAATTA
GCCGTGGGCA GTGAAATAGA AAGTTGGTTA GATAATGTAC CCGCAATAGA TGTTAATGAA
AGAAAAGTTA TTCTTGAGAG TTTATTACAA AAAGAAAGTC TACTTTTTTC TAATGTTCGT
GATATTAAGA AATTCCCTAT CCCTCTGGAT TTTAATAGTG ATGTGATTAA TGTTAACACA
AATAAATTAA AAAACACTTT CATTCCGTTT AATTTGTTAC GGGAAAAATG GGATGTCATA
ATCAGTGATC GTAGTTTAGT GGACGGAACA CTTACGATAC ATTCTTCTGC TGGTAGAAAA
ATAATGATTA AGGTGGATAC TAACCGTAAT CAGTTAAAAC AGATTGCTAC TCTTGAACGT
TTTTTATTGG CTAATTTTAC CCCAAAAAAT GCACCCCAAG ATTTACAGTT AATTGAAAAT
TTTATTATGA GTGGTGATGC AGTATTAGCA GAGAGGAAAG GTGATAAAGG TTGGCATAAT
GATCAGCAGA TGATAGAAAA AGTTAAATTA AGTGAGTTTG ACTATTTTCT AAAAAGTAAT
GATTTGGGGA TAAAGAAGAA TGACAATGGA TTTGTTATCT ATTTAATATC AGATCCTGAG
GATAGTCGAG ATGTAATTAT AAACCCTAAT AATGATTATA ATTTAAATTC GATTAAAGAT
TTTATAGAAA ATAACTATTT ATTTTTTGAT GATGTGCCAG AGTACCTTAT TGTAAAGAAA
AATGTAGAAA ATAAAGAATG TATATTTGCC CATGATGAAG GGGAGACGTA CCAAGTTGCT
TATAGAGATG GAGAAGCATG GGTTCTACTT TCTAAAATGA ATACTTCAGA TCAAATTAAA
AATCTAAACG AAATAACCAT GTCCGTAAAT CTAAATAATG CTGAGTCAAG GTCTTTAGCA
TTAATATTAT CTTTATTAAA AAAACATGAT CGATTAGTAT CGATACTTCC TGATACGCAT
CCTAAAGTCA TGGAAAACTT TTTAGATATA GACTCTCTAT TAAAAAATAG CACGCATCCA
TTTGAGCACC CTCTCTATAG GAAGTTATTA AATAGCATAT CAAATGATAT AAATAAATAT
ATGGAGTCAT TGAATGAGAT AAAGGATTCA TACCATTTGT TACCATTCGA TGTTCGGCCT
GGGCAATATA TGAATACTTG GAGTAAAATT GATAGAGACA CTGTTATTGA ATATTCCATT
AAACAAAATG ATAAAAATAA CCACCCACAA TTTATTGTGT TATTACAAGA TGACTCTCTT
TCTAAGAGAG TCGGAGAGAT TATTGCTTCG TATAATCATA ATAAATCAAT AGTATTACAG
TTCGATGCCA GGAGTTCTGA AGCACGAATA GCCTACGGTG CTCAAAATAA TATAACTGAA
ATGGGGGAAT TTGAATTATC ATTTGTAACT CATGGTACTC CAGATGGATT ATATAGTTTC
AGTATAGCTA ATGTTATTGA AATTTATAAA CTTACAATTA ATTCATTCGC ATTGCCTCCT
CCGGTAAAAA TTAGATTAGT AATATGCAGT ATTGCTGATA ATGGACAGGG ATCACAGGGA
TTTAATGGTA CTCATCCAGC CCTGGGAATT GTCAATATGA TGCACCAGGA AGGATTTGAT
ATTCCTATAT TGGCCTATAC AACAAAAGTG GGGGTTTCAG TAGAATACCC CGGTGAGTTA
GTGGTATTCA ATTCAGAAAA TCAGGGAGGA GTGCTGAAAA ATATAGACGA CTATCAGGTG
TTATATCATT ATAAAAATAA TATATTACTT ACCGATGGTA TCCCGGTAGT TGAACTGTTA
CTCAAAGATG TAAGGAATAA AATAAAGTCC ATTGATCAAC TTGTTGAATA TTATTCACAA
TATCTAGTAC CTTTCTTCTC TGACGATAAC GGTGTTATTG ATCGTAATTT ATTAGAATTA
ACTATTAATG ACTCTGATAC GCATTCAAAA TTTGAAAATT TCCTTGATAT TATTAGGCAG
CGTCCTGAGT TACGTAATAG TGATAATTGG CAGTTAGTCG TCGCTAATAA TGCAACTGGC
TTCCTTATAA CAACACTAGA TGAACCCGTT GTAAAATATC CAGACATTGT TAAAGTTAAT
GAGTGGGATC TGCCTGCTAT TGCCAATATA GATAAAACGG CTACGGCAAG TCAATACGAT
ATGCAGATTG TTTTTCAATG TGAAAATAAC CCAACGGTTA ATCGTGCAGC GACGAGACTT
GCGGGTAAAC ATGCTAAGAA TTCAATTATT ATTCAGTTAG ATGTTGATAA TAATTACAGG
GCTTTTATCA TTGACGATAA TATTCACGCC GAATGGCATG AAATTAGTCA TAATGAACTC
GTCACTAAAC TAAAAATCCA GCCTGAAAAC GGTAAAATAC GGTGGCAAGT TGTTGGGCAT
GGCCGTTCTG AAGGTGGCAA TGATAAGCAT CCGACACTGG CTGGGCAACG ACCCGAGCAA
CTAACAGCCC GCTTGCATCA ATTTTCTGAC TATTTACAGA CTGAGCATCA GATCAATATT
TCACCTCAGC AAGTCAGTTT GGTGGGGTGT GCTATGAGTA GTAGTGATAG ATATACCAGT
TTTGCACATA AATTTATGTC TCATTTGAAT GAAAATGGCA TTAGAACCAA TGTTTCAGCG
TCAACAAAGG CTATTGAGGT TGATCCTCAG GGGCATAAGC ATGATGTGGA TACTCCCGAT
ATTGATAGCT ACAATAATAA ATACTTATCG TCAATAAAAG GAACTGAAAA ATTATATTGG
AACCGTTGGG GGGAAATAAC AACAGAACGC AAAAAAGACA TCAATGGCCG TTTGAATAAT
ATAGATAGTC TACTGGATAA GTTGATAACA AGGCAGTTAT CAGTTAATCA ACTTAATAAA
AAACAACAGC GTAAATTAGC GGAAATATTT CCACAGTTAA CAGATAAGAA ACTGAATAAA
GGTGAATTGT TATTAACCTT ACATGACTCT TGGCGTATGC AGGCGTTGAA ATATGATTTG
CTTTTTTTAC AGAAAATATC AGACAGGCCA GATTTTGATA CTGAACTTTG GCGAGTAACT
GATCGTTGGC GAATAACCGA GGCTGATGGT AATACGTTAC AAGATGTAAT GATTAAATCA
GGTTCACAGC ATAAAACTGA TTTAGCGACT TATCCTCACT CGATAACATC TGATCCTGAT
CTAAGAACAT CTGCTCCTGA ACTAAAAACA TCTAATCCTA AAGCAAGAAC AGCGATTTTT
GGGCGGTTTG GTTATGGTAT GCAGGGATAT GGTTTTATTT CTGCTTTACG TTTATCTGCT
GATTACCAAC GTTGGATGAG TAACGGTGAC TTAACAGAAA AACAAGAGGA AGAAATACAA
TTGCAATTGG CCATGGCCTG GGGAGGTATA GGGGCTAATC TGGCGACCGA TGGCTTGCAA
TATGCATTTG GTAAGTGGGG AATTGGCTAC TTACAAAAAT TAGCGAGTAA AGGAGGGAGG
TTGTCACCGG CCCTTTTGAG CCAATTAACA TTATTAAAAC GCAACCCTGC ATTGCTATTG
GCCCCAGGTT TTCTTAAAGA CCTAAGAAAA TTAGCACTCA ACCAATTCGC ACATGGTGCT
GCACGTTTTA GTATGCCACT GCTCTCAGCA CTCACTTCTG GTATTGATAT CTATCAGGCA
TACCATGCAT TTTCACAATT GGCGACAGAA ACAGATCCTC ATGTTAGACG AGATCTTATT
GCCAGCGGGG TCTTTTCGAC CATTAATGCA ACCATTGGTT TGGGCGTCGC GTTTGCCATG
GCAATGGGGG GGACTGCTGC TACAGCTGCC GGGCCGGCGG GTATTGCCCT GGCTTTTACC
ATGATTATTG TTGGCGATAT TTACTCAGCC GTTAGTCAGA TTGAAAGGAT CCGCGATATT
GTACCTGATA TGACGGGAAG CCAACGGTTT GAGAATGGCT TGCGGTTATT TTTAAAATTT
GGTTTAACAC CGGGGCTGGA TAATCAGATT AGATATAACC AGACTCTGGA AAGTGTTTAC
CAGCGACAAA GAGATTATTA TGAGGCGCTA TTAGCGAGTA AACAGGGAGT AGATACGCTC
TTTTATAGCC GGGGCGAAGC AGTTCTTAAA GCTATCCCTT TTATAAAAAG AGATGAACGA
TCGCAGACAG AGAAAAATCT GGAGAAAATC AGTATTTTTT CAGGTGATCC CTTTACCAAT
GCCAAAATTT ATACCACTTA TGCAGAAATG GGAAAACATG AGTATTATGA ACTGGATAAG
ATAAATGATG TTGACGATTA TGTTATCGCT GATTTTTTCG AGGATAATAA TCGTAGTGTA
GTTAAGCTAC AAAATAAAAA TCTGCATCAG GCGTTTGCTG AGCTTGATAT TAATTCTACC
TATAGCCCAT TTATCCTTTC TGCTGATGTT GATCGTAATG GGCTAAACGA TTTTATCGTT
ATCAACGAAA AATATAATAC AACAATAGCA TCAAGAAAAA ATTCAGTGGG TATGACAGTG
ATTGATGACT ATGTTAGTCG CTGGCATTAT GAGCTATACA CTTGGTTGGC TCAGCCGGAT
GGCAGCTACC TAAAAATAGA TACAAGGCTG GAGTGGGAGA AATTATCTCA TGCAATAGAA
GTCGATGAAT TTAATGAAGT TGTGTTCCCT GTGCTAGGGG ATTTTAACGG TGATAACGTG
TTTGAATTGG TTATTTTTCA TGATGATAAA ATGACAACTT ATCATTATGA TAGTCTGGAT
TTTAATCAGA GTGGAAAAGA TAATCACAAT GTAATTAATA TCGGAGATTT TATTGAGCCA
GTGAGGCAGG CATTTGAGGG TGAAAAAAGT AAAAATTATC CTTATTCACT TGTTGGAGAT
ATCAATAATG ATGGCTTTGA TGATGTTTTA CTGCTAAATA AATCAGGGGA TATGTTACAC
CTGATGGGAA ATAATAGTGG TGTTTTTAGG CAGCATAAGA CCAAGTTATC TTCTGAACTA
ACATCATTAC TCTCCTCCTC TAATCTTCAT CGTTCACAGT TACAATTAAC TGATCTCAAT
AAAGACGGGG GGCTTGATTT AGTTATCATT CTTAATGATG GGAATTATTA TCAGGCGTTG
GGGTATAAAA TAGATGGGGA GTATCATTTT GATACACCTT CCATGGTTAA TAAAATAACG
ATTAAGAACG AGGGAGGAGA CAGTGTCCGT TATCAGCAAA ATAGACTGTC TCAAATAGAT
AAACATAAAA TAATAGCGAT ATCTCGGAGT GCTCAAGGTG AAAATAGATT AATCTCTCTC
TCTGATTCAG GGAAATTACT TGCTCATCCT CTTCGCGAGA TAAAAGAGAA TGATGTTGCC
GCTTTATTTG ATTTAGGTGG AGGCGATGAT GTAGCAGAGG GTTATCATAA AAAGAAAAAT
ATATTTACTA TTGGTAGCGG GTTTAAGCAA TATCAAGGCG GTGAAAATGC GGATACGTTT
ATATTGACCA GTGCTACGGC CTCTAAGAGT CATATCCTTA GCGGTGGCGA AGGAAATGAT
ACAGTCGCTC TGGGAGAAGT CCTGGGAAAT GAAATTGACA GTATTATTGA TATCAGCAAA
GGCTATTATA GTCAGGTGAA TGGTGGTGTC GAAAAACAAG TTGCTTTACT TTATGATTTT
GAAAATATTC TGGGTCATGA AAATGTTAAT GATACTATCA TCGGAAATGA TGTAGATAAT
TATCTTAATG GCATGGGAGG CGATGATAAA ATATGGGGTA ATGGAGGCAA TGATTTATTA
GCACTACAGT CAGGTTTAGC CCAGGGAGGC ACGGGGTTAG ATAGTTATCA TATTCTGAAA
AGTACTCATG AAAAATCGTT ACAGATTAGA ATTGAGGAAG TATCAGAAAA TAACAATACA
GATATGCAAG TCAGTAATAT CTTTCTTGAG CATAAGCTCA ACCAAATAAC GTCTATTGAG
TTAGATAATA TTGATGTGTT GATTAATATA AAGAATGATA ATGGATTTAT GACCCAGATC
AGGTTGGTTG GTGTCTATAA TATAAATAAT AATCAAAAGC AGCAAGTACT TAACTTCACC
ATCCAGACTG TTGATGGGTT TACAATGGTG CCATTGTGGC CAAGCTACCT TAATGAAGTT
ACTGAATTTT CCCCGAATAT GGTCGCATAT TACTCCTCGT TGGTTGATCG CAATTATAAG
GACTTAGTGG GTAAGGGAGA TCCTGATGAT ATCGTCGTAC GGTTCTCATT AGACAATGGT
TATCAACAGC AGCAGGTGAC TCATCTTCAA AGGATAGAAG GAGAGAAAGA TATTGTTTTA
CGGCAAGCTA TATTACCTGA TTTTATTAGG CTCTCACCTC AAGAGCATTC AATGCTAATG
GGGTTTTTAC CTCGCTATGA GTTATTGGGG GATAATAAAG ATAATCTATT GCAAGTGTTA
AGCGGGGAAG GGTTACTCGA GGGGCGCGGT GGACAAGATA CTTACTTCAT TCAAGAGGCA
GAGGGGAGTC CAACAGATAT TATTATCAAT AATTTTGATG ACTCATTAGC TTCAGATACT
TTAGTTCTGT CATCTTGGTT ATTATGTGAT GTTATTGTTG AACGTTCGGA TGATGACTTG
TTATTACGCT ACCGCGATCA ACCAGAAAAA CACCAGAGTA TACGGTTAGT TAACTATATG
AACGATGAAC GTTATCGGCA TCTAAAAATA ACGGATAAAA GTGGGCAGTC ACAATATCGA
GATCCTGTTA CTGGAACATT TATTGATTAT CAGATAAACC TCGATAAAAA TGGTCATCCT
TTTATAGCAG CCCAACAGGC CCCTGTCGTC AGCAGCGGTA ATGATGAGGT TGTCATTACT
TCGGCGACAT TCTTACCGGG TAATTATATT GATACAGGCG ATGGCAATGA TGCGATTATT
TATATTCGTG GGCAAGAAGG TACCATGCTG AAAGGGGGGG GCGGTGACGA TACTTATTAT
TATAGCGCAG GGAGTGGAGC GATAAATATT GCCGATACCA GTGGGCTGGA TCATCTTTAT
CTGGATAAGC ACATTCTACT GCATACATTG TCAGCAGAGC GGCGTGAAAA TAATCTGGTG
CTGAATATCG CGGATAATAC ATCGGGTCGT ATTATTTTTG TTGACTGGTA TCTTGCTGAC
GAAAATAAAG TTGAGTTTAT TTGGGTAGAA GACTCTCAAA TTACTTTTGA TGAACTATTC
AGCCTGCGTC CGTATTCCGA TGAATACTAT CAATTATGCC AGCAACTTAA ATCCATGGGA
TTATCCCTTA CCGTGAGGCA ATTAGCCGAT CTTGATTCTC AAGATGGCTA TAACACCCTC
AATCAGCTAA GAACCATTAA AGCATGGGCG ACAAAGAATC CAATTTATGA TGTTGCTGAT
TTGGATTATC TGGTGGCAAT GTCGTCAATT GCCTGGCGTG GTAACGCCCG TAACACTGAC
CCACTACCGC TAATAGAGCA GAAAATCGAT GCATTCTTCC AACCTTTGAT AGCAGAACGA
ATTAGCCTTA CAGAAGAGCA TGTTACCTGG ATCCAGCGCG AGGAGTTCGA TACTGTCGAT
ATCGCCAAAT GGGTAAAAAA TTATCATCTA CGTAGCCAGA ATGAAATTAA TTATCTGCTG
GAACAACTGG GTTTACTCAA GGAATCGCCA TTAAGTGATA AAGCCTTGGA TTTTACATTC
AAAAATAGAA TCGATCTGGC ACAGGCCGAT ATTGAATTGT GTCAGCAAGA ATATGGGATC
AACCGCCAGA GTCTTATTAA TTTGGCAATG ACATATCATG TCACAGGCCG TGGGCACTTT
GAGTTACTGA TATCGAATAT TCAGGTGCTT AAGGAGTATG GTGTGGTGGT GAGTGAGTCT
GAACAGCCTC TCGTCTTGAG AAAGCCCATA GACTTAAGGC AGTACTTCAA TCAGAAAAAT
TTAACAAAAG ATCATGTTGG CCGTTTAGCG GAACACGATA TGAGTTTTGA TGAATTGACC
CTGCTCTTGG ATAAAAACAT TCCCATTGAG CAGGCTTTTA CCCAAAGATT ACAAACTCAG
CTTGGGCCTC TGAAATTATT TAATGACGAG AGAGTACTTA ATCAAGGGGA CATATTTGAT
CAAGATATTA GCCAATTAGC GGAGGCTATG GGGGGATTGG AATCAACTGA AAGCTATTCG
CTACCGCTAG AGCGGCAAAC AGCGATGGCA ATAACTACTC ATCAGTTTGT GAGTGATTCT
ATTGCCGCTT ATTGA
 
Protein sequence
MAGFYIDKLS LSQRLSIVSE TYDRVNKNNK KEKLKYSYDD IEMIKKRFVK YIDAQLYSLI 
REGLSVPATL TQEEKIKIAD LAIDAAFFND YERFNELIIY ISSLGISVTP PLPQEEGGNR
LYIYFSGDIH TYMDVWRGDL FVGSGTELSD MQSITGLRFM IDMAESLKLN IINPADKAMV
DLINHLRYEM ISYASSFYAT YSAERGGTVY LSSPDGLRIN NYFWNSELPV LRALQKQGLI
GDIRILHKPL EFYKDTPLDE LGDLLTAKDL SMTAEYQFLP VWLQEKLLVD IYQQWLDEEF
QPSLFTVRRE IINTIDIDRN APEVELLRYF LSKIHGQLDE ITEYKALKEA ERINFIKKKL
AVGSEIESWL DNVPAIDVNE RKVILESLLQ KESLLFSNVR DIKKFPIPLD FNSDVINVNT
NKLKNTFIPF NLLREKWDVI ISDRSLVDGT LTIHSSAGRK IMIKVDTNRN QLKQIATLER
FLLANFTPKN APQDLQLIEN FIMSGDAVLA ERKGDKGWHN DQQMIEKVKL SEFDYFLKSN
DLGIKKNDNG FVIYLISDPE DSRDVIINPN NDYNLNSIKD FIENNYLFFD DVPEYLIVKK
NVENKECIFA HDEGETYQVA YRDGEAWVLL SKMNTSDQIK NLNEITMSVN LNNAESRSLA
LILSLLKKHD RLVSILPDTH PKVMENFLDI DSLLKNSTHP FEHPLYRKLL NSISNDINKY
MESLNEIKDS YHLLPFDVRP GQYMNTWSKI DRDTVIEYSI KQNDKNNHPQ FIVLLQDDSL
SKRVGEIIAS YNHNKSIVLQ FDARSSEARI AYGAQNNITE MGEFELSFVT HGTPDGLYSF
SIANVIEIYK LTINSFALPP PVKIRLVICS IADNGQGSQG FNGTHPALGI VNMMHQEGFD
IPILAYTTKV GVSVEYPGEL VVFNSENQGG VLKNIDDYQV LYHYKNNILL TDGIPVVELL
LKDVRNKIKS IDQLVEYYSQ YLVPFFSDDN GVIDRNLLEL TINDSDTHSK FENFLDIIRQ
RPELRNSDNW QLVVANNATG FLITTLDEPV VKYPDIVKVN EWDLPAIANI DKTATASQYD
MQIVFQCENN PTVNRAATRL AGKHAKNSII IQLDVDNNYR AFIIDDNIHA EWHEISHNEL
VTKLKIQPEN GKIRWQVVGH GRSEGGNDKH PTLAGQRPEQ LTARLHQFSD YLQTEHQINI
SPQQVSLVGC AMSSSDRYTS FAHKFMSHLN ENGIRTNVSA STKAIEVDPQ GHKHDVDTPD
IDSYNNKYLS SIKGTEKLYW NRWGEITTER KKDINGRLNN IDSLLDKLIT RQLSVNQLNK
KQQRKLAEIF PQLTDKKLNK GELLLTLHDS WRMQALKYDL LFLQKISDRP DFDTELWRVT
DRWRITEADG NTLQDVMIKS GSQHKTDLAT YPHSITSDPD LRTSAPELKT SNPKARTAIF
GRFGYGMQGY GFISALRLSA DYQRWMSNGD LTEKQEEEIQ LQLAMAWGGI GANLATDGLQ
YAFGKWGIGY LQKLASKGGR LSPALLSQLT LLKRNPALLL APGFLKDLRK LALNQFAHGA
ARFSMPLLSA LTSGIDIYQA YHAFSQLATE TDPHVRRDLI ASGVFSTINA TIGLGVAFAM
AMGGTAATAA GPAGIALAFT MIIVGDIYSA VSQIERIRDI VPDMTGSQRF ENGLRLFLKF
GLTPGLDNQI RYNQTLESVY QRQRDYYEAL LASKQGVDTL FYSRGEAVLK AIPFIKRDER
SQTEKNLEKI SIFSGDPFTN AKIYTTYAEM GKHEYYELDK INDVDDYVIA DFFEDNNRSV
VKLQNKNLHQ AFAELDINST YSPFILSADV DRNGLNDFIV INEKYNTTIA SRKNSVGMTV
IDDYVSRWHY ELYTWLAQPD GSYLKIDTRL EWEKLSHAIE VDEFNEVVFP VLGDFNGDNV
FELVIFHDDK MTTYHYDSLD FNQSGKDNHN VINIGDFIEP VRQAFEGEKS KNYPYSLVGD
INNDGFDDVL LLNKSGDMLH LMGNNSGVFR QHKTKLSSEL TSLLSSSNLH RSQLQLTDLN
KDGGLDLVII LNDGNYYQAL GYKIDGEYHF DTPSMVNKIT IKNEGGDSVR YQQNRLSQID
KHKIIAISRS AQGENRLISL SDSGKLLAHP LREIKENDVA ALFDLGGGDD VAEGYHKKKN
IFTIGSGFKQ YQGGENADTF ILTSATASKS HILSGGEGND TVALGEVLGN EIDSIIDISK
GYYSQVNGGV EKQVALLYDF ENILGHENVN DTIIGNDVDN YLNGMGGDDK IWGNGGNDLL
ALQSGLAQGG TGLDSYHILK STHEKSLQIR IEEVSENNNT DMQVSNIFLE HKLNQITSIE
LDNIDVLINI KNDNGFMTQI RLVGVYNINN NQKQQVLNFT IQTVDGFTMV PLWPSYLNEV
TEFSPNMVAY YSSLVDRNYK DLVGKGDPDD IVVRFSLDNG YQQQQVTHLQ RIEGEKDIVL
RQAILPDFIR LSPQEHSMLM GFLPRYELLG DNKDNLLQVL SGEGLLEGRG GQDTYFIQEA
EGSPTDIIIN NFDDSLASDT LVLSSWLLCD VIVERSDDDL LLRYRDQPEK HQSIRLVNYM
NDERYRHLKI TDKSGQSQYR DPVTGTFIDY QINLDKNGHP FIAAQQAPVV SSGNDEVVIT
SATFLPGNYI DTGDGNDAII YIRGQEGTML KGGGGDDTYY YSAGSGAINI ADTSGLDHLY
LDKHILLHTL SAERRENNLV LNIADNTSGR IIFVDWYLAD ENKVEFIWVE DSQITFDELF
SLRPYSDEYY QLCQQLKSMG LSLTVRQLAD LDSQDGYNTL NQLRTIKAWA TKNPIYDVAD
LDYLVAMSSI AWRGNARNTD PLPLIEQKID AFFQPLIAER ISLTEEHVTW IQREEFDTVD
IAKWVKNYHL RSQNEINYLL EQLGLLKESP LSDKALDFTF KNRIDLAQAD IELCQQEYGI
NRQSLINLAM TYHVTGRGHF ELLISNIQVL KEYGVVVSES EQPLVLRKPI DLRQYFNQKN
LTKDHVGRLA EHDMSFDELT LLLDKNIPIE QAFTQRLQTQ LGPLKLFNDE RVLNQGDIFD
QDISQLAEAM GGLESTESYS LPLERQTAMA ITTHQFVSDS IAAY