Gene HS_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0478 
Symbol 
ID4239960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp509163 
End bp517718 
Gene Length8556 bp 
Protein Length2851 aa 
Translation table11 
GC content38% 
IMG OID638104026 
Productcell-surface large adhesin 
Protein accessionYP_718689 
Protein GI113460623 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC GAAATACGAT GTAACAACCG GTGAAACTAA AGTGGTATCT 
GAATTGGCGA AAAATTGTCC AGCGGCGAGC GGGGTGTCGT GTGCTTCGTC GGTCGGTGTG
GGTCAGCCGA AGTGCGGTGT GTTTTTCGGC GGAATGTTAG GGGCGTTTAA GATTCTGCCG
TTAGCGTTGT TGATATCGGG GGTGTTGTCG CCTTTGGGGT ATGCGGCTGC GGAAGTGGCG
GCGATGCCAA AAGTGGTAGA GAAAGAAAAA ACGCCGGCAC AACGAGAAGA AGAGCTAAAA
AATGCGTTGA AAAAAGAATT AAAGCAAGAA TTAAAAAGTG AAATAGAAAA ATTACAAAGA
AATTATGAGA GAAGTGTATC GTCGCCCTTA GTATTTAATA CGGATGTGAG AAGTGCGGGG
TTACCTGCGT TAAATTACTC GCCAAGAATA GCTAATCCGT ATAATACACA TTTAAAGCAT
GAAGGCGGGC AAATTAGAGT AAGAAATTCT GCCCAAGAAT GGTCTGATCA AACAACTAAT
AAGTATCCAA CTGGAGCACA TGATAACTGG AAATATGATC AGATTGACGG CGGGAGAAAT
GCTTTAGGTT ATGATGAATT TAGTTATAGA AATGAACCAG GATGGAATTA TATCGGTACA
GATAGATACA GTGGCGCAAC TAGTGATTAT GGATTGTATA ATAATAATGG AAAGAGTGAA
TACAGATGGT CTGCTCCTAA TAGAGGAGCG ATTGTATTAG CTCCTTATGA ACAGGTTGAA
ACGAATAAAG AGGAAAAAGG TGTCTTTGGT AGGATAAAAA AACATAAAAA TGAAGCAGTT
GCAGATAAAT ATAAAAATCT ATCAGATGCA AACCTATTTA AGGCTTATGT TGACGGGAAT
AGAACAAGAA GAAATATCGT CTCAATTGGG GTAGACTCTG GGACTCTAGC AGATAAGACA
ACTTTAGTGG GGAATGAATC CTATACATTT GGGGCACAAG CGGTTGTAGT AGGACACAAA
TCAGGGGCAG CCCATCAAGC GGTATCGGTT GGATCGGATA CCTATGCTTA TGGTAATTCG
TCAGTTGCCA TCGGTAATGA CGATTTATCA GTAGGAGGGA TAAATGATAC CCTACCAGAA
ACAACAATGA TAAAAATTTA TGAAAAATTA TATACATTGC CAAATGTGAA AAATGATAAG
GAGTATCATG AAGCACAAGA AAAACATATC CCAGGTGTAT ACTACGCACC TTTAGATGAG
AATAAAAAAA GAAACTATAG TATTGTTAGA ACATCTATGA TTGACAAGGA AAAATTTAGA
GATAAGTACC TACACGCCAG ACAATACTCT CCAACAGCTT CTATGGGATT TGCATCAATT
GCAATTGGGT CAAGATCGGT AGCTTATGAT ACAGGAACAA CTGCAATAGG GTCGCTTTCT
TATGCACTTG GTAAATATTC TACAGCATTA GGATTTCGTT CGTATGTAGA TTTTGATGCT
AATTCAGGAA TGGCACTTGG TAATGAATCT AGGGTGTTTG CTAAAAATTC CATTGCATTA
GGGACGAATG TTGAATCAAC AAATACAGGG GCAATGTCTT ACGGATACAA TGCAAAAGCC
GTAGGTGAGG GTGCTATTGC TATTGGTCAT ACTGTTGCTG CTAATGCTAC ATTAAAAGAG
GATGTATATA ATCATTTAAA ATCAATTTAT GAGGGCGACG ATTTATTAAC AAGACAAGCA
ACAACGCAAT CAAGTGAAGA TAATAAGGTA AGTGAAAAAA TAAAGACAAT AAATGAATAC
CTAGCAACTA AAGGAAGTAA TTTTGATAAA GGTACTAATG ACAATTTATT TGATTATGAA
GATAAGGAAA TTATAAAAAC AACGGTATCA GTTAAAAAAG CTAAAAAGAA AGGTGACAAT
GGTCTTGTTA TAGGGAGATC ATCATTTGCG TTGGGCGATA GAGCCCTTGC AATAGGTACA
GGTGTTGGAG CATTTTCAAA AAATGGTATG GCAATTGGAG ATTTATCATA TATTAAAGAA
AAAGCAGATA ATTCAATAGC GATTGGTACA GGCTCTATGG TTACTGAAAA AAATGCAACT
GCTATTGGAT ATTTCTCAAA AGCCACGATC AACAATTCAA TGGCTTTAGG CTATAAGTCG
GAGACGGATT ATACCGATGA AGAGTTTAAG ATGGCTGCTT ATACGCCAAA AGGGTCATTA
TCTATACCGA GTTCTAAAAA TACTGGGGTA TTTTCTATTG CCAGAAAAGG TGCTGAGAGA
AGAATTACAA ACGTTGCACC TGGGGCAATA GATACTGATG CAGTCAATGT TTCGCAATTA
AAAGCATTGG AAGATAGACT GAGCATCAAT ATTGATGATG AAGATGTAGA ATATAATGAG
CTACACTATA TCTCAATAAA ACCGGAGCAA AAAGATATAG ATGCGAGAAG CATTGAGCTG
AAGAACAAAA GATACATCAA GTATAAAACG GAGCTATTAA AATTAGACGC TAGAGATAAG
TATGATTCTG CGGGAATTGA TAAAAGTGCG GGCAGTCCTT ACGATAAATT GAAAAAAGAA
GTTGAAAAAC TTAAACAAGA GTTAGGGACT GCTAATAAAG CTACTAAATT AGATGCGATT
AATATTAATG CAAAAGGGAA AAATCTAAAT AAATTATCAA AGGAAATAGA TGAAGCCAAA
GAGGGTTCTC TTGCTGAGTT GAAAGCGTAT TTGGAAAAAG TTCAAGATAC GGATACAAAC
TACAATAATA AAGGTGCTAA AACAACAGGT TCAGTTGCTA TTGGTTATAA GGTGAGTGTA
GGAGCGGGGG ATAAACACGC TAACTCGGTC GCCATCGGTA GCAATTTAGC CGTTAAAGGA
AAAGAAAATC AGGTTATTGG GAATAATATA GACGTCGGAC AAAATACAAA CTGGTCTATC
CTGATAGGAA ACAATATAGA GAAGACTGAT GATAAAACGA CAGAAGCCAT CGTTTTAGGC
GATAGAAGCC AAATTGTTAA TGGTGCATTA TCTATTGGTG GATTTAATAA AAAGGGAATA
GACCAAGATA AGGACGCTAG ATTTAAAACA AGAAAAATCC ATTTTGTCAA AGCGGGTGAA
GTTAATCCTA AATCCAATCA AGCGGTAAAC GGTTCGCAGT TAGATCCTGT GTATGAAATA
TTGGGTATCA CTAATATGCA AGTGCAGGCT GATAAAGTGA CAGACGTAGC CAACAAATTA
GGAGAAGGCG GGCATGTAAC TGTTACAAGA CCTACATTTA ACCTAACGTC AGGTGCGAAT
AAATATTCTG GAGTAACAAT AGGAGATGGT GATACAGCAA GACAGAATGT CCATACAATC
GCTTCTGCAT TAGCCTATCT AGATCAAGCC ATCACGGCAG CAAGACCTGT GTATTATACA
AATGGGACAA AAGATTCAAT AGGAACAAAA GTACCTAGTA CAGGAAATAT ATCAAATATT
TCTTTTGGAA AAGAGTTTAA AGTAACTGAA AAAACTAATA ATGGTAATAA CGAAAAATAC
TTATTAGTGG AATTAAACGA AACAGAAATA CAGCAAAATC CAGCATTAAA AGGTCCTAAG
GGAGATAGAG GGCAACAAGG AGCTAGAGGG CTAAAAGGAG AAAGAGGACC TATAGGACCA
GTAGGACCTG CTGGAGAAAG AGGATTACAA GGTGAAAGAG GACCTCAAGG TGAACCAGGA
CCAGAGGGAC CTAAGGGAGA CACTGGAGCA AAAGGTGAAC CAGGCTTGCC AGGAGCACCT
GGAAAAGATG GAAAAGATGG TGAAAAAGGA GATAGAGGAC CGGCGGGACC TCAAGGACCT
AGAGGAGATA AAGGAGAAGC AGGGCCTCAA GGTGAACCAG GACCAGAGGG ACCTAAGGGA
GACACTGGAG CAAAAGGTGA ACCAGGACCT GCTGGAGAAA GAGGACCGGC TGGACCTGAA
GGGAAACCTG GAATTCAAGG ACCACAAGGT GAACCAGGCT TGCCAGGAGC ACCTGGAAAA
GATGCTTTTG AAGTATGGAA ATCGCTTGAT GGAAATTCTG ATAAAAGCAA AGATGACTTT
ATCAATTCGC TGAAAGGTGA AAAAGGTCAA GATGGTACCA ACGGTCAAGA CGGCAAATCA
GCTTATGACA TCTGGAAAGA GAAACCTGAA AATACCAAAA AAACAGAAGA GGAATTCTTG
AAATCCTTAA AAGGTGAAAA AGGTCAAGAT GGAAGAGCTG GAGAAAACGG TAAATCAGCT
TACGAGGTCT GGCAAGAAAA ACCTGAAAAC GCTGGAAAAT CAAAAGAAGA CTTCTTTAAA
GCAATAAAAG GCGATAAAGG TCAAGATGGT ACCAACGGTC AAGACGGCAA ATCAGCTTAT
GACATCTGGA AAGAGAAATC TGAAAATACC GGAAAAACAG AAGAGGAATT CTTAAAATCC
TTAAAAGGTG AAAAAGGTCA AGACGGAAGA GCTGGAGAAA ACGGTAAATC AGCTTACGAG
GTCTGGCAAG AAAAACCTGA AAACGCTGGA AAATCAAAAG AAGACTTCTT TAAAGCAATA
AAAGGCGATA AAGGTCAAGA TGGTACCAAC GGTCAAGACG GCAAATCAGC TTATGACATC
TGGAAAGAGA AACCTGAAAA TACCAAAAAA ACAGAAGAGG AATTCTTAAA ATCCTTAAAA
GGTGAAAAAG GTCAAGACGG AAGAGCTGGA GAAAACGGTA AATCAGCTTA CGAGGTCTGG
CAAGAAAAAC CTGAAAACGC TGGAAAATCA AAAGAAGACT TCTTTAAAGC AATAAAAGGC
GATAAAGGTC AAGATGGTAC CAACGGTCAA GACGGCAAAT CAGCTTATGA CATCTGGAAA
GAGAAACCTG AAAATACCAA AAAAACAGAA GAGGAATTCT TAAAATCCTT AAAAGGTGAA
AAAGGTCAAG ATGGAAGAGC TGGAGAAAAC GGGAAATCTG CCTTTGTAGT GTGGAAGGAA
AACTTGGGTG AAGAAGGTAA GGATAAAACA GAGAATGACT TTATAAATTC ACTTAAAGGA
AAAGATGGAG CACCTGGAAA CACGAAAAAA ATAACAGGTG ATAGCAATAT CAGCGTAAGT
GATAAGGGTA CAACTACAAA TGTATCCCTT AATGAAGATT TAAAAGGAAT TAATTCTATT
GGACGTTATA AAAAAGAAGG GGTTAATAAA ATAACATTCA TCGGTAAAAA TAAAGGCACT
TCTATTGGTA ACAATAGTTT ACGTAATAAT GTAACTATTT CTTCAAATGG AGGAATATTT
GAATTTAATA GAAAAGGGTT GCATATAAAC GATAAAAAAA TTACAGGGGT TGCCGATGGT
GCATTATCTG CCGACTCCAC AGATGCAATC AATGGTGGAC AGCTTGTCAA AGCCACAGGG
GCGAAATTGA TCGATGATCC AAGTTCTACT TCAGATTCTC CAAAACCTAA GATAACAGTA
TTTGCAGATG GTAAGGATGG AAAATCAGGT CTTGAAAATG GTAAAGATCC GATGGCAAAT
AAAGGATTAA CTTCTAAAGA TGGATTAAAT GGAAAGAATG CAAACGACAA AGCTAACGCA
ATACGTGATG GTGAAGCGGG AACAGTCGTA TTTACTGATG ATAAAGGAAA TCGTTTAGTA
AAAGCTAATG ATGGTAAATA CTATAAAAAG TCAGATGTTA ATGATGATGG AACGGTTAAA
AATGAAACTG ATGGTAAAGA TAAACCAAAA CCAGTTGAAA AACCACAACT TTCGTTAGTA
AATCAAGATG GAGAAATAAT TAATCCAATC GTATTAGGTA ATGTTAATTC AGGTTTAGGG
CTTGAAAAAT ATAAAGAGCC TGTTATTGAA GAGGGAACTA GTGAAGAATC TAAAAAACAA
AAAATAGGTG AAGCTAAGAA AGAGCATCAA AAGGCTAAAA AAATTGCTAT TGATAAATTG
TTAGGAAATA ACGATGATGA AAATAGTAAC ATTAAAGATA GTGATTTAAT ACTGAATAAT
GTTGCCAATA TAAGAGATCT GCAAGCATTA GGACAAGCTG GACTGGATTT TGCCGGCAAT
GATGCTGATG AAACGAAAAA AGTTCATAGA AATCTTGGGC AAAAACTGGT AATTAAGGGT
GATCAAAATG CACCTGCTAA ACCTTTTGAA TCTGCAAAAG ATAATATTAA TGTAGCTGTA
GAAGGTGAAG GGTTAGTAGT ACAACTATCT AAAGACTTAA AAAATCTAAC ATCAGCAGAA
TTTACAACTG AAGATGAAAA TAATCCAAAA ACTAAAACAA CTATTAATGG AAAAGGAACA
ACAATAGTTG AATTAGGTAA TGATGGTAAT GAAAATCCAG ATGGCAAAAG AGCAGAATAC
ACAATTGACG GTACTAAGGT TACAGATGGA AAAAATACAT TAGAAACTAA TTCAAAAGGA
ATTAAATTAA AATATAAATC TAATGATCCT AATAATTCAA ATGAAAAAGA AGTGTTTTCT
ATTAACAATG ATGATGGAAC AGTTACTGTT GATTTCAAAA ATACTGGAAA AATAACGGGA
TTAGCAGATC CAACTGAATC AACAGACGCA GCAAATAAAG GGTATGTTGA TGAAAAAGTG
ATAGACTTGG ATAGCAATCG TCCATTTGAC TTCTATATTA AAGATGGAGA TAAAGAAATC
AAAGTCGTGA AAGGACGTAA CGGTAAGTTC TATAAACCTG AAGACTTGAA TGGTGCGAAG
TATGATGAAA GTGGTAAAAA ATACACTAAA AATATCGAAA ATAACCAAAG TGAAGAGGTT
AAATCATCTA TAGAGGATAG ACAAAACGAG GTTGTAATTA AAGCAGAGCC GACAAATAAA
GCTATCGTTG TTACGAATAT TGCCGATGGT AAGTTGGCTG CTGACTCCAA AGATGCGATC
AATGGTGGAC AGCTTGTCAA AGCCACAGGG GCGAAATTGA TCGATGATCC AAGTTCTACT
TCAGATTCTC CAAAACCTAA GATAACAGTA TTTGCAGATG GTAAGGATGG ACTGTCAGGT
CTTGAAAAAA CATCAGACGG CAAAGAATCG ATGGCGGCGA AAGGCTTAAC TGGCAAAGAC
GGTTTAAATG GCAAAAATGC CAACGACAAA GCGAACGCTT TACGAGATGG TGAGGCTGGA
ACTGTGGTGT TTACCGATGA TAAAGGCGAA CGTTTGGTCA AAGCCAAGGA TGGTAAATAT
TATAAAAAAG AAGATATTAA AGATGATGGC ATAACGCCAA AAGACGTCAA AAATGAGGTT
AAAAATCCAC AACTTTCGTT AGTAAACCAC GAGGGCAATA CAACGATACC AACTGTCTTG
GGCAACGTGG CGAGCGGTTT GGGTATTGAC TCGGAGCAAA ATGAAAAAGC GAAGAAAGCG
AAGGATGATA TGAACAACCA AGCCACAATC GTGATAGAAA AAGTGACTGA GATGCTGAAA
AAACGTCAAG ATGTGGATAG CCTACAAACG GCGAAAGAGG CACAACAGTC AGCGATTGAT
GTGTTGAGTA TGTTGCCAGA AACCACAGTT GCGGAAAAAG CAGAGAAAGA AAAACGTCTG
AAAATCGAAA AAGACAAATT GGAGCAAATT GAGAGTAACT TGATAAAAGC TGATAAGGCT
TTGAAGAACG CACAACAAGA GATGACGGAA GCAAAAGAGA AACTTGACGA ACAGAAAATC
AACTACCAAA AAGCGTTGGA AGATGATTCG GTGCATCAGT TGTTGTCAGG TAATTCCTCA
ATTGATGATA AAAAAATTAA GCGAGCTGCC AATCTGCAAG ATTTGAAAGC CTTAGGTCAG
GCAGGGTTAA ATTTTGAGGG CAATGACGGC GTGCTTGTCC ATAAAAATTT AGGCGAGAAA
TTGACCATCA AAGGCGAGGG AACGTTTAAC AGCGACAACA CCGCCGCCGG CAATATTAAA
GTGACTGCTT CTGACAGCAG TATGGAAGTG AAGTTGTCCG ATACCTTGAA AAATATGACC
TCCTTTGAAA CTAAGGAAAC GGCAAAGGGG AATAAATCTC GTCTTGACGG CAACGGATTG
ACAGTGACAG GTAAAAACAA TCAGTCTGCA CATTATGGCT CGGAGGGTAT TACTCTCACA
GACGGTAAGA ACAACGTTAC CTTGACCTCC AGTTCGTTCA CCTTTAAAGA AGGGCAAGCT
GAAAAGATTG TGATTGACGG CAAGGAAGGG GAAATCCGTG TGCCTGATTT AACCTCGAAA
TCATCACCGA ATGCCGTGGC TAATAAACAA TATGTTGATG CCTTGCAGAC ACAGACTGAC
CAAAAATTCA ACCATCTTGA AAATAGGTTT GATGCGTTTA GTAAAGAGTC TCGAGCAGGA
ATTGCCGGTT CGAATGCGGC GGCGGCATTG CCTACGATTT CGATTCCGGG TAAATCACTG
CTTTCAGTCT CTGCCGGTAC GTATAAAGGA CAAAGTGCGG TAGCCGTAGG CTATTCTCGT
GTCAGTGATA ACGGCAAAAT TTTCTTGAAA GTACAAGGCA ACAGCAACTC TATCGGCGAC
TTCGGTGGCG GTGTGGGTAT CGGCTGGGCT TGGTAA
 
Protein sequence
MNKIFKTKYD VTTGETKVVS ELAKNCPAAS GVSCASSVGV GQPKCGVFFG GMLGAFKILP 
LALLISGVLS PLGYAAAEVA AMPKVVEKEK TPAQREEELK NALKKELKQE LKSEIEKLQR
NYERSVSSPL VFNTDVRSAG LPALNYSPRI ANPYNTHLKH EGGQIRVRNS AQEWSDQTTN
KYPTGAHDNW KYDQIDGGRN ALGYDEFSYR NEPGWNYIGT DRYSGATSDY GLYNNNGKSE
YRWSAPNRGA IVLAPYEQVE TNKEEKGVFG RIKKHKNEAV ADKYKNLSDA NLFKAYVDGN
RTRRNIVSIG VDSGTLADKT TLVGNESYTF GAQAVVVGHK SGAAHQAVSV GSDTYAYGNS
SVAIGNDDLS VGGINDTLPE TTMIKIYEKL YTLPNVKNDK EYHEAQEKHI PGVYYAPLDE
NKKRNYSIVR TSMIDKEKFR DKYLHARQYS PTASMGFASI AIGSRSVAYD TGTTAIGSLS
YALGKYSTAL GFRSYVDFDA NSGMALGNES RVFAKNSIAL GTNVESTNTG AMSYGYNAKA
VGEGAIAIGH TVAANATLKE DVYNHLKSIY EGDDLLTRQA TTQSSEDNKV SEKIKTINEY
LATKGSNFDK GTNDNLFDYE DKEIIKTTVS VKKAKKKGDN GLVIGRSSFA LGDRALAIGT
GVGAFSKNGM AIGDLSYIKE KADNSIAIGT GSMVTEKNAT AIGYFSKATI NNSMALGYKS
ETDYTDEEFK MAAYTPKGSL SIPSSKNTGV FSIARKGAER RITNVAPGAI DTDAVNVSQL
KALEDRLSIN IDDEDVEYNE LHYISIKPEQ KDIDARSIEL KNKRYIKYKT ELLKLDARDK
YDSAGIDKSA GSPYDKLKKE VEKLKQELGT ANKATKLDAI NINAKGKNLN KLSKEIDEAK
EGSLAELKAY LEKVQDTDTN YNNKGAKTTG SVAIGYKVSV GAGDKHANSV AIGSNLAVKG
KENQVIGNNI DVGQNTNWSI LIGNNIEKTD DKTTEAIVLG DRSQIVNGAL SIGGFNKKGI
DQDKDARFKT RKIHFVKAGE VNPKSNQAVN GSQLDPVYEI LGITNMQVQA DKVTDVANKL
GEGGHVTVTR PTFNLTSGAN KYSGVTIGDG DTARQNVHTI ASALAYLDQA ITAARPVYYT
NGTKDSIGTK VPSTGNISNI SFGKEFKVTE KTNNGNNEKY LLVELNETEI QQNPALKGPK
GDRGQQGARG LKGERGPIGP VGPAGERGLQ GERGPQGEPG PEGPKGDTGA KGEPGLPGAP
GKDGKDGEKG DRGPAGPQGP RGDKGEAGPQ GEPGPEGPKG DTGAKGEPGP AGERGPAGPE
GKPGIQGPQG EPGLPGAPGK DAFEVWKSLD GNSDKSKDDF INSLKGEKGQ DGTNGQDGKS
AYDIWKEKPE NTKKTEEEFL KSLKGEKGQD GRAGENGKSA YEVWQEKPEN AGKSKEDFFK
AIKGDKGQDG TNGQDGKSAY DIWKEKSENT GKTEEEFLKS LKGEKGQDGR AGENGKSAYE
VWQEKPENAG KSKEDFFKAI KGDKGQDGTN GQDGKSAYDI WKEKPENTKK TEEEFLKSLK
GEKGQDGRAG ENGKSAYEVW QEKPENAGKS KEDFFKAIKG DKGQDGTNGQ DGKSAYDIWK
EKPENTKKTE EEFLKSLKGE KGQDGRAGEN GKSAFVVWKE NLGEEGKDKT ENDFINSLKG
KDGAPGNTKK ITGDSNISVS DKGTTTNVSL NEDLKGINSI GRYKKEGVNK ITFIGKNKGT
SIGNNSLRNN VTISSNGGIF EFNRKGLHIN DKKITGVADG ALSADSTDAI NGGQLVKATG
AKLIDDPSST SDSPKPKITV FADGKDGKSG LENGKDPMAN KGLTSKDGLN GKNANDKANA
IRDGEAGTVV FTDDKGNRLV KANDGKYYKK SDVNDDGTVK NETDGKDKPK PVEKPQLSLV
NQDGEIINPI VLGNVNSGLG LEKYKEPVIE EGTSEESKKQ KIGEAKKEHQ KAKKIAIDKL
LGNNDDENSN IKDSDLILNN VANIRDLQAL GQAGLDFAGN DADETKKVHR NLGQKLVIKG
DQNAPAKPFE SAKDNINVAV EGEGLVVQLS KDLKNLTSAE FTTEDENNPK TKTTINGKGT
TIVELGNDGN ENPDGKRAEY TIDGTKVTDG KNTLETNSKG IKLKYKSNDP NNSNEKEVFS
INNDDGTVTV DFKNTGKITG LADPTESTDA ANKGYVDEKV IDLDSNRPFD FYIKDGDKEI
KVVKGRNGKF YKPEDLNGAK YDESGKKYTK NIENNQSEEV KSSIEDRQNE VVIKAEPTNK
AIVVTNIADG KLAADSKDAI NGGQLVKATG AKLIDDPSST SDSPKPKITV FADGKDGLSG
LEKTSDGKES MAAKGLTGKD GLNGKNANDK ANALRDGEAG TVVFTDDKGE RLVKAKDGKY
YKKEDIKDDG ITPKDVKNEV KNPQLSLVNH EGNTTIPTVL GNVASGLGID SEQNEKAKKA
KDDMNNQATI VIEKVTEMLK KRQDVDSLQT AKEAQQSAID VLSMLPETTV AEKAEKEKRL
KIEKDKLEQI ESNLIKADKA LKNAQQEMTE AKEKLDEQKI NYQKALEDDS VHQLLSGNSS
IDDKKIKRAA NLQDLKALGQ AGLNFEGNDG VLVHKNLGEK LTIKGEGTFN SDNTAAGNIK
VTASDSSMEV KLSDTLKNMT SFETKETAKG NKSRLDGNGL TVTGKNNQSA HYGSEGITLT
DGKNNVTLTS SSFTFKEGQA EKIVIDGKEG EIRVPDLTSK SSPNAVANKQ YVDALQTQTD
QKFNHLENRF DAFSKESRAG IAGSNAAAAL PTISIPGKSL LSVSAGTYKG QSAVAVGYSR
VSDNGKIFLK VQGNSNSIGD FGGGVGIGWA W