Gene HS_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1058 
Symbol 
ID4240556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1163918 
End bp1172638 
Gene Length8721 bp 
Protein Length2906 aa 
Translation table11 
GC content41% 
IMG OID638104619 
Productlarge adhesin 
Protein accessionYP_719270 
Protein GI113461201 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[S] Function unknown 
COG ID[COG1196] Chromosome segregation ATPases
[COG4372] Uncharacterized protein conserved in bacteria with the myosin-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC TAAATACGAT GTAACAACGG GTCAAACGAA AGTAGTGTCT 
GAATTAGCGA ATAACCGTCA AGTGGCGAGC CGTGTTGAGG GGGCGGAAAG TCAGCCGAAG
TGCGGTGTGT TTTTCGGCGG AATGTTAGGG GCGTTTAAGG TCCTGCCGTT GGCGTTAGTG
ATGGCGGGTA TTTTGGGTGT AAACAATGTA GCTTATGGGG ATTATAAAGA AATAAAAATT
GATGAAGATA GCGGTCAAGT AGGGACTGAA GACCTAGAAA AAGATGAAAA TGTGGTTTTA
GCAAAGAAAT CTTCACTTGA GACAAATAAT GGCAGACAAA CTAAAAAAAC TGTTGTAATT
GGAGCAGGAG ATGTTCAGGC TAAGTATACT AACCAATTTA ATTGGACATT TGGTGTTGAA
AAAGGTGTTG TTATAGGATA TGCAGCTAAG GCAAGAGCAA ATGAATCAAT AGCAATAGGT
GCTGAAGCAG TAGCAGGTGA TAATGAAAAC GCTAAAGGAT CAACAGCAGT TGGGTATCTT
GCTAAAGCAA AGGGGAGTGA TTCTGTAGCT GTTGGTAAAA ATGCTGAATC GGTATCTAAA
GGTACATCAA TAGGTGCTAG TAGTAAAGTT ATAGGTGAAC AAGGAACGGC ATTAGGAAAT
AACGTTTATG CTGCAGGACA AGGGACAGCA CTAGGATCGG ATGTTATGGC TGCTGGATAT
GGATCTATCG CCATAGGTAA TGATGATATA GTTAAAAATG GATATACGGA TAAACTACCG
GAAGATACTA TTTTTCTAAT TTATGGTTAT AAAGATAATC ATAATTATGG GGATGATAAT
AAAACTTATA AAGATATTTT AACTAAAGAA GCATTTCAAA GAAAATATGT AAAAGATGGA
AATAAGGATA ATAGAATATA TTCTCCAACA TACGCAGCAG GATTAGGATC AATAGCATTA
GGATCAAGAA CAGTTGCTTT CGGACAAACA TCTCTTGCAG TAGGAGCATT ATCATTTGCT
CTAGCGGACG ATTCAACAGC ACTTGGAGTT CGTGCGTTTG TTGATACAAA AGCAAAAGGC
GGAGTTGCCA TAGGAGATGA ATCAAGGGTA TTTGCTGAAA ACTCATTTGC GATAGGAAAC
AGAGCAGAAT CTACATCTCA GGGATCACTA TCATTTGGAT CAAATGCAAA AGCAGTAGGA
GTAGGTTCAA TTGCGATAGG ACCGAGTGTT GCATCAAATG CTAAGTTGGC AGGAACTGGT
GATGAAGATT TTGCAAGATA TATTATGGAA AAAACAAAAA CTGCTAGTCC AACTGTAACA
ATATCTGGGG CTACTGCGAA TATTGAATAT AGTTCTGAAA CTGATAAAAA ACTAAAATTT
GGTAACCATA GTCATTTAGT TTCAAAAGGA TTTGGTTCAC AAAACTTTAA AAATGGTCAA
AAAAAAGGTA ATCCAGTTAC TGATATAATA AATGAAGTAA TTAAATTAAA AGAAGATAAG
AATAATCCAA TAAAATATTC TTATGATACA AACAATGAAA AATTAGAAAC AACTGAAAAA
ATATATACGA CTGAAAAAGA GGGGGAACAC GCAATATCCC TAGGATATCA CCTATCAAAC
AACGGAGATA ATACAATTAC AATAGGTAGT GCTTCTGTAG TAAGAGGAGC AAACTCGGTA
GTTTTAGGAG CATTAAATAA TATAGGTAAA TATGCAACAA ATACAATAGC ACTAGGTATT
GGTACAAATG TATTTAAAGA AAACTCTGTT GCAATTGGTA CAGGAGTGAA CGTTGCAGGA
GCAGGTGTAG TTGCTATTGG GTCGGGTGTT GGTGTAACGC AAGACAATAC TATCGCTGTA
GGATATGGAG CATATGGACT ATCAGCCGAA AGTATCGTTT TAGGAAATGC TGCGGGCTTG
GAAGAAAATG CAAAAAAATC AATAGTAATA GGTAATAAAG CGAAGGTAAT AAACAAAAAA
GCTGAAGAGG CGATAGCAGC TGCAGCAGCA GGTCAACCTA CAACAAGAAC AACTTCAACA
AAAATAAATC AAGATATGTC AGCAATCTCA ATAGGGACAG GTTCTTATGT GTATTCTGAA
AAAGGTATGG CTTTTGGTGA TGCGGCGAAA GTTGAAAACA ATGCAGAAAA TGCAATCGCT
TTCGGGACAA GTTCAAAGGC TACTAAAGTA AGTTCAATCG CTTTAGGTAA TGAGGCGCAT
TCTACTATGT TAAACTCGGT AGCATTAGGG TATAAATCAA GAACAGATTA TGCACATTTG
GATACAGCAC CTTATTCACC AAAAGGAGCA TTAACTATTC CTACATCAAG TAATGTTGGG
TTAATTTCTG TTGGAGCTCA AAACTATACG AGAAGAATAA TAAACGTTGC AGCGGGATCA
CAAGATACAG ATGCGGTAAA CGTGTCGCAA TTAAAGGGAT TGGAAGAAAA GGTTGAATTG
CTATCTGGTA GTATAGGAGA TAATAGTACC CCTTATTTTG GTGTGGAACA AACTAATAAT
AGCAGTGAGG CTAAAACTAT TAGTGATGGT ATAAATAAAC AAAAAAATTA TGACAGATAT
GTTTATTTGG CTGGAGAATA TGCTAGTTTG TTAAACCGTC AAAAAAATGG TGGAGAAGTA
TTTAATGAGA AGTCATTACA AGAAATACAA GAAGAAATTG ATAAAATAGG AACACCTGAA
ATAAAACGTA AGGCTAATCA AATAACAAGA GTTTTAAATG AGTTAAAGAG TGCTCCTCAA
AATAAAAATT CTATGCAACT AACTCAAAAA TCAAATCAAA TAGAGCGAGC TATTACGACA
GATAAAGTTA AAAAAGTTAA TGCTATATCA GAAGATGTAG TTAAAAAATC AAATTATAAA
GGAAACTATG CTGAGGGAGT AGATTCGATA GCAATAGGAT ACGGAGCACA TACAACCTCT
ACTGCAAATC ATGCTGTGGC TTTAGGATAT AAGGCTGAAG CACAAAAAGC TGATTCGATA
GCATTAGGAA GTAATTCAGT GGCTGATACA GCAGGAGATA TTACAGGATT TGATACGGTT
ACTGGAGAAG TAAAAAAAAC TGCAGATTAT GGTGCTTGGA AATCTAAGTA TGCCGCACTA
TCTATTGGAA AAGTGAGTGG GAATGATAAA ATTACAAGAC AAATTACAGG AGTGGCAGCA
GGGTCAGAGG ATACGGACGC TGTAAACTTG GCACAACTTA AAGAAGCAAC GTTGCATTTT
GTGTCTGTTA ATGGTGGAAG TAACACAGAC AAAAACTATG CTAATAATGG TGCGATTGGA
AAAAATTCTA TTGCAATAGG TGTAGGTGTA CAATCAAAAT CTGAAAATTC TATAGTAATG
GGTAATAATT CAACGATTGA AACTGATATA TCGAAATCAA TATCCATTGG GGTGAATAAT
CACATAAGAG CAAAGAAAGG GAATGCTAGA GAAGACCTTA CGAATACTGT TGCTCTTGGT
TCAGACAATA TCATCACTGG ACGAAAAGTA GTGAATTTGG GTTCGGGGAA TAAGATAGGT
AATACTGGAA ACGATTATCA AACTGACAAA CACGCAGGTG CTGTAAGCAT AAGATTAATC
GGGGACGATA ATATCGCACA TGGAGTATGG AATACAGTTA TTGGTGAAGG GAATAAGCTA
GAGTCATCAA GCTGGGCTCA AGTGATGGGG GATTACAATG CCGTCACGAA GTCAGACTAT
GCCATCGTTA TTAGTAGTAA TAAAGCAAAT AAAAAAGAAC CAATAACAAC AGATACCGTA
ACGGATTCAA AATATGCTAT CGTTATTGGT AATAACGCAA AAGCCACAAG TGCAGAAAAA
GGTGTCGTGA TTGGACAAGG GGCTTCAGTA ACCGTTGCAA ATAGTGTTGC AATAGGAAGT
GGTTCAAAAA CTAGTGGGGA TATTACTACT ATGGGATATG ATCCTTCGAC AAATACTACG
TATAGTTCTA CAAATACAGA CGATGATTAT AAATGGAAGC CAACCGCAGG AGAGTTTGCA
GTTGGAGATA CAACATATAC TGTTGAACAA GGTAAAGGTA AAAATAAACA GACTGTAAAA
AAACCTCAAA CAAGAAGAAT CACTGGTGTA GCAGCAGGTA TTGAGGATAC AGATGTAGTT
AACGTTGCAC AACTTAAAAA GATTCTTAGT GATGCTTTAG CAGCTAAACC AAATAATAAT
GCAGGAGCCA ACCCACAAAC GCAAGCGGAA GTGGAAGCAG CGAAACAAGG GGCTGTAGAT
GCTAAGAATG AAGCTGTAAC TGCTAAGGAT GAAGCAGAGG CTGCAAAACA AGGAGCTGAA
ACTGCTAAGA ATCAAGCGGA AAGTTTTGCG ACACAAGCAG ATACTGCAAA ACAAGGAGCA
GAAACTGCTA AGAATCAAGC GGAAAGTTTT GCGACACAAG CAGATACTGC AAAACAAGGA
GCTGAAACTG CTAAGAATCA AGCGGAAGAT GCTAAAACGG CTGCGTTAGA TGCAAAACAA
GGTGCTGAAA ATGCTAAGAA TCAAGCGGAA ACTTTTGCAA CACAAGCAAA TACTGCGAAA
CAAGGTGCGT TAGAAGCTAA GGATAAAGCA GAACAAGCTA AGGCTGATGC GGTGGCTGCA
AAATTAGGAG CAGATGCTGC ACAGACACTT GCGGTAAATG CTAAGGATAA AGCGTTAGAA
GCACAAGGAA AAGCAGAAGC TGCACAGGCA GCGGCTCAAA ATTCAGCATC ACAAGCACAG
ACAGCCCAAA ATAAAGCAGA GGCTGCAAAA CTAGGAGCAG AGCAAGCTAA GGCTCAAGCA
GATGCTGCTA AGAATCAAGC AGTGTTAGCA CAAGCAGGAG CAGAACAAGC AAAACAAGAT
GCGTTAGATG CACAAGGAAA AGCAGAGCAA GCAAAACAAG GTGCAGATGC TGCGAAAGAT
GAGGCGGTAG CGGCAAAAAC TCAAGCGGAA ACTTTTGCAA CACAAGCAAA TACTGCGAAA
CAAGGTGCGT TAGAAGCTAA GGATAAAGCA GAACAAGCTA AGGGTGAAGC AGAGGCTGCA
AAATTAGGAG CTGAGCAAGC ACAGACAGTT GCGGTAGATG CTAAGAATAA AGCGTTAGAA
GCACAAGGAA AAGCAGAAGC TGCACAGGCA GCGGCTCAAA ATTCAGCATC ACAAGCACAG
ACAGCCCAAA ATAAAGCAGA ACAAGCACAA GTAGCGGCGG TAGCGGCACA GCAAGGAGCA
GATACTGCGA AAACACAAGC AGAAGCGGCA AGAAATGAGG CTGTAACTGC TAAGAATCAA
GCGGAAGATG CTAAAACGGC TGCGTTAGAA GCACAAAATA AAGCAGAGGC TGCAAAACTA
GGAGCAGAGC AAGCTAAGGC TCAAGCAGAT GTTGCTAAGA ATCAAGCGGA AAGTGCAAGA
GATGAAGCGG TAGTGGCAAA AACTCAAGCG GAAGAATTTG CAACAAAAGC ACAGACAGCA
CAGGCAGCGG CTGTAAATGC ACAACAAGGA GCTGTAGAAG CTAAGAATAA GGCAGAACAA
GCTAAGTCTC AAGCGGAAAC TTTTGCGACA CAAGCAAATG CTGCGAAAGA TGCGGCGTTA
GAAGCCCAAG CAGGAGCAAA TAATGCGAAA CAAGAAGCAG AAGCTGCAAG ATATGAGGCT
GTAATGGCTA AGGATGATGC TGTAGCCGCA AAACGAGGAG CAGAAGCTGC ACAGACAGCG
GCTCAAGGTT CAGCATCCCA AGCTGAAGCT GCTAAAACAA AAGCGGAAGA ATTTGCAACA
AAAGCAGAAC AAGCTAAGGG TGAAGCAGAG GCTGCAAAAT TAGGAGCTGA GCAAGCACAG
ACAGTTGCGG TAGATGCTAA GAATAAAGCG TTAGAAGCAC AAGGAAAAGC AGAGCAAGCC
CAAAATAAAG CGGAAGAAGC ACAAGGAAAA GCAGAAGCTG CGAAAGATGA GGCGGTAGCG
GCACAACAAG GAGCTGTAAC TGCTAAGAAT CAAGCCGAAA CAGCAAGAGA TGGAGCAGTA
GATGCTAAGA ATAAGGCAGA ACAAGCTAAG TCTCAAGCGG AAACTTTTGC GACACAAGCA
AATGCTGCGA AACAAGATGC TGTAACTGCT AAGAATCAAG CGGAAAGTGC AAGAAATGAA
GCAAATGCTG CTAAAACGGC TGCGTTAGAT GCAAAACAAG GTGCTGAAAA TGCTAAGAAT
CAAGCGGAAA CTTTTGCAAC ACAAGCAAAT ACTGCGAAAC AAGGTGCGTT AGAAGCTAAG
GATAAAGCAG AACAAGCTAA GGCTGATGCG GTGGCTGCAA AATTAGGAGC AGATGCTGCA
CAGACACTTG CGGTAAATGC TAAGGATAAA GCGTTAGAAG CACAAGGAAA AGCAGAAGCT
GCACAGGCAG CGGCTCAAAA TTCAGCATCA CAAGCACAGA CAGCCCAAAA TAAAGCAGAA
CAAGCACAAG CAGCGGCGGT AGCGGCACAG CAAGGAGCAG ATACTGCGAA AACACAAGCA
GAAGCGGCAA GAAATGAGGC TGTAACTGCT AAGAATCAAG CGGAAGATGC TAAAACGGCT
GCGTTAGAAG CACAAAATAA AGCAGAGGCT GCAAAACTAG GAGCAGAGCA AGCTAAGGCT
CAAGCAGATG TTGCTAAGAA TCAAGCGGAA AGTGCAAGAG ATGAAGCGGT GGCTGCACAG
ACAGCAGCTC AAGGTTTAGC ATCACAAGCT GAAGCTGCGA AAACACAAGC AGAAACAGCA
AGAGATGGAG CAGTAGATGC TAAGAATAAG GCAGAACAAG CTAAGTCTCA AGCGGAAACT
TTTGCGACAC AAGCAAATGC TGCGAAAGAT GCGGCGTTAG AAGCCCAAGC AGGAGCAAAT
GCTGCACAAC AAGGTGCGGA AAGTGCTAAG GATGATGCTG TGGCTGCACA GAAAGCCTCA
GAAGATGCAA GAGATAAAGC AGAAGGATTT GCGACAAAAG CAGATGACGC TAAAACTAAA
GCGGTAGCGG CACAAGGAAA AGCAGAAGAT GCTCAAAATA AAGCAGAAGC GGCACAGGCA
GCGGCAGAAG ATGCTAAAAA TAAGGCAAAT CAAGCTAAGG ATGACGCGGT GGCTGCAAAA
TTAGGAGCAG AAGCTGCACA GACAGTTGCG GTAGATGCTA AGAATAAAGC GTTAGAAGCA
CAAGGAAAAG CAGAGCAAGC CCAAAATAAA GCGGAAGAAG CACAAGGAAA AGCAGAAGCT
GCGAAAGATG AGTCGGTAGC GGCAAGAGAT AAAGCATTAG AAGCACAGAA AGCATCAGAA
GTTGCAAGAG ATGAAGCAGA AAAAATACTT TCTAAAACGG AAGATATTGC AGCGAATAAT
CCATTTGAAT ATTACACAAA AGATGGTAAA GATAAGGTTA TAAAAGGAAG AGATGGTAAC
TTATATAAAG AGGATGAATT AGCAAACTAT CAATATGATA AAACTCAGAA GAAATATGTT
GCTAAGGATC CTTCTACTAA GGAAGACCTA AAATCATTAG AAGCAAAAGA TGTTTTAATA
AAAGCTGTAC CGAATACTAT TCCGATGGAA ATAACAAATG TGGCAAGCGG TTTGGGTATC
ACAACACCAA CAGACGATGA AAAAACACAA CTTAATAAAC TTGCTGAGAA AGTGAGTGAA
AAAGTGACGG CATTAGGCAA AAAAACAAAA GATTTTTCTG AAAAAGCAGA AAAATTTGCT
GACTTAGAGT TAATGGTAGA CAGTTTAAAA CAAACACTGG ATACCATGCC TGATGGAGAA
GCGAAAGAGA AAATTCGTGA GAATTTGGAA AAATACAAAA CGCAATTGTC AGATGCTCAA
GAGGCTAAAA AGAATGCTAA AGAAGCCGTA GAATCGGCAC GCAACGAGTT AATTGAAGCA
AATGGGGACT ATAACGCCTT TTCCGAGGCG ATGGCAAAAG TCGAAGAACT GGTTGAACCG
GATAGCCAAG CCGAGTTAAG CAATGTCGCT ACTATTGGCG ATTTGCAAGC GGTGGCAAAA
TCCGGCTTGA AGTTTAAAGG CAATGACGGT GTTGAAGTGC GAAAACAACT CAGCGAAACC
TTAGAAATTA AAGGTGAAGG TGAGTTTAAC AGCGACCGCA CTGCCACCGG CAATATCAAA
GTGGAAATGG CACAAGACGG CAATGGCTTA GAAGTGAAAT TGTCTGATCA ATTGAAAAAT
ATGACGTCCT TTGAAACCCG AAAAGTTGAC GGCAAACAAT CCGCTTTGGA CAGCAATGGT
TTGAAAGTTA GCAATAGCAA AACAGAAGAG CGTTCCCAAT TAAGCGAAAA TCGCTTGGCG
TTCTATGAAA ATGATAAGTT AGGGCTGAAT TTAGACGGTA AGTCTCGTGC GTTGAAAGTC
GGTGAAAAAG CGATTATTAG CATCAACGAC AAAAATGAGG CTTTAGTTGA AGATCTCAAC
GCCTCCAGTT CAAGTAAAGC AATTGCTAAT AAAAACTATG TGGACACCAA AAACAATGAA
CTACGGACAC AATTGAATAC TACTGACCGA AATTTACGTG CGGGTATTGC AGGAGCGTTG
GCGGCGGCTG GATTGCCGAT GTCTTCTGTG CCGGGTAAAT CCATGTTTGC TGCTTCGGCA
GGTTCCTATA AAGGACAAAG TGCGGTGGCG TTAGGTTATT CACGAGTAAG TGATAATGGT
AAAATAACGC TACGATTGCA AGGTACTCGC AGTTCAACCG GCGATGTCGG CGGTTCGGTT
GGTGTGGGTT ATCAGTGGTA G
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEGAESQPK CGVFFGGMLG AFKVLPLALV 
MAGILGVNNV AYGDYKEIKI DEDSGQVGTE DLEKDENVVL AKKSSLETNN GRQTKKTVVI
GAGDVQAKYT NQFNWTFGVE KGVVIGYAAK ARANESIAIG AEAVAGDNEN AKGSTAVGYL
AKAKGSDSVA VGKNAESVSK GTSIGASSKV IGEQGTALGN NVYAAGQGTA LGSDVMAAGY
GSIAIGNDDI VKNGYTDKLP EDTIFLIYGY KDNHNYGDDN KTYKDILTKE AFQRKYVKDG
NKDNRIYSPT YAAGLGSIAL GSRTVAFGQT SLAVGALSFA LADDSTALGV RAFVDTKAKG
GVAIGDESRV FAENSFAIGN RAESTSQGSL SFGSNAKAVG VGSIAIGPSV ASNAKLAGTG
DEDFARYIME KTKTASPTVT ISGATANIEY SSETDKKLKF GNHSHLVSKG FGSQNFKNGQ
KKGNPVTDII NEVIKLKEDK NNPIKYSYDT NNEKLETTEK IYTTEKEGEH AISLGYHLSN
NGDNTITIGS ASVVRGANSV VLGALNNIGK YATNTIALGI GTNVFKENSV AIGTGVNVAG
AGVVAIGSGV GVTQDNTIAV GYGAYGLSAE SIVLGNAAGL EENAKKSIVI GNKAKVINKK
AEEAIAAAAA GQPTTRTTST KINQDMSAIS IGTGSYVYSE KGMAFGDAAK VENNAENAIA
FGTSSKATKV SSIALGNEAH STMLNSVALG YKSRTDYAHL DTAPYSPKGA LTIPTSSNVG
LISVGAQNYT RRIINVAAGS QDTDAVNVSQ LKGLEEKVEL LSGSIGDNST PYFGVEQTNN
SSEAKTISDG INKQKNYDRY VYLAGEYASL LNRQKNGGEV FNEKSLQEIQ EEIDKIGTPE
IKRKANQITR VLNELKSAPQ NKNSMQLTQK SNQIERAITT DKVKKVNAIS EDVVKKSNYK
GNYAEGVDSI AIGYGAHTTS TANHAVALGY KAEAQKADSI ALGSNSVADT AGDITGFDTV
TGEVKKTADY GAWKSKYAAL SIGKVSGNDK ITRQITGVAA GSEDTDAVNL AQLKEATLHF
VSVNGGSNTD KNYANNGAIG KNSIAIGVGV QSKSENSIVM GNNSTIETDI SKSISIGVNN
HIRAKKGNAR EDLTNTVALG SDNIITGRKV VNLGSGNKIG NTGNDYQTDK HAGAVSIRLI
GDDNIAHGVW NTVIGEGNKL ESSSWAQVMG DYNAVTKSDY AIVISSNKAN KKEPITTDTV
TDSKYAIVIG NNAKATSAEK GVVIGQGASV TVANSVAIGS GSKTSGDITT MGYDPSTNTT
YSSTNTDDDY KWKPTAGEFA VGDTTYTVEQ GKGKNKQTVK KPQTRRITGV AAGIEDTDVV
NVAQLKKILS DALAAKPNNN AGANPQTQAE VEAAKQGAVD AKNEAVTAKD EAEAAKQGAE
TAKNQAESFA TQADTAKQGA ETAKNQAESF ATQADTAKQG AETAKNQAED AKTAALDAKQ
GAENAKNQAE TFATQANTAK QGALEAKDKA EQAKADAVAA KLGADAAQTL AVNAKDKALE
AQGKAEAAQA AAQNSASQAQ TAQNKAEAAK LGAEQAKAQA DAAKNQAVLA QAGAEQAKQD
ALDAQGKAEQ AKQGADAAKD EAVAAKTQAE TFATQANTAK QGALEAKDKA EQAKGEAEAA
KLGAEQAQTV AVDAKNKALE AQGKAEAAQA AAQNSASQAQ TAQNKAEQAQ VAAVAAQQGA
DTAKTQAEAA RNEAVTAKNQ AEDAKTAALE AQNKAEAAKL GAEQAKAQAD VAKNQAESAR
DEAVVAKTQA EEFATKAQTA QAAAVNAQQG AVEAKNKAEQ AKSQAETFAT QANAAKDAAL
EAQAGANNAK QEAEAARYEA VMAKDDAVAA KRGAEAAQTA AQGSASQAEA AKTKAEEFAT
KAEQAKGEAE AAKLGAEQAQ TVAVDAKNKA LEAQGKAEQA QNKAEEAQGK AEAAKDEAVA
AQQGAVTAKN QAETARDGAV DAKNKAEQAK SQAETFATQA NAAKQDAVTA KNQAESARNE
ANAAKTAALD AKQGAENAKN QAETFATQAN TAKQGALEAK DKAEQAKADA VAAKLGADAA
QTLAVNAKDK ALEAQGKAEA AQAAAQNSAS QAQTAQNKAE QAQAAAVAAQ QGADTAKTQA
EAARNEAVTA KNQAEDAKTA ALEAQNKAEA AKLGAEQAKA QADVAKNQAE SARDEAVAAQ
TAAQGLASQA EAAKTQAETA RDGAVDAKNK AEQAKSQAET FATQANAAKD AALEAQAGAN
AAQQGAESAK DDAVAAQKAS EDARDKAEGF ATKADDAKTK AVAAQGKAED AQNKAEAAQA
AAEDAKNKAN QAKDDAVAAK LGAEAAQTVA VDAKNKALEA QGKAEQAQNK AEEAQGKAEA
AKDESVAARD KALEAQKASE VARDEAEKIL SKTEDIAANN PFEYYTKDGK DKVIKGRDGN
LYKEDELANY QYDKTQKKYV AKDPSTKEDL KSLEAKDVLI KAVPNTIPME ITNVASGLGI
TTPTDDEKTQ LNKLAEKVSE KVTALGKKTK DFSEKAEKFA DLELMVDSLK QTLDTMPDGE
AKEKIRENLE KYKTQLSDAQ EAKKNAKEAV ESARNELIEA NGDYNAFSEA MAKVEELVEP
DSQAELSNVA TIGDLQAVAK SGLKFKGNDG VEVRKQLSET LEIKGEGEFN SDRTATGNIK
VEMAQDGNGL EVKLSDQLKN MTSFETRKVD GKQSALDSNG LKVSNSKTEE RSQLSENRLA
FYENDKLGLN LDGKSRALKV GEKAIISIND KNEALVEDLN ASSSSKAIAN KNYVDTKNNE
LRTQLNTTDR NLRAGIAGAL AAAGLPMSSV PGKSMFAASA GSYKGQSAVA LGYSRVSDNG
KITLRLQGTR SSTGDVGGSV GVGYQW