Gene HS_1632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1632 
Symbol 
ID4241159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1858163 
End bp1866907 
Gene Length8745 bp 
Protein Length2914 aa 
Translation table11 
GC content41% 
IMG OID638105218 
Productlarge adhesin 
Protein accessionYP_719837 
Protein GI113461768 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC AAAATACGAT GTAACAACAG GTCAAACTAA AGTAGTGTCT 
GAATTAGCGA ATAACCGTCA AGTGGCGAGC CGTGTCGAGG GAGCGTCGGA GAGTCAGCCG
AAGTGCGGTG TGTTTTTCGG CGGTATGTTA GGGGCGTTTA AGGTTCTGCC GTTGGCGTTG
GTGATGAGTG GGGTGTTGTC GAGCGTTGGG TATGGTTATG ATTTGTGGCT AGACGGTCCA
AAAGACCCAG GTCAATTTAA TAATAGTGAA TTAAATGAAG GAACAAAAGT GTGGTTTGGC
TACCGGACTA ATAAAAAACA TACCGTTCAA AATGAATCAA CAATATTAAC AAGTACAATG
AATAAAACAG GGGCTAATGC TTCATTTAGA AATAGGGACT TTGACCAAAC AGTAGCTATT
GGATCACGTG CGGTTGCAGG AGGTAATAGT TCAACAGCAA TAGGATTTCA AGCTATTACT
GGAGATAATG TGAAAAGGGA TACTGCTGCA CAACATAACC GTGAAGGTGT TGCTATAGGA
TATAAATCAT TTGCAAGAGG AAAGGAAGCA ACGGCAATGG GAAATGATGT TGTTGCTTGG
GGTGATTCAA GTATAGCTAT AGGGGCTGAT AATGTTTCTG AACCAAAATC GAATAAAAAA
GAAGAGGATC TAAGAAAACC ACTTAGTAGA GACATATTTA GGCTATTCTA TAACGCAAGG
TCTGATTTTA ATGATACAGG TAGTTACGCT GCAGTTCATT TTGCTAAGGC TGGAGAGACT
GTTAACGTCG GTAATGGTGT CCAAAGACCA AAAGAAGTAA TGGCTGATGC TGATGGAATA
TTGTATAGTG AAAATTTTAC TAATTATAAT CATCCTAATG GTGTTCAACT TTCAAGAAAA
GATGGAACTA TATTTATAAG AAAAAATCCT AAAAGCGCTG ATTATGGATA TGGTGATGGA
TACTATACTT ACCAAAATGG GGATTTTTAT GAAAGTAAGG GCTTATATTT GGATGAAGTA
AAAGATGAAG CTGAAATAGA AAAAATAAAA GCATTATCAG ACTGGAAAAG ATATGAAGCA
CATAGAGCAG AGGTAAATAA AGCTTACGCT GAATATATTA TAGACCCAAG AAGATTTAAA
ACTCATACTT GGGCAAGAGG TAACAATGCA ATAGCATTAG GAGCTAGATC AATAGCTTAT
GGAGATTATT CAACAGCATT AGGTACATTA GCTATAGCAA ATAGAGATTA TTCAACAGCA
TTAGGAACAA ATACGATAGC ATTTGGGAAA AAATCATTAG CTATTGGTAA TGAATCGTAT
GTTTATGAAG ATGAATCAGT AGGTGTTGGT AATAATGTAC AAGCTCTATA TGTTGGGTCA
ATGGCATATG GTAGACATGC TTATTCAGGA GGAAGAGGGT CATTAGCTAT TGGATCAAAT
GTTTTTTCAA ATGTTGTTAT GGATGATACA AAACTACCTA ATTTAGAAAA AATATACGAG
GGAGAATCTA ATGACAAAAC ATTAAATGAT TTATTAAATT TAAGTACTCA ATTAAATTAT
GTACCTAAAT TTGAGGACCA GTATGGATCA GGAGAATCAA AGGCTATAAG AGATACAGAT
AAAGATGGTA ACTATAATAA TGGTTCAATA GCATTGGGTT CTTATGCAAT AGCAAACGGA
GATAACTCAT TAGCAATGGG TAGATTTGCG TACGCAAAAA ATGATAAGGC TTTTGCAATA
GGTCAAGCAG CTTATGCAAA AGGTAATAAA AGTTTTGCTA TAGGTTATGG ATCAAAATCA
CTAGAGGATG ACTCAATTTC ATTTGGATCA TTATCAAGAG TTGGAGCTCC TTCATCTATG
GCTTTAGGTA TAGAAGCAAA AGTTTTAAAA GATATTCCTG ATACACTTTT GAGTGAAGAG
AATAAAAAAA ATAGCAAATA CAAAAAAGAT TCTGTTAAAA ACGCAATGGC GATAGGAAAC
TATGCAGAAG CATATTTCCC AGAATCAATA GCATTGGGTG TAAATGCAAA AACTGATTAC
ACTCAAACGG AAATGAAACA AGATGCGTGG GCACCAAAAC ATGCCATATC ATTCCCATCA
TCAGAAAAAA TAGGATATTT ATCAGTTGGT GGAAAGAATG CCGAAAGACG TATAGTTAAT
GTGGCACCTG GTTCGAGCGA TACGGATGCT GTAAACGTGT CACAATTAAG GGCATTAGAA
GAAGCTATTC TTTATGGAAA TACATTAGAC GATGAGAGTG ATATTAATTC CGGAGTTAAG
TATTTATCAG TTAAAGGTTT AGATGACTTA AAAACATTAG TAACAAAAAA ATATGATTAT
GAGAGTTATA CTAAATTAAA GAAAGAATAT TTAAAATTAA AATTAAGAAA AGTTATAAAT
GAAGAAACTA TTAATTTAAC TAAATACGAG GAAAGACTAA AAAAATATCA GGAAAAATAT
GGAGATTTTC AAAATGCTGC ATCTACATTA AAAGAGCTAG ATGAAAGAAT GAGTAGAAAG
AATTTTGGAC CAGAAACAAC AAATACAATG TCTGATAAGG AAAAAGAGGA TTTAAGAAAA
GATTGGTATA ATTCACTTTT CAAAGAAATT GAAACAGCAC AAGTTCGAGA CACATCAGAA
GAAAATATTA AGAATTTAAT ATCTGAAGAA AATGAAGCAA AAATAAAATC ATCTAATTTC
TTTAGTGATG GAGCAAAAAA ATCAGGTTCG ATTGCAATAG GAGTTGGGGC TTTAGCTAAT
TCTAGTGAAT CTATCGTTAT TGGTAGAAAT GCAAAGATAG AAAATGATAA GGCTCAAAAT
ACAGTACTAC TTGGTAATAA TACATCATCT GCTACTGCCA ACGCCGTCGC ATTGGGTAAC
TATTCGGTTG CAGATAGAAA CCCTGAACCG GCAAAAAATT TAACTCCAGA ACTAAGATCT
AATGCTTATA TTTCTAACGA ATTGGCAAAC TTAATAGATG GACATCAGGT GTATGCGGCA
GTGTCTGTTG GTCGTTATGG CGGAGATTTG GAAGATCTTA AGAACAATAC TGATGCAAAA
GCAAAGGACA ACCAAAAAAA ATATGAAAAA TGGGTAAAAG ACAATGTTAC TCTCAAAGAG
ACAGAACCTG ATGAATATGA GAAACAAAGG CTCGAACAAG CAGCGAAGGT TAAAAAATTC
GCCTTAAGAA AAATTACCAA CGTCGCACCA GGTACAAAAG ACACAGACGT TGTTATCCTT
GCTCAATTAA AAGAGGGTGT AAAACAAGCA AGAGGGCATT TTGTGTCTAT AAATAGTACG
GATTCAAAAG CTGGAAATTA CGATAATAAA GGGGCAACTG GAACTAATGC TATAGCTTTA
GGTGTTAATG CAAGTGCAAC TGGGAATTAT GCTATAGCTA TAGGTAATGC TAAATCTTTA
GGTGCTAAAA ACATAGTAAT TGGAGATGAT TCAACAACTA AAGGTACTGC CGAAAATTCT
AAAGCATCAG CATGGGCAAT AGTTATAGGT AATAAATCAA ATGCGTATAA CAACACTAAA
AGTATAAGTG ATGTAGTTGT TTTAGGAGGA AATGCTAATG CGAGTGAAAC AGGAGCAGTA
GCTATAGGTA ATAGAGCGAA TGTAAGCGGT GAAAGAGGTG TTGCGATAGG GTCTGGTGGA
AGTAAGGATA ATCAAGCAGC TAGAGTAACA ATAAGCGATG GAATAGCGCT TGGGTCTTAT
TCTATTTCAA ATAGGATGTT AACTGATGAC AAGTCAAAAG GTTATGATCC GTTAACAGAT
ACTTATAGCA CATCAAGTGA TATGAAATGG AAAGCTAATT TAGGAGCGCT TTCTATAGGA
GATACAACAA CACATACTAG ACAAATAACT GGTGTTGCTG CAGGTTATGC TGATACAGAT
GCAGTGAATG TGGCGCAGTT AAAAAGAGCG GTATCTTTAA TACCAACATT TTATACTCAA
AATACTACAG CTTCAGGTGA TTCCTTAGGA AGTGGGGGAC AAAATGGAAC TTCAAAAATA
GGCACTCAAG TAGGAAATGG TGTAAGTAAA ATAACGTTTG GAAAAGAGTT TAAAGTAACT
GAGAAATCTG TTAATGGAAA CAGTGGAGAA AAATACTTAC TAGTAGAACT AAACGAAAAA
GAAATACAAA ATAATGAAGC GTTAAAAGGA CCTAAGGGAG ATGCAGGACC GAAAGGAGAA
AGAGGACCAC AAGGACCGGC TGGAAAAGAT GGTGAAACTG GACCTATGGG ACCAACAGGA
CCACAAGGAC CTCAAGGTGA GCCTGGACCA CAAGGACCTC AAGGACTACC TGGAGCAAAA
GGTGATACTG GACCGGCGGG ACCACAAGGT ATTCCTGGAC CGCAAGGACC TAGAGGAGAG
CAAGGCTTGC CTGGAGCACC TGGAGCACAA GGACCTAAGG GCGACCCAGG AGCACCAGGA
CCGGTAGGAC CTCAAGGTGC AACAGGACCA GCTGGAAAAG ATGGTGAAAC TGGACCTATG
GGACCAACAG GACCACAAGG ACCTCAAGGT GAGCCTGGAC CACAAGGACC TCAAGGACTA
CCTGGAGCAA AAGGTGATAC TGGACCGGCG GGACCACAAG GTATTCCTGG ACCGCAAGGA
CCTAGAGGAG AGCAAGGCTT GCCTGGAGCA CCTGGAGCAC AAGGACCTAA GGGCGACCCA
GGAGCACCAG GACCGGTAGG ACCTCAAGGT GCAACAGGAC CAGCTGGAAA AGATGGTGAA
ACTGGACCTA TGGGACCAAC AGGACCACAA GGACCTCAAG GTGAGCCTGG ACCACAAGGA
CCTCAAGGAC TACCTGGAGC AAAAGGTGAT ACTGGACCGG CGGGACCACA AGGTATTCCT
GGACCGCAAG GACCTAGAGG AGAGCAAGGC TTGCCTGGAG CACCTGGAGC ACAAGGACCT
AAGGGCGACC CAGGAGCACC AGGACCGGTA GGACCTCAAG GTGCAACAGG ACCAGCTGGA
AAAGATGGTG AAACTGGACC TATGGGACCA ACAGGACCAC AAGGACCTCA AGGTGAGCCT
GGACCACAAG GACCTCAAGG ACTACCTGGA GCAAAAGGTG ATACTGGACC GGCGGGACCA
CAAGGTATTC CTGGACCGCA AGGACCTAGA GGAGAGCAAG GCTTGCCTGG AGCACCTGGA
CCACAAGGAC CTAAGGGCGA CCCAGGAGCA CCAGGACCGG TAGGACCTCA AGGTGCAACA
GGACCAGCTG GAAAAGATGG TGAAACTGGA CCTATGGGAC CAACAGGACC ACAAGGACCT
CAAGGTGAGC CTGGACCACA AGGACCTCAA GGACTACCTG GAGCAAAAGG TGATACTGGA
CCGGCGGGAC CACAAGGTAT TCCTGGACCG CAAGGACCTA GAGGAGAGCA AGGCTTGCCT
GGAGCACCTG GACCACAAGG ACCTCAAGGG CAACCTGGAA AATCAGCCTA TGAAGTATGG
AAAGAAGCAA AAATAAAAGA ACAAGGTAAA GATGATCATA CCAGTGAAAA AGACTTCCTA
AATTCATTAA AAGGAAAAGG CGGTACATTA GAAGGTATAG TATTTGTTGA TAATGATGGG
AATAAATTAG CAAAAGCTAA TGATGACAAA TACTATAAGG AAAGTGATGT TGATAAAAAC
GGAAACGTTA CAAATGGTAG CAATACACCT ACTACACCAT CAACAGGAAC TGCATCTACA
GAAGGAACTA CAACTCCTCG CCCTGTTGAA GGTAAAAAAA TAGCACTTAC AAGTACAGAT
GGTAAGATAG ATAGTCCTAT AGCATTAACA AATCTTGCCG ACGGTTTAGG CTTGCAAAAA
GTACCAGAAG CTAATGATAC CGAAGAGACG AAGAAAAAAG TCGAGGAAGC TAAAGCAGCT
AATAAAGCCA TCCTTGACAA GGTACTTGCC GGAACACCTG AAGAGAATAA GGTAAAAAAT
GCGGTCAATG TTCAAGACTT GTCTGCGGTT GCCAAGGCTA TTGTAGGGGA AGTCACCGCA
CAACACTCCG AAGCAGAGAA AGTGGCGGTG AAATATGATG ATGACACAAA AACATCCATC
ACTCTAGGCG GTAAAGGCAC CAATGGCACT AAGTCATCCC CTGTTGCGAT TGATAACCTT
AAATCCGGTT TGGGTATTGA TGATATTAAA GACAGTGACA TTGCTTCAGC TGCACAAGGC
AAACAAGGTG AGTTGGTAAA ACAACTCGTA GCAGGTGAGC TTGATACCAC GAAAGACGCT
AGCGGTAAAG CAAAAGACAA TCTGCATAAA GCCGTAAATT TAGCGGACTT AAAAGCCGTT
GCACAAGCCG GATTAAACTT TGCCGGCAAC GATGGACAGG ATATCCACAA AAACCTCAGC
GAAAAACTGG AGATTGTGGG GCAAGGCTTG GATAATAAAG ACAAAGTTAC TGCATTCAAA
GGCACAAACG GCAATATTGC GGTGAAAACG GATAATGGTA AATTATCCAT CTCCCTCAAT
GAAGCCTTGA CCGGCTTGAA GTCTGCCGAG TTTATTTCTG AAGAAACAAA TTCTGATGGA
ACTAAACAAC CAAAAACTAA AACAACTATT AATGGAAAAG GAACAACAAT AGTTGAATTA
GGTGATAATG GTAGTGCCAA AGAAAATGGA GATGGGAAAG AAGCCTCCAA AAATAATATC
AATGCGGACG GCATGACGGT CGGCAATCCG AGTGGTACGG ATCAATCCAA TACGCACTAT
GGCAAAGACG GAATGACAGT TAAGGACAAA GACGGTAAAG ATGCGGTGAG CTTGAAAATG
AAAATGTCGG AGAAAAACGG TAAAAGCGTT CCAACCCTTG AATTTGCCAA AGGGGCTGAC
GGTAAATCAG GCACAGGCAC GATCACAGGG CTTGCGGACA TTAAGCCAGA TGAAACTGAT
GGAAGCCTTG TGGCGAATAA AAACTATGTG GACGAAAAAG TCGAGAGCAT TAACGACAAG
TTGAAAAACA ACTTAGGGTT AAAAGAAATC GACAATCCGG ACTATGTTAA AGCGGAAGAA
GACTTAGCGA AAGCGAAAGA GGCATTGGAA AAAGAGAACA ATCTTGCGAA AAAAGCCGAA
TTGCAAAAAG CGGTGACCGA TGCCGAAGCG AAAGTTAACG AGTTAAGCAA AAACAAAAAA
CTGATAGTGA CACCGGACGG ACGGGACGGC AAATCCTACT TGGAAGCAGG AGCAGCAGCA
ACCCACGGAC CGACAGACAA AGACGGGCTT AACGGTAAAA ATGCTACTGA AAAAGTCAAC
GCTTTGCGTA ACGGCGAAGC GGGAGCGGTG GTGTTTACCG ATCAAGACGG TAATCGTCTG
GTCAAAGCTA ATGATGGTAA GTACTATAAA GCGACAGATG TAGATGACAA AGGTAACGTT
AAATCTGCAG CTAATGGTCA AGTTGCACCA GAAGCAGTGG ACAATCCGCA ACTTTCTCTG
GTCAACACCA GTGGTGAAAC CAATAAACCT GTAGTGCTGG GCAACGTGGC AAGTGGTTTG
GGTATTGATG CCGATAAAGC AAAAGAGCAA GCTGAAAACG TGAAAAATGC AGGCGAAGCA
GTGAAAAATG AGGCTAAAGA GGTGACGGCG AAAGTGGGCG AGATGTTGAG CAAACGTCAA
GAAGCGAATG CCCTTGAAAC GGCGAAAAAT GCCCAAGATT CTGCGATTAA CGCCCTTGAA
ACGGCGTGGA ACTTGATGCC TGACAGCACA GAGGAAGAAA AGGTGGCGAA AGCGAAAGCG
AAAGCCAACC TTGATGCTGA AAAAGCCAAA TTGGCAGAGC TTGGAAATGA ATTGACAAAA
GCTAATAAAG CAGTTAAAGC GTTGCAAGCC GAAATTGAAC CGTTGCAAAA ATCCTTGAAA
GATAAGCAAA AAGCGTATCA AATCGCATTA GCCACAAAAG ATGACGCAGT CAATAAGTTG
TTGTCAGATA AGTCCTTAAT TGATGTTAAG CGTGCTGCCA ATCTGCAAGA CTTACAAGCC
TTAGGACAAG CGGGCTTAAA TTTTGAGGGC AATGACGGCG TGCCTGTCCA TAAAAATTTA
GGCGAGAAAT TGACCATCAA AGGCGAGGGT GAGTTTAACA GTGCAACAAC CGCTGCCGGC
AATATTAAAG TGACTGCTTC TGACAGCGGT ATGGAAGTGA AGTTGTCCGA TACCTTGAAA
AATATGACCT CGTTTGAAAC CAAAGAAACC GCAGAGGGGA ATAAATCCCG TCTTGACGGC
AACGGATTGA CAGTGACAGG TAAAAACAAT CAGTCTGCAC ATTATGGTTC GGGGGGGATC
ACTCTCAAAG AAGGTAATAA CAAGGCTACT TTGACATCAA GTGCATTGAC GTTCACAAAC
GGACAAGGTC AGAAAGTTGA GATTGACGGT GCAAAAGGTG AAATCCGTGT GCCTGATTTA
ACCTCGAGTT CATCACCGAA TGCTGTGGCT AATAAACAAT ATGTCGATCT CTTGCAGACA
CATACTGACC AAAAACTGAA CAATCTCGAG CATAAATTTG ATATGTCCAA TAAAAACTTG
CGAGCAGGGA TTGCCGGAGC CAATGCGGCG GCAGGGTTGG CATCAGTCTC TATGCCGGGT
AAATCCATGC TGGCGATTTC TGCAGCAGGC TATGACGGAG AAAACGCAGT AGCGGTTGGG
TACTCTCGCA TGAGTGATAA CGGGAAGGTT ATGCTGAAAC TTCAAGGGAA TAGTAACTCT
CGAGGTAAAG TCGGCGGATC GGTCTCCGTA GGCTATCAAT GGTAA
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEGASESQP KCGVFFGGML GAFKVLPLAL 
VMSGVLSSVG YGYDLWLDGP KDPGQFNNSE LNEGTKVWFG YRTNKKHTVQ NESTILTSTM
NKTGANASFR NRDFDQTVAI GSRAVAGGNS STAIGFQAIT GDNVKRDTAA QHNREGVAIG
YKSFARGKEA TAMGNDVVAW GDSSIAIGAD NVSEPKSNKK EEDLRKPLSR DIFRLFYNAR
SDFNDTGSYA AVHFAKAGET VNVGNGVQRP KEVMADADGI LYSENFTNYN HPNGVQLSRK
DGTIFIRKNP KSADYGYGDG YYTYQNGDFY ESKGLYLDEV KDEAEIEKIK ALSDWKRYEA
HRAEVNKAYA EYIIDPRRFK THTWARGNNA IALGARSIAY GDYSTALGTL AIANRDYSTA
LGTNTIAFGK KSLAIGNESY VYEDESVGVG NNVQALYVGS MAYGRHAYSG GRGSLAIGSN
VFSNVVMDDT KLPNLEKIYE GESNDKTLND LLNLSTQLNY VPKFEDQYGS GESKAIRDTD
KDGNYNNGSI ALGSYAIANG DNSLAMGRFA YAKNDKAFAI GQAAYAKGNK SFAIGYGSKS
LEDDSISFGS LSRVGAPSSM ALGIEAKVLK DIPDTLLSEE NKKNSKYKKD SVKNAMAIGN
YAEAYFPESI ALGVNAKTDY TQTEMKQDAW APKHAISFPS SEKIGYLSVG GKNAERRIVN
VAPGSSDTDA VNVSQLRALE EAILYGNTLD DESDINSGVK YLSVKGLDDL KTLVTKKYDY
ESYTKLKKEY LKLKLRKVIN EETINLTKYE ERLKKYQEKY GDFQNAASTL KELDERMSRK
NFGPETTNTM SDKEKEDLRK DWYNSLFKEI ETAQVRDTSE ENIKNLISEE NEAKIKSSNF
FSDGAKKSGS IAIGVGALAN SSESIVIGRN AKIENDKAQN TVLLGNNTSS ATANAVALGN
YSVADRNPEP AKNLTPELRS NAYISNELAN LIDGHQVYAA VSVGRYGGDL EDLKNNTDAK
AKDNQKKYEK WVKDNVTLKE TEPDEYEKQR LEQAAKVKKF ALRKITNVAP GTKDTDVVIL
AQLKEGVKQA RGHFVSINST DSKAGNYDNK GATGTNAIAL GVNASATGNY AIAIGNAKSL
GAKNIVIGDD STTKGTAENS KASAWAIVIG NKSNAYNNTK SISDVVVLGG NANASETGAV
AIGNRANVSG ERGVAIGSGG SKDNQAARVT ISDGIALGSY SISNRMLTDD KSKGYDPLTD
TYSTSSDMKW KANLGALSIG DTTTHTRQIT GVAAGYADTD AVNVAQLKRA VSLIPTFYTQ
NTTASGDSLG SGGQNGTSKI GTQVGNGVSK ITFGKEFKVT EKSVNGNSGE KYLLVELNEK
EIQNNEALKG PKGDAGPKGE RGPQGPAGKD GETGPMGPTG PQGPQGEPGP QGPQGLPGAK
GDTGPAGPQG IPGPQGPRGE QGLPGAPGAQ GPKGDPGAPG PVGPQGATGP AGKDGETGPM
GPTGPQGPQG EPGPQGPQGL PGAKGDTGPA GPQGIPGPQG PRGEQGLPGA PGAQGPKGDP
GAPGPVGPQG ATGPAGKDGE TGPMGPTGPQ GPQGEPGPQG PQGLPGAKGD TGPAGPQGIP
GPQGPRGEQG LPGAPGAQGP KGDPGAPGPV GPQGATGPAG KDGETGPMGP TGPQGPQGEP
GPQGPQGLPG AKGDTGPAGP QGIPGPQGPR GEQGLPGAPG PQGPKGDPGA PGPVGPQGAT
GPAGKDGETG PMGPTGPQGP QGEPGPQGPQ GLPGAKGDTG PAGPQGIPGP QGPRGEQGLP
GAPGPQGPQG QPGKSAYEVW KEAKIKEQGK DDHTSEKDFL NSLKGKGGTL EGIVFVDNDG
NKLAKANDDK YYKESDVDKN GNVTNGSNTP TTPSTGTAST EGTTTPRPVE GKKIALTSTD
GKIDSPIALT NLADGLGLQK VPEANDTEET KKKVEEAKAA NKAILDKVLA GTPEENKVKN
AVNVQDLSAV AKAIVGEVTA QHSEAEKVAV KYDDDTKTSI TLGGKGTNGT KSSPVAIDNL
KSGLGIDDIK DSDIASAAQG KQGELVKQLV AGELDTTKDA SGKAKDNLHK AVNLADLKAV
AQAGLNFAGN DGQDIHKNLS EKLEIVGQGL DNKDKVTAFK GTNGNIAVKT DNGKLSISLN
EALTGLKSAE FISEETNSDG TKQPKTKTTI NGKGTTIVEL GDNGSAKENG DGKEASKNNI
NADGMTVGNP SGTDQSNTHY GKDGMTVKDK DGKDAVSLKM KMSEKNGKSV PTLEFAKGAD
GKSGTGTITG LADIKPDETD GSLVANKNYV DEKVESINDK LKNNLGLKEI DNPDYVKAEE
DLAKAKEALE KENNLAKKAE LQKAVTDAEA KVNELSKNKK LIVTPDGRDG KSYLEAGAAA
THGPTDKDGL NGKNATEKVN ALRNGEAGAV VFTDQDGNRL VKANDGKYYK ATDVDDKGNV
KSAANGQVAP EAVDNPQLSL VNTSGETNKP VVLGNVASGL GIDADKAKEQ AENVKNAGEA
VKNEAKEVTA KVGEMLSKRQ EANALETAKN AQDSAINALE TAWNLMPDST EEEKVAKAKA
KANLDAEKAK LAELGNELTK ANKAVKALQA EIEPLQKSLK DKQKAYQIAL ATKDDAVNKL
LSDKSLIDVK RAANLQDLQA LGQAGLNFEG NDGVPVHKNL GEKLTIKGEG EFNSATTAAG
NIKVTASDSG MEVKLSDTLK NMTSFETKET AEGNKSRLDG NGLTVTGKNN QSAHYGSGGI
TLKEGNNKAT LTSSALTFTN GQGQKVEIDG AKGEIRVPDL TSSSSPNAVA NKQYVDLLQT
HTDQKLNNLE HKFDMSNKNL RAGIAGANAA AGLASVSMPG KSMLAISAAG YDGENAVAVG
YSRMSDNGKV MLKLQGNSNS RGKVGGSVSV GYQW