Gene Spro_4402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4402 
Symbol 
ID5605959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4862809 
End bp4871931 
Gene Length9123 bp 
Protein Length3040 aa 
Translation table11 
GC content58% 
IMG OID640939964 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001480624 
Protein GI157372635 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.887533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTA AACAGAATCA ACCGCTTAAT ACCCCTCAGC GTCTGCTGAG CTACACCCTG 
TGCATGTTAC TGGCCGGGCA GCCGCTGCTG CCGGCGCTGG CCGAAGGCGT TAACGTGGCC
GAAGGTAATA CCCGTGTCGA TCAGGCCGCC AACGGCGTGC CGGTGGTCAA CATTGCCACG
CCGAATCAGG CCGGTATTTC CCATAACAAA TACAACGATT TCAACGTCGG CAAAGAGGGC
ATGATCCTCA ACAACGCCAC CGGTCAGCTC AACCAGAGCC AGCTTGGCGG CCTGATCCAG
AACAACCCCA ACCTGCAGGC CGGCAAGGAA GCGCAGGGCA TCATCAACGA AGTGGTGGCA
CCGAACCCTT CACAGCTTCA GGGCTACCTG GAGGTGGCAG GCAAGCAGGC CAGCGTGATG
GTCGCCAACC CCTATGGCAT CACCTGCGAC GGCTGTGGCT TTATCAACAC CCCTAACGCG
ACGCTGACCA CCGGCAAACC GGTGCTGGGT GCCGACGGCA AGCTGCAGGC GCTGGAAGTC
ACCCAAGGGG CGATCACCAT TCAGGGCAAG GGGCTGGACG CCAGCAAGAG CGACAAATTC
GCCCTGATCG CCCGCGCTAC CGAGATTAAC GCTCAACTGT ATGCCAAAGA TCTCAACATC
ACGCTCGGGG CTAACCGGGT GGACGCTACC GGCAAGGCCA CGGCGATCGC CGGTAATGGC
GACGTGCCCA AGGTGGCCAT CGACACCGGC GCTCTTGGCG GTATGTACGC CAACCGTATT
CATTTGGTGT CCAGTGAAAA AGGGGTCGGA GTCAACCTCG GCAACCTCAC CGCGCGTGAA
GGCGATATTC AGCTGGATGC CAATGGCCAA CTGCGACTGA ATAAAACCCT GGCGCAGGGC
GGCCTGAATG CCACTGCGGC GGGAATAACG CTCAGCGGCA ACCACAAGGC CGGGCAGGCG
CTTACCCTGA ATAGCCGTGG CGATATTGCC ATGAGTCAGG CCACGCTGAG CGCCAAAGGG
GATATGCAAC TGACTGCCGC AGGCAAACTG CAGGCGGGCA AAGCCGATAC GGACGGCCGC
CTGCAAGTGG ATGCTGCCAG CCTGAGCAGC GATAAAAACA GCAGCCTGAC GGCCGAAGGT
GATATCCACC TGACGCTCAG CGGCGACGGT AACTGGCAGG GCACCCTGAC CGCTGGACGC
GATCTGCAGT TGCAGGCCAA TCATCTGACC AACGTTGGCC AGCTGGCTGC CAATCGCGAC
AGCCGTGTTA AGACACAGAA CCTGACCAAC AGCGGCCTGA TCCAGGCTCA GGGCACACAG
AACCTGACCA GCAACCAACT CGCTAACCTT GGGCAGTTGC AAGCGGGGGG AATGCAGACC
ATCACCGCCA ACGGTGTCGA CAATAAGGGG ACGATCGGCT CCCAACAACA ACTCGAACTG
CGGGTACGCG ACAGGCTTAA TGTCGCGGGT TCACTGTCTG CCGATGGCCT GTTGACGGTA
CAGGCGGGTG AATTTTTACT GACCGGAAGC GCCAGCGGCA AGCAGGGTGT CACCCTTGTT
CACACCGGCT CTCTGAAAAC CGAGCTTGGC TCATCGCTGC TGAGCGACGG CAATATCACC
CTCAACGCCG ATGATGTACG GCTCGGCGGG CTGCTTTCCA GTGAACTTGG CCTGACGATA
GAGACAAAAA AACTGCTGTC TACCGCCGGC GCCCGCACCC AGTCCAAACA GGACATGAAG
CTCAAGGTTG CGCAAGACGC GCAGCTGGCC GGCGTGTTCA ACACGCTGGG GGATTTGAAG
TTTTCTGCCG CCACGGCGGA TAACCGCGGC GAAATAGACG CGCGCAATAT TGACTGGGGC
GGAGACTCGC TGAAGCAGCG CGGACACCTG AAGGCCACTG AGAACGCCAC GCTGAAGGTA
AAACAGCTCG ATCAGCAGGG GGATCTGCTG GCTGACCAAC GGCTGGAGCT ACGCGGCGAT
AACCTGGTGA ACAGTGGGCT GATGGGTGCC CAGACGCTGG ATCTGGTGTT AAGCGAATCA
CTGAACAACA CCGGCAACCT GCACGGCGTA CAAAAGCTGG CGTTGCAGTT GGCTAAGGAT
TTGACCAACG CTACTGGGGG CAAGCTGCTC AGCGAAGGCG AATTAACGGC CCAGGCGGCG
ACGGTTAATA ACAGCGGATT ATGGCAGGGG GATCGTATCA CGCTGACTGC GCGGCAGTTG
GATCATAAAG GTACGCTGCA GGCGGGCAAA GGGATCACAC TGGATCTGAG CGGCGATCTT
GATGCCGGTC TCGACAGCAA GATTTTCTCC AATGGCAAAG CGGACCTGAC GGCGCTGACC
TTGAACAACC AGGGTCATAT ACAGGCAGAA GAATTGAAGT TAAGCGCTGC AGACCTGACC
AACAGCGGCA GGCTGCAGGG GCAAAAAAGC CTCGAGGCCA AGCTAAGCGG CATATTCAAT
AACCTGGCCG GCGGCAATGT GCGCAGTCTG GGGACGTTGC AGCTGGAGGC TAAAGAGCTC
AATAACGCCG GGGATCTGCA GGGTGATGGC AGCAGCGCGC TGCAGTTGGG TCTTCAGTTG
ATCAACTCAG GTAACCTGAT GGCAGGTGGT GCGCTGAGTC TGCAAGCACC GACGCTGAAC
AACAGCGGCT TGCTGCAGGC CGACAGCCTG ACGTTCACCG GCACGACGTT GGATAACAGC
GGCACATTGA GCACCTTCGG CGATAACCAG CTCACGCTGG ATACTCTGAA TAATAAAGGG
ACGTTGCAGG GCGGTAATTT AAAACTGAAG GCGGACAGCC TGGATAACAC CGGCACGCTG
CTGGCGACAA GCCAGATGAG CCTCAAAGCG CGGCAGATTG ATAACCAGAA CAGCGGCAAA
CTATTGAGTG GCGGCGATAT TTCACTGGCC AGCACCCAGT TGAATCAGTA CGGCCAATTG
GTGGCGCTGG GCAACATGAC GTTGACGCTG AAAGACGCCT TCACCCAGAG CGGCACCCTG
GCGGTGGGCA AAGCGCTGGT TCTGACCAGC GACGGCGATA TTCTGCTGCA GGGCGCCACT
CAGGGCCAGA GTGTGGATGT TCGCAGCGGC GGTCAGTTGA CCAACGCAGG TACCTTGCGC
GGCGGCAGCG GAGAAATTCG CCTGGAAGCG GCCGGGCTCA CTCAAAACGC CGCCGCCAGC
CTGCAGTCGG GCGGATTGGT GCAGCTGTTG AGCCGTGGCG ACATCAGCAA CAACGGCTTT
ATCGGCACCG CCGGTACTCT GCTGTTGAGT GCCGCCAACT CATTACTCAA CAGCGGCATG
CTGTACGCCG GCGGCGATAT GAAGTTGCTG GCTGATCGTA TCACCAACCA GCGTGGCGAC
ATTCTGGCAG CCAGCAAACT GTGGATGCAG AAAGACGCCG ACGGTAATAC CAACAGCCAG
ATCGCCAACA CCTCGGGCAC CATCGAGACC GAAACCGGCG ATATTCAGAT CAAAACCGCG
CATCTGCTTA ATCAGCGTGA TGGGCTGAAA ACCTCGGTTA CGCAGGAAGA TCTGACCCAA
AAATACGACT GGCTGAACGG GGCCACGGCC AGCATTCCGC TGAGCTTCTT TGACGATGAC
GAGTACGGTT ATTACACCGT GGAGACCATG CGTCAAATGG CGGGGGATGC TGCGCGTGAA
GTGTACAAGA CGTATACCTA CGCCTCGCCG TATGCCACCA CAAAAGAGCT GGCATTGTCG
ACCAGCAAAG TGACTGTCAG CAGCTCGGGC GGCGCGGCGC GCATTGCCGC CGGCCGCAAT
ATGACGTTGA ATGCCACGAC GCTGGATAAT CTGGCCAGCG ATATTTTGGC CGATGGGGAT
ATTGCGCTCA CCGGCAGCAC GCTGAACAAC CAGTCATGGG CTGCGGGTAC AGAAACACGT
TATCAAACCT ATACCTATCA AAAACCGCCG TTACCGGGCT CGGATAATAC GCCGAAGGTG
TCCGCGGTCT CTGCAATCAA TAATTACGCC AAAGGGAAAA TCGACGACAA AAATATCCAC
TATAAAGCCA GCGGAGAAGT GCGCACTGAA CGCACCGACG ACGGTGTTTA TCGTTCAGTG
CTCCAGGCTG GCGGTGCAGT CAACGCCAAG TTTACCGACG ATATCAGCAA TACCACTGTG
ACCCCGAACG CCGGTAGCCT CAGTCACACT CTGGCGCGGC CAACGCTGGA TAGCCTGCAG
CAACCCGATG CCGTCAACGG CGTGGAACAG CAGGCGTTAG CGAAAGATCA AAGCGTAGTG
TTTGGTTCGC CGGAGTGGAA AGACAATCTG GCGGATTACC CGTTGCCAAC CGGTGGCAAC
GGTCGTTTTG TCGTGACCGA AGACCCGAAC AGCCCCTATC TGATCACCAC CAATCCCAAA
CTGGACGGCC TGGGCCAGTT GGACAACAGC CTGTTCAACG ATCTCTACGC CATGCTGGGC
CAACAGCCGG GCGCGGCCCC GCGTGAAAGC GACAGCCGTT TCACCAACGA GAAGCAGTTT
ATCGGCTCGG CCTATTTCCT CGACCGCTTA AAGTTAAACC CCGATTACGA CTACCGTTTC
CTGGGCGATG CGGCGTTCGA TACCCGTTAC ATCAGCAACG CGATGCTCAG CCAGACCGGT
CAGCGCTATC TTAACGGCGT CGGATCCGAA CTGGCGCAGA TGCAGCAGTT AATCGACAAC
GCGACACGGG CGCAGAGCGG CCTGAAGTTG CAGTTTGGCA TCGGCCTGAC ACCGGCACAG
GTTTCACAGC TGGAGCACAG CATCGTCTGG TGGGAGAAGG TGACGGTCAA CGGCCAGACG
GTGCTGGCAC CAAAACTGTA CTTGGCCAAA GCTGACGTCG CCCCGCTGAG CGGCAGCGTG
ATTGCGGGCA ACAAGGTTAA CCTGAACGGT GGCAACATCC GTAACGACGC CAGCACGCTG
CAGGGCGGTG AGCGGCTGAA CGTCAACAGC CAGGCAGGCA TCAGCAATCT TAATCAGGGA
ATGATCAACG CTAAAGATGG GCTGAGTTTG AACGCTATTG GGGATATCAG CAATATCGGC
TCGACCATCA GCGGTCAGCG GGTCGAGCTT GAAAGCCAGG ACGGCAGCAT TATCAATAAA
ACCCAGGCCC GCCAGTGGGA TGCCACCGGT ACCCTGGGTG GCCAATCGCT GACGCTGTCG
CGTACCGAAG TCGGTGATAC TGCCGTGATC CGCGCCGGAG ATACATTGAA CATGCAGGCC
CGTAACAATA TCGATGTTAC CGGGGCGCAG GTTACCTCTG GTGGTGCAAT GGATCTGCGG
GCGGGTGGCG ACATCAACCT GCTTGCCAAT AACACTTCTC GCGTCGATAA GTCAGACGGC
GGCCGCTGGG GAGGTGGGCT AAAAGAAAGT GAAACCCATG GCAGCCTGGC CACTGAAATC
AGCGCGGGTG GGGCGCTAAA CGTCAACGCC GGGCAGGATC TGAACCTGGT TGCCAGCCAA
ATCGGCAGCA AGGGTGATGC GGCATTAACA GCCGGACGTG ACATCAATCT GCAAACCGCC
GAACAGGGGA GCCGTCAGAA AACCGACGGT AGCGAACACA TCAGCAGCGG TGCGACGCGC
AGCACCCTCA CCAGCGGTGG GGATCTGCAG TTGCAGGCCG GTCGCGATTT GAATTCGCAG
GCGGCGGCGC TGGTGGCGGA TAACGACGTT GAACTCCGCG CCGGGCGTGA CGTTAACCTC
AATACTCAAC AAAGCCGGGA ATATCAGGAA AGTCACGGCG GTCGCCAACA GCGAGTAAAC
GAGTCCATAC GCCAGCAGGG CACTGAAATC GCCAGCGGCG GCGACACGCG CATTCAGGCC
GGACGGGACG CGACCCTTAA CGCCACCCAG GCCCAGGCCA GCGGCGACGT GGCGGTGAGT
GCCGGGCGTG ATATCGCGCT GAACAGCGCC ACCGAAAGCG ACTATAGCTT CTTTGAAGAG
ACCAAGGTCA AAAAAGGCCT GATGTCCAAA ACCACCACCC ATACGGTGGA AGAGGATTAC
GCCACCCGCG AGAAAGGCGG GCTACTGAGC GGGAACAATG TCTCACTCAA TGCCGGTAAC
GATCTCAAGG TGCAAGGCTC GACGGTGGTC GGCGACGGTA AGGTAAACCT GCAGGCGGGC
AATAACGTCG ACATTGTGGC AGCCACCGAG GAGCAATCGC GCTACCGGCT GAACGAGAAA
AAAACCAGCG GCATGTTCAG CGGCGGTGGC ATCGGCGTGA CCTTCGGCAG CAAATCTTCA
CGTCATCAGT TGAACGAAGA CGGCACCACC CAAAGCCAGA GCGTCAGCAC CATTGGCTCT
ACCGGTGGCG ACGTGAATAT CGTGGCCGGC GGCAAGGCGC ATATCGGCGG CGCGGACGTG
ATTGCGGATA AGAACCTGTC GGTGACGGGA GATAGCGTAC AAATCGATCC GGGTCAGGAT
ATCCGTCGTC GTGATGAAAC CTTTGAACAG AAACAAAGTG GTCTGAGCCT CGCCTTGTCC
GGCACGGTTG GTAGCGCCAT CAACACGGCG GTAACGACGG CCCAGCAGGC GAAGCAGGAG
ACTGATGGTC GGTTGGCGGC ATTGCAGGGC ACCAAAGCAG CCTTATCTGG CGTGCAGGCC
GTTCAGGCAG GGCAACTGGC ACAAGTCGGC AATACCGATA CCGAAGAAGG CAGCATGGTA
GGTATCAGTG TCTCGCTGGG GGCACAGAAA TCGTCGTCGA AACAGCATCA GGAGCAAACC
TCAGTTACTG GCTCGACGCT CAACGCCGGT AATAACTTGC AGGTGACGGC CACCGGCAAG
GGTAACTCGG CCGACAGTGG CGATATTGCC GTAGTGGGCA GCCAGCTCAA AGCAGGGGGC
GACACCACCC TGAGTGCCGA GCGGGATGTG TTGCTGCTGG GCGCAGCCAA TACGCAGAAG
ACCGAGGGCA GCAACAAGAG CAGCGGCGGT AATATCGGCG TCAGTATTGG CGTAGGGCAA
CAGACCGGAT TGAGCGTGTT CGCCAATGCC AACAAGAGCC AGGGCAGCGA GCATGGCGAT
GGCACGTTTT GGAGTGAAAC GACGATAGAC AGTGGCGGCA CGCTGTCAAT GCATAGCGGT
CGGGATACCC TGTTGTCGGG TGCGCAGGCC AGTGGCGAAA CGGTGAAGGT GGATGCCGGG
CGTAATCTGA CGTTGCAAAG CCAGCAGGAT AGCGATAATT ACGACGCCAA ACAGACCAGC
GTCAGTGGTG GTTTCAGCGT AGCCATCATC GGTGCCGGCG GCTCCGCCAG CCTGAGCATG
AGTCGTGACA AACTGCACAG TAATTATGAC AGCGTGCAGG AGCAAACCGG TCTGTTTGCC
GGCAAGGGCG GCTATGATGT TAAGGTGGGC GAGCATACGC AGTTAGATGG CGCGGTGATT
GCCAGCACGG CTACGGCGGA TAAAAACCAC CTTGATACCG GCACCTTGGG CTTCAGTGAT
ATCCATAATC AGGCGGATTT CAAGGCCGAG CATCAGGGGG GCAGCATCAG CAGCGGTGGC
CCGGTAGGGG CCGATTTGCT AACCAATCTG GCGGGCGCAG CCCTGTCCGG CGCGGGTAAT
AAAGGGCATG CAGAAGGCAC CACGCAGGCG GCAGTGTCGG GCGGTAGCGT GGTCATTCGG
GACCAGGCCA ACCAGCAGCA GGACGTTAAT CAACTCAGTC GCGATACCGA TAACGCCAAC
GGCAGTATCG GACAGATATT CGACAAGGAG AAAGAGCAGA ACCGGCTGAA GCAGGCGCAG
CTGATTGGTG AGATCGCCGG ACAGACGATC GATGTTATCC GCACCCAGGG TGATATTAAT
GGGCTGAAGG CGGCGAAGGA TAAGCATCCG GGGCTGGACG CGAAAGCGTT GCGTAAAACG
CCGGAATATC AAGCCGAGAT GAAGGAATAC GGCACCGGCA GCGATATACA AAGAGCCGCA
CAGGCAGTTA CGGGGGCGCT GCAGGCGCTG GCGGGGAACA ATCTTGCCGG AGCACTGGCG
AGCGGCGCGG CACCGTATCT GGCGAGAGAA ATCAAAGCGC GTATCGGTGA TGACAACGTG
GCCGCCAACG CGATGGCGCA TGCGGTGTTG GGTGCGATAA CCGCGCAGTT GAACAACCAG
TCGGCGGTAG CGGGTGCGGT AGGTGCCGGT GGTGGTGAGC TGGTAGCGCG GGTGATAGTA
GAGATAAGGT TCCCGGGCCG TGACATAAGC AGTCTGACAG AAAGCGAGAA ACAGCAGGTC
AGTGCTTTGA GCCAGCTGGC CGCAGGACTG GCGGGAGGTC TCGTCTCAGA CAGCAGCGCA
GGGGCCGTGA CAGGTTCGCA AGCCGCTAAG AATGCGGTGG AGAATAACTT GTTGGGCGGG
AATGAGGAAA CTCAGACCAA GTTCGTGCAG GAACACGGCA AGAATATTGC GTCTTGCAGC
ACTGACTCAA GTTCAGCATC CTGCCAGAAA GGTTTGGCGA TGAACGATGC GCTGATGGTT
GCTTTGCCAG CCGGGCTTGG TGGTGGCGTT CTGGCCGCTG CGACACCGGA GCTAGCGGCA
GCAGCCAAGG CGGCGATACA GACCTGTGCG GGTAATGTAG TGCTGTGCCT TAATAATGCT
GGTATCCAGA TGTCGGAAGC CATCGTGCCG GGTGGTGTAG GTGCCGGTGG TGCGGTGGGT
ATCGGTAAGA CAGCGGCAGA GGCGACAGCC GCTAAGGCTG AAGCTGTGGC AGCTAATGCT
GCGAAGAATA TAGGAAAGGG TACAACGGGT TCTTTAAGTG GCCAGCCAAC TAAACTGCCG
CCGAATGCCT CAGCAGAAAA TATACGTTCA CTCCAACGAG AAAATGAGGG GGCGACGATT
CTGTCTAAAA ATGGTTATCA TGTTGAGCAG AATCCTGTAA CTCCCGGGGT TAAAAAACCA
GATTATAAAA TTAATGGTGA GGTGTTCGAT AATATCGCCC CTAAAACAAA CTCAGTTCGT
AATATTTATG ACCGAGCTTT AGAGAAAGTG AATAGTGGTC AAACAAACAA TGTCGTTATC
AACTTGGCAG ATACAAAAGC AAGCGTTAGT GAGTTGCAAA AACAATTTAG TGATTGGCCA
ATCAAAGGGT TAGATAAGGT TATAGTTATC GACCAGTCAG GAAAGCCAAT CCGAGTTAAG
TAA
 
Protein sequence
MKRKQNQPLN TPQRLLSYTL CMLLAGQPLL PALAEGVNVA EGNTRVDQAA NGVPVVNIAT 
PNQAGISHNK YNDFNVGKEG MILNNATGQL NQSQLGGLIQ NNPNLQAGKE AQGIINEVVA
PNPSQLQGYL EVAGKQASVM VANPYGITCD GCGFINTPNA TLTTGKPVLG ADGKLQALEV
TQGAITIQGK GLDASKSDKF ALIARATEIN AQLYAKDLNI TLGANRVDAT GKATAIAGNG
DVPKVAIDTG ALGGMYANRI HLVSSEKGVG VNLGNLTARE GDIQLDANGQ LRLNKTLAQG
GLNATAAGIT LSGNHKAGQA LTLNSRGDIA MSQATLSAKG DMQLTAAGKL QAGKADTDGR
LQVDAASLSS DKNSSLTAEG DIHLTLSGDG NWQGTLTAGR DLQLQANHLT NVGQLAANRD
SRVKTQNLTN SGLIQAQGTQ NLTSNQLANL GQLQAGGMQT ITANGVDNKG TIGSQQQLEL
RVRDRLNVAG SLSADGLLTV QAGEFLLTGS ASGKQGVTLV HTGSLKTELG SSLLSDGNIT
LNADDVRLGG LLSSELGLTI ETKKLLSTAG ARTQSKQDMK LKVAQDAQLA GVFNTLGDLK
FSAATADNRG EIDARNIDWG GDSLKQRGHL KATENATLKV KQLDQQGDLL ADQRLELRGD
NLVNSGLMGA QTLDLVLSES LNNTGNLHGV QKLALQLAKD LTNATGGKLL SEGELTAQAA
TVNNSGLWQG DRITLTARQL DHKGTLQAGK GITLDLSGDL DAGLDSKIFS NGKADLTALT
LNNQGHIQAE ELKLSAADLT NSGRLQGQKS LEAKLSGIFN NLAGGNVRSL GTLQLEAKEL
NNAGDLQGDG SSALQLGLQL INSGNLMAGG ALSLQAPTLN NSGLLQADSL TFTGTTLDNS
GTLSTFGDNQ LTLDTLNNKG TLQGGNLKLK ADSLDNTGTL LATSQMSLKA RQIDNQNSGK
LLSGGDISLA STQLNQYGQL VALGNMTLTL KDAFTQSGTL AVGKALVLTS DGDILLQGAT
QGQSVDVRSG GQLTNAGTLR GGSGEIRLEA AGLTQNAAAS LQSGGLVQLL SRGDISNNGF
IGTAGTLLLS AANSLLNSGM LYAGGDMKLL ADRITNQRGD ILAASKLWMQ KDADGNTNSQ
IANTSGTIET ETGDIQIKTA HLLNQRDGLK TSVTQEDLTQ KYDWLNGATA SIPLSFFDDD
EYGYYTVETM RQMAGDAARE VYKTYTYASP YATTKELALS TSKVTVSSSG GAARIAAGRN
MTLNATTLDN LASDILADGD IALTGSTLNN QSWAAGTETR YQTYTYQKPP LPGSDNTPKV
SAVSAINNYA KGKIDDKNIH YKASGEVRTE RTDDGVYRSV LQAGGAVNAK FTDDISNTTV
TPNAGSLSHT LARPTLDSLQ QPDAVNGVEQ QALAKDQSVV FGSPEWKDNL ADYPLPTGGN
GRFVVTEDPN SPYLITTNPK LDGLGQLDNS LFNDLYAMLG QQPGAAPRES DSRFTNEKQF
IGSAYFLDRL KLNPDYDYRF LGDAAFDTRY ISNAMLSQTG QRYLNGVGSE LAQMQQLIDN
ATRAQSGLKL QFGIGLTPAQ VSQLEHSIVW WEKVTVNGQT VLAPKLYLAK ADVAPLSGSV
IAGNKVNLNG GNIRNDASTL QGGERLNVNS QAGISNLNQG MINAKDGLSL NAIGDISNIG
STISGQRVEL ESQDGSIINK TQARQWDATG TLGGQSLTLS RTEVGDTAVI RAGDTLNMQA
RNNIDVTGAQ VTSGGAMDLR AGGDINLLAN NTSRVDKSDG GRWGGGLKES ETHGSLATEI
SAGGALNVNA GQDLNLVASQ IGSKGDAALT AGRDINLQTA EQGSRQKTDG SEHISSGATR
STLTSGGDLQ LQAGRDLNSQ AAALVADNDV ELRAGRDVNL NTQQSREYQE SHGGRQQRVN
ESIRQQGTEI ASGGDTRIQA GRDATLNATQ AQASGDVAVS AGRDIALNSA TESDYSFFEE
TKVKKGLMSK TTTHTVEEDY ATREKGGLLS GNNVSLNAGN DLKVQGSTVV GDGKVNLQAG
NNVDIVAATE EQSRYRLNEK KTSGMFSGGG IGVTFGSKSS RHQLNEDGTT QSQSVSTIGS
TGGDVNIVAG GKAHIGGADV IADKNLSVTG DSVQIDPGQD IRRRDETFEQ KQSGLSLALS
GTVGSAINTA VTTAQQAKQE TDGRLAALQG TKAALSGVQA VQAGQLAQVG NTDTEEGSMV
GISVSLGAQK SSSKQHQEQT SVTGSTLNAG NNLQVTATGK GNSADSGDIA VVGSQLKAGG
DTTLSAERDV LLLGAANTQK TEGSNKSSGG NIGVSIGVGQ QTGLSVFANA NKSQGSEHGD
GTFWSETTID SGGTLSMHSG RDTLLSGAQA SGETVKVDAG RNLTLQSQQD SDNYDAKQTS
VSGGFSVAII GAGGSASLSM SRDKLHSNYD SVQEQTGLFA GKGGYDVKVG EHTQLDGAVI
ASTATADKNH LDTGTLGFSD IHNQADFKAE HQGGSISSGG PVGADLLTNL AGAALSGAGN
KGHAEGTTQA AVSGGSVVIR DQANQQQDVN QLSRDTDNAN GSIGQIFDKE KEQNRLKQAQ
LIGEIAGQTI DVIRTQGDIN GLKAAKDKHP GLDAKALRKT PEYQAEMKEY GTGSDIQRAA
QAVTGALQAL AGNNLAGALA SGAAPYLARE IKARIGDDNV AANAMAHAVL GAITAQLNNQ
SAVAGAVGAG GGELVARVIV EIRFPGRDIS SLTESEKQQV SALSQLAAGL AGGLVSDSSA
GAVTGSQAAK NAVENNLLGG NEETQTKFVQ EHGKNIASCS TDSSSASCQK GLAMNDALMV
ALPAGLGGGV LAAATPELAA AAKAAIQTCA GNVVLCLNNA GIQMSEAIVP GGVGAGGAVG
IGKTAAEATA AKAEAVAANA AKNIGKGTTG SLSGQPTKLP PNASAENIRS LQRENEGATI
LSKNGYHVEQ NPVTPGVKKP DYKINGEVFD NIAPKTNSVR NIYDRALEKV NSGQTNNVVI
NLADTKASVS ELQKQFSDWP IKGLDKVIVI DQSGKPIRVK