Gene RS02101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRS02101 
SymbolRSp1545 
ID1223857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1939063 
End bp1947138 
Gene Length8076 bp 
Protein Length2691 aa 
Translation table11 
GC content66% 
IMG OID637241408 
Productputative hemagglutinin-related protein 
Protein accessionNP_523104 
Protein GI17549764 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.386866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGC CGGTCACGGC CCGCGCACGC GCTGCGCTGG AGCAGCCGCG TGCCATTGCA 
CTGCAACAAG AACAACATGG CGCTAGCGCT GGTACCGCTG ATCCCGTCAG CCGCCATGAC
CGCGCCAGGC GCTCGTTTGT AGCGCGGTCC ATCGCCGTCG CCGTTGCGCT CCTCAGCTGG
CTCGGCCCGG TGCAGGTCTC CTGGCAGGCG GCCCGGCAGA GCGCCGCCAC GATTGCACTG
CACGGCACAA CCGTCGACAG CCCATTCACG TCGTGGCGCA CCACCGGCCG CCTGCTGGTG
CGCTGGGGCC TGCGGCAGGC GCAGGCCGGC GCCATCATCG ATCCCACCGC GCCGATCCGC
TTCACGCCCA CCCTTACCCA GACCACGGGA GCGAACGGGG GCGTCCCGGT GATCAACGTG
ACGACGCCGA ACGCAAGCGG CCTGTCATAC AACCTGCTGC GCTCGCTGAC GGTCGATGGC
GTGGGGCTGA TTCTGAACAA CAGCCTGCTC GGCGGCGGCA CGCTGCTGGG CGGCAACGTT
ACCGGCAACG CGAACCTGGC CACGTCCGGC TCGGCCTCGA CGATCCTGAC GCAGGTCACG
GGCACCGATC CGATCCGCAT CAACGGCACG GTGGAGGTGT TCGGCACGCC GGCCAGCGTG
ATCTTTGCCG CGCCGGCGGG CGTCTATACG CAGGGCGCGG GGTTCACCAA TACGCCACGG
GTGACGCTGT CGACCGGCAC GCCGCAGTTT CTGAATGGCA GCGGCGCAAA CGTTGCATTC
GATCAGGCCA CCGCGGTGGG CTTTCTCGTC AACAGCGGGC GCGTTCAGAT CGATCCGGCG
GCAGGCTCCA CGGCCGGTGC GGGGATCGAG GGCACGGTCG GGGCGATCAA CCTGATCGGC
CAGACGGTGG GCGTCAACGC GCCCTTGTTC GCGGGCAACC AGATCAACGT GATCGCGGGG
AACCAGCAAG TGGCGCCGGT GGCGACGGGC ACGGGGCGGG CGGGTTCCGA CTGGCAGGTG
AGCGGCACCG GGGCCAATGG GGCGGCCAAC AGCGCGAGCG CGCAGAACGG CCTGGCGATC
GACGCGACGG CGTTCGGTGC GATGACGGCG GGGCAGATCA AGCTGATCTC GACGGCGCAG
GGGCTGGGGG TGCGCGCGGC TGGTGATCTG GCGGCCAATA CCAGCAACGT CAATATCGAC
GCCAATGGCG ATGTCAGTGT TGGCAATGTG TATGGGCACC AGAATGTGGG TGTCACTTCG
ACCGGCACCG TCACCACCAC CGGTACCGTC AAGGCCCAAC AGGATGTCGC GCTGTCGGCC
AATGGCGATC TGACCGTTGG CGGTACCGCG CAGGCGGGCA ACAACCTGAC CTTGAACGCA
GGCGGCAATC TGACGGGGGC CGGTAACCTG GCCGCCGCCA AGGCGCTGAC CGCCGTGGCC
GGCGACAGCG TCAAGCTGAC CGGTAGCCTG AACGCTACCC ATCTCGCCGT CTTCGCACAG
GGGCAGGACG GCACGGGCGA CGTCGCGCTG GGCGGCAGCG TCTCGTCGCC CAACACCATC
ACGTTGACTG CCGCGCGCGA TCTGTCCGCG GCTGGGCCGA TCACGACGAA CGGCGACCTG
CAAGCCACAG TTGGCCGAGA CATCGCCATT GCGGGCGCCA CGCAGAGTTC GGGCGCCACC
GTGCTCAACG CCGGCCGCGA TATCGAGGTG GCGAACATGG GCTCGGTGGC GGCCGGCACG
ACGCTGTCCG CTACCACAGG TCGCAATCTC ACCGTATCGG GTGGTACCGC TTCTGGTGGC
GATACCACCC TGACCGCTAG CGGTGGAACT TTGGCGACTT CCGGCGCCGT GCTGGCCGGC
GGCAACGTCA CCGCCAATGG ACAAGCGGGC ACGACGCTCA GCGGTACTGT GTATGCGGCC
AAGGGTGTGA CGGCGCAATC GAGCGCGGGC GCCACGAGCG CGACCGGCAG TGTGGTCGCG
CACAGCGGCA AGGTTGCGCT CACGGGTACC AGCGTCGCCG TGTCGGGCTC CACCCAGTCC
GGGGGCGATA CGGCGCTCAG GGCAACGCAG GGCGACGCCA CCATCGATGG TCAGGCCGCC
GCGCTGGGCA AGTTGACCGT CACCGCGTCC CAGGACATCA CAGGGCCGGG CAGCACGGCC
AGCGTCGGCG ACACCGCGCT GACGGCAGGC CACGACATTG CCCTGACCGG CAGCAGCCAG
ACGGCCGGCA ATCTGGCCGC CACGGCTGGC AACCGTCTCG CATTGTCCGC ACTGCCTGTC
GTGGGCGGCG ACGCCACCCT GAATGGGGTC GCTGTCGTGC TGGGCGCAAG CGGCAAGACC
AGCCAGGTCA ATGGCACGCT GACCGCAACC GGGACGCAAA GTCTCGCCAC GGCCGGCACG
ATCAACACCG CAACGGCCAA CCTGACCGGC GGCACCGTCA CGAATTCGGG CACGGTCAGC
GCTTCAAAAA CCCTGACCGT CGCGGGCACC ACGATTACCA ACAGTGGCAC GCTGGGCGGC
GCCACTGCCA CCGTGCACGG CGCGGATGTG GCCAATGCCG GCCTGATCGG CGGCCAGACG
GTCAGCGTCA CGGCGGACAA CACGCTCAGC AACCAGAACG GCACGCTGCT TGGTACCAAG
TCGCTGGCCG TGGCCGCCAA CACGCTGACG AGCAACCGCA ACGGCGTGAT GTTCGCCGGC
AGCCCGTCCG GCACCACGCC CGGGCAAGGC GACCTGAGCG CCACGGTGTC GGGCGGGAAC
GGCAGCTTCA ATAACGCCGG TGGCCAAATC CTGGCCGGCA ACAACGCCAC GATCAACCTG
CGGAACCAGA CCGTCGACGG CGCGAACCTC GGCACCATCA ACGCCAACGG GGCGCTGACT
TACAACGTCG GCGCTGTCGC CAACACCGGC GCATGGACCG TGGGCGGCAA GAGCGCGACC
ATCAACGCAG CCAACGGCAT CGCCAACACT GGCTCGATCC AGCACGCGGG CGATTTGACG
CTGAGCACGC CCGGTGCGGT GACCAACAGC GGACAGATCA TCGCCGGCAA CGATCTGGCG
GTTTCGGGCG GCGGCATCAA CAACGCGGCC GGCGCGACGC TGCACGCCGA TCATGACCTG
TCTGTCACGG GTGCCACCAC CAACCGGGGC ACGGTCGAAG CGCTCAATGA CGTCAAGATT
GCCGGGGCCG GCTATGACAA CGCTGGCGCA CTGACTCAGG CCAACCGCGA CCTCAACGTC
AATGTGTCGG GCAACGTGCT GAACCAGGGG GGCACGATCG GCGCCGGGCG CGATGTGAAT
CTGTCTGCAG GTCAGATCAT CAACGATGCG ACCGCGTCGG GTGGTGCCAG CACGACTGTG
GTGACGGGTC AGGAAATCAA CCCGACGTAC TTGTCGCAGA TTGTGATTGG CAGGGAGCAG
TTGAGTTTCT CGCTCCCAGG CAGTGAGCCA GGTAGTCCGG ACTACATGCC CATCTACAGA
GCGGTCACGA TCGGCGATCT GAAACCGGAT GCGAACGGGG TCATCTACGG CTATCTGGCG
ACCGAACTGG TACGTAGTCT GCATGACCAG TACGTGGACT TCTGGCACCT CGGCAGCATG
GTCGACCATC CTGCTACGGC GCTTGTCACG CTGCCAACCG TGACGCGGAC CGAGACGACG
ACCCAGGCTG GTGTCAGCGG CATCATTCGG GCGGGGCGCA ATCTGGCGGT GACGGCTTCT
ACGCTGTCCA ACAACGGGGG GCAGATCAGC GCAGCGGGCA ATATCACGAT GGCCGTTGGG
GCGCTCAACA ATGGCGCATC GGCTGGCGCA AGCAAGACCA TCACCGAATC CATCGACGGT
GCCGTGCTCA ATAGCTTCAT GGCTCAGCTC TATGATGCTC TGCACTCCGG CACCAAACCG
ATGTTTTCTG ATGCCGTCAT GAGCGACGGT TGCTCGGATG GCTGTCCCAC ACCGCATTGG
TTTGTCATGC AGACCCAGGT GGTGCCCGGC ACATTCCACA CCGTGCCGGC AGCGACGCCT
CCCGCACCGC AAGTCACCGT GCAGCAGACC GCCGGCAAGC AGGGCGTCAT CGCCGCCGGC
GGCAGCATCG ATCTGACGCA GGTTGGCACG CTGAACAACG GCGGACAGAT CGCCGCCGCC
GGCAACATCA GCTTGGGCGG TTCGGTCAAC AATGTCGGCC AACAACTGGT CAACCGCACG
ACGTTGCCCG GCTGCGTCGG GAATCCGGCA ACGTGCACCA ACGCTGTCGG CAACAGCTTT
TCCGGCACTT CCGCCGGGCC GTGGGACAGC CCGACGTACG ACGTCATCGA TCCCAAGCAG
CAGGTCGCCA GCATCGTGGC GGGCGGCACG CTGACCGCCA ACGCCGCCCA GCTGACCAAT
CAGACCGGCA CCATCACCGC GGCCGGGAAT GTCCTGATCA CGGCGCCCAC CGTCACCAAC
ACGGGCGGCA CGATCCAATC GAAGGCCGGT TCGGTCACGA TCAATGCCGC CAATGGCCTG
GTCAACCAGG CCGCACCGAC GACCACGGTT CACCAGAGCC ACGGCTCGGA TGTCGGGCCA
TGCGGCAAGA GCGGCAGCGG CAACTGCGAT ACGGCCACGC AGACCGCCAC CGGCGACGCC
GGCATGATCC TCGCCGCGGG CGACCTGACC GTCAACGCCG GTTCGGTGCG CAACAATGGC
GGCGCCATGG TGGCCGGCGG CAACAACACC ATCACCACGG GCAGCTTTGA TAACAGCCCG
GTCTTCCTGC GCCAGTACTA CCACTGGATG TTCCTGGACC AGGACAGCAA CGCGAGCGAT
CGCTGGGGCT GCGATTCGGC AGGCGACATC TCCGGTTGCC AGCGGGCCTT TGGCGGCAAC
CTTCGCAACG GCACCAACGC CAATGCCGAG AACGCGCCCA CCATCGGCGC GCTGAACTCA
TACGTGAGCG GCGGCAACCT GACCATCCGC TCGGGCGGCG CCATCATCAA TAGCGGCAAC
ATCGAGGGTA CGGCGATTTC GCTCTCGGGC GCGACGATCA CCAACGGTAT CACCAACCCG
TCGATCCAGA CGCCGCCGTC CACCAGCGGC CGGCAGGTGG TGAGCCTGGG CCCGATCGGT
ACCGCCAATG CGCAGTTGCC GGTGACTGGG ACGCCGGACA CCTTCAGCGG TCCGACCACG
GTTGTGCAGC AGGGCGTACC GAACCCGTCC AACCCGGGCA CCGCGAATGG GCGCTGGCAG
TTCAACCCCG TCGTCGTGAC GACCCAGAGC GGTGGCGCGG TGGCATGGCA TTTCAATACG
CCGCTCGATG GCGCGGCGAT GAGCGCGCCG ACGGCGTCCG GCTCCACGGC GCAATACCTG
TCGAATAGCC CCGCCACGGC AGTGCTGGGC GGCGTCGGCC CGCAGACGCT GATCAACGCG
CTGCCGGCCA ATCTGCGCCC CGGCAGCACA CCGTTCTACT ACGACCCGCA AGCCGAGAAC
CAGCGCCTGG ACCAGGCCGC CCTCGCGCAG ACCGGCCGCA CGAGCTTCAT CAACGGCCTG
ACGTACGACA GCCAGACCCA CCTGACGGTG GACGACCAGC AAAAGCTGAT CCTGTACCAG
AACGCCGTCG ACTACGCGAA GGCGCACAAC GTCCAGCTGG GCCAGGCCCT GACGCCGGGC
CAACTGGCCG CGCTGGACAA GCCGATGCTG TGGTACGTGA CGCAGCAGGT GCCGGATCCG
AACTGCCTGA GCGGCGCGTG CCCGATGGTC AGCGCGCTGG TGCCGCAGGT GTACCTGCCG
CAGGGTTACA GCGGGATCGA GCCGGGCGGC AGCATCGTCG CGAGCAAGTC GTTGGAGCTG
CTGGCCGACA GCCCGATCCG CAACACCGGC ACGCTGGGCT CGTACGGCAC GCTGACGAGC
AACACCACCA TCATCAACGA GCAGCGCGCG GCGGAGATGA CGGCGGCGTG GCAGCCGATC
GAGGACGGCT GGGCGCGCAC GACGGGGCAG CAGGGGCAGG CCAACAGCGG CTTCGTGTTC
GCGGCCAACG CGGCGGGCAT CGCTGGGCAG ATCCAGAACA TCAATGGTGT CGTTGCGCAG
CTGAATGCGG ACGGCACGAT GAGCGCGGCG GAGGCTGCAC GGGTGGCTGC GGCCGTGCAG
GCTGGTATGC AGGCTGTGAC CAGCACGCAC ACGGACACCT TTGTGCGCTC GGAAGACTGG
TTCGGGCAAC TGTTCGCCGG CGTGGTGATG GTGGCCATTG GGATCATGAC GGGCGGCGCG
GCGATGGCCG CGTATGCCGG GGTGGGAGCG ACCCTGACCG TGGGTCAGGC CATGGCCCAG
GCGGCGGTTG CGTCGATGAC GACCAACACG ATGCAACAGG CGAGCAGCGG CCAGGGCTTC
AGCTTTGGCG CGCTGGTCAA GGCAGGCGCC ACGTCGGCGC TGACGGCGGG GATTACGCAG
GGGATCACGC TCAATGCGAA TGGCATGCTT GGGACGGTGG ATAGCCTGAA CTCGGTGGCA
TCCGATCGGA GCCTTGCGGC GTTGTCGGGT GCCAAGAACG TCGGCAACGG TCTGACGCAG
GCGGCGGCTT CAAGCGGCAC GCTGGGCGAG CAATTGACCG CGCTGACGTT GGGCATCGGC
ATCAAGGCCG GTGTGACCAC GGCCATCAAC GGCGGCAGCT TTGGCCGGGC ATTGACCAAT
ACCGCCGCCA GCGATATCGG GGCGGTCGCG GCCAACGTAC TCGGCACGCT GACTCCGGGG
ATCGGCGAAG TCAATGCCTC GCCGAACAGT GTCGTCGGCA ACATCCTCGG GCACGTTGCA
CTGGGCTGCG CCACTTCGTC GATGCAGGGG ACGGGTTGCG CGGGCGGCGC AGCGGGCGGT
CTGGCGGGAT CGGTGGTGGC GCCGCTGGTG GGCATGGGCC TCTATGCGGG CACGTCGGGC
ACGAACAGTG CCATCGATGC GGCGACGGTG GCGATTGGCG CGATGGCGGG CGGGGCGATT
GCGCATGCGA TCGGTGGCGA CACCACGGCC GGGGCGTCGA CGGCGCAGAA CGCGGCCATG
AACAACTGGC TGGATCACCG CCGGCCCAAC GCGGTGGTCT ATTCGGAGCA GGAGCGTCGG
GATAACGCAG CAGCGGCTTG TAAGGACGAT CCCACGCAGT GCGATGCTGC CAACCGCTGG
GATGTCGTGT CGAAGCAGCG CAATGCCGAA TTGCAGGCCG CGTGCGCCAA CCTGAGTTCC
GACACCTGCC GTAGCGCGAT GGCGACGGCG CAGGCGGCGG GCAACTACAT CGTGTTTGCC
GGCGGCAAGG TCTACGCGTA CGGCAAGGAG GATCCTGTCG CCCGTTCGTT GGACCCCAGT
CCGGCGGCGA AAACGCTGGA CACGATGGTG GGAAGTCCGC TGGCAGGGAT ATTTGGGGGC
ATTCCGTATT TCAAGTCCAA TGCGGACCCG GCTGCCGGTT ACTACTTCGC TCAGTACGGG
ATGGCGCTGG AAGGGATCGG CGCGGGGGTG CTGGGTCTGC CGACGGGGCC GTTGGCTGGG
CCGGGGTGGC GGGCGACGTT GGAATCGCCG AATACGCTCT ATGTTGGTTC GGGGGCGGGT
AGTGCGTTGC CGACTTGGAC CAACGTGGCG GGGCCTTACT CTGTCATTGG GCAAGGTGGC
GGGGCGGCTA CAACCGTTCA GGCTGGGCCG GGATATTCAA TTGGTGATGT TTTGCGTGGA
AGCCCAAATA GTTCAAGTAA TCTCCCTGCT ACTCAACAGT CGTCTGCAGA TAACATCGTT
GTCAGCGTTC CTGGTCGAGT ACAGTCGCGC ATTAATGTGA CAAACGCTGG CATGGACCAT
ATTGATGCCC GACACTTCGA TTCGACGGTG AATGCGAGCC AGTTTACTCT TAGTGAGAGC
GATCTTGTGA CCTTGTTGCA GAGTCCAAGC ACTGTATCCA CTCCAGTGAT CCGAACTATC
CAAAGCGGTA GCAATATCAA CTACGTTCGA GAAGTGAATG TTGGGAAGGT GGTCGGTACC
GATAAATTTA GTAACTATCA ACCGACGTCA ACAATGACAG TGATTACAGA TAAGTACGGT
AATCTAGTTA CCGCATTTCC CGGGAAATTG AAATGA
 
Protein sequence
MRQPVTARAR AALEQPRAIA LQQEQHGASA GTADPVSRHD RARRSFVARS IAVAVALLSW 
LGPVQVSWQA ARQSAATIAL HGTTVDSPFT SWRTTGRLLV RWGLRQAQAG AIIDPTAPIR
FTPTLTQTTG ANGGVPVINV TTPNASGLSY NLLRSLTVDG VGLILNNSLL GGGTLLGGNV
TGNANLATSG SASTILTQVT GTDPIRINGT VEVFGTPASV IFAAPAGVYT QGAGFTNTPR
VTLSTGTPQF LNGSGANVAF DQATAVGFLV NSGRVQIDPA AGSTAGAGIE GTVGAINLIG
QTVGVNAPLF AGNQINVIAG NQQVAPVATG TGRAGSDWQV SGTGANGAAN SASAQNGLAI
DATAFGAMTA GQIKLISTAQ GLGVRAAGDL AANTSNVNID ANGDVSVGNV YGHQNVGVTS
TGTVTTTGTV KAQQDVALSA NGDLTVGGTA QAGNNLTLNA GGNLTGAGNL AAAKALTAVA
GDSVKLTGSL NATHLAVFAQ GQDGTGDVAL GGSVSSPNTI TLTAARDLSA AGPITTNGDL
QATVGRDIAI AGATQSSGAT VLNAGRDIEV ANMGSVAAGT TLSATTGRNL TVSGGTASGG
DTTLTASGGT LATSGAVLAG GNVTANGQAG TTLSGTVYAA KGVTAQSSAG ATSATGSVVA
HSGKVALTGT SVAVSGSTQS GGDTALRATQ GDATIDGQAA ALGKLTVTAS QDITGPGSTA
SVGDTALTAG HDIALTGSSQ TAGNLAATAG NRLALSALPV VGGDATLNGV AVVLGASGKT
SQVNGTLTAT GTQSLATAGT INTATANLTG GTVTNSGTVS ASKTLTVAGT TITNSGTLGG
ATATVHGADV ANAGLIGGQT VSVTADNTLS NQNGTLLGTK SLAVAANTLT SNRNGVMFAG
SPSGTTPGQG DLSATVSGGN GSFNNAGGQI LAGNNATINL RNQTVDGANL GTINANGALT
YNVGAVANTG AWTVGGKSAT INAANGIANT GSIQHAGDLT LSTPGAVTNS GQIIAGNDLA
VSGGGINNAA GATLHADHDL SVTGATTNRG TVEALNDVKI AGAGYDNAGA LTQANRDLNV
NVSGNVLNQG GTIGAGRDVN LSAGQIINDA TASGGASTTV VTGQEINPTY LSQIVIGREQ
LSFSLPGSEP GSPDYMPIYR AVTIGDLKPD ANGVIYGYLA TELVRSLHDQ YVDFWHLGSM
VDHPATALVT LPTVTRTETT TQAGVSGIIR AGRNLAVTAS TLSNNGGQIS AAGNITMAVG
ALNNGASAGA SKTITESIDG AVLNSFMAQL YDALHSGTKP MFSDAVMSDG CSDGCPTPHW
FVMQTQVVPG TFHTVPAATP PAPQVTVQQT AGKQGVIAAG GSIDLTQVGT LNNGGQIAAA
GNISLGGSVN NVGQQLVNRT TLPGCVGNPA TCTNAVGNSF SGTSAGPWDS PTYDVIDPKQ
QVASIVAGGT LTANAAQLTN QTGTITAAGN VLITAPTVTN TGGTIQSKAG SVTINAANGL
VNQAAPTTTV HQSHGSDVGP CGKSGSGNCD TATQTATGDA GMILAAGDLT VNAGSVRNNG
GAMVAGGNNT ITTGSFDNSP VFLRQYYHWM FLDQDSNASD RWGCDSAGDI SGCQRAFGGN
LRNGTNANAE NAPTIGALNS YVSGGNLTIR SGGAIINSGN IEGTAISLSG ATITNGITNP
SIQTPPSTSG RQVVSLGPIG TANAQLPVTG TPDTFSGPTT VVQQGVPNPS NPGTANGRWQ
FNPVVVTTQS GGAVAWHFNT PLDGAAMSAP TASGSTAQYL SNSPATAVLG GVGPQTLINA
LPANLRPGST PFYYDPQAEN QRLDQAALAQ TGRTSFINGL TYDSQTHLTV DDQQKLILYQ
NAVDYAKAHN VQLGQALTPG QLAALDKPML WYVTQQVPDP NCLSGACPMV SALVPQVYLP
QGYSGIEPGG SIVASKSLEL LADSPIRNTG TLGSYGTLTS NTTIINEQRA AEMTAAWQPI
EDGWARTTGQ QGQANSGFVF AANAAGIAGQ IQNINGVVAQ LNADGTMSAA EAARVAAAVQ
AGMQAVTSTH TDTFVRSEDW FGQLFAGVVM VAIGIMTGGA AMAAYAGVGA TLTVGQAMAQ
AAVASMTTNT MQQASSGQGF SFGALVKAGA TSALTAGITQ GITLNANGML GTVDSLNSVA
SDRSLAALSG AKNVGNGLTQ AAASSGTLGE QLTALTLGIG IKAGVTTAIN GGSFGRALTN
TAASDIGAVA ANVLGTLTPG IGEVNASPNS VVGNILGHVA LGCATSSMQG TGCAGGAAGG
LAGSVVAPLV GMGLYAGTSG TNSAIDAATV AIGAMAGGAI AHAIGGDTTA GASTAQNAAM
NNWLDHRRPN AVVYSEQERR DNAAAACKDD PTQCDAANRW DVVSKQRNAE LQAACANLSS
DTCRSAMATA QAAGNYIVFA GGKVYAYGKE DPVARSLDPS PAAKTLDTMV GSPLAGIFGG
IPYFKSNADP AAGYYFAQYG MALEGIGAGV LGLPTGPLAG PGWRATLESP NTLYVGSGAG
SALPTWTNVA GPYSVIGQGG GAATTVQAGP GYSIGDVLRG SPNSSSNLPA TQQSSADNIV
VSVPGRVQSR INVTNAGMDH IDARHFDSTV NASQFTLSES DLVTLLQSPS TVSTPVIRTI
QSGSNINYVR EVNVGKVVGT DKFSNYQPTS TMTVITDKYG NLVTAFPGKL K