Gene BURPS1106A_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1129 
Symbol 
ID4900885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1102888 
End bp1112313 
Gene Length9426 bp 
Protein Length3141 aa 
Translation table11 
GC content61% 
IMG OID640134359 
Productcell surface protein 
Protein accessionYP_001065409 
Protein GI126454898 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGTCTA AAAAAGCGTT TGCCGTATCC ATGAATAAGA ACCAATATCG TCTGGTGTTC 
AGCCGCGTGC GCGGCATGCT GGTAGCGGCC GAGGAGACCG CTCATGCAAC GGGGAAAGTC
AGCAAGGGAG AGGCACCGCG CTCCGTTGCC AATCGCGCAT TGAGCGCTTT GGCGAGCTTC
GCCTTGCGTC ATGCCGCGTT CGCGGTGTTG ATTGCGGCCG GCGTTACACC GATGTGGGCG
AGCGCTCAGG TCGTGGGCGC CGGTGCCAAC GCACCGTCAG TCATCCAGAC GCAAAACGGC
CTCCAGCAAG TCAACATTAC CAAGCCGAGC GGCGCGGGCG TGTCGCTGAA CACCTATTCG
CAGTTCGACG TGCCCAAACA AGGCGTTATC GTCAATAACT CACCTACGCT GACGAACACA
CAGCAGGCTG GGTATATCAA CGGCAACCCG AACCTCGGCC CGAACGGTTC GGCCAAAATC
ATCATCAACC AGGTCAACAG CAACAACCCT TCACAACTGA AAGGTTATGC CGAGATCGCC
GGCCAGCGCG CCGAGATGAT TATTTCGAAT CCGTCGGGGT TGGTGGTCGA TGGTGGTGGC
TTTATCAACA CCTCGCGCGC AATCCTGACG ACTGGCACGC CGAATCTGAA TGCGGACGGC
TCGCTGGCGG GCTTCAACGT GACGCGTGGC CTGATCACAG TGCAAGGCGC GGGCCTGACC
GCCACGAATG TGGATCAGGT CGACCTCATT TCCAGAGCCG TACAGGCCAA TGCAGCCATC
TATGCATCGA ATCTGAACGT GGTCGCTGGG GCCAATCAGG TCAATCACGA GACGCTCCAG
GCGACCCGTA TTCAGGGGGA TGGTCCGGCT CCGGCCGTTG CAATTGACGT GGGTCAATTG
GGCGGTATGT ACAGTAATCG AATTTTTCTG GTGGGGACCG AAGGGGGCGT CGGGGTCCGC
AATGCAGGGA CGATCGCGGC CGATGCGATG GGCCTGACGT TGACGACGGA CGGCCGCCTG
GTGCAGGCGG GCAAGATCAG TTCGGTCGGG AACGTTGCAG TATCGGCCGC GGGCGGGATC
GAGAATAGCG GTACGACCTA CGCTCAGCAG TCGGTGTCGC TCAACACCGG CGCGGACGTG
GCGAATACCG GTACGCTCGC CGCGCAGCAT AATGTCGGCG TGACGGCGGG TTCGCTGAAT
GCGACGGGCT TGCTGGGTGC GGGTGTGAAC AGTGACGGTA CCGTCACGCA AGCCGGCGAT
CTGCAACTGA CGACGGCGGG CCAGTTGAAC GCGACCGGCA AGAACGTCGC GGGCGGCAAC
GTCTCGGTGA CCGGCGCCGG CGTGAGCCTC GCTGGCAGCA CGACGGCCGC GAACGGCAGC
CTGTCGCTGA GCGCGACGGC TGGCGACGTG AATCTCACGA ACGCGACGAC GAGTGCTCAG
GGGGCCGTCA CGGCGAATGC CGCGGGCACG GTCATCAATG ACCACGGCAA CCTGACGAGC
GGCGGCAGTA CGACGCTGAC GGGCGGCAAC GTATCGAACC AAGGCGGCAC GGTGTCGTCA
CAAGGTCCGT TGTCCGTGAC AGCCGCCGGG CAGATCGCCA ATCAGGCCGG TGTGCTCGTG
TCGGAAAGCA CAATCGCGCT GCGCGGCGGT ACCGTCGCGA ACAACCAGGG CACGATTCAG
AGCGCCGGCC ACGCGACTGT CGACGGCGTG ACGATCGACA ACACGGCCGG CCGCATCACG
TCGCTCAATA CCGATGGGAT GGTGCTGACG GCAACTGGCC AGCTCACGAA CGTGGCAGGC
ACGACCGCGA ACGGCGCACA AGGCGGTGTG ATCGGCGGCA ACGGCGATGT GACCGTCCAA
GGCGGCAATA TCGCGAACCA CGCGACGATT ACGTCCAATA CCAATCTCCA CGTTTCTGGA
CAATCGGTCG ACAACAGCGG CGGGGCGCTC CAAGCCGCGC AGGGTGTGAC GGTCGATGCC
GGCACACACC TCGCCAATGG CGGCGGCAGC ATCGTCGGCC AAACGGCGGC GCTGACGGGC
ACGACACTCG ACAACAGTTC CGGTACCGTG CAGGCGGATC AGGTTTCGTT GACCGCTACC
AACGTGGTGA ACCACGGCGG CACGATCACG CAGACCGGGA ACGGCGCGAT GGCCGTCAAT
GTCACGGGCA CGCTCGACAA CTCGCGCGGT GGCACGCTGC AGACCAACAG CGCCGATCTG
ACGTTGGCTC CCGCAGCGCT GGTGAATGAC GGCGGCACGA TCACGCACGC CGGCACCGGT
ACGCTGACGC TCGGCAATGG GGCGGGTTCG GTGTCGAACG TCGGTGGCAC GATCGCCAGC
AATGGACATA TCTCGGCTCA AAGCGGTTCG CTGAACAACA CGTCGGGTTC GATCAATGCC
CAATCCGGGC TGACGGCCGC AGTGAGTGGC GTGTTGAACA ACACGAACGG CAAGCTGCTT
TCGAATACGG ATCTCGGCAT CACCAGCGGC ACGCTGACGA ATGACGGCGG ACAGATCGGC
GCGAGCACGA ATGCAACGAT CCATACGGGC TCGATGACCA ATCGGGGCGG AACGGTCGTC
GCACCCAACC TGTCCGTGAT CGCCGATTCG ACGCTCGACA ACAGCGGCGG CACGCTCGAA
GCGAACCAGC TCGCGTTGAC CGCGCCGAAC CTGACGAACC ATGGCGGGAC GATCACGCAG
TTTGGATCGT CCGTGATGGG CGTCAATGTG AGCGGCACGC TGGACAACTC CGCGGGCGGC
GTGATTCAGA CCAACAGCAC GGACCTCACG CTGACTCCGG CCCAATTGAA CAATGCGGGC
GGCACGATCA CGCATGCCGG CACGGGCACG CTGACGATTG AGCCCGGCAA CGGCGCGAGC
GCGCTGAACA ATGCAGGCGG CACCATCGTC ACCAAGGGCC AAGCGGTGGT TGACGCGAGC
AGTTGGGACA ACTCGGGCGG TATTCTCGCG GCGCAGGGCG GCATCACCGG AACGATCGCG
GGCGATGTGA ACAACGCGCA AGGGCTGGTG CGTGCCGGAA CGTCACTGTC GCTGACGAAC
GGCGGCGCGC TGGTGAACCA GGGTGGACAT ATTCAGGCCG GCCAGCAGAC GGCGGGCGAT
ACGAGCACGC TCAGCATCCA GTCGGCATCG GTTAACAATG CTGACGGCTC GATTGCCGAC
CTCGGCGCGG GCAAGATGAC CGTGCAGGGC GGCAGCCAGA TCACCAACAG CCACGCGGGC
GGCGTGTCCG GTATGGGCGC GATCACCGGT AACGGTGACG TGACGGTCAG CGCGACGTCG
ATTTCCAACA CCCAGGGCGG CCAACTGAGC GGCGCAAGCC TCCACGTTCA AGGGACGACC
CTGGACAACA GTGGCGGGCA GATCGGCAAC GTCGCGCATT CGAGTGGCGA TGTGGACGTG
ACGACGAGCG GCACGGTCAC GAACACGAAC GGGCAGATCA GTTCCACGCA TGACCTGACA
GTAACGGCTC CGACGCTCCA GGGCGGCGGC ACGTACAGTG CAGCGCACGA TGCCAACGTG
AATCTGCAGG GTGATTTCTC GGTGACGCCC GACTACCAGT TCAACGTCGG TCACGATCTT
GCGTTCGCGT TGCCTGGCAC GTTCGACAAT AGCGGCAACG TGCAATCGGT CAACGACCTG
AACGTCAACG CGGGCAACAT CGTCAACTCG GGCGCACTGT CAGCCGGTGG GTTGCTGCAT
ACGCAATCGG GCAATCTGAC CAACACGGGT GCGATCGTGG GCGGCAGCGT GTCGCTCAAT
GCAACGGGCA CGGTAGCGAA CGTCGGTCCG ACCGCGCTGA TTGGCGCGTC GGACAGCAAC
GGCGCGCTTG AAATTCTCGC GAACGACATC GAGAACCGCG ACGGCACGAC GGCGACTGAT
TCGATGGCGA CGACGGCCAT CTTCGGGATG GGCAAGGTTG TATTGGCCGG TGGCAAGGAC
GCCAGCGGCA AGTACACGAA CGCGGCCCTG ATCAACAACT CGTCGGCGTT GATCCAGTCC
GGCGGCGACA TGGAATTGCA TGCCGACAAG ATCACGAACA CGCGTCGCGT GATGACGACT
TCGACGGGCT CGGTCGATCC CGCGACGCTG GCGCCGTTCG GCGTACCGAT CAAGGGCCAA
ACGGGGCAGG TCGGTGTTAA GGACCCGACG AGCATCGGCG GCGTGTATAC CGATCCGCCC
CACGGCGGCC AGTGGAACAG TACGTATCAA TATACGACCT ACTACGCGGA TAGCGCGACC
GCGACGACCG TGACCAGCAT CAGCCCGGCT GCGCAAATCG TGTCGGGCGG CAGTATCAAT
GCATCAACGG TCGCCAGCCT GCAAAACTAC TGGAGCAGCA TCGCGGCCGT TGGCAATGTC
CAGATGCCGA AGAGCTACGA TGCCAACGGG TGGGCTGCCA CCGGACAACA GGCACCGAGC
GTGACTGTCT CGTATTCCGG GCAGTATCAC TACAACAACT ATGACAACAC CGAGCACGAT
TGGCAATTGC CGTTCGGCAA TGCGCCGTTC GTGACGAGCC GTCCGGGTGG CTATACGCAG
GCGGCGCCGG CATCGGTCAA GCAGTACTCG CTGCCGAGCT ACGATTCGAC CTTGGGTTCG
AACGGGACGA TTTCCGGAAC CGGCGTGAGC ATCAACAACA CGGCGGGCAA CGCGTCGATC
CCGTCGCTGG GCTTGCTGCC CGGGCAAGCC GTACCCGGCC TGACGATCGG TGGACTGAGC
GGTAGCGCGA GCGGCACGAA GTCGGGTGCG TCGGCGGTGC ATGGTGGTGT GACCACGATC
GATCCGGTCA TCGCCAGCGC GACCGCGTTG AACGTGCTGA ACAACCTGAC GATTCCGCAG
GGCGGCTTGT TCAAACCGAA TCCGTCGCCG AACGCGAGCT ACGTGATCGA GACGAACCCG
GCGTTCACGA ACCAGAAGAG TTTCATTTCG AGCGACTACT TCTTCGGTCA GATCGGTGTC
GACCTTACCC ACATTCCGAA GAGGCTGGGT GACGGCTTCT ACGAGCAGCA ACTTATCCGT
AATCAGGTCA CGTCGCTGAC GGGCCGTGCG GTGCTCGGTC CGTACACGGA TCTGCAGTCG
ATGTACAAGT CGCTGATGGC GGCAGGCGCT TCGCTGGAGA AGTCGCTCAA CCTGCCGTTG
GGCGCGAGCC TGTCGGCCGA GCAGGTGTCG CAGCTCACGA GCAACGTGGT CGTGATGGAG
ACGCGCGTGG TTGACGGGCA GTCGGTGCTG GTGCCGGTCG TGTATCTCGC GAAGGCCAAT
CAGCAGAATA TCAACGGTCC GCTGATTACG GCCACCGACA TCGACCTGAA GGATGCGCAG
AACTTCACGA ACAGCGGCAC GGTGAAGGCG GACAACACGC TGTCGATCCA GGGCAAGCAG
ATCGACAACG CATTCGGTGC GCTGCAGAGC GGCGGCCTGA TGTCGTTGAC GACGACCGGG
AATGTCGATT TGACGTCGGC CAAGGTGCAA GCCGGTAGCC TGAACCTGAA TGCGGGTGGC
GACCTGATTC TCGATACCGC AGTGAAGACC GACAAGCGGG TCAGCCGCGA CGGCGCAACA
AGTATCACGA CGATACTCGG ACCGACCGCC CAACTCGACG TGACGGGTAA TGCGGCGATC
AAGACGGGCG GCAACTTCCA GCAGAACGCG GGCAACCTGT CGGTTGGCGG CAATCTCGGT
ATGAATGTCG GGGGCGATTG GACTTTGGGT GCGGTGCAGA CGGGCGAACA CAAGATCGTG
CAGCGCGCGA ACGGTGTGTC GAATACGGAC ATCAACAACG CAGTTGGCAG CTCGGTGAAG
GTCGGCGGGC AATCGAGTAT CGGCGTCGGC GGGGACGTTA CTGCGAGAGG TGCGCAGATC
GATCTCGGTC AGGGGGGCAC GATCGCGGCC AAGGGTACCG TTACGCTTGG CGCTGCGAGT
GCAACCTCGA CGGTGAACAG CAACAGTTCG GGTAGTGATA GTCACGGCAG TTACGCCGAG
ACACTGCACA CATCGGATCA GGCGCTTACC GGAACAATGC TCAAGGGCGG AGACACCATA
ACGCTTGCGT CGGGCAAGGA CATCACGATC AGCGGCAGTA CCATCAACCT CGACAAGGGC
AATGCAAACC TGCTGGCGAA GGGGGACGTG AATGTCGGTG CGGCGACCGA AACGCATACG
TTCGAATCGC ACGAAACGCA TAGCCATAGC AACGTAGTAA GCGGCGTGAA GGTCGCAAGT
GGCACAGACC AGACCGCAAC GTATAGCAAG GGCAGCACAA TTTCGGCGGA TGGGATTACG
GTCGAAAGTG GCCGGGATAT CAACGTGACG GGAAGCAACA TCGTCGGCAC GAACGACGTG
AGCCTTGATG CGGCGCGTAA CGTGAATATC ACCACGTCGC AGGACACGGT GCAATCGTCG
TCGTATTACG ACAAGAAGGA AAGCGGCTTG CTGACCAATG GCGGGCTGTC CGTGACCTTC
GGTAGCCGCT CGATGGGCCA GACGGACCAA TCCAAGCAGG TGACGAACAA CGCAAGCGTC
GTCGGCGCAT CGTCCGGCAA TGTTTCGATC AGCGCGGGCA AGGACGCGAC CATCACCAGC
AGTACCGTAG TCGCCGGTCA AAATCTCGAC GTGACCGGCC AGAACGTTGC TGTGAATTCG
GCCTATGACA CGTACAACGA CGCGCAGTCG CAGCACTTCA GCCAATCGGG CTTGAGCGTC
GGCGTGAACG GCGGTGTGGT GGGCCTTGCT CAGTCGATGG CGAGCACGGT TCGTCAGGGC
GTGCAGTCTG GCGATTCACG TTTGGCGGCA GTGCAAGGTG TAGCGGCAGC CGAGCAGGCT
TATCAGAGCC GTGATGGATT GAAGAATGCG GCCACTGCCC TGTCGAGCGG TAAAGTCAGT
GAAGCCGCGA ACGGTGTACA GGTTCAACTG AGCATCGGAT CGAGCCACAG CAGCAGCAAC
GAGACGACAT CCATCACACA GGCCAAGAAT TCGTCGCTCA TCGGCAACGG CAACGTGCAT
GTGACCGCGA CGGGCACGCC CGACGCGAAC GGCAACGCAC AACCAGGAAC CGGCGATATC
ACGATGACCG GTGCGAACGT GTTGGGTAAG AACGTGTCGC TCAATGCGAA CAACGCGATC
ACGCTGCAAA GCGCACAGAG CACGGAGCAG GACACGAGTT CGAATAGTTC ATCGGGCTGG
AACGCGGGCG TTGGGATCGG CGTCGGCAAG CAGACTGGGA TCAGCGTTTT TGCGAACGGT
ACTAACTCGC ACGGCCAAGG CAACGGTAGC GCCGTGACTC AGACCAACAC GACCATCGCG
GCAGGCAACA CGCTGGTGAT GAAGTCGGGG GGCGATACGA CGTTAGCCGG TGCGCAGGTG
TCGGGCGATA AAGTGAAGGC CGATGTGGGC GGCAATCTCA CGATGACGAG CGTTCAGGAT
ACGTCGAACT ACGCGAGCAA CCAGCATAGT GCGGGTGCGA GCGGTAGCTT TACGTTCGGC
TACGGTGGCG GTGCCGAATT GTCGATTGGA CACACCGGCA TCGATGCGAA CTATGCTTCG
GTCGATCAGC AGACCGGTAT CGTTGCCGGC AAAGGCGGGT TCGACGTGAG CGTGGCGAAC
CATACGCAGC TCAACGGCGC GCAGATCGCG AGCGCCGCCC CTGCTGAAAG CAATACGTTG
ACGACGGGCA GCCTTGGGTT CAGGGACATC CAGAATTCGA TGTCATACTC GGCATCGTCG
GAAGGCTTCT CGACTTCGAG CGGTCCGAGC TTCGCGCATA CGGGTGATAG CGCGAGCGGC
GTGACGAAGG CAGCGGTGAG TCCGGGGACG ATTACCGTCA AGTCCGATCA ACAGAATGGC
ACCGACAGCA CTGCAGGTTT GTCGCGCGAT ACGGCGAACG CGAACCAGAC CGTTAAGAAC
ACGTTCAACT TGCAGCAGAC ACAAAACGAT CTGGCGTTTG CGCAGGCGTT CGGCAAGGCG
GCGACGTTCG CGGTCGCGGA AGCGGCCACG CAGCTTGAGA ACAGCAGTCC GCAGATGAAG
GCGTTGTTCG GCGAAGGCGG CGCGGGACGC GATGCGCTGC ACGCGGCGGT GGCCGCGATC
GGCGCGGCGT TGTCGGGCGG CAACATCGGC GGCGCGGTGG CGGGTTCGCT CGCGGGCGAT
GCGTTGCAGT CGCTGGCGCA GCCGATTATC GATCAGGCGG TAAGCCAGTT GCCGCTGGAT
GCGCAGGCGG CTGCACGCAA GGCACTGAAC GAGGTTGTCG CGACAGCAGG CGGTGCGGCG
GGCGGCGCGC TGGCCGGAGG CGGTTCGTCG GGGATGCTCG CGGGTGCGGG CGCGGCGGCG
AACAATGAGC TTTACAACCG TCAATTGCAC GAAAGCGAAG CGCAAAAACT CCAGCAGCTT
CAGAAAAATC AGTCGCCCCA AGAGCAATAC CGCCTTGCCG CAGCGGAGTG CTCGCTTGTG
CATTGTGCAG ACAACATCCC GGACAGCGAT CCGAACAAGG CCGTGCTGCA AAAGATGCAG
AATGACGGCG CGCAGTTCAC CTACGAGCAA GGGGTACTGA AGAAAGCCGG TGCGTTCGAT
GGGTACGGCA AGCTCGATTC ACTGTCCGAT GCCTATGATC GGAATCAGGT CTCGAATCGT
CTTGTCGGTG CCGTGCAGGG TGTCGGAAGT ACCGCAGCGG GAATCGGCGC TGCAACAGGT
GGCTGCTATA CGCTCGTTGC TTGTATCGCT GGGGCGGCAG TAGCTGGCGT GAGTTTCGAT
TACGCAAAGG CAGGCTTTAC GCAGCTTGTG AACGGTAACC CGACGCCAAC CTATGGCGAA
CTGGCATTGC AGAGTTTGGG GATGAGTCCG AGCGGCGCTG CGTTGACTTA CGCAGGCTTG
GGTCTCGGCG CGGCAGTCGG TAGCGTGGCC GCGAATAATG CGGCTGCACA GGCAGCGGCG
AAGGGCGTGC CGCAATCAGT TGAGTCGATT CAGGCCGGGA TCAAGTACGA CCTGATGCAG
CAAGTTGCTG ATTTGCGCGC ATCGCTGACC GGTACTCCTC GAACAATGGG AAATATGGGG
GTTGCACAGA TTAGCATTCC TGGGGTTCAG TCAGAGATGG CTGCGTCTAG TCAAATCCCC
AATCCAACCG CCGAGCAACG GGCACTTGGG TTTGTTGGGA TGGGACCTGA TATTTTCTCT
AGCACGGTTG TTCCTTTGCC AAACGGATAT CCGTTGCTGC GGAATGTAGA CTCGGAAGCG
AAAATATTGA ACAACGTCGC CGCGCAACTC GGCGACAATA CGTCAGTTAG TGGGGTGATT
AATCTTTTCA CGGAGCGGCC GCCATGCACA AGCTGTTCAA ATGTGATTCA GCAATTTCAA
AACAAATACC CGAATATTAA AATTAACGTC ATGGACAGCA ATGGCGTGTT GAAGCCGTCT
AAATAG
 
Protein sequence
MTSKKAFAVS MNKNQYRLVF SRVRGMLVAA EETAHATGKV SKGEAPRSVA NRALSALASF 
ALRHAAFAVL IAAGVTPMWA SAQVVGAGAN APSVIQTQNG LQQVNITKPS GAGVSLNTYS
QFDVPKQGVI VNNSPTLTNT QQAGYINGNP NLGPNGSAKI IINQVNSNNP SQLKGYAEIA
GQRAEMIISN PSGLVVDGGG FINTSRAILT TGTPNLNADG SLAGFNVTRG LITVQGAGLT
ATNVDQVDLI SRAVQANAAI YASNLNVVAG ANQVNHETLQ ATRIQGDGPA PAVAIDVGQL
GGMYSNRIFL VGTEGGVGVR NAGTIAADAM GLTLTTDGRL VQAGKISSVG NVAVSAAGGI
ENSGTTYAQQ SVSLNTGADV ANTGTLAAQH NVGVTAGSLN ATGLLGAGVN SDGTVTQAGD
LQLTTAGQLN ATGKNVAGGN VSVTGAGVSL AGSTTAANGS LSLSATAGDV NLTNATTSAQ
GAVTANAAGT VINDHGNLTS GGSTTLTGGN VSNQGGTVSS QGPLSVTAAG QIANQAGVLV
SESTIALRGG TVANNQGTIQ SAGHATVDGV TIDNTAGRIT SLNTDGMVLT ATGQLTNVAG
TTANGAQGGV IGGNGDVTVQ GGNIANHATI TSNTNLHVSG QSVDNSGGAL QAAQGVTVDA
GTHLANGGGS IVGQTAALTG TTLDNSSGTV QADQVSLTAT NVVNHGGTIT QTGNGAMAVN
VTGTLDNSRG GTLQTNSADL TLAPAALVND GGTITHAGTG TLTLGNGAGS VSNVGGTIAS
NGHISAQSGS LNNTSGSINA QSGLTAAVSG VLNNTNGKLL SNTDLGITSG TLTNDGGQIG
ASTNATIHTG SMTNRGGTVV APNLSVIADS TLDNSGGTLE ANQLALTAPN LTNHGGTITQ
FGSSVMGVNV SGTLDNSAGG VIQTNSTDLT LTPAQLNNAG GTITHAGTGT LTIEPGNGAS
ALNNAGGTIV TKGQAVVDAS SWDNSGGILA AQGGITGTIA GDVNNAQGLV RAGTSLSLTN
GGALVNQGGH IQAGQQTAGD TSTLSIQSAS VNNADGSIAD LGAGKMTVQG GSQITNSHAG
GVSGMGAITG NGDVTVSATS ISNTQGGQLS GASLHVQGTT LDNSGGQIGN VAHSSGDVDV
TTSGTVTNTN GQISSTHDLT VTAPTLQGGG TYSAAHDANV NLQGDFSVTP DYQFNVGHDL
AFALPGTFDN SGNVQSVNDL NVNAGNIVNS GALSAGGLLH TQSGNLTNTG AIVGGSVSLN
ATGTVANVGP TALIGASDSN GALEILANDI ENRDGTTATD SMATTAIFGM GKVVLAGGKD
ASGKYTNAAL INNSSALIQS GGDMELHADK ITNTRRVMTT STGSVDPATL APFGVPIKGQ
TGQVGVKDPT SIGGVYTDPP HGGQWNSTYQ YTTYYADSAT ATTVTSISPA AQIVSGGSIN
ASTVASLQNY WSSIAAVGNV QMPKSYDANG WAATGQQAPS VTVSYSGQYH YNNYDNTEHD
WQLPFGNAPF VTSRPGGYTQ AAPASVKQYS LPSYDSTLGS NGTISGTGVS INNTAGNASI
PSLGLLPGQA VPGLTIGGLS GSASGTKSGA SAVHGGVTTI DPVIASATAL NVLNNLTIPQ
GGLFKPNPSP NASYVIETNP AFTNQKSFIS SDYFFGQIGV DLTHIPKRLG DGFYEQQLIR
NQVTSLTGRA VLGPYTDLQS MYKSLMAAGA SLEKSLNLPL GASLSAEQVS QLTSNVVVME
TRVVDGQSVL VPVVYLAKAN QQNINGPLIT ATDIDLKDAQ NFTNSGTVKA DNTLSIQGKQ
IDNAFGALQS GGLMSLTTTG NVDLTSAKVQ AGSLNLNAGG DLILDTAVKT DKRVSRDGAT
SITTILGPTA QLDVTGNAAI KTGGNFQQNA GNLSVGGNLG MNVGGDWTLG AVQTGEHKIV
QRANGVSNTD INNAVGSSVK VGGQSSIGVG GDVTARGAQI DLGQGGTIAA KGTVTLGAAS
ATSTVNSNSS GSDSHGSYAE TLHTSDQALT GTMLKGGDTI TLASGKDITI SGSTINLDKG
NANLLAKGDV NVGAATETHT FESHETHSHS NVVSGVKVAS GTDQTATYSK GSTISADGIT
VESGRDINVT GSNIVGTNDV SLDAARNVNI TTSQDTVQSS SYYDKKESGL LTNGGLSVTF
GSRSMGQTDQ SKQVTNNASV VGASSGNVSI SAGKDATITS STVVAGQNLD VTGQNVAVNS
AYDTYNDAQS QHFSQSGLSV GVNGGVVGLA QSMASTVRQG VQSGDSRLAA VQGVAAAEQA
YQSRDGLKNA ATALSSGKVS EAANGVQVQL SIGSSHSSSN ETTSITQAKN SSLIGNGNVH
VTATGTPDAN GNAQPGTGDI TMTGANVLGK NVSLNANNAI TLQSAQSTEQ DTSSNSSSGW
NAGVGIGVGK QTGISVFANG TNSHGQGNGS AVTQTNTTIA AGNTLVMKSG GDTTLAGAQV
SGDKVKADVG GNLTMTSVQD TSNYASNQHS AGASGSFTFG YGGGAELSIG HTGIDANYAS
VDQQTGIVAG KGGFDVSVAN HTQLNGAQIA SAAPAESNTL TTGSLGFRDI QNSMSYSASS
EGFSTSSGPS FAHTGDSASG VTKAAVSPGT ITVKSDQQNG TDSTAGLSRD TANANQTVKN
TFNLQQTQND LAFAQAFGKA ATFAVAEAAT QLENSSPQMK ALFGEGGAGR DALHAAVAAI
GAALSGGNIG GAVAGSLAGD ALQSLAQPII DQAVSQLPLD AQAAARKALN EVVATAGGAA
GGALAGGGSS GMLAGAGAAA NNELYNRQLH ESEAQKLQQL QKNQSPQEQY RLAAAECSLV
HCADNIPDSD PNKAVLQKMQ NDGAQFTYEQ GVLKKAGAFD GYGKLDSLSD AYDRNQVSNR
LVGAVQGVGS TAAGIGAATG GCYTLVACIA GAAVAGVSFD YAKAGFTQLV NGNPTPTYGE
LALQSLGMSP SGAALTYAGL GLGAAVGSVA ANNAAAQAAA KGVPQSVESI QAGIKYDLMQ
QVADLRASLT GTPRTMGNMG VAQISIPGVQ SEMAASSQIP NPTAEQRALG FVGMGPDIFS
STVVPLPNGY PLLRNVDSEA KILNNVAAQL GDNTSVSGVI NLFTERPPCT SCSNVIQQFQ
NKYPNIKINV MDSNGVLKPS K