Gene BURPS668_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1122 
Symbol 
ID4882398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1092565 
End bp1101999 
Gene Length9435 bp 
Protein Length3144 aa 
Translation table11 
GC content61% 
IMG OID640127050 
Productcell surface protein 
Protein accessionYP_001058172 
Protein GI126439023 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.691127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGTCTA AAAAAGCGTT TGCCGTATCC ATGAATAAGA ACCAATATCG TCTGGTGTTC 
AGCCGCGTGC GCGGCATGCT GGTAGCGGTC GAGGAGACCG CTCATGCAAC GGGGAAAGTC
AGCAAGGGAG AGGCACCGCG CTCCGTTGCC AATCGCGCAT TGAGCGCTTT GGAGAGCTTC
GCCTTGCGTC ATGCCGCGTT CGCGGTGCTG ATTGCGGCCG GCGTTACACC GATGTGGGCG
CGCGCTCAGG TCGTGGGCGC CGGGGCTAAC GCGCCGTCAG TCATCCAGAC GCAAAACGGC
CTCCAGCAAG TCAACATTAC CAAGCCGAGC GGCGCGGGCG TGTCGCTGAA CACCTATTCG
CAGTTCGACG TACCCAAACA AGGCGTTATC GTCAATAACT CACCTACGCT GACGAACACA
CAGCAGGCTG GGTATATCAA CGGCAACCCG AACCTCGGTC CGAACGGTTC GGCCAAAATC
ATCATTAACC AGGTCAACAG CAACAACCCT TCACAGCTGA AAGGTTTTGT CGAGATCGCC
GGCCAACGCG CCGAGATGAT AATTTCCAAT CCGTCGGGAT TGGTGGTCGA TGGTGGTGGC
TTTATCAACA CCTCGCGCGC AATCCTGACG ACTGGCACGC CAAGCCTGAA TGCGGACGGC
TCGCCGGGGG GCTTCAGCGT GACACGCGGC CTCATCACGG TACAAGGCGC GGGCCTGACC
GCCACGAATG TGGATCAGGT CGACCTCATT TCCAGAGCCG TACAGGCCAA TGCAGCCATC
TATGCATCGA ATCTGAACGT GGTCGCTGGG GCCAATCAGG TCAATCACGA GACGCTCCAG
GCGACCCGTA TTCAGGGGGA TGGTCCAGCT CCGGCCGTTG CAATTGACGT GGGTCAATTG
GGCGGTATGT ACAGTAATCG AATTTTTCTG GTGGGGACCG AAGGGGGCGT CGGGGTCCGC
AATGCAGGGA CGATCGCGGC CGATGCGATG GGCCTGACGT TGACGACGGA CGGCCGCCTG
GTGCAGGCGG GCAAGATCAG TTCGGTCGGG AACGTTGCAG TATCGGCCGC GGGCGGGATC
GAGAATAGCG GTACGACCTA CGGTCAGCAG TCGGTGTCGC TCAACACCGG CGCGGACGTG
GCGAATACCG GTACGCTCGC CGCGCAGCAT AACGTCGGCG TGACGGCGGG TTCGCTGAAT
GCGACGGGCT TGCTGGGTGC GGGTGTGAAC AGTGACGGTA CCGTCACGCA AGCCGGCGAT
CTGCAACTGA CGACGGCGGG CCAGTTGAAC GCGACCGGCA AGAACGTCGC GGGCGGCAAC
GTCTCGGCGA CCGGCGCCGG CGTGAGCCTC GCTGGCAGCA CGACGGCCGC GAACGGCAGC
CTGTCGCTGA GCGCGACGGC TGGCGACGTG AATCTCACGA ACGCGACGAC GAGTGCTCAG
GGGGCCGTCA CGGCGAATGC CGCGGGCACG GTCATCAATG ACCACGGCAA CCTGACGAGC
GGCGGCAGTA CGACGCTGAC GGGCGGCAAC GTATCGAACC AGAGCGGCAA TGTGTCGTCA
CAAGGTCCGT TGTCCGTGAC AGCCGCCGGG CAGATCGCCA ATCAGGCCGG GGTGCTCGTG
TCGGAAAGCA CAATCGCGCT GCGCGGCGGT ACCGTCGCGA ACAACCAGGG CACGATCCAG
AGCGCCGGCC ACGCGACTGT CGACGGCGTG ACGATCGACA ACACGGCCGG CCGCATCACG
TCGCTCAATA CCGATGGGAT GGTGCTGACG GCAACTGGCC AGCTCACGAA CGTGGCAGGC
ACGACCGCGA ACGGCGCACA AGGCGGTGTG ATCGGCGGCA ACGGCGATGT GACCGTCCAA
GGCGGCAATA TCGCGAACCA CGCGACGATT ACGTCCAATA CCAATCTCCA CGTTTCTGGA
CAATCGGTCG ACAACAGCGG CGGGGCGCTC CAAGCCGCGC AGGGTGTGAC GGTCGATGCC
GGCACACACC TCGCCAATGG CGGCGGCAGC ATCGTCGGCC AAACGGCGGC GCTGACGGGC
ACGACACTCG ACAACAGTTC CGGTACCGTG CAGGCGGACC AGGTTTCGTT GACTGCTACC
AACGTGGTGA ACCACGGCGG CACGATCACG CAGACCGGGA ACGGCGCGAT GGCCGTCAAT
GTCACGGGCA CGCTCGACAA CTCGCGCGGT GGCACGCTGC AGACCAACAG CGCCGATCTG
ACGTTGGCTC CCGCAGCGCT GGTGAATGAC GGCGGCACGA TCACGCACGC CGGCACCGGT
ACGCTGACGC TCGGCAATGG GTCGGGTTCG GTGTCGAACG TCGGTGGCAC GATCGCCAGC
AATGGACATA TCTCGGCTCA AAGCGGTTCG CTGAACAACG CGTCGGGTTC GATCAATGCC
CAATCCGGGC TGACGGCCGC AGTGAGTGGC GTGTTGAACA ACACGAACGG CAAGCTGCTT
TCGAATACGG ATCTCGGCAT CGCCAGCGGC ACGCTGACGA ATGACGGCGG ACAGATCGGC
GCGAGCACGA ATGCAACGAT CCATACGGGC TCGATGACCA ATCGGGGCGG AACGGTCGTC
GCACCCAACC TGTCCGTGAC CGCCGATTCG ACGCTCGACA ACAGCGGCGG CACGCTCGAA
GCGAACCAGC TCGCGTTGAC CGCGCCGAAC CTGACGAACC ATGGCGGGAC GATCACGCAG
TTTGGATCGT CCGTGATGGG CGTCAATGTG AGCGGCACGC TGGACAACTC CGCGGGCGGC
GTGATTCAGA CCAACAGCAC GGACCTCACG CTGGCTCCGG CCCAATTGAA CAATGCGGGC
GGCACGATTA CGCATGCCGG CACGGGCACG CTGACGATTG AGCCCGGCAA CGGCGCGAGC
GCGCTGAACA ATGCAGGCGG CACCATCGTC ACCAAGGGTC AAGCGGTGGT TGACGCGAGC
AGTTGGGACA ACTCGGGCGG TATCCTTGCC GCACAGGGCA GCATCACCGG GGCGATCGCG
GGCGACGTAA AAAACTCGCA GGGGCTGGTG CGTGCCGGAA CGTCACTGTC GCTGACGAAC
GGCGGCGCGC TGGTGAACCA GGGTGGACAT ATTCAGGCCG GCCAGCAGAC GGCGGGCGAT
ACGAGCACAC TCAGCATCCA GTCGGCATCG GTTAACAATG CTGACGGCTC GATTGCCGAC
CTCGGCGCCG GCAAGATGAC CGTGCAGGGC GGCAGCCAGA TCACCAACAG CCACGCGGGC
GGCGTGTCCG GCATGGGCGC GATCACCGGT AACGGTGACG TGACGGTCAG CGCGACGTCG
ATTTCCAACA CCCAGGGCGG CCAACTGAGC GGCGCAAGCC TCCACGTTCA AGGGACGACC
CTGGACAACA GTGGCGGGCA AATCGGCAAC GTCACGAACT CGAGCGGCAA CGTGGACGTG
ACGACGAGCG GCACAGTCAC GAACACGAAC GGGCAGATCA GCTCCACGCA CGACCTGACG
GTAACGGCTC CGACGCTGCA GGGCGGCGGG ACTTACAGCG CGGCCCACGA TGCCAACGTG
AACCTGCAGG GCGATTTCTC GGTGACGCCC GACTACCAGT TCAACGTCGG TCACGAACTT
GCGTTCACGT TGCCCGGCAC GTTCAACAAT AGCGGCAATG TGCAATCGGT CAACAACCTG
AACGTCAACG CGGGCAACAT CGTCAACTCG GGCGCGCTGT CGGCCGGTGG GTTGCTGCAT
ACGCAGTCGA ACGATCTGAC CAACACAGGC GCGATCGTCG GCGGTAGCGT GTCGCTCAAT
GCGACGGGCA CGGTAGCGAA CGTCGGTCCG ACCGCGCTGA TTGGTGCGTC GGACAGCAAC
GGCACGCTCG AAATTCTCGC GAACGACATC GAGAACCGCG ATGACACCAC TGCGACCGAT
TCGATGGCGA CGACGGCCAT CTTCGGGATG GGCAAGGTTG TCCTGGCCGG TGGCAAGGAC
GCCAGCGGCA AGTACACGAA CGCGGCCCTG ATCAACAACT CGTCGGCGTT GATCCAGTCC
GGCGGCGACA TGGAATTGCA TGCCGACAAG ATCACGAACA CGCGTCGCGT GATGACGACT
TCGACGGGCT CGGTCGATCC CGCGACGCTG GCGCCGTTCG GCGTACCGAT CAAGGGCCAA
ACGGGGCAGG TCGGTGTTAA GGACCCGACG AGCATCGGCG GCGTGTATAC CGATCCGCCC
CACGGCGGTC AGTGGAACAG TACGTATCAA TATACGACCT ACTACGCGGA TAGCGCGACC
GCGACGACCG TGACCAGCAT CAGCCCGGCT GCTCAAATCG TGTCGGGCGG CAGTATCAAT
GCATCGACGG TCGCCAACCT GCAAAACTAC TGGAGCAGCA TCGCGGCCGT TGGCAATGTC
CAGATGCCGA AGAGCTACGA TGCCAACGGG TGGGCTGCCA CCGGACAACA GGCACCGAGC
GTGACCGTCT CGTATTCCGG GCAGTATCAC TACAACAACT ATGACAACAC CGAGCACGAT
TGGCAATTGC CGTTCGGCAA TGCGCCGTTC GTGACGAGCC GTCCGGGTGG CTATACGCAG
GCGGCGCCGG CATCGGTCAA GCAGTACTCG CTGCCGAGCT ACGATTCGAC CTTGGGTTCG
AACGGGACGA TTTCCGGAAC CGGCGTGAGC ATCAACAACA CGGCGGGCAA CGCGTCGATC
CCGTCGCTGG GCTTGCTGCC CGGGCAAGCC GTACCCGGCC TGACGATCGG TGGACTGAGC
GGTAGCGCGA GCGGCACGAA GTCGGGTGCG TCGGCGGTGC ATGGTGGTGT GACCACGATC
GATCCGGTCA TCGCCAGCGC GACCGCGTTG AACGTGCTGA ACAACCTGAC GATTCCGCAG
GGCGGCTTGT TCAAACCGAA CCCGTCGCCG AACGCGAGCT ACGTGATCGA GACGAACCCG
GCGTTCACGA ACCAGAAGAG TTTCATTTCG AGCGACTACT TCTTCGGTCA GATCGGTGTC
GACCTTACCC ACATTCCGAA GAGGCTGGGT GACGGCTTCT ACGAGCAGCA ACTTATCCGT
AATCAGGTCA CGTCGCTGAC GGGCCGTGCG GTGCTCGGTC CGTACACGGA TCTCCAGTCG
ATGTACAAGT CGCTGATGGC GGCAGGTGCT TCGCTGGAGA AGTCACTCAA CCTGCCGTTG
GGCGCGAGCC TGTCGGCCGA GCAGGTGTCG CTGCTCACGA GCAACGTGGT CATGATGGAG
ACGCGCGTGG TTGACGGGCA GTCGGTGCTG GTGCCGGTCG TGTATCTCGC GAAGGCCAAT
CAGCAGAATA TCAACGGTCC GCTGATTACG GCCACCGACA TCGACCTGAA GGACGCACAG
AACTTCACGA ACAGCGGCAC GGTGAAGGCG GACAACACGC TGTCGATCCA GGGCAAGCAG
ATCGACAACG CGTTCGGCGC GCTGCAGAGC GGCGGCCTGA TGTCGTTGAC GACAACCGGC
AATGTCGATT TGACTTCGGC CAAGGTCCAA GCCGGTAGCT TGAACCTGAA TGCAGGCGGC
GACCTGATTC TCGATACCGC AGTGAAGACC GACAAGCGGG TCAGCCGCGA TGGCGCAACA
AGTATCACGA CGACACTCGG ACCGACCGCC CAACTCGACG TGACGGGTAA TGCGGCGATC
AAGACGGGCG GCAACTTCCA GCAGAACGCG GGCAACCTGT CGGTTGGCGG CAATCTCGGT
ATGAATGTCG GGGGCGATTG GATTTTGGGT GCGGTGCAGA CGGGCGAACA CAAGATCGTG
CAGCGCGCGA ACGGTGTGTC GAATACGGAC ATCAACAACG CAGTTGGCAG CTCGGTGAAG
GTCGGCGGGC AATCGAGTAT CGGCGTCGGC GGGGACGTTA CTGCGAGAGG TGCGCAGATC
GATCTCGGTC AGGGGGGCAC GATCGCGGCC AAGGGTACCG TTACGCTTGG CGCTGCGAGT
GCAACCTCGA CGGTGAACAG CAACAGTTCG GGTAGTGATA GTCACGGCAG TTACGCCGAG
ACACTGCACA CATCGGATCA GGCGCTTACC GGAACAATGC TTAAGGGCGG AGACACCATA
ACGCTTGCGT CGGGCAAGGA CATCACGATC AGCGGCAGTA CCATCAACCT CGACAAGGGC
AATGCAGACC TGCTGGCGAA GGGGGACGTG AATGTCGGTG CGGCGACCGA AACGCATACG
TTCGAATCGC ACGAAACGCA TAGCCATAGC AACGTAGTAA GCGGCGTGAA GGTCGCAAGT
GGCACAGACC AGACCGCAAC GTATAGCAAG GGCAGCACAA TTTCGGCGGA TGGGATTACG
GTCGAAAGTG GCCGGGATAT CAACGTGACG GGAAGCAACA TCGTCGGCAC GAACGACGTG
AGCCTTGATG CGGCGCGTAA CGTGAATATC ACCACGTCGC AGGACACGGT GCAATCGTCG
TCGTATTACG ACAAGAAGGA AAGCGGCTTG CTGACCAATG GCGGGCTGTC CGTGACCTTC
GGTAGCCGCT CGATGGGCCA GACGGACCAA TCCAAGCAGG TGACGAACAA CGCAAGCGTC
GTCGGCGCAT CGTCCGGCAA TGTTTCGATC AGCGCGGGCA AGGACGCGAC CATCACCAGC
AGTACCGTAG TCGCCGGTCA AAATCTCGAC GTGACCGGCC AGAACGTTGC TGTGAATTCG
GCCTATGACA CGTACAACGA CGCGCAGTCG CAGCACTTCA GCCAATCGGG CTTGAGCGTC
GGCGTGAACG GCGGTGTGGT GGGCCTTGCT CAGTCGATGG CGAGCACGGT TCGTCAGGGC
GTGCAGTCTG GCGATTCACG TTTGGCGGCA GTGCAAGGTG TAGCGGCAGC CGAGCAGGCT
TATCAGAGCC GTGATGGATT GAAGAATGCG GCCACTGCCC TGTCGAGCGG TAAAGTCAGT
GAAGCCGCGA ACGGTGTACA GGTTCAACTG AGCATCGGAT CGAGCCACAG CAGCAGCAAC
GAGACGACAT CCATCACACA GGCCAAGAAT TCGTCGCTCA TCGGCAACGG CAACGTGCAT
GTGACCGCGA CGGGCACGCC CGACGCGAAC GGCAACGCAC AACCAGGAAC CGGCGATATC
ACGATGACCG GTGCGAACGT GTTGGGTAAG AACGTGTCGC TCAATGCGAA CAACGCGATC
ACGCTGCAAA GCGCACAGAG CACGGAGCAG GACACGAGTT CGAATAGTTC ATCGGGCTGG
AACGCGGGCG TTGGGATCGG CGTCGGCAAG CAGACCGGGA TCAGTATTTT TGCGAACGGC
ACGAACTCGC ACGGCCAAGG AAACGGCAGT GCCGTGACGC AGACCAACAC GACCATCGCG
GCGGGCAACA CGCTGACGAT GAAGTCGGGC GGCGACACAA CGCTGACGGG CGCGCAAGTG
TCAGGCGACA AGGTGAAGGC TGACGTGGGC GGCAATCTCA CGATGACGAG CGTTCAGGAT
ACGTCGAACT ACGCGAGCAA CCAGCATAGC GCGGGCGCGA GCGGTAGCTT TACGTTCGGC
TACGGCGGCG GTGCGGATGT GTCGATCGGG CATACCGGCA TCGATGCGAA CTATGCTTCG
GTCATTCAGC AGACCGGTAT CGTTGCCGGC AAGGGCGGGT TCGACGTGAA CGTGGCGAAC
CATACGCAGC TCAACGGCGC TCAGATCGCA AGCGCCGCCC CTGCCGAAAG CAATACGCTG
ACGACGGGCA GCCTCGGGTT CAAGGACGTC CAGAATTCGA TGTCGTACTC GGCTTCGTCG
GAAGGATTTT CGACTTCGAG CGGGCCGAGC TTTGCGCACA CGGGTGACAG TGCGAGCGGC
GTGACGAAGG CAGCGGTGAG TCCGGCAGCG ATTACCGTCA AGTCCGATCA ACAGAATGGC
ACCGACAGCA CTGCAGGCTT GTCGCGCGAT ACGGCGAACG CGAACCAGAC CGTTAAGAAC
ACGTTCAACT TGCAGCAGAC ACAAAACGAT CTGGCGTTCG CGCAGGCGTT CGGCAAGGCG
GCGACCTTCG CGGTCGCTGA GGCGGCAACG CAGCTCGAGA ACAGCAGCCC GCAAATGAAG
GCGTTGTTCG GTGAAGGCGG CGCGGGACGC GACGCGCTGC ATGCGGCGGT GGCGGCGCTC
GGCGCGGCGC TGTCGGGTGG CAACATCGGC GGCGCGGTGG CGGGTTCGCT CGCGGGCGAT
GCGTTGCAGT CGCTGGCGCA GCCGATTATC GATCAGGCGG TAAGCCAGTT GCCGCTGGAT
GCGCAGTCGG CTGCGCGCAA GGCACTGAAC GAGGTCGTTG CGACAGCAGG TGGTGCGGCG
GGCGGCGCGC TGGCCGGTGG TGGTTCATCG GGGACGCTCG CGGGTGCAGG CGCGGCGGCG
AACAATGAGC TTTACAACCG TCAATTGCAC GAAAGCGAAG CGCAAAAACT CCAGCAGCTT
CAGAAAAATC AGTCGCCCCA AGAGCAATAC CGCCTTGCCG CAGCGGAGTG CTCGCTTGTG
CATTGTGCAG ACAACATCCC GGACAGCGAT CCGAACAAGG CCGTGCTGCA AAAGATGCAG
AATGACGGCG CGCAGTTCAC CTACGAGCAA GGGGTACTGA AGAAAGCCGG TGCGTTCGAT
GGGTACGGCA AGCTCGATTC ACTGTCCGAT GCCTATGATC GGAATCAGGT CTCGAATCGT
CTTGTCGGTG CCGTGCAGGG TGTTGGAAGT ACCGCAGCGG GAATCGGCGC TGCAACAGGT
GGCTGCTATA CGCTCGTTGC TTGTATCGCT GGGGCGGCAG TAGCTGGCGT GAGTTTCGAT
TACGCAAAGG CAGGCTTTAC GCAGCTTGTG AACGGTAACC CGACGCCAAC CTATGGCGAA
CTGGCATTGC AGAGTTTAGG GATGAGTCCG AGCGGCGCTG CGTTGACTTA CGCAGGCTTG
GGTCTCGGCG CGGCAGTCGG TAGCGTGGCC GCGAATAATG CGGCTGCACA GGCAGCGGCG
AAGGGCGTGC CGCAATCAGT TGAGTCGATT CAGGCTGGGA TCAAGTACGA CCTGATGCAG
CAAGTTGCTG ATTTGCGCGC ATCGCTGACC GGTACTCCTC GAACAATGGG AAATATGGGG
GTTGCACAAA TTAGCATCCC CGGGGTTCAA TCAAAGATGG CGGCGTCTAG TCAAATCCCC
GATCCAACCG CCGCGCAACG GGCACTTGGA TTTGTAGGGG AAGTTAATGA GACTTTCCCG
AGCGCGAGTG TTTGGACTGG CGGGGATACA CCGTACTTGC TGAATCGAAA GGTTGATTCG
GAGGCCAAGA TTTTGAACAA TATCGCGGCT CAGCTGGGGG ATAATACTTC AGCAAGCGGA
ACAATCAATC TGTTTACGGA GCGCCCCCCG TGCGAAAGTT GCTCAAATAC AATTATCAAA
TTTCAGGAAA AATATCCAAA CATAAAAATT AATGTTATGG ATAGCAATGG CGTAATTCGT
CCATCAAAAA GATAG
 
Protein sequence
MTSKKAFAVS MNKNQYRLVF SRVRGMLVAV EETAHATGKV SKGEAPRSVA NRALSALESF 
ALRHAAFAVL IAAGVTPMWA RAQVVGAGAN APSVIQTQNG LQQVNITKPS GAGVSLNTYS
QFDVPKQGVI VNNSPTLTNT QQAGYINGNP NLGPNGSAKI IINQVNSNNP SQLKGFVEIA
GQRAEMIISN PSGLVVDGGG FINTSRAILT TGTPSLNADG SPGGFSVTRG LITVQGAGLT
ATNVDQVDLI SRAVQANAAI YASNLNVVAG ANQVNHETLQ ATRIQGDGPA PAVAIDVGQL
GGMYSNRIFL VGTEGGVGVR NAGTIAADAM GLTLTTDGRL VQAGKISSVG NVAVSAAGGI
ENSGTTYGQQ SVSLNTGADV ANTGTLAAQH NVGVTAGSLN ATGLLGAGVN SDGTVTQAGD
LQLTTAGQLN ATGKNVAGGN VSATGAGVSL AGSTTAANGS LSLSATAGDV NLTNATTSAQ
GAVTANAAGT VINDHGNLTS GGSTTLTGGN VSNQSGNVSS QGPLSVTAAG QIANQAGVLV
SESTIALRGG TVANNQGTIQ SAGHATVDGV TIDNTAGRIT SLNTDGMVLT ATGQLTNVAG
TTANGAQGGV IGGNGDVTVQ GGNIANHATI TSNTNLHVSG QSVDNSGGAL QAAQGVTVDA
GTHLANGGGS IVGQTAALTG TTLDNSSGTV QADQVSLTAT NVVNHGGTIT QTGNGAMAVN
VTGTLDNSRG GTLQTNSADL TLAPAALVND GGTITHAGTG TLTLGNGSGS VSNVGGTIAS
NGHISAQSGS LNNASGSINA QSGLTAAVSG VLNNTNGKLL SNTDLGIASG TLTNDGGQIG
ASTNATIHTG SMTNRGGTVV APNLSVTADS TLDNSGGTLE ANQLALTAPN LTNHGGTITQ
FGSSVMGVNV SGTLDNSAGG VIQTNSTDLT LAPAQLNNAG GTITHAGTGT LTIEPGNGAS
ALNNAGGTIV TKGQAVVDAS SWDNSGGILA AQGSITGAIA GDVKNSQGLV RAGTSLSLTN
GGALVNQGGH IQAGQQTAGD TSTLSIQSAS VNNADGSIAD LGAGKMTVQG GSQITNSHAG
GVSGMGAITG NGDVTVSATS ISNTQGGQLS GASLHVQGTT LDNSGGQIGN VTNSSGNVDV
TTSGTVTNTN GQISSTHDLT VTAPTLQGGG TYSAAHDANV NLQGDFSVTP DYQFNVGHEL
AFTLPGTFNN SGNVQSVNNL NVNAGNIVNS GALSAGGLLH TQSNDLTNTG AIVGGSVSLN
ATGTVANVGP TALIGASDSN GTLEILANDI ENRDDTTATD SMATTAIFGM GKVVLAGGKD
ASGKYTNAAL INNSSALIQS GGDMELHADK ITNTRRVMTT STGSVDPATL APFGVPIKGQ
TGQVGVKDPT SIGGVYTDPP HGGQWNSTYQ YTTYYADSAT ATTVTSISPA AQIVSGGSIN
ASTVANLQNY WSSIAAVGNV QMPKSYDANG WAATGQQAPS VTVSYSGQYH YNNYDNTEHD
WQLPFGNAPF VTSRPGGYTQ AAPASVKQYS LPSYDSTLGS NGTISGTGVS INNTAGNASI
PSLGLLPGQA VPGLTIGGLS GSASGTKSGA SAVHGGVTTI DPVIASATAL NVLNNLTIPQ
GGLFKPNPSP NASYVIETNP AFTNQKSFIS SDYFFGQIGV DLTHIPKRLG DGFYEQQLIR
NQVTSLTGRA VLGPYTDLQS MYKSLMAAGA SLEKSLNLPL GASLSAEQVS LLTSNVVMME
TRVVDGQSVL VPVVYLAKAN QQNINGPLIT ATDIDLKDAQ NFTNSGTVKA DNTLSIQGKQ
IDNAFGALQS GGLMSLTTTG NVDLTSAKVQ AGSLNLNAGG DLILDTAVKT DKRVSRDGAT
SITTTLGPTA QLDVTGNAAI KTGGNFQQNA GNLSVGGNLG MNVGGDWILG AVQTGEHKIV
QRANGVSNTD INNAVGSSVK VGGQSSIGVG GDVTARGAQI DLGQGGTIAA KGTVTLGAAS
ATSTVNSNSS GSDSHGSYAE TLHTSDQALT GTMLKGGDTI TLASGKDITI SGSTINLDKG
NADLLAKGDV NVGAATETHT FESHETHSHS NVVSGVKVAS GTDQTATYSK GSTISADGIT
VESGRDINVT GSNIVGTNDV SLDAARNVNI TTSQDTVQSS SYYDKKESGL LTNGGLSVTF
GSRSMGQTDQ SKQVTNNASV VGASSGNVSI SAGKDATITS STVVAGQNLD VTGQNVAVNS
AYDTYNDAQS QHFSQSGLSV GVNGGVVGLA QSMASTVRQG VQSGDSRLAA VQGVAAAEQA
YQSRDGLKNA ATALSSGKVS EAANGVQVQL SIGSSHSSSN ETTSITQAKN SSLIGNGNVH
VTATGTPDAN GNAQPGTGDI TMTGANVLGK NVSLNANNAI TLQSAQSTEQ DTSSNSSSGW
NAGVGIGVGK QTGISIFANG TNSHGQGNGS AVTQTNTTIA AGNTLTMKSG GDTTLTGAQV
SGDKVKADVG GNLTMTSVQD TSNYASNQHS AGASGSFTFG YGGGADVSIG HTGIDANYAS
VIQQTGIVAG KGGFDVNVAN HTQLNGAQIA SAAPAESNTL TTGSLGFKDV QNSMSYSASS
EGFSTSSGPS FAHTGDSASG VTKAAVSPAA ITVKSDQQNG TDSTAGLSRD TANANQTVKN
TFNLQQTQND LAFAQAFGKA ATFAVAEAAT QLENSSPQMK ALFGEGGAGR DALHAAVAAL
GAALSGGNIG GAVAGSLAGD ALQSLAQPII DQAVSQLPLD AQSAARKALN EVVATAGGAA
GGALAGGGSS GTLAGAGAAA NNELYNRQLH ESEAQKLQQL QKNQSPQEQY RLAAAECSLV
HCADNIPDSD PNKAVLQKMQ NDGAQFTYEQ GVLKKAGAFD GYGKLDSLSD AYDRNQVSNR
LVGAVQGVGS TAAGIGAATG GCYTLVACIA GAAVAGVSFD YAKAGFTQLV NGNPTPTYGE
LALQSLGMSP SGAALTYAGL GLGAAVGSVA ANNAAAQAAA KGVPQSVESI QAGIKYDLMQ
QVADLRASLT GTPRTMGNMG VAQISIPGVQ SKMAASSQIP DPTAAQRALG FVGEVNETFP
SASVWTGGDT PYLLNRKVDS EAKILNNIAA QLGDNTSASG TINLFTERPP CESCSNTIIK
FQEKYPNIKI NVMDSNGVIR PSKR