Gene BTH_I2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2723 
Symbol 
ID3848805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3122515 
End bp3131958 
Gene Length9444 bp 
Protein Length3147 aa 
Translation table11 
GC content62% 
IMG OID637842391 
Productfilamentous haemagglutinin 
Protein accessionYP_443237 
Protein GI83721505 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA ACCATTATCG GCTGGTGTTC AGCCGCGTGC ACGGAATGTT GGTCGCAGTG 
GAAGAAACGG CCAGCTCAGC GGGGAAAGCA AGCGCAGGTG AAACGCGGCG TACCCTCGAT
CGCAGCGGTG TGCATGTAGT GACGCGGTTT GCGTTACGTT TTGCGGCATT TGCTGCGCTG
ATTGCCGCGG GCGCGATGCC GATGTGGGTG CACGCACAAA TCGTCGGAGC GGGGCCGAAT
GCACCATCGG TAATCCAGAC GCCCAACGGC CTGCCGCAAG TGAATATCAA CAAGCCGGGT
GGCGCCGGCG TTTCGCTGAA CACCTACAAC CAGTTTGACG TGTCGCATGC CGGCGCCATC
CTGAACAACT CGCCGACGAT CGTCAACACG CAACAAGCGG GCTATATCAA CGGCAACCCG
AACCTGAGTG CGGGGCAGGC GGCGCGCATC ATCGTCAACC AGGTCAACAG CACGGCGGCC
AGCCAGATTA AAGGGTACGT CGAGATAGCG GGAAGCCGTG CGGAGATTGT GCTGGCCAAC
CCTGCCGGTA TCGTCGTGGA CGGTGGCGGG TTCATCAATA CCTCTCGCGC AGTCCTGACA
ACTGGCGTGC CGCAGTTTGG CGCGGACGGA TCGTTGACGG GCTTCAACGT CAATCGTGGT
CTCGTGACAG TGCAAGGTGC GGGCCTCGAT ACCTCGAATG TTGATCAGAC TGACATCATT
GCCCGCGCTG TACATGCCAA TGCCGCGATC TATGCGAAGA ATCTCAATGT AATTGGCGGT
ACGAACCAGG TTAATCACGA CACGCTTGCT GTGACGCAGA TCGCCGGTGA CGGTCCTGCG
CCGGCTGTCG CGATTGACGT AGCTCAGTTA GGTGGCATGT ATGCCAATAG GGTGTTTTTG
GTTGGCAACT CCGCAGGCGT GGGTGTCGCG AACGCCGGGA CGATCGCGGC GCAGGCAGGC
GACCTGACGC TGCAGTCGAA CGGCCGGCTC GTCCTGACGG GTAAGACCAC GGCGAGCGGC
AATATGGCGC TGTCGGCCGC GGGTGGTATC CAGAACAGCG GCACCACGTA TGCGCAGCAG
TCGTTGTCGG CCAGCACAAG TGCCGATCTC ACGAACAGCG GCACGCTCGC GGCGCAGCAG
AATACAGCGG TCAACGCGGG CAGCGTCAAC TCGACCGGCA CGCTCGGCGC CGGCGTGAAC
AACGACGGTA CCGTAACGCA CAGCGGCGAC CTGAACCTGA CGGCGTCGGG CCAACTGACC
GCCACCGGCC AGAATGTCGC GGGCGGCAAC GCATCGCTGA CGGGCGGCAG CGTGAATCTC
GCCGGCAGCC AGACGGCAGC GAATGGCAAT CTGTCGCTGA ACGCGACAGC CGGCGACGTG
AACCTCTCGA ACGCGATGAC GAGCGCGCAA GGCGCCATTC AGGCGAAGGC AGCGAGCACC
GTGATCAACG ATCACGGCAG CCTGTCGAGC GGAGGCAGTA CGACACTGAC CGGTGGCAGT
CTGTCGAACC AGAGTGGCAA GGTGTCATCG CAGGGGCCGT TGTCGGTCAA CGTCGCCGGC
CAGATCGCCA ACCAGTCCGG CGAACTGGTA TCCGAGAGCA CGGCGGACGT GCATGGCGGC
GCCATCGCGA ACAACCAGGG CACCCTTCAA AGCGCGGCCG GCATGACGGT GGCCGGTGCG
TCGCTGGACA ACACGGCGGG CCGAATTACG TCGCTCAACG GCGATGGCCT GTCGGTGACG
ACGAGTGGCC AGTTGACCAA CGTCGCAGGG ACGACCGCGA ACGGCGCGCA AGGCGGTGTG
ATCGGCGGCA ATGGCGATGT GTCGGTCCAA GGCGCAAACG TTGTCAATCG CGGCGCGATC
ACATCCAACA CGAATCTGCG GGTATCGGGC CAGTCGGTCG ACAACGGCAG CGGCACGCTG
CAGGCTGCGC AAAAGGTGGC TGTCGATGCG GGCGCGCGCT TGATCAACAA CGGCGGCTCG
ATCGTCGGTC AGACGGCCGC GCTGACGGGC ACGACCCTCG ACAATAGCGC AGGCGTTGTG
CAGGCCGATC AGGTGTCGTT GAACGCGACC GACCTCGTGA ACCACGGCGG TACGATCACG
CAGACCGGTG CCGGCGCGAT GAGCGTGAAC GTGTCGGGCA CGCTCGAGAA CTCAAATGGC
GGCACGCTGC AAACCAACAG TACCGATCTG ACACTCGCGC CTGCCGCGCT CGTCAACGAT
GGCGGCACGA TCACGCATGC GGGCAACGGC ACGCTGACGC TCGGCGGAGG TACGGGGTCG
GTGTCGAACG TGGGCGGTTC GATCGCCAGC AACGGGCGCG TCGTCGCACA AACCGGAGCG
CTGAACAACA CGGCGGGCTC GATCAACGCG CAGAACGGAC TGGCGGCAAC CGTCGGCGGC
ACGCTCAACA ACGCAAACGG CAAGCTGTTG TCGAACACGG ATCTGAGCGT CACCAGCGGT
ACGCTGTCGA ACGACGGCGG CCAGATCGGC GCCAGCGCGA ACGCGACGAT CCGCACCGGC
TCGATGACGA ACCAGGGCGG ATCGATCGTT GCACCGAACC TGTCGGTTAC CGCCGATTCG
ACGCTCGACA ACAGTGGCGG CAAGCTCGAA GCCAATCAGC TTGCGCTGAC GGCAACGAAC
CTCGTGAACC ACGGCGGCAC GATCACGCAG TATGGATCGT CAGCGATGGG GATGAACGTC
AGCGGCACAC TCGACAACTC GGCCGCCGGG GTGATCCAGA CCAACGCCGC GGATCTGACG
CTGGCGCCAG CCGAGCTGAA TAACGCAGGC GGCACCATTA CGCATGCCGG CACCGGCACG
CTGACTATTG CGCCGGGCAA TGGAGCCAGC GCGCTGAACA ACGCGTCGGG CACCATCGTG
ACGAAGGGAC AAGCCATCGT CAACGCGGCC GCCTGGAACA ACGCGAGCGG TATTCTCGCC
GCGCAGCGAG GCATCAACGC GACCATCACA GGCGACGTGA ACAACACACA GGGCCTGCTG
AGATCGGACG CGTCGCTGTC GTTGAAGAAC GGCGGCGCTC TGTCAAATCG AGGCGGCCAC
ATCCAGGCGG GGCAATTGGT TGCGGGCGAC AGCAGCACGC TCGCCATTGA ATCGACCTCG
ATCGACAACG CTGACGGCGC CATTGTCGAC CTCGGTGCGG GCAAGATGAC GGTGCAAGGC
GGCAGCCAGA TCGCCAACAG CCACGCCAGC GGCGTCGCGG GCATGGGCGC GATTACCGGC
AACGGCGATG TGACGGTCAG CGCCGCGTCA ATCAGCAACA CGCAGAGCGG TCAGCTCAGC
GGTGCATCGC TTCATGTTCA GGGCAACACC TTGGACAACA GCGGCGGCAC GATCGGCAAC
GTCACGAACT CGAACGGCGA CGTGGACGTC AAGACGATCG GCGCGATCAC GAATACAAAT
GGCCAGATCA GCTCGACGCA CGACCTGTCG GTCGCGGCGG CCACGCTGCA GGGCGGCGGC
ACATACAGCG CGACGCATGA TGCCAACGTG AACCTGCAGG GCGATTACAC GGCGGCATCC
GATACGCAGT TCAACGTCGG TCACGATCTT GCCTTCACGC TGCCCGGCAC CTTCACGAAC
AACGCGAACC TGCAATCGGT CAACAACCTG AGCGTCAACG CCGGCAACAT CGTCAACGTG
GGCGCTTTGA CGGCCGGCGG CTTGCTGCAC ACGCAATCGA CCAACCTGAT CAATACCGGC
GCGCTCGTGG GCGCCAGCGC CTCGCTCAAC GCGACGAACA CGATCTCGAA CCTCGGGCCG
ACCGCGCTGA TCGGTGCATC CGACAGCAAC GGCACGCTGG AGATCCTCGC GCACGACATC
GAAAACCGCG ACGACACGAC TGCGACAGAT TCGATGGCGA CGACCGCTAT CTTCGGCATG
GGCAAGGTCG TGCTGGGCGG CGGCAAGGAC GCGAGCGGCA ACTACACGAA CGCCGCGCTG
GTCAACAATG TGTCTGCCCT GATCCAGTCC GAAGGCGACA TGGAACTGCA TGCGGACAAG
ATAACGAGCA CGCGGCGAGT GATGAAGACG TCGACCAGCG CGATCGATCC GTCGCTGCTC
GGCCAGTTCG GCATTCCGAT CAGCGGCCGC ACGGGCCAGG TCGGCGTGAA GGATCCGGAC
AGCATCGGCG GCGTCTACAC CGAGCCGCCT CACGGCGGGC AGTGGAACAG TACGTATCAG
TTCACGACCT ACTATGCGGA CAGCGCGACG GCGACGACCG TAACGGACAT CAGCCCGGCT
GCCCAGATCG TGTCGGGCGG CAAGATCGAT GCGTCGTCGG TCGGCACGCT GCAAAACTAC
TGGAGCAACA TCGCGGCGGT CGGCGACATC AAGATGCCGG CCCGCTACGA CGCGGACGGC
TGGGCGGCGT CCGGTCAGAA CCTGCCGGGC GTGTCAGTCT CCTACTCGGG CCAGTATCAC
TACAACAACT ACGACAACAC CGAACACGAC TGGCAATTGC CGTTCGGCAA CGCGCCGTTC
GTGACCGGCC GCCCGGGCGG CTATACGCAG GCGGCGCCGG CATCGATCAA GGATTACAAG
TTGCCAGGCT ACTTTTCGAC GATGAGCTCG AACGGTACGA TTTCGGGCAC GGGCGTCAGC
GTCAGCAACA CGGCGGGCAA CGCATCGATT CCGTCGCTCG GCTTGCTGCC GGGTCAGTCG
GTGCCGGGTC TCACCCCCAC CAACTTGAGC GGTAACGCGA GCGGTGCAAA ATCGGGTGCG
TCGGCCGTGC ATGGCGGTCA GTCGGCGCCG GTCGATCCGA TCATCGCCAG TGCGACGGCG
CTGAACGTGC TGAACAACCT CACGATTCCG CAGGGCGGGC TCTACCGGCC GACTACCGCC
CCGAATGCGA GCTACGTGAT CGAAACGAAT CCGGCGTTCA CGAACCAGAA GAATTTCATT
TCGAGCGACT ACTTCTTCGG TCAGCTCGGC GTCGACTTCA CGCACATCCC GAAGCGTCTC
GGTGACGGTT TCTACGAGCA GCAACTCGTG CGAAACGAAG TCACGTCGTT GACCGGCAAG
GCGGTACTCG GGCCGTATGC CGACCTGGAG ACGATGTATC AGTCGCTGAT GGCGGCCGGC
GCGGATCTGT CGAAGTCGCT CGATCTGCCG ATAGGCGCGA GCCTGTCGGC CGATCAGGTG
TCGAAGCTGA CCAGCAACGT GGTCATGATG GAAACGCGGG TGGTCGACGG TCAGTCGGTG
CTCGTGCCGG TCGTATATCT CGCGAAGGCC AGCCAGCAGA ATATCGATGG CCCGTTGATC
AGTGCGACGA ATGTCGATTT CCAGAATGCA CAGTCGTTCA CGAACAGCGG CACGATCAAG
GCGGACAACA CGCTGGCGAT CCAGGGCAAG CAGATCGACA ACGCATTCGG CGCGCTGCAA
AGCGGTGGGC TGATGTCGCT GAAGACCGAG AACAACGTCG ACCTGACGTC GGCGAACGTG
AAAGCCGGCA GTTTGCAGCT GGACGCCGGA AAGGATCTGA TTCTCGACAC GGCGACGAAG
ACGAACACGC GTGTGAGCCG CGACGGCGCG ACGAGCGTGG TGACCACGCT CGGGCCGACC
GCCAAACTTG ACGTTGCGGG CGATGCGTCG ATCACGACGG GCGGCAACTT CCAGCAGAAC
GCGGGCAACC TGTCGGTCGG CGGCAATCTC GGCATGAACG TTGGCGGCAA CTGGGATCTC
GGCGCGGTGC AGACGGGCGA GCACAAGATC GTGCAGCGGG CGAACGGCGT GTCGAATACC
GACATCAACA AGGTCACCGG CAGCTCGGTG ACGGTCGGTG GACAGTCGAG CATCGGCGTC
GGCGGAGACC TGACGGCCAA GGGCGCGCAG ATCGACCTTG GCCAGGGCGG GACAATCGCG
GCCAAGGGCA ACGTGACGCT CGGCGCGGCG AGCGCGACAT CGACGGTGAA CAGCAACAGT
TCGGGCAGCG ACAGTCACGG CAGCTATGCG GAGACGCTGC ACACGTCCGA TCAGGCGCTC
ACAGGCACGA CGTTCAAGGG TGGCGATACC GTCACATTGG CGTCTGGCAA GGATCTCACG
ATCAGCGGCA GCACGGTCAG CCTGGATAAA GGCAATGCGA ACCTGATGGC GAGCGGCGAT
GTGAATATCG GCGCGGCAAC CGAAACGCAT GTGCTGAACT CGCACGAAAC GCACAGCCAC
AGCAATGTCG TGAGCGGCGT GCAGGTTGCA AGCGGCATCG ACCAGACGGC GACCTATAGC
CAGGGCAGCA CGGTGTCGGC CGACGGCGTC AACATCGTCA GCAACCGCGA CATCAACGTG
GCGGGTAGCA ACGTCGTTGG CACGAACGAC GTGACGTTGC AGGCGACGCG CAACGTCAAC
ATCACGACCT CGCAAGACAC GACTCAATCG TCCAGCTATT TCGACAAGAA GGAATCGGGT
CTGCTCACGA ATGGTGGTTT GTCAGTGACG GTGGGTTCGC GCTCGGCCGC CCAGCAAGAC
CAAAGCAGCT CGGTGACGAA TAAGGGGAGT GTGATCGGGT CATCGCAGGG CAATGTCACG
ATCCAGGCCG GCAAGGATGC CACGATTACC GGTAGCACGA TTGTGGCGGG CCAGGATGTC
GGAATCGCCG CTCAAAACGT GACGGTAAAT GCTGCGTACG ACACCTACAA GGACGCGCAG
TCGCAGCAGT TTAGCCAGTC GGGTTTGAGT GTCGGATTGG GCGGCGGCCT GGTCGGACTC
GGACAGTCGA TGGCAGGCGC CGTCCGCCAA GGCCAGCAGT CGGGCGATTC GCGCCTCGCC
GCAGTACAGG CCGTGGCGGC TGCCGAACAG GCTTACCAGA ACCGCGGCGG GATCAAGGAC
GCGGCCAATG CCCTGTCGAA CGGGAACGTG AGCGAAGCTG CCAAGGGCGT TCAGGTACAG
TTCAGCATCG GGTCGAGCCA TAGCAGCAGC AATGCGACGA CATCGATCTC GAGCGCGAAA
GGTTCGTCGA TCATAGGCAA CGGCAACGTC TCCATCACCG CGACGGGCAC ACCGGACGCA
AACGGAAACG CTCAGGCGGG CACCGGAAAC ATTGCGATGA CCGGAGCGTC AGTGCTCGGC
AAGAACGTCG CGCTCGACGC CAACAACGCG ATCACGCTGC AAAGCGCGCA GAGCACCGAA
CAGAGTACGA GCTCGAACAG TTCGACCGGC TGGAATGCAG GCGTGGCGAT CGGCGTGGGC
AAGAATACGG GGATCAGCGT CTTCGCGAAT GGCTCGAACT CGCACGGACA GGGCAACGGC
GATAGCGTCA CGCAAACCAA CACGACGGTG GCGGCCGGCA ACACGCTGAC GATGAAGTCG
GGTGGTGACA CGACGTTGTC GGGCGCCAAG GTTTCAGGCG ACAAGGTCAA GGTCGATGTC
GGCGGCGACC TGACGATGAC GAGCCTTCAG GATACGTCGA ACTACAGCAG CAACCAGCAC
AACACGGGCG TCAGCGGCAG CTTTACGTTC GGCTATGGTG GCGGCGTCGA CGCATCGATC
GGCCACACCA GCATCGACGC GAATTATGCG TCGGTGAACC AGCAAACCGG CATCGTGGCT
GGCAAGGAAG GGTTTGATGT CAACGTGGCG GGCCATACGC AGCTCAACGG CGCACAGATC
GCGAGCGCGG CGCCGGCAGA CAGCAATACG CTGACGACCG GCAGCCTCGG ATTTACCGAC
ATCCAGAACA AGATGTCGTA TTCGGGATCG TCGGAAGGTT TTTCGACGTC GGGCGGCCCG
AGCTTTGCAC AGACGGGTGA TAGCGCGAGC GGTGTCACGC GCGCCGCGGT GAGCCCGGCG
AAGATCGTCG TCAAATCGGA TGAGCAGAAC GGCACGGACA GCACGGCCGG GCTGTCGCGC
GACACGGCGA ACGCGAACCA GACGGTGGAG AACACATTCA ACCTGCAGAA GGTCCAGAAC
AATATGGCCT TTGCGCAGGC GTTCGGCAAG GTGGCGACGT TCGCGGTGGC GGAAGCGGCG
ACTCAGCTGG AAAACAGCAG CCCGCAGATG AAGGCATTGT TCGGCGAAGG CGGTGCAGGT
CGCGACGCGC TGCACGCCGC AGTGGCGGCG ATCGGGGCAG CGCTGTCGGG TGGCAACATC
GGCGGAGCGG TCGCGGGTTC ACTGGCCGGT GATGTGCTGC AGTCCCTGGC GCAGCCGATC
ATCGATCAGA CGGTGAGCCA GTTGCCGCTG AGCGCGCAAG CCGCCGCGCG GAATGCGTTG
AACGAGATCG TGGCGACGGC CGGCGGTGCG GCGGCTGGCG CTGTCGCGGG TGGAGGTTCA
TCCGGTGCGC TGGCCGGTGC GGGCTCGGCT GTCAACAACG AGCTTTACAA TCGACAGCTT
CACGTAGAAG AGGTGAAGGT TGTCGAGCAG CTCGCAAAGG AAAAGGCGCA GGCGGTATGC
CGCGGCGATT CAAGCTGCGT GGCGAAGGCA ACGACCTACT GGACCGACAT GCTCGAGCGC
GCGGCGAAGG GAATGGTCGA CGACACGGCG AACAAGGAGA ACATGGCCTA TCTCCAGACG
CTGATCCAGA CGGCGAACAA TCCGACCAGC GAAGGCGCGA TGGGGGGCCT CAGCTCCTAC
CTGACGAACC TTCAGACGGC GCAAGACATG TTGTCGCAGT ATATGGGCAA GCCGATTCTG
GTACGCGGTT CGCCTATCAT CTCCGACGGT TCGGCGCAGA CGTATTTCAG TGCGACACCC
GAGCAACGTT CCAATCAAAG CCTGAATGCC ATATTGGGCT CGCTGCCTGG CTCGATCGTG
CCGGGTGCAA GCCAGCGCGA TCAGAGTCGA GTGGATTCCT TCGCCACGCA AAACGGCTCC
GTGAAACCCG ATTACACGAT CGAGGAAACG GTCATCGGCG GTATCCTTAC AAACAAGATC
GCGTCGACGG CTGCACGCGT GGGCGAGTCG ATTGATGTTT GGTTGGCGGG ACCTGTGAAT
CCGACAGGCA AAGGATTCAT CAGCACCGGC AAGGTGACGA TGGAGAGCAT GCCGGTGAAA
TTGAATACCG CGGAACAAGG CGTGCTGTCA CAACTCGACC AGCTGCCGTC GAAGGATTTG
CAGGGCCAGG CCCGTGAATA TGTGGCGAAT AACTACTTCG TGCGGAATGG TTTTACCCCG
CTTGACGGGA AGTGCGGCGC CAACTGCTTT GACGGAGTCT ACGTTAAAGG AAATACTGTC
TACGTTAACG AGGTCAAGCC TCTTAATGAA TCTGGATCGA TCAGTTTGAA TCCTCCAAAC
AGCGCCACGG GACTTCCGGG GCAGCAGACG GATAACTGGG TCGCATATTC GGTTCAAAGA
CTGAAAGATA CGGGTGACCC CCAACTTATT AAGACGGCCG AAGTTGTCGA GCAAGCGTTT
AGAAATGGAA ATCTCGTCAA AACGGTTTCT GGGGTCAACT CTAATGGGAT GGTGGTTGTC
AAGGTGCCAA GGAATACGCC ATGA
 
Protein sequence
MNKNHYRLVF SRVHGMLVAV EETASSAGKA SAGETRRTLD RSGVHVVTRF ALRFAAFAAL 
IAAGAMPMWV HAQIVGAGPN APSVIQTPNG LPQVNINKPG GAGVSLNTYN QFDVSHAGAI
LNNSPTIVNT QQAGYINGNP NLSAGQAARI IVNQVNSTAA SQIKGYVEIA GSRAEIVLAN
PAGIVVDGGG FINTSRAVLT TGVPQFGADG SLTGFNVNRG LVTVQGAGLD TSNVDQTDII
ARAVHANAAI YAKNLNVIGG TNQVNHDTLA VTQIAGDGPA PAVAIDVAQL GGMYANRVFL
VGNSAGVGVA NAGTIAAQAG DLTLQSNGRL VLTGKTTASG NMALSAAGGI QNSGTTYAQQ
SLSASTSADL TNSGTLAAQQ NTAVNAGSVN STGTLGAGVN NDGTVTHSGD LNLTASGQLT
ATGQNVAGGN ASLTGGSVNL AGSQTAANGN LSLNATAGDV NLSNAMTSAQ GAIQAKAAST
VINDHGSLSS GGSTTLTGGS LSNQSGKVSS QGPLSVNVAG QIANQSGELV SESTADVHGG
AIANNQGTLQ SAAGMTVAGA SLDNTAGRIT SLNGDGLSVT TSGQLTNVAG TTANGAQGGV
IGGNGDVSVQ GANVVNRGAI TSNTNLRVSG QSVDNGSGTL QAAQKVAVDA GARLINNGGS
IVGQTAALTG TTLDNSAGVV QADQVSLNAT DLVNHGGTIT QTGAGAMSVN VSGTLENSNG
GTLQTNSTDL TLAPAALVND GGTITHAGNG TLTLGGGTGS VSNVGGSIAS NGRVVAQTGA
LNNTAGSINA QNGLAATVGG TLNNANGKLL SNTDLSVTSG TLSNDGGQIG ASANATIRTG
SMTNQGGSIV APNLSVTADS TLDNSGGKLE ANQLALTATN LVNHGGTITQ YGSSAMGMNV
SGTLDNSAAG VIQTNAADLT LAPAELNNAG GTITHAGTGT LTIAPGNGAS ALNNASGTIV
TKGQAIVNAA AWNNASGILA AQRGINATIT GDVNNTQGLL RSDASLSLKN GGALSNRGGH
IQAGQLVAGD SSTLAIESTS IDNADGAIVD LGAGKMTVQG GSQIANSHAS GVAGMGAITG
NGDVTVSAAS ISNTQSGQLS GASLHVQGNT LDNSGGTIGN VTNSNGDVDV KTIGAITNTN
GQISSTHDLS VAAATLQGGG TYSATHDANV NLQGDYTAAS DTQFNVGHDL AFTLPGTFTN
NANLQSVNNL SVNAGNIVNV GALTAGGLLH TQSTNLINTG ALVGASASLN ATNTISNLGP
TALIGASDSN GTLEILAHDI ENRDDTTATD SMATTAIFGM GKVVLGGGKD ASGNYTNAAL
VNNVSALIQS EGDMELHADK ITSTRRVMKT STSAIDPSLL GQFGIPISGR TGQVGVKDPD
SIGGVYTEPP HGGQWNSTYQ FTTYYADSAT ATTVTDISPA AQIVSGGKID ASSVGTLQNY
WSNIAAVGDI KMPARYDADG WAASGQNLPG VSVSYSGQYH YNNYDNTEHD WQLPFGNAPF
VTGRPGGYTQ AAPASIKDYK LPGYFSTMSS NGTISGTGVS VSNTAGNASI PSLGLLPGQS
VPGLTPTNLS GNASGAKSGA SAVHGGQSAP VDPIIASATA LNVLNNLTIP QGGLYRPTTA
PNASYVIETN PAFTNQKNFI SSDYFFGQLG VDFTHIPKRL GDGFYEQQLV RNEVTSLTGK
AVLGPYADLE TMYQSLMAAG ADLSKSLDLP IGASLSADQV SKLTSNVVMM ETRVVDGQSV
LVPVVYLAKA SQQNIDGPLI SATNVDFQNA QSFTNSGTIK ADNTLAIQGK QIDNAFGALQ
SGGLMSLKTE NNVDLTSANV KAGSLQLDAG KDLILDTATK TNTRVSRDGA TSVVTTLGPT
AKLDVAGDAS ITTGGNFQQN AGNLSVGGNL GMNVGGNWDL GAVQTGEHKI VQRANGVSNT
DINKVTGSSV TVGGQSSIGV GGDLTAKGAQ IDLGQGGTIA AKGNVTLGAA SATSTVNSNS
SGSDSHGSYA ETLHTSDQAL TGTTFKGGDT VTLASGKDLT ISGSTVSLDK GNANLMASGD
VNIGAATETH VLNSHETHSH SNVVSGVQVA SGIDQTATYS QGSTVSADGV NIVSNRDINV
AGSNVVGTND VTLQATRNVN ITTSQDTTQS SSYFDKKESG LLTNGGLSVT VGSRSAAQQD
QSSSVTNKGS VIGSSQGNVT IQAGKDATIT GSTIVAGQDV GIAAQNVTVN AAYDTYKDAQ
SQQFSQSGLS VGLGGGLVGL GQSMAGAVRQ GQQSGDSRLA AVQAVAAAEQ AYQNRGGIKD
AANALSNGNV SEAAKGVQVQ FSIGSSHSSS NATTSISSAK GSSIIGNGNV SITATGTPDA
NGNAQAGTGN IAMTGASVLG KNVALDANNA ITLQSAQSTE QSTSSNSSTG WNAGVAIGVG
KNTGISVFAN GSNSHGQGNG DSVTQTNTTV AAGNTLTMKS GGDTTLSGAK VSGDKVKVDV
GGDLTMTSLQ DTSNYSSNQH NTGVSGSFTF GYGGGVDASI GHTSIDANYA SVNQQTGIVA
GKEGFDVNVA GHTQLNGAQI ASAAPADSNT LTTGSLGFTD IQNKMSYSGS SEGFSTSGGP
SFAQTGDSAS GVTRAAVSPA KIVVKSDEQN GTDSTAGLSR DTANANQTVE NTFNLQKVQN
NMAFAQAFGK VATFAVAEAA TQLENSSPQM KALFGEGGAG RDALHAAVAA IGAALSGGNI
GGAVAGSLAG DVLQSLAQPI IDQTVSQLPL SAQAAARNAL NEIVATAGGA AAGAVAGGGS
SGALAGAGSA VNNELYNRQL HVEEVKVVEQ LAKEKAQAVC RGDSSCVAKA TTYWTDMLER
AAKGMVDDTA NKENMAYLQT LIQTANNPTS EGAMGGLSSY LTNLQTAQDM LSQYMGKPIL
VRGSPIISDG SAQTYFSATP EQRSNQSLNA ILGSLPGSIV PGASQRDQSR VDSFATQNGS
VKPDYTIEET VIGGILTNKI ASTAARVGES IDVWLAGPVN PTGKGFISTG KVTMESMPVK
LNTAEQGVLS QLDQLPSKDL QGQAREYVAN NYFVRNGFTP LDGKCGANCF DGVYVKGNTV
YVNEVKPLNE SGSISLNPPN SATGLPGQQT DNWVAYSVQR LKDTGDPQLI KTAEVVEQAF
RNGNLVKTVS GVNSNGMVVV KVPRNTP