Gene Bxe_A3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBxe_A3647 
Symbol 
ID4004424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia xenovorans LB400 
KingdomBacteria 
Replicon accessionNC_007951 
Strand
Start bp902948 
End bp911902 
Gene Length8955 bp 
Protein Length2984 aa 
Translation table11 
GC content62% 
IMG OID637945994 
Productfilamentous haemagglutinin 
Protein accessionYP_557393 
Protein GI91782187 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGA ACAATTGCCG GCTGGTATTC AGCCGACTGA GAAACATGCT TGTCGCCGTC 
GAGGAAACCG CCACGGCTAT GGGAAAGGAA AGCGGACAGA CCGCGGTTTA CGGTCGCGGT
AACCCGGCAG GTTATACAGC GCTATTCACA CTCAGGCAGA TCGCATTCGC GGCGCTTGCG
TTGCTCGGCG CGTTGCCAAC GTGGTCAGGC GCGCAGATTG TACCGGGCGG CGCGCACGCT
CCTTCCGTCG CGCAAACGCA GAACGGACTG GATCAGGTGG ACATCAACCG GCCTTCCGGC
GGGGGCGTGT CGGTCAACAC GTACAACCAG TTCGATGTGC AGCAGCGCGG CGCAATCCTG
AATAATTCGC CGACCATCGT GCAGACGCAG CAGGCGGGCG TGATAAACGG CAATCCGAAC
CTCGGGGCGG GCCAGTCGGC GCGCGTCATC GTCAATCAGG TGAACAGCAG CGCGGCGAGC
CAGATTAATG GCTATCTCGA AGTCGCCGGA CAGAGAGCCC AGGTCGTCAT CGCCAATGGT
TCCGGTATCA GCGTGAACGG TGGCGGCTTT ATCAATACGT CGCGCGCGAT CCTGACTACC
GGGACGCCGA ACTATGGCGC GGACGGCAGC CTGACGGGCT TCAACGTGAC CGGCGGTAAC
GTCACCATCG ACGGCGCCGG GCTGAACGCG GGCAACGTCG ATCAGGTCGA TCTGATTGCG
CGCGCCGTGC GAGCGAATGC CGCGATCTAC GCGAAGAACC TGAACGTCAT TACGGGCGCG
AACAGTGTCG ATCACGATAC GCTGAACGCC ACGGCCGTCG CCGGCAACGG ACCCGCACCG
GGCGTCTCCA TCGACGTGAG CAGTCTCGGC GGCATGTATG CGAACCGTAT TGTTCTCGTC
GGCACGGAGT ACGGTGTGGG CGTGTCCACT AAAGGCGTAC TTGCCGCGCA GGCGGGCGAT
CTGATTCTGA CCACACAGGG CAAGCTCGTC CTCGCGGGGC AGACCAACGC GAGCGGGAAC
ATTGCGATCA ACGCGCACGA CGGCATCGAT AACAGCGGCA CGACCTACGC GCAGCAAAAC
GTCAGCGCGA ACACATCGGG CGCGCTGAAC AACAGCGGCA CGCTGGTCGC GCAGCAGGAC
GCGACGATCA ACGCCGGGAG CGTTGCATCG GGCGGAACGC TTGGCGCAGG TATCAACGGC
GACGGCTCGA TTGCACAGGC GGGCGAACTA AACCTGAACG CGAGCGGTGC GCTCTCGGCA
ACCGGCCGCA ACGCGGCGGG TGGCAACGCG ACCGTACAGG GCGCATCGGT CAATCTCCCT
GGCAGCAGCA CATCGGCGAA TCGCAACCTC GTCCTGACCG CGACGGGTGG CGACCTGAAC
ACATCGGGCG CCACGACGAC CGCAGGCGGT GCGCTGACGG CCAGCACGGC GGGCGCGCTG
ACCAACGACA ACGGCGTTAT GTCGTCAGGC GGCGCGCAGA CCGTCAACGC CGCCGCACTC
TCGAACCGTA GCGGCCAGAT GGTAGCGGGA GGCACGCTCT CGGAAAACGT CACGGGTGCA
GTCTCGAACA CAGGCGGCAC GATGCAGGCG GCGGGCGCGC TGACAGCAAG CGCCGGATCA
TTCGACAACA CGGCCGGACG TGTCGCGTCG CTCAATGCGG ACGGCCTGAC CCTCGCAAGT
ACGGGCGTGC TGAATAACGG CACGGGCGGC AACATCGGCG GCAATGGCAA CGTCACGGTG
CAGGCCGGGC AGATTGTCAA CGCGGGTTCG ATTACGGCAG TCCAGAACCT GATTGCGGCG
GCGGTTCAAA CCCTGTTCAA CGGCGGCACG CTCGCAGCGA ACGGGAACGT GAACGCATCG
GCCGGCACGA CGCTGACCAA CGCGAACACG ATCACGGCCG GCAGGCAGGC AGCCGTCAGC
GCCGCGACAT TCGATAACAG CGGCGGCTCG GCAAGCGCTG ACCAGTTCAC GCTTTCCGCG
ACGAACCTCG TCAATCACGG CGGCTCAATT ACGCAGACCG GCAACGGTGC GACCAGCGTC
AACGTATCCG GCATGCTCGA CAATACGGGC GGCACGATCC AGACCAACAG TACGGATCTC
GCGCTCGGCC CCGCCACGCT GCTGAACGAC AACGGCAAGA TCGCCAGTTC CGGCAGCGGC
ACGTTGTCAG TGAAAACCGG AACGCTGTCG AACAATGGCG GGACGATTGC GACCAACGGC
GCACTGATGA TTGACGGCGG CGCAGTTTCG AATCGTGGCG GCACACTCGC GGGCCAGTCG
TCGGCCACGC TCAGACTCGT GTCGCTCGAC AACAGCGCGG GCGGCTATAT CGGCGCGCAT
CGCGCGAGTG TGATCGATAC GGGTACGCTC AACAATGCAG GCGGAACGAT CCAGGCCGAC
GATGCGCTCG CGGTATCCGC GCAGTCGGTG ACGAATGACG GCGGCACGAT TGCCAATGGT
GGCACAGGCG CAACGACGGT CAATGCGGCC GGCGCACTGA CGAACACGAA TAACGGCCTG
ATTGGCGGCA ATGGCAACGT GTCGGTATCG GGTGCGAGCA TCGACAACTC GGGCGGCACG
ATCACGGCAG CCGGCGCCAC GACCGTGCAG TCCGGCAGCA CGCTCGGCAA CCGCGCCGGC
ATGATCCAGG GTACGGGCAA CGTGAGCGCA TCGGCACAGG GTGCAATCGA CAACACGGGC
GGCCAGATCG AAGTGGACGG CACGAACGCC ACCATGCAGC TATTGGCTGC GTCGCTCGAC
AACACGAACG GCCGCGTCGC GAACACCGGC AACGGTGCGA CGACAATAAG CGCGGCCGGT
ATCACAAACA GCAACACGGG CGGCGTAGCA GGTGCAGGTG CGATCGGCGG TAATGGCGAC
GTGACTATTA ACGCGACGAC GCTGTCGAAC ACGAACGGCG CGCAACTCGT CACCGGTCAC
GACCTGACGC TGAACATTGC GCAGTTTGCG AACAACACGA ACGCGATACT CTCGGGCGCG
AATAACGTCA CGCTTAACGG TCTGAACGCG GCGGTCATCA ATGCGGGCGG CTCGATCCAC
GGCAACGGCG CGATTGGCCT GAACGTCGCC TCGCTGGACA ACACAACAGG CAGGATCGGC
AACGATGCCG GGAGCGGCGG CAGCATCGCA ATTGCTACGG GCTCGCTTGC CAATCAGGGC
GGCGCAATCG GTAGCGACCA GAATCTGAGC GTTACAACGA ACCAGCTTAC CGGCGACGGC
CGGATCATCG CCGGCAACGA CGGCGCGGTG ACGGTGAACG GCGATTACAC ACTCGACGGT
ACGAACCAGA TCCAGGCGAA TCACGACCTC ACCTTTACCA CGTCGGGTAA TTTCACGAAT
CAGGGTACGC TCGGCGCGGT GAATGCGCTC ACGGTCAACG CGGCCAATGT CGATAACCAG
GCCGGCGCAG ACCTGAATTC GACGAACACA ACGGTCAACG CAGCCGGCTC AATCAGCAAC
GCGGGCCGCA TCGAAGGCGA TAGCGTCACC ACGCACAGCG CTGCGCTTGT GAACGTCGCG
ACCATCGTCG GCAATACCGT GACACTGAAC GCCGGTTCCA TCGCGAACAC CGGCGCGGCG
GCAGCCATTG CAGCGGCCAC GGCAGTGAAC CTGTACTCGC CTGGCGACAT TTCGAACACA
GGCGGCGCGA ACATTTTCAG TCTTGGCGAC ATTAGCATCG CGGCTGACGC CACGCGTGAC
GCCAACGGGT TACTTGCAAA CCGCTCGAAT TCAGTCACGA ACGACCAGTC AACGATTGAA
GCACAGGGCA ACATTGAAGT CGCCACGCAG ACGTTGACCA ATACGCGGCC GGCTCCCGCT
GTCGAAACGG TGACAACCGA TGTTGAAACG GAACATCAGA CGAAGCGCGA CAAATACATG
GCTTGCACGA CAGAAAACGG CGACAAAGGT TTCTGTACGA CGGACATGTG GAATAACGGT
TATCTCGCAC CGATCAATAC GACCTTCTCG AATTCAGATG TTGTGTCGCA GAACTCCGGT
CCCAATGCCA CCGACAAGGT GCTGGTCGTC AACGTCAACG GCCAGCCGCA GACGATCTAC
TACAACTCGA TCACGACCAA CGCTGACGGC ACGATCACGG CTAACTATTG GGACGGCTAC
GACCCGAACG TCAACTACGA CCCGGCCACG GAGTACACCT CACGCAACGA TGCTCATAAG
GGCTATCAGC GCGTCGAGAT AGCACGCGAC ACGACCACCA CCACGCAGCA GGATCAGGTC
ACGGGTCCGC AGGCACAGCA GGCCCAACTA CTCGCGGGCG GCAGCATGAC GCTCGCCAAC
GTCGGGACCG TGAACAATGC GTATAGCGCG ATTGCAGCAG GCAGCGCCAT CGCGATCGGC
AGCGACGCCT ATGGCGGCAC GCTCGTGAAC AACACCGGCC AGACGCTCTA CCAGTACCAG
AAGCAGGACA TCGTTTCGAC CTATGCGTGG AACGAGAACA TTTCGCGCGA CGTGGGAACC
GTGGTCGAGC CGTCGATCGT CCTTTCTCCG GTCGCGATCG GCGGCACGGG CGGCACGATC
ATCGCGAACA ACGCGGTGCA GATCAATGCG ACCGACATCA ACAACACAAA CGTCGCGGCA
GCGAACTCGG CCACGGGTGC AACGGGCGGC ACACTCGGCG CCAATGGCAT AGTCGGCGGC
GTGACAGGCG GCGGCGCGCA GACCGTCAAT CTCGCGACCG GTCAGACGCA GACCATTAAC
GCGCCGCAGT CCATCGCCGG ACCGACTGGC GCGCTGAATA TCACGTTGCC GGCGAGCGGG
CTGTACACAT TCAACACCTC ACCGGGCGCG TCGTATCTCG TCGTGACCGA TCCCCGTCTG
ACCAGTTACA CAAGTTTCAT TTCGAGCGAC TACATGCTCG GCGCACTCGG GCTCGATCCG
TCGAAGACCA TCAAACGGCT CGGCGACGGC GCGTATGAAG AACAGATGGT GCGCAACCAG
ATCACCCAGT TGACCGGGCG TGTCTACCTG CAAGGCTATA CGAACAAAGA GGACGAGTAT
CGCGCGCTGA TGAACAGCGG CGTCAACGTC GCGAAGGAAT TCAATCTTGA GCCGGGCATG
GCGCTCACGG CTGCGCAGAT GGATGCGCTG ACCAGCGACA TCGTATGGAT GGTCAACCAG
ACCGTCACCC TGCCGGACGG TTCGACGCAG ACCGTGCTTG CGCCCGTCGT GTATCTCGCG
CACGCAAACG CCAATGACCT GCAACCGACC GGCGCACTGA TTGCGGCCGA CGATGTCGAG
ATTCACGCCA CCGGCAGCGC GACGAACTCG GGTGTCATCA AGGGCGGTAC GCAGACCGTT
ATCAGCGCGA CCAATATCCT GAATCGGGGC GGCTCGATCG GCAGCAGCGG CGAGAACGGC
ACGACCGTGA TTGCGGCGAC GAACGACGTG GTCAACGCCT CGGGCCGGAT TACCGGCAAC
CGCGTGGCCG TGCTGGCGGG TCATGACATC GTCAATACGA CGCTGGTGGA TACAGTGGGC
GTCAGTTCGA GCGCAGGCAA CAGCAAGGTC AACCAGAGCC TTGTCGGTGC GCAGGGCACG
ATTGCATCGA CGGGCGACAT GGTCATCGCA GCAGGCAATG ACCTGATGGT TCACGGCGCC
AACATCGCGG CCGGCGGCAA CGCGCAGATC GCGGCCGGGC ACGACATTAA CGTCGATACC
GTGCAGTCTG ACACCTCGCA GTCCGTGACG AAGAACGACC AGCACCATTG GGAAGCCAGC
AGTACGCTCA ACCAGACAAG CGGCATCGGC GCGGGCGGCG ACCTCGCGAT GCAGAGCGGC
AACGATATGA CGTTCAGGGG GGCCACTGTC GCGGCGGGCG GCGGTATGGC CGTTGTCGCG
GGTGGCAATC TGACGGCGAC CACCGTAACA AACACTGCGA CCTATAACAA CGTTGCCACT
GACGACAGGA CGCGCAAGCA GGCGGACCGC AGTTATGACG AGCAGGCAGT CGGCACCAGT
TTTACAGCCG GCGGCAGCGG CACCCTTGCT GCACTCGGTA CGGACACCAC GAAAGGCAAT
GTCACGTTGA CCGGTTCGTC GCTTTCGACG GGAACCGGCA CGGCGAACAT CGCCGCAACC
GGCAACGTGA ACATCAACGA AGCGCGCGAG GAACACGACA GCTATACGGC AACCGAGTCG
AAGCGCGGCA GTGCCTTCCA TGGTTCGACC ACGAACACCA GCCAGACCAC GCAGGCGAAT
ATTGGCGTCG CCAGTACCGT TTCAGGGGAC TCGGTCAACG TCAGCGCCGG TCGCGACCTG
ACCGTCAAAG GTGCGACCGT TGCCGGCACG AACGATGTGA ATCTCGCGGC CGGCGGCAAC
GTGGCGATCA CGACCTCGCA GGACACACAG AATACGTCCA CCTACTACCA GAAACATGAA
TCCGGTTTCG GCACGGGTGG CGGTATCGGA ATATCGGTCG GTAGCCAGAC GCAGACCAAC
ACGGGCAGCA TGTCACAGGT GACGAATACT GGTAGCACGA TCGGGTCGCT GAACGGCAAC
CTGAACATCG TCGCCGGCAA CGATCTGCAT GTGACGGGCA GTGACCTGAT TGCAGCGAAG
AACGTCACGG GTACGGGGGC GAATGTCACC ATCGACTCAG CGACTGACAC GTACCGCCAC
GACGAGAAGC AGACGGTCAG CAAGAGCGGC TTCACACTTG CGATCAAGGC GCCGGTTATC
GACGCGATCT CGAACACGGT CGATCAGGCT CATGCGGCAA GCCACAGCCA GGACGATCGC
GCGGCGGCAC TGCACGGCAT CGCGGCGGCG AGCAGTGCAG TCGATGCTCT TGGCGCGGGT
GGCGCTGCTG CTGGCGCGCT CGCAAACGGC GCTCAACCCG AAGCCAAGAT AGAACTCAGT
TACGGCAGTA GTCACAGCGA GAACACCTAT TCGGAAGCCT CAACCACGAA CAGAGGCTCC
AATGTGACGG CGGGCGGCAC GGCTGCATTC GTCGCGACCG GCAACGGCGA ATCTGGAAGC
GGCAACGTGA CGATTGCCGG CTCGAACGTG AACGCCAGTG ACGTGATTCT CGCAGCGAAG
AACCAGGTTA ATCTCGTCAG CACGACCGAC ACTGATTCGA CGCGCAGCAC GAACCAGTCG
AGCAGCGCGA GCGTGGGCAT ATCGTATGGT ACGAGCGGTT TCGGTGTGGA TGCGTCGATG
TCGAAGGCGC ACGGCAACAG TAACGGCGAC ACGGCCATCC AGAACAATAC ACATGTGACG
GCCGCGAATA CCGCGACGAT TATCTCGGGC GGCGATACGA ACATCATCGG CGCGAACGTG
AGCGGCAATC AGGTGAACGC CGACATTGGC GGCAACCTGA ACATCGCGAG TGTGCAGGAC
ACCATGACGA CGGCCGCGCA TCAGGAAAGT ACGGGCGGCG GATTCGCCAT CAGTCAGGGC
GGCGGCAGCG CGAGTTTCAG CCACACGAAT GCGAATGCGA ACGGTAGTTA TGCGGGCGTG
AACGAACAGG CTGGCATCCA GGCGGGTGAC GGCGGTTTCA ACATCAGCGT GAAGGGGAAC
ACGGATCTGC ACGGCGCGGT GATTGCGAGC GATGCGGACG CATCGAAGAA CACGCTTTCG
ACGGGCACGC TGACCTATAG CGACATCCAG AATTCGTCAA GCTACAACGC GCATACGGGC
GGCATCAGTG GCGGCGTGAC GACCGGCGAC GGCGGTGCGA ATTACAGCAC GACCGGTTCA
ACGTCAGGCA GGAATGCAGG CGGCGTGGCG CCGATGTTGA GTCAGAACGA CAGCGGCAGC
GATAGCGCGA CGACGAAAAG CGGCGTCAGT GCGGGCTCCA TCAGCGTGAC CGATGCGGAC
CACCAGACGC AGGACATCGC GAGCCTGAAC CGTGACACGT CGAACACAAA CGGCACGGTT
GCCAAATTGC CGGACGTGAA CAATCTGCTT GATCGTCAGG GGGACATGAT GGCCGCAGCG
GGCGCGGCCG GCGAAGCCGT CTCCCGGCGG ATCGGTGACT TTGCACAGTC GAAATACAAA
GAGGCAGAAG CAAACGGCGA TCAGGCCGGA ATGGATGCAT GGAAGGAAGG CGGAACGGCT
CGCGCAGAAA TGCAGGCGGC GGGCGCCGCA CTTGTGACCG GACTGGCAGG CGGCAACGCA
ATCGGCGGCG CAGCGGGTGC AGGTATTGCG TCGATTGCAG CAGGCAAACT CAACGAACTG
AGCGGCACGA TTGCCGGCTC GAACCCGACC GGCAACGCTT CGATGAATGA AGCCCTCGGC
AACATCGTGG CGGATGCCAT TGCTACAGGC GCGGGCGGTG CGATCGGCGG CGACGCGGGC
GCGTTTTCGG GCTACAACGT TGACCGGTTC AATCGGCAGT TGCACCCCGA CGAATACGCG
CGGGCGAAGA AAGACGCCAA AGTCGTCGCA CAGCAGCTTG GAATCAGCGA ACAGGAGGCT
GAAGGCCGGA TCGTAGCTGA AATGCTACGC AATTCGGACA AACAAACGGC GGATGACTCT
GGCGGCATTC ACGACTATCA GGTGCGGGCC ATCATCGGAT GCCAGAATCT GAACTGCGAC
GGCTACAAGA GCGATCCGCA GTATGCCAAT CACGACTACA ACAGCCAGTA CATTGCCAGC
AACCAGAGCG CGTACAACGC CGGACAGAGT CAGCTTGGCA AGGGCGTAAC TTATAACGAT
CTGGTGAAGA ACAACGTCAA GAACAATCCA GTCAGCACAG CCATCGCTGG CGCGGGCATG
ATGGCGCTTG GTGGGGTAGC TGCTGGTGGA CTGCCGTCGA TTGGAGGCGC GTTGATCGGT
GGCGGGATTG GTGGGACGGT CAACGCAGGC GCACAGTATA TGTACGGCGG GGGTCAGGTT
AGCGTGGTTG ACGTAGCCAT TGCTGGACTG ACCGGAGCGA TCACATATGG AACCAGTCTT
TTGCCGGGCC TTTAA
 
Protein sequence
MNRNNCRLVF SRLRNMLVAV EETATAMGKE SGQTAVYGRG NPAGYTALFT LRQIAFAALA 
LLGALPTWSG AQIVPGGAHA PSVAQTQNGL DQVDINRPSG GGVSVNTYNQ FDVQQRGAIL
NNSPTIVQTQ QAGVINGNPN LGAGQSARVI VNQVNSSAAS QINGYLEVAG QRAQVVIANG
SGISVNGGGF INTSRAILTT GTPNYGADGS LTGFNVTGGN VTIDGAGLNA GNVDQVDLIA
RAVRANAAIY AKNLNVITGA NSVDHDTLNA TAVAGNGPAP GVSIDVSSLG GMYANRIVLV
GTEYGVGVST KGVLAAQAGD LILTTQGKLV LAGQTNASGN IAINAHDGID NSGTTYAQQN
VSANTSGALN NSGTLVAQQD ATINAGSVAS GGTLGAGING DGSIAQAGEL NLNASGALSA
TGRNAAGGNA TVQGASVNLP GSSTSANRNL VLTATGGDLN TSGATTTAGG ALTASTAGAL
TNDNGVMSSG GAQTVNAAAL SNRSGQMVAG GTLSENVTGA VSNTGGTMQA AGALTASAGS
FDNTAGRVAS LNADGLTLAS TGVLNNGTGG NIGGNGNVTV QAGQIVNAGS ITAVQNLIAA
AVQTLFNGGT LAANGNVNAS AGTTLTNANT ITAGRQAAVS AATFDNSGGS ASADQFTLSA
TNLVNHGGSI TQTGNGATSV NVSGMLDNTG GTIQTNSTDL ALGPATLLND NGKIASSGSG
TLSVKTGTLS NNGGTIATNG ALMIDGGAVS NRGGTLAGQS SATLRLVSLD NSAGGYIGAH
RASVIDTGTL NNAGGTIQAD DALAVSAQSV TNDGGTIANG GTGATTVNAA GALTNTNNGL
IGGNGNVSVS GASIDNSGGT ITAAGATTVQ SGSTLGNRAG MIQGTGNVSA SAQGAIDNTG
GQIEVDGTNA TMQLLAASLD NTNGRVANTG NGATTISAAG ITNSNTGGVA GAGAIGGNGD
VTINATTLSN TNGAQLVTGH DLTLNIAQFA NNTNAILSGA NNVTLNGLNA AVINAGGSIH
GNGAIGLNVA SLDNTTGRIG NDAGSGGSIA IATGSLANQG GAIGSDQNLS VTTNQLTGDG
RIIAGNDGAV TVNGDYTLDG TNQIQANHDL TFTTSGNFTN QGTLGAVNAL TVNAANVDNQ
AGADLNSTNT TVNAAGSISN AGRIEGDSVT THSAALVNVA TIVGNTVTLN AGSIANTGAA
AAIAAATAVN LYSPGDISNT GGANIFSLGD ISIAADATRD ANGLLANRSN SVTNDQSTIE
AQGNIEVATQ TLTNTRPAPA VETVTTDVET EHQTKRDKYM ACTTENGDKG FCTTDMWNNG
YLAPINTTFS NSDVVSQNSG PNATDKVLVV NVNGQPQTIY YNSITTNADG TITANYWDGY
DPNVNYDPAT EYTSRNDAHK GYQRVEIARD TTTTTQQDQV TGPQAQQAQL LAGGSMTLAN
VGTVNNAYSA IAAGSAIAIG SDAYGGTLVN NTGQTLYQYQ KQDIVSTYAW NENISRDVGT
VVEPSIVLSP VAIGGTGGTI IANNAVQINA TDINNTNVAA ANSATGATGG TLGANGIVGG
VTGGGAQTVN LATGQTQTIN APQSIAGPTG ALNITLPASG LYTFNTSPGA SYLVVTDPRL
TSYTSFISSD YMLGALGLDP SKTIKRLGDG AYEEQMVRNQ ITQLTGRVYL QGYTNKEDEY
RALMNSGVNV AKEFNLEPGM ALTAAQMDAL TSDIVWMVNQ TVTLPDGSTQ TVLAPVVYLA
HANANDLQPT GALIAADDVE IHATGSATNS GVIKGGTQTV ISATNILNRG GSIGSSGENG
TTVIAATNDV VNASGRITGN RVAVLAGHDI VNTTLVDTVG VSSSAGNSKV NQSLVGAQGT
IASTGDMVIA AGNDLMVHGA NIAAGGNAQI AAGHDINVDT VQSDTSQSVT KNDQHHWEAS
STLNQTSGIG AGGDLAMQSG NDMTFRGATV AAGGGMAVVA GGNLTATTVT NTATYNNVAT
DDRTRKQADR SYDEQAVGTS FTAGGSGTLA ALGTDTTKGN VTLTGSSLST GTGTANIAAT
GNVNINEARE EHDSYTATES KRGSAFHGST TNTSQTTQAN IGVASTVSGD SVNVSAGRDL
TVKGATVAGT NDVNLAAGGN VAITTSQDTQ NTSTYYQKHE SGFGTGGGIG ISVGSQTQTN
TGSMSQVTNT GSTIGSLNGN LNIVAGNDLH VTGSDLIAAK NVTGTGANVT IDSATDTYRH
DEKQTVSKSG FTLAIKAPVI DAISNTVDQA HAASHSQDDR AAALHGIAAA SSAVDALGAG
GAAAGALANG AQPEAKIELS YGSSHSENTY SEASTTNRGS NVTAGGTAAF VATGNGESGS
GNVTIAGSNV NASDVILAAK NQVNLVSTTD TDSTRSTNQS SSASVGISYG TSGFGVDASM
SKAHGNSNGD TAIQNNTHVT AANTATIISG GDTNIIGANV SGNQVNADIG GNLNIASVQD
TMTTAAHQES TGGGFAISQG GGSASFSHTN ANANGSYAGV NEQAGIQAGD GGFNISVKGN
TDLHGAVIAS DADASKNTLS TGTLTYSDIQ NSSSYNAHTG GISGGVTTGD GGANYSTTGS
TSGRNAGGVA PMLSQNDSGS DSATTKSGVS AGSISVTDAD HQTQDIASLN RDTSNTNGTV
AKLPDVNNLL DRQGDMMAAA GAAGEAVSRR IGDFAQSKYK EAEANGDQAG MDAWKEGGTA
RAEMQAAGAA LVTGLAGGNA IGGAAGAGIA SIAAGKLNEL SGTIAGSNPT GNASMNEALG
NIVADAIATG AGGAIGGDAG AFSGYNVDRF NRQLHPDEYA RAKKDAKVVA QQLGISEQEA
EGRIVAEMLR NSDKQTADDS GGIHDYQVRA IIGCQNLNCD GYKSDPQYAN HDYNSQYIAS
NQSAYNAGQS QLGKGVTYND LVKNNVKNNP VSTAIAGAGM MALGGVAAGG LPSIGGALIG
GGIGGTVNAG AQYMYGGGQV SVVDVAIAGL TGAITYGTSL LPGL