Gene Bphy_4291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_4291 
Symbol 
ID6245815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp1294023 
End bp1303262 
Gene Length9240 bp 
Protein Length3079 aa 
Translation table11 
GC content63% 
IMG OID642596047 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001860454 
Protein GI186473112 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGA CGACTTACCG GCTTGTGTTC AGCCGGCTTC GCGGGATGCT CGTGGCCGTC 
GAGGAAACCG CCACTGCTTC GGGCAAGTCT GCCGGCGAGA CCCGCGCAAC GGGCCGCGCG
TCAAGGAAGG CAACACGTTC GCCGCTTCGC AATCTGGTTG CGCTCATGCT GGCTGTCGCG
CCCCTGCTGG CGTTCTCGCA GATCGTTCCA GGCGGCGCGC ACGCGCCTGG TCTTAACACG
ACGCAGAACG GTATCCCGCA AGTGAATATC AACAAGCCCT CGGGTGCGGG CGTGTCGATG
AACACCTATA GCCAGTTCGA TGTTCAAAAG AATGGCGCGA TCCTGAACAA CTCGGCGACG
ATCACGAACA CGCAACTTGC GGGGTACATC AACGGTAATC CCAACTTCGG GCCGAACGAT
GCCGCGAAAA TCATCGTTAA TCAGGTCAAC TCCAGCAGTG CAAGCCAGAT CAACGGCTAT
GTCGAAGTCG CCGGGCCGCC TGCTGAAGTC GTGATCGCGA ACGGCTCAGG CATCAGCGTG
AACGGCGGCG GTTTCATCAA TACGTCTCGC TCGATTTTGA CCACAGGTAC GCCCAACTTT
GCGGCGGACG GGTCGCTCGC GGGTTTCAGT GTGACGGGCG GCAACATCAC CGTGCAGGGC
GCGGGCTTCA ATGCGTCGAA TATTGATCAG GTAGACCTGC TGGCGCGTGC GGTCCAGGTC
AATGCCGCGA TCTATGCCGG GAAGAACCTG AACGTCATCA CGGGCGCGAA CAGCATCGAC
CATAACACGC TTGCTGCCAC ACCGATAACG GGAAACGGCC CCGCGCCTGC CGTGTCCATC
GACGTGAGCA ACCTGGGCGG TATGTACTCG GGGAAGATTC TGCTTGCCTC GAATGAGTAC
GGTGTCGGGG TTTCGAACCG GGGCGTGATT GCTGCACAGT CCGGCGACCT GACACTGACA
TCACAGGGCA AGCTGGTTCT CGCTGGACAG ACCAACGCGA GCAGCAACAT CAGCGCAAGC
GCACGCGATG GCATTGACAA CAGCGGCACG ACTTACGCGC AGCAGAACGT CAGCGCGAAC
ACGTCGGGCG CGCTCACCAA CAGCGGCGTG CTCGCCGCGC AGCAGAACAC AACCGTCAAT
GCGGGCAGCG TCGCCTCGAC GGGCACGCTC GGCGCTGGCG TGAACGGCGA CGGCACGATG
GCCAATGCGG GCGACCTGTC GGTGTCGGCG ACGGGTGTGG TCACGGCGGC GGGCCGCAAT
GCGGCGGGCG GCAATGCGTC AGTGAATGGC TCGGCGGTCA ACCTTGCCGG GAGCACGACA
TCGGCCAAAG GCACGATGAC GCTGGCGGCC AACGGCGGCG ACCTGAATCT GTCAGGCGCG
ACGACGACGG CGGGCACGAC GCTCAACGCC CGCGCAACGG GCACCCTCAC AAACGATAGC
GGCGCGCTGT CTTCGGGCGG TGCGCAGGCG ATCACAGCGG GCGCCCTGTC GAATAACAGC
GGCCAGATGG TGTCCGGGGC CACGCAGACG GCAACGGTCA CGGGCGCGGT CTCGAACCAG
AGCGGCACGA TGCAGGCCAA CGGTGCGCTG TCCTTGCAGG CGGGTTCCGC CGACAACACG
GGCGGTCACA TCGCATCGCT CAATGCCGAC GGCCTGAATC TCACCGTGAG CGGCCTGCTG
AAGAACGGCG CGGGCGGCAG CATCGGCGGG AACGGCAACG TGACCATCCG GGCGGGACAG
GTCGCAAACG CCGGTTCGAT CACTGCCGTC CAGAACCTGA TCGCGTCGGC CGTGCAGACG
TTGTTCAGCA GCGGCACGCT CGCGGCGAAC GGCAATGCAA CACTTTCGGC TGGCACGTCG
CTGACGAACT CGGGCAGCAT CACAGCCGCG AGCGCGGCAT CGATCACGGC ATCGACGTTC
GACAACAGCA GCGGCACGAC GCAGACCGCT CAGTTCACGC TGCGCGCGAC GAACCTCGTC
AACCGCAACG GCAGCATCGC ACAGACAGGC ACGGCCGCGA CGACCGTCGA TGTAACCGGC
ACGCTCGATA ATACCGGCGG CACGCTCCAG ACCAATGCCG ACAGCCTCAC GCTCGGCCCG
GCCACACTCA CCAACGACCA CGGGGCGATT GCCAGCGCTG GGACGGGAAC GCTGTCGGTT
GCAACGGGCG CGCTGTCGAA CAACGGCGGA ACAATTGCGA CGAACGGCGC GCTCGACGTG
CGTGCGGGCG CGGTATCGAA TCGCGGCGGC ACGCTCGCGG CGCAGACAGG CGCGACGCTG
CTGGTTGCTT CGCTCGATAA CAGCGCGGGC TACATCGGTG CGCAGACGGT GTCGGTCACG
GACGCGGGCG CACTCAACAA CGCGGGCGGC ACGATCCAGG CGGACGATAC ACTCGCGATG
TCCGCGCAGT CGGTCGCCAA CGACGGCGGT TCAATCGCAA ACGGTGGCAC GGGCGCGACG
AGCGTCACGG CGACGGGCAC GGTCACGAAC ACGCAGAACG GCCTGATTGG CGGCAATGGT
GGCGTATCGG TTGCGGGTGG CAGCGTCGAC AACTCGGGCG GCACAATCAC GGCGGGCGGC
GCGGTATCCG TCACGTCCGG CAGCACGTTC GGCAACCGTG CGGGCATGAT CCAGGGCACG
GGTAACACGA CGGTGTCAGC ACAGGGCGCA ATGGACAACA CGGGCGGGCA GATCGAGGCG
GACGGCACCG GGTCAACGCT GACGGTCACG GGCGCAACGC TTGACAACAC CAACGGTCGC
ATCGCAAACA CAGGTTCCGG CACGACCACG ATCAGCGCCA GCGCGATCAC GAACAGCAAT
ACAGGCGGCG TCGCGGGCGC GGGCACGATT GGCGGCAACG GCGATGTCAC GCTCAACGCG
CAGACGCTGT CCAGCACGAA CGGCGCGCAA CTCGTTGCGG GTCACGACCT GACGCTGACC
ATAGCGCAGC TTGCCGACAA CACGAGCGCG ACGCTCTCGG GCGCGAACAA CGTCACGCTC
AACGGCCCGA ACGCAGTCGT CATCAACGCG GGTGGCTCGA TCCACGGCAA CGGCGTGGTC
ACGCTCAACA CCTCGACGCT GGACAACACG AATGGACGTA TCGGCAACGA TACGGGCAGC
GGCGGCAGTA TCGGCATTAC CACAGGCGCA TTTGCGAACC AGAACGGCGC AATTGGCAGC
GACCAGAACC TGAGCGTCAC GACAGGCTCG CTCACGGGCG ACGGCCGGAT CGTGGCTGGC
AACGACGGCA CGGTCACGGT GAACGGCGAC TACACGCTGA CGGGCGCGAA CCAGATCCAG
GCAAACAACA ACCTCACCTT CACGACAGCG GGCAACTTCA CGAACCAGGG CACGCTTGGC
GCAGTGAATG CACTGACGCT CAACGCTGCG AACGTCGATA ACCAGGCGGG CGCGGATATC
AACTCTGCGA ACACGACGGT CAACGCGGCC AACGCGATCA CGAACGAAGG CCGCATCGAA
GGCGATAGCG TCACGACGCG CAGCGCGTCA CTCTCGAACG TCGCGACCAT CGTCGGCAAC
ACGGTCACAC TGAACGCCGG TTCGATTGCG AACACGGGTG CTGCGGCAGC CATTGCCGCA
GCGTCGGCCG TTAATCTGTT CGGCTCCGAC ATCTCGAACA TGGGCGGCGC GAACATATTC
AGCCTGGGTG ACGTCAATAT CGCGGCCGAC GCCACACGCG ACGGCAACGG GCTGCTCGCG
AATCGCGCGA ACTCGGTGAC GAACGACCAG TCGACCATCG ACGCACAGGG CAACATCGAA
ATCGCGACGC AGACGCTGAC CAACACGCGC CCCGCGCCCA CGGTCGAGAC GGTCACGACC
GATGTCGAGA CGATTCATCA GACCAAGCGC GACAAATACA TGGCCTGCAC GCCGACCAAT
GGCGACAAAG GCTATTGCAC GCAGGACATG TGGGACAAGG GCTACGAGAA TCCGCTAAAC
GCCACGTTCA GCAATGCCGA TGTCGTATCC ACCGCCAGCG GCCCGAATGC GGTAGACCGC
GTGCTTGTCG TGAATCTCAA CGGTCAGCAG CAGACCATCT ACTACAACTC GCTCACGACC
AACGGCGATG GAACGGTCAC GGTCGCCTAT TGGGACGACT ACGACCCGCA CGTCAACTAC
GATCCAGGCA CGGAATATCC TGGCGACAAC CAGGCGCACC ACCACTACCA GCGCATCGAG
TCGGCGCGTG ACACGACGAC CACGACACAG CAGGACCAGG TTACCGGGCC ACAGGCGCAA
GAGGCTCAGA TCATGGCGGG CGGCAACATG GTCCTTGCCA ATGTCGGCAC GCTGAACAAC
AGCTTTAGCG CCATTGGAGC GGGCGGCGCG ATCCCGATCG GCAGCAGCCA GACGGACGGC
GGCGTGGCGA GCGGCAACTA TGGCGGGACG ACCGTCAACA ATACCGGACA GACGCTGTAC
CAGTATCAGA AGCAGGACAT TGTATCGACC TATGCGTGGA ACGAGGACAT TTCGCGCGAC
GTCGGTCAGG TGGTCCAGCC CACTGTGATC CTGACGCCGG TTGCCATCGG TGGCCTGGGC
GGCACGATCA TCGCGAACAA CGCGGTGCAG ATCAACGCAA CCGACATCAA CAACACGAAT
GTCACGGCGG CGAACTCGGC CACGGGCGCA ACGGGCGGCA CGCTCGGCGC GAACGGAACG
GTGGGCGGCA TCACGGGCGG CGGCGCACAG ACCGTGAATC TCGCGACAGG ACAGACGCAG
ACCATCAACG CGCCGCAATC GGTCACAGGG CCAATGGGAG CGCTGAATAT CGCGTCGCCC
AAAAGCGGCC TCTACACGTT CAACACGGCA CCTGGCGCGG CATATCTGAT CGCAACCGAT
CCGCGCCTGA CCAGCTACAC GAGCTTCATT TCGAGCGACT ACATGCTCAG GCAGCTTGGC
TACGACCCGT CGACGGTCGA GAAGCGCATG GGTGACGGGC TGTACGAGAC GACGCTGATC
CGCAACCAGA TTACCCAACT GACGGGTCGC GTGTACCTGC AAGGCTATAC GAACAACGAG
GACGAGTATC GCGCGCTGAT GGACAACGGC GTGAATGTCG CGAAGGAATT CAGCCTGGAA
CCCGGCATGG CGCTGACGGC CGCGCAGATG GACGCGCTCA CTAGCGATAT CGTGTGGATG
GTGAACCAGA CCGTGACGTT GCCGGACGGC AGCACGCAGA ACGTGCTCGC GCCCGTCGTC
TATCTCGCGC ACACGCACGC GAACGACCTG CAACCAACGG GCGCACTCAT TTCTGCCGAT
GATGTCGAGA TTCACGCGAC GGGCAGCGCG ACCAACTCGG GCGTCATCAA AGGCGGCACG
CAGACGGTCA TCAGCGCGAC GAACATTGTC AATCGCGGCG GCTCGATTGG CAGCAGCACC
GACAACGGCA CGACAGTGCT GTCGGCGACG AACGATGTCG TCAACGCATC GGGCCGCATC
ACGGGCAACC GGGTTGCCGT GCTTGCGGGT CACGATATCG TCAATACGAC GCTCGTCGAT
ACGGTTGGCG TGAGTTCCAC GGTAGGCAAC AGCAAGGTCA CGCAAACGCT GGTCGGGGCG
CAAGGTACGA TTGCCTCGAC GGGCGACATG GTGATCGTGG CGGGTAACGA CCTGAACGTG
CATGGTGCGA GCATCGCGGC GGGCGGCAAT GCGCAGATTG CAGCGGGCCA CGATATCAAC
GTCGACACGG TTGAATCGCA CACGTCGCAG TCGGTCATGA AGAACGCCGA CAACTTCATG
CACGCCGATA CGACGCTGAA CCAGACGAGC GGCATCAGTG CAGGCGGCAG TCTCGCGATG
CAAAGCGGCA ACGACATGAC GTTCAGGGGC GCGTCTGTCA GCGCCGGCGG CGATATGGCC
GTTGTGGCAG GCGGCAACCT GACGGCGACA ACGGTGACGA ACACGGCCAG CCGTGACGAT
GTCACGCGCG GCGACAAGAC CCGTAGCGGC GAAGATCGCA GCTACGACGA GCAGGCGGTG
GGAACGTCGT TCAGCGCGGG CGGCAACGGC ACACTCGCTG CGCTCAGTGC GGATACGTCG
CAAGGTAACG TGACGCTCAC AGGTTCGTCC CTCTCGGCGG GTATGGGCGC GGCGAACATT
GCGGCCACCG GCAAAGTCGA TATCAACGAG GCACGCGAGG AACACGACAG CTATTCGGCC
GTCGAGTTCA AACGCGGCAG CTTCGTACAT GGCTCGACGA CGGAACAGAT GCAGGACACG
CAGGCCAATA TCGGGGTCGG CAGTACGGTG TCGGGCGATA CGGTGAACGT CAGCGCGGGT
AAGGATCTGA CCGTCAAAGG CTCGACTGTC GCGGGAACGA ACGACGTGTC CCTGAATGCG
GCCGGAAACG TGAACATCAC CACGTCGCAG GATATGCAGA ACGCGTCCAG CTACTACCAG
AAGCACGAGT CGGGCCTCGG CACGGGCGGC GGTATTGGCA TATCGGTCGG CAGCAAGACG
CAGACGGATA CGATCCATGA CGCGACAGTG ACGAACAACG GCAGCACGGT CGGCTCGCTC
AACGGCAGCC TGAACATCGT CGCGGGCAAC GATCTGCACG TCACGGGCAG CGACCTGGTT
GCTGCGAAAA ACGTAACGGG CACGGGCGCG AATGTGACGA TTGATTCAGC GCTCGACACG
ATGCATCACG ATGAGACGCA CGAGGTCAAG CAAAGCGGCT TCACGCTCGC GATCAAGGCT
CCTGTGATCG ACGCCGTTTC GAATACGGTT GACCAGGCGC GCGCGGCAAG TCGCAGTCAA
AACGACCGTG CGGCAGCGCT GCACGGTATG GCGGCGGCAA GCGGCGCACT TGATTCCATC
GGGGCGGCGA GCGCGGCGCT CGGCGAACTT GCCAACGGTC AGACACCGTC CGCCAAGATC
GAACTCAGTT ACGGCAGCAG CCATAGCAAG AGCACGTACA CGGCGGAAAG CACGACGAAC
CGGGGTTCGA GTGTGACGGC TGGCGGCACG GCCGCATTCG TCGCGACAGG CAATGGACAG
GCCGGAAGCG GCAACGTCAC GATTGCCGGT TCGAACGTCG ACGCGAACGA TGTGATTCTT
GCTGCGAAAA ACCAGGTCAA TCTCGTCAAC ACGACCGATA CGGATTCGAC GCGCAGCACG
AACCAGTCGA GCAGCGCGAG CGTGGGCGTG TCATACGGCA CGCAGGGATT CGGCGTGGAC
GCGTCAGCTT CGAAGGCGCA CGGCAACGCG AACAGCGACG CGACGATGCA GAACAATACG
CATGTCACGG CGGCGAATAC CGCGACCATC ATTTCGGGCG GCGATACCAA CATCGTCGGC
GCGAATGTGA ACGGCCGTCA GGTGAATGCC GATGTCAGCG GTAACCTGAA CATCGCGAGC
GTGCAGGACA CGATGACGAG CAGCGCGCAT CAGGAGAGCA CTGGTGGCGG ATTTGCGATC
AGCCAGGGCG GCGGCAGTGC GAGTTTCAGC CACACCAACG CAAATGCCAA CGGCAGCTAT
GCGGGCGTGA ACGAACAGGC AGGCATCCAG GCGGGCGACG GCGGATTCAA CGTCAACGTC
AAGGGCAACA CGGACCTGAA AGGCGCGACC ATCGCGAGCG ATGCGGACGC GTCGAAGAAC
AACCTGTCGA CGGGCACGCT GACGTACTCC GATATACAGA ACCAGTCGAG TTACAACGCG
CATTCCAGCG GGTTCAGTGC GGGGGCAACG ACGGGCGACG GCGGTTCGAA CTACGCGACG
CACGGCCCCG CTTCCGGCAA GAACGCAGGC GGCGGTGCGC CAATGTTGAG ACAGAGCGAC
AGCGGGAGCG ATAGTGCGAC TACGCGCAGC GGGATCAGCA CGGGCACGAT CAACGTCACG
AACGGCACGC ACCAGACGCA GGACGTGGCA AGCCTGAACC GCGATACGTT AAACACGAAC
GGTACGGTCG TAAAAATGCC GGACGTGAAC AACATACTGA ACAACCAGGC TGACTTGATG
GCGGCGGCGA GCGCGGCGGG CGAAGCCGTT TCGCGGCGAG TGGGTGACTT TGCGCAGTCG
AAGTATGAAG AGGCAAAAGC CAACGGCGAC CAGGCAGGCA TGGACGCGTG GAAGGAAGGC
GGCACGGCGC GGGCGGAAAT GCAGGCGGCG GGCGCGGCAC TCGTGACGGG GCTGGCAGGC
GGCAACGCGC TCGGCGGCGC GGCCGGTGCG GGCATCGCGT CGATTGCGGC GGGCAAGTTG
AACGAACTGA GCGGCGCGAT TGCCGGTTCG AACCCGACCG GCAATGCGGG CATGAACCAG
GCACTCGGCA ACATCGTGGC GAACGCGATT GCTATTGGCG CAAGCGCAGC GGTCGGCGGC
AATGCGGGCG CGTTCTCCGG TTATAACGCC GACCGCTACA ACCGGCAGTT GCATGCACCG
GAAAAGACAA AAGCGCAACA GATCGCTTCA CAGGCAACAG CGCAAGGGTT GAAGAATCCT
GACGGTTCAC CTATCACGGA TGCACAGATT GAAAACGCGA TGCGAGCGGC GAACAACAGT
CGGTATGGCG AGATTGTCGC AACGGGCGTT GTGGTGCCGT TGAACGCCAA CACGCCCGCA
AGTGCTGTCT CTGACACAAC CGGGATGAAG CTTACGAAGG ACGGCGCCGG AAATAACTAT
CTTGTTCAGG ATCCTTCGAT GCTGTCGACG CCTTCGAAGG CAGTGCAAGA CCTGATTGTT
CAGAACACAG GCGGCGCCAA CTCGCCGTAT AGCTGGAACC CGGCATCCGC GCAAACTGTA
AGTACGCCGA CGATTGACCC ATACGGTCCC TTCTCTCCAA GCTGGAACAC GGGAGATTAC
TCGGCGGGAC TTGGGGCTGG CGGGCGTGGA TTGGCACCGG ACTACATGAC CGTAAACGCC
GGGGTACTTT CGGCCAATGT ATCGGGCGTC GTGAACTTGT ACGATGGCTC GCTGTACGCA
GGAGGTGGAG TAGCCATGAC CAATCCTTCG GCGGTGTCGT ATAACCCTGG CGTCAGCACG
ACGTTTGGCT ATATCTTCGG CGCAAAGACC GCGCAGGATG TAACGAACCT TGTCGCGGGT
GACGGGAACC AAGCTTTTGT GTCGATACCA ACGAATATGG GTGTGAACGT TATAGGAGCA
ATTACACACG CCTATGGCGG CGCTACGGCA ATTGAGATAG GTGTGGGCCA GCCTGGCACG
CTCTCGTACG GCATTGTTCC CTGGAGCCAT ACGACTCAGG TGACGGGACA AAGTAAATAA
 
Protein sequence
MNRTTYRLVF SRLRGMLVAV EETATASGKS AGETRATGRA SRKATRSPLR NLVALMLAVA 
PLLAFSQIVP GGAHAPGLNT TQNGIPQVNI NKPSGAGVSM NTYSQFDVQK NGAILNNSAT
ITNTQLAGYI NGNPNFGPND AAKIIVNQVN SSSASQINGY VEVAGPPAEV VIANGSGISV
NGGGFINTSR SILTTGTPNF AADGSLAGFS VTGGNITVQG AGFNASNIDQ VDLLARAVQV
NAAIYAGKNL NVITGANSID HNTLAATPIT GNGPAPAVSI DVSNLGGMYS GKILLASNEY
GVGVSNRGVI AAQSGDLTLT SQGKLVLAGQ TNASSNISAS ARDGIDNSGT TYAQQNVSAN
TSGALTNSGV LAAQQNTTVN AGSVASTGTL GAGVNGDGTM ANAGDLSVSA TGVVTAAGRN
AAGGNASVNG SAVNLAGSTT SAKGTMTLAA NGGDLNLSGA TTTAGTTLNA RATGTLTNDS
GALSSGGAQA ITAGALSNNS GQMVSGATQT ATVTGAVSNQ SGTMQANGAL SLQAGSADNT
GGHIASLNAD GLNLTVSGLL KNGAGGSIGG NGNVTIRAGQ VANAGSITAV QNLIASAVQT
LFSSGTLAAN GNATLSAGTS LTNSGSITAA SAASITASTF DNSSGTTQTA QFTLRATNLV
NRNGSIAQTG TAATTVDVTG TLDNTGGTLQ TNADSLTLGP ATLTNDHGAI ASAGTGTLSV
ATGALSNNGG TIATNGALDV RAGAVSNRGG TLAAQTGATL LVASLDNSAG YIGAQTVSVT
DAGALNNAGG TIQADDTLAM SAQSVANDGG SIANGGTGAT SVTATGTVTN TQNGLIGGNG
GVSVAGGSVD NSGGTITAGG AVSVTSGSTF GNRAGMIQGT GNTTVSAQGA MDNTGGQIEA
DGTGSTLTVT GATLDNTNGR IANTGSGTTT ISASAITNSN TGGVAGAGTI GGNGDVTLNA
QTLSSTNGAQ LVAGHDLTLT IAQLADNTSA TLSGANNVTL NGPNAVVINA GGSIHGNGVV
TLNTSTLDNT NGRIGNDTGS GGSIGITTGA FANQNGAIGS DQNLSVTTGS LTGDGRIVAG
NDGTVTVNGD YTLTGANQIQ ANNNLTFTTA GNFTNQGTLG AVNALTLNAA NVDNQAGADI
NSANTTVNAA NAITNEGRIE GDSVTTRSAS LSNVATIVGN TVTLNAGSIA NTGAAAAIAA
ASAVNLFGSD ISNMGGANIF SLGDVNIAAD ATRDGNGLLA NRANSVTNDQ STIDAQGNIE
IATQTLTNTR PAPTVETVTT DVETIHQTKR DKYMACTPTN GDKGYCTQDM WDKGYENPLN
ATFSNADVVS TASGPNAVDR VLVVNLNGQQ QTIYYNSLTT NGDGTVTVAY WDDYDPHVNY
DPGTEYPGDN QAHHHYQRIE SARDTTTTTQ QDQVTGPQAQ EAQIMAGGNM VLANVGTLNN
SFSAIGAGGA IPIGSSQTDG GVASGNYGGT TVNNTGQTLY QYQKQDIVST YAWNEDISRD
VGQVVQPTVI LTPVAIGGLG GTIIANNAVQ INATDINNTN VTAANSATGA TGGTLGANGT
VGGITGGGAQ TVNLATGQTQ TINAPQSVTG PMGALNIASP KSGLYTFNTA PGAAYLIATD
PRLTSYTSFI SSDYMLRQLG YDPSTVEKRM GDGLYETTLI RNQITQLTGR VYLQGYTNNE
DEYRALMDNG VNVAKEFSLE PGMALTAAQM DALTSDIVWM VNQTVTLPDG STQNVLAPVV
YLAHTHANDL QPTGALISAD DVEIHATGSA TNSGVIKGGT QTVISATNIV NRGGSIGSST
DNGTTVLSAT NDVVNASGRI TGNRVAVLAG HDIVNTTLVD TVGVSSTVGN SKVTQTLVGA
QGTIASTGDM VIVAGNDLNV HGASIAAGGN AQIAAGHDIN VDTVESHTSQ SVMKNADNFM
HADTTLNQTS GISAGGSLAM QSGNDMTFRG ASVSAGGDMA VVAGGNLTAT TVTNTASRDD
VTRGDKTRSG EDRSYDEQAV GTSFSAGGNG TLAALSADTS QGNVTLTGSS LSAGMGAANI
AATGKVDINE AREEHDSYSA VEFKRGSFVH GSTTEQMQDT QANIGVGSTV SGDTVNVSAG
KDLTVKGSTV AGTNDVSLNA AGNVNITTSQ DMQNASSYYQ KHESGLGTGG GIGISVGSKT
QTDTIHDATV TNNGSTVGSL NGSLNIVAGN DLHVTGSDLV AAKNVTGTGA NVTIDSALDT
MHHDETHEVK QSGFTLAIKA PVIDAVSNTV DQARAASRSQ NDRAAALHGM AAASGALDSI
GAASAALGEL ANGQTPSAKI ELSYGSSHSK STYTAESTTN RGSSVTAGGT AAFVATGNGQ
AGSGNVTIAG SNVDANDVIL AAKNQVNLVN TTDTDSTRST NQSSSASVGV SYGTQGFGVD
ASASKAHGNA NSDATMQNNT HVTAANTATI ISGGDTNIVG ANVNGRQVNA DVSGNLNIAS
VQDTMTSSAH QESTGGGFAI SQGGGSASFS HTNANANGSY AGVNEQAGIQ AGDGGFNVNV
KGNTDLKGAT IASDADASKN NLSTGTLTYS DIQNQSSYNA HSSGFSAGAT TGDGGSNYAT
HGPASGKNAG GGAPMLRQSD SGSDSATTRS GISTGTINVT NGTHQTQDVA SLNRDTLNTN
GTVVKMPDVN NILNNQADLM AAASAAGEAV SRRVGDFAQS KYEEAKANGD QAGMDAWKEG
GTARAEMQAA GAALVTGLAG GNALGGAAGA GIASIAAGKL NELSGAIAGS NPTGNAGMNQ
ALGNIVANAI AIGASAAVGG NAGAFSGYNA DRYNRQLHAP EKTKAQQIAS QATAQGLKNP
DGSPITDAQI ENAMRAANNS RYGEIVATGV VVPLNANTPA SAVSDTTGMK LTKDGAGNNY
LVQDPSMLST PSKAVQDLIV QNTGGANSPY SWNPASAQTV STPTIDPYGP FSPSWNTGDY
SAGLGAGGRG LAPDYMTVNA GVLSANVSGV VNLYDGSLYA GGGVAMTNPS AVSYNPGVST
TFGYIFGAKT AQDVTNLVAG DGNQAFVSIP TNMGVNVIGA ITHAYGGATA IEIGVGQPGT
LSYGIVPWSH TTQVTGQSK