Gene Bphy_7346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_7346 
Symbol 
ID6248945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010627 
Strand
Start bp116361 
End bp124361 
Gene Length8001 bp 
Protein Length2666 aa 
Translation table11 
GC content65% 
IMG OID642598975 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001863381 
Protein GI186474410 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.550949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCT TTCCGTTCGC CGTATTGAGC GCGCCGTCCC GCCCATTGTG GATGCGTGCG 
ACAGCCTGCG TGATGACGGG GGTGATGTAT TTCGCGCCGG CGGTGTTTCT TGCCGATGCA
ACTGCGCATG CTGCGCCAAT CGTCGATCCG CGTGCGCCGG TCCAGTTCCA GCCGACCATC
ACACAGACCA GCGCGGGCGT AGCGGCGGTG AACATTACGG CGCCGAATGC GAACGGCGTC
AGCCTGAACC AGTACCAGTC GTTTAATGTC GACGCTTCCG GGCTGGTGCT GAACAACAGC
CTCATCGCCG GGACACCTCT GCTCGGCGGC ACGCTCGGCG CCAACCCGAA CTTCGTGGGA
CGCGCGGCAA CGACCATCAT CAACCAGGTG ACTTCCACGG GTCCGGCATC GAGCCTGATG
GGGCCGCTGG AAGTGTTCGG AAGTGCGGCC ACGATTGTGG TCAGTTCACC GAACGGTGTC
AGCGTCGGCG GGCTGTCGCT GACAAATGCG CCGGGCTTAG TGCTCACCAC CGGCACGCCG
CAGTTCCTCA CGGGGGGGAG CGGAACGTCC ACCGACTTCG CGCATGCGGG CGCGGTGGCT
TATAGCGTCA ATTCCGGGAG CATCTCGATC AACGGTCCGG CAGGCGTCAA CGGTCCTGGC
GCCGGCATCG AAGGCACGGT CGGCAACATC GATTTGATAG GCCAGACGGT CAATGTCAAC
GCGCCACTGC GCGCGGACCA GCGCGTGAAC ATCATCGCGG GCAACCAGAC CGTTACGCCG
GTGGCATTGG GCTCTGGGGG CACCACCTAC AGCACCGCGT CCAATGGTAC CGCCAATACG
GCTGCGGCGA TAGGCAACAA CGGTGTCGCG ATTGACGCGA ACCGGTACGG CTCCGTCACC
TCGGGGCAGG TGTACATCGT CTCGACGGCA GCGGGTATGG GTGTCAATAC CCAGGGTGCG
CTGTCGGCGA CCGCAGGTAA CGTCAGCGTC ACCTCAAATG GGGACATCGC GGTCGGCAGC
ACGTTCGCGT ACCAGAACGT GAATCTCGCC AGTGTGGGCA GCACGAGCAT TGGCGGCACG
GGTCTGGCCA ACCAGAACTA CACGGTAACC GCGAACGGCG ACATCAACGC GACGGGTACC
GTGTCGGCAG GCCAGAACGT GTCGATGAGT GCCGGTGGCA ATCTTGCCGC TGCATCCGTT
GCCGCGAACG GCAATGCGAC GCTCAACGCG GGCAACTCGA TGACCGTGGG TTCGGTCTCG
GGCCAGAATC TCGCGCTGCA GACAGGGACG GGCGACCTGA CCGTCAATTC ATCGATGACG
GCGCCGGGCA CCATTGCCGC GAGTGCGGGC CGTGACCTGA CGGTGAATGG CTCGGTGCAG
GGTGGCAGCA CCGTGGGCCT GACGGCTGCG CGCAACGCGG TGATCAACGG CGCAGTCTCG
GGTGTCGGCG ACACGGCTGT CACGGCCCGG ACGGGCACGG CCAGCGTAGC GGGCAACGTG
CAGACCGATG GCGCGCTCAC GGTCTCGTCA GCGCAGGGCA CGACGCTGGG CGGCACCGCG
CAGGCCGTGG GGCCTGTGTC GATCAGCGCG CAGTCTGGCT CCGTCACGGG CAACGGGAAT
GTCGCATCGT CGCAGGGCGC GGTCTCGCTG GTGGCAGGGC AGAACATCGG CCTGACCGGC
TCGGTGCAGT CGGGCAGTTC CGTTACCGCG CAGGCGGACG GCAACGCATC GCTCGGCGGA
ACGGTCACCG CACCCGGCGC GATTTCGGTC AGCGCCGGCG GTGACGCCGC ACTCGGGGGC
AATGCCACGA GCGGCAGCAC GCTCACGGTG ACCGGTGGCG GCAACACGAC GATCAACGGC
GCGGCGGCTT CGGTCGGCGA CATGACCCTT ACCTCCACCG GGGGCACGCT CTCCACGACC
GGCAGCGTCA CCACGCTGGG CAATCTCACG GCGAGCGGCC AGCAGGGCGC GAGTCTCGGC
GGCAACGTCT ACAGCCAGGG CAATGCGCAG ATCGCCTCGG GCGCCGGCAG CGTGACCGTC
GGCGGCACGC TGACGACGCC GGGAATGGTC ACGGTCAATG CCGGACAGGA CGCCACCGTA
TCGGGCAGCG TGCACAGCGG CCAGAGCACG GCCATCACGG CGACACGCGA CGCAACGCTC
AACGGCGGCC TCGAGGCCGA CGGCACAGGC AACGCGACGG TGACAGCCGG ACGCGACATC
ACTGGCAGCG GCACGCTTAA CGTCGCAAAC GACACGACGC TGTCAGCCGG ACGCAACCTT
GGCATCACGG GCGCGATCCA GACGGGCAAC AACCTCAACG CGACGGCGGG CACGAACCTC
TCCGTAGGCG CGACCACAGC GGTCGGGACC GAAACGCTGA CGGCCACAAA TGGCAGCGCG
ACGCTCGGTG GCGATGCCCT GTCCGGCGGG CCGATGAGCG TGACGGCCGG CTCGGGCATT
TCCGCGCAGG GCAGCGTCCA GAGTCTGGGT GACCTGAGCC TGAACGCGAA AGGTGGCGAT
CTCACCGCGA ACAGTGCCGT TTCGAGCGCC GGTAACGCGA CGCTCACCGC GGCCGGCAAT
CTCACGCTCA ACGGCCAGAC CACCGTCTCG AAAGACGCCA CGCTCACCGC CAACAACATC
ACCACACAGG GCATGGCCGT CGGCGGCAAT CTGGTTGCGA CTGCAACGAA CAGCATCGAC
ACGTCAGCCG GTCAGCTCAA TGCCGCGTAC AACTCGGGCG CGCCGGCACT TACCGTCGCG
GGCAATGCGA CGCTCTCGGG CGCCAACGTG ACCACGGCCA ACGCGGTGAT CGGCGGCACC
TACAGCGCAA CGGGTACCAC GAGTTTGACG ACGGGCGGCA CTGCCGCGTA CCAGGGCGAT
GCGACACTGG CCGGCGGCAC GGTGACGAAT GTCGGCACGC AGATGGCCAG AGGCAATCTC
ACCGTCTCGG GTCCGGATAT CACGAACCAG GGTGCGCTGT CGTCGCTTGC GACGATGGCG
GTCAATGCCT CGAATCTGAA TAACGCCGGC ACGGTCTACG GGCCGGTGAA CAACCTCAGT
GTGTCGGGCT CGACGACCAA CACCGGCGGG CTGCTCGCAA CCAGCGCGCT CAATCTCGTC
ACCGGCGCAC TCGACAACAG CAACGGGCTG ATTTTCTCGG GCGACGTGAA CAACCCGAGC
GCTGCAACCG GCAACACGTC GGTCACGGTA ACGGGCGGTA ACGGCAGCTT CAACAACTCG
GCCGGCCAGA TCCTCGCGCA GAACATCGCG ACGCTCGCGC TGCCTAACCA GGCGATCGAT
CCGTCGGCCA GCGCATTCGG GACGGTCAAC GGCGGCTACG GGCTGAACCT GTCCGCGCAG
TCGGTGAACA ACACCGGTAC GTGGACGCTG CCGGGCACCA CGGTGACCGT TTCCGCTACC
CAGGGTATCA GCAACAGCGG CACGATCAAC CAGGGCAGCG GCTCGCTCGC GCTGAACGGC
GCCGTTTCCA ACGCCGGGAC GGTCAACGCG CACGACCTGA CGGTCAATGG CTCGCTCGCG
AACCAGACCG GGAGCACGGT GCAGGCAAGC GATGCGTTCA CGCTCAATGG TTCGGGCACG
AACGCGGGCA CCATCGAAGC GCTCAACGCG CTGAGCATTG CCGGTTCGAG CTATGACAAC
TCGAACGGCA TCACGAAGGC AGGCAACGGC ACCAGCGGCG CGGGCAACAT GACGGTTAGC
CTGAGCGGCG ATCTGGGTAA TGCCGGCGGC ACGCTGTCCG CGACGAACGA CCTGTCGATC
ACCGCGAACA ACGTCAATAA CTCGACGGCG TCGGGCTCGA CCACGACCAC CACGACCACG
GTCATCAACA ATCCTGCGCT GGTGATGGCG CTCAACGTCG GCACCGATAC CCTCGACGCC
GCCTTCCTTT ATGGGGCGGA GAATGGCTTC TGCTGCACCA TTACCCAGAC GCCACAGACG
GTGACGCTGG CCGATGCGCT GTCGCCGGAT GGCACGGCGA CTGACATCAC AGACGGCCTG
ACTTTTGTAC TGCCCGGCAG CCCGCCCGTC ACGACCTCGG GGACGGTCAC GTTCGTCGAA
ATCCCGACCG TCGTCGGGAC AAACAATGCC GGCAACAACA TCGTCCAGAA CCTGTGGTTC
GTGCAGACGC CGCAGAATGC GGGCATGGGC ATCGCAACGA AGACGGTTGC GTTGCCAACG
GCCACCGAGA CGACAACCAC GACGGGCTCA CAGTCCGGCG CCAGCTCGGT CATTGCCGCA
GGGCACAACC TCAGTGTGAC GGCAAACGCC CTCAACAACC AGGGCGGCAC CGTCAGCGCC
GGCAATGACG CATCACTGAA CCTGCAGTCG CTCTCCAATG GCGGGTCAAC CTACAGCTCG
ACGGTCACCG ACACCGTTGA CCAGGCCTCG ATCAACAGCT TCCTGTCGCA GGCGCCGTCG
ACTATCTCGG TCTGGAACAA CTTCTACGGA GCGACCGCAG TCGGTCCCTA CACGCAGGGC
TATTACATTG ATCCGGCGGG TATCAGTCTG AGTGCGCCGG GCACCGTGAC GCCGCTTGCG
ACGTCGTCGT CGGTCACTGT GCAGGGTTCG ACGGGCCAGA TCGTCGCCGG GCACAACCTG
AACCTGTCGG GCGGCAACCT GACCAATGCG GGCACGCTCG CGGCCGCCAA CAACGTCAAT
ATCACCGCGT CGGGCTTCAC GAACCAGGGC ACCAATACCG GGACGATGAC GACCACGGCC
GGCTGCGCGT CGGGCTATTC CGGTTGCACG AGCGGCTCCA CGACGAACCC GAATTCGCAG
ACGTACAGCT ACCAGCAGAA CAACGCAACG GTGACCGCAG GCAATGACAT CGTGATTGCC
GCGAACACGG TCAGCAACAC ATACGGCAAT CTCGCCGCCC GGCGCAATGT GGTGATCGGC
GGCGCAGGCA CGAGTGCGAG CGATGCATCG ACGACGCCCT CGTCGCTCAC CCAGGCGGCC
AGCGTCACAA ACACGTCGGG CACGATTGCC GCCGGCAATG ATGTCGACAT CAACGCCGCG
ACGCTGACGA ACACCATTGC CGCGCCGGTG CAGGTGCACC AGAACTACGG CAGCGGCACG
CCCTTCACGG GCTGCACGAG CAACTGCGAG GCGTACGTCG ACGTGCAGTC GGCGAGCCCC
TCGACGATCA CGGCGAACCA CAACGTCAAT CTCGCCGCAG GCAGTTTCAG CAATACCGGG
AGCCTCGTCA CGGCGCTGAA CAATGTCACG ATCAACGCCT CGTCGTCGGC CACGAGCAGT
AACCAGTACC TCTCCGCGTA CTGGAGCAGC GGCTTCACGC ATTACGGCAC GCAGTACGCG
ACATGGGGCT GCGCGAACAA TCCGGCGCTG TGCCAGCAGC TTTACGGCAG CGCGTACAGC
AGCAGCGCTG CGCAGGACCC GGCGGGCCTG CCTTCGTCGG TTGGCCTGCC TGACTTCGTG
CCGGCGACGA TCCAGGCGGG CAATACGCTG TCGGTCAATT CGCCCACGCT GACTAACACC
GGCAACGTGA TCGGCCAGAA CGTGGCCTTG ACCGGCTCAC AGCTCATCAA CGGACTGACG
AACCCGAACG TCTATACGCC GCCGCCCGCC GTCTCCGGGC AGGTGATTAC GCTGGGGCCG
CCATCGGTGC CGGCGAACGC AACGACGACC GTCAACAGTG CGGGGCGGGT CACGACGCTC
AGCGGCCAGC CCACGTCGGT AACGGGCGCC GCGGGCCTGC CGTCGAATAC GCCGATCGGC
GTGACCACGG TTGGCAAGCC GGTGGCGCCG ACCGTCGCGA AGTCGACGGC CCCAGCGGGC
TCGAGCGTGC AGACCCTCAA CGGACAGACC GTATCGGTCA GCTACCTGAC GAACAGTCCG
GCCGCGCAGG TCATGGGCGA TCTTTCGCCT GCGGCGCTGC TCGCGGCGCT TCCGTCGAAT
CTGCAGCCGG GCAACGTCCC CTTCTATTAC GACCCGTACA CCGAAGACCA GCAGATCGAG
CAGGCTGCGC TGCAGGCCAC CGGCAAGGCC AGCTTCTACA GCACCTCATA TGGCACACCC
GGCGCCACCG ACAGCACCAG CCAGGCCTCG ATTGCCAGCC AGGACAAGGC GGCGCTCTAC
GGTGTCGCGC TCCAGTACGC GAAGGAGCAT AACGTCGCGC TTGGCACGCA GCTCAGTCAG
GCGCAGCTCG CGCAGATCGA TGCGCCGATG CTCTGGTACG TCGAGGAGAC GGTGCCTGAG
CCGGGGTGCA CGGCGACCGG CAACGGCACA TGTCCGACCG TCCAGGCGCT GATGCCCGAG
GTGCTGTTGC CGCAGAACTA TGCGGTGGTC AACGCGGATG GCGAGATTAC CGGCGCCAAT
GTCGCGCTCA ACTACGCGAA CAGCATTCTC AACACCGGTT CGATCTCGGC GCAGAACCTC
ACGGTCAACA CGGCGAGCCT GACCAACGAG CAGCGCTCGA CCAACATCGG CACGATCTAT
CAGGAAGTCG ATGGAGGTGT GGCGAAGACG ACCGGGACGG TGGTCCAGCA GGGCGGCTTC
ATGTCGGCGA TGAATTACGA CATGAACGTG CAGACGCTGA ACCAGATCGG CGGCGCCCTG
CAGCAGGTCA ATGCCGATGG TTCGGTGAAC CAGGCGCAGA CGGCGCAACT TCTGTCGAAC
CTGAAAAGCC AGCTCGGCAC GAGCTTCACG CAGTCCACCG TCAGCAATCA CCTTGACACC
ACACTGATCG CGGACGGCGG CATGGGGCCG ATCCTGGTGA TCCAGGAGGT CATCAGCCTG
GCGATATCGA TTGTGACCGC CAACCCTGTA CTGGGGGCGG CCCTGAGTGC CGCGATGAGC
CAGGCCGACA GCGGGCAGTT CAGCCTGTCG GACATCGCGG AGGCGGTGGC GGTCGCTTAC
CTGACGCAGG GCGTCGATGA GGGGCTGGGG CTGGATAACT TCGGCGGTGT CGGCTCGAAC
CTGATCTCGT CGGACACCGC ACAGACGCTG CAAAACATTG GTCAGGCCGC CGTCGCGGTC
GGCGAGCAGA GCGTGGTCAA CGCGGGCATT TCCACCGCCA TCGAAGGGGG CAGCTTCCTG
ATGGCGCTGC GTGACAACGC GGTTGCGGAT GTGGCGGCGA TCGGCTCGGG CGCGATCGGC
GCGAATACGC CGTCTGGCTC GTTCGAAAAT GTGGCGCTGC ACGCAGTGCT GGGATGTGCG
GCGGGCGCGG CGTCGGGCCA GGGATGCGGC GGTGGGGCCG TGGGGGCGGG CGTCAGTGCG
GCGGCGGCGC CGTACATGGT GCAGCAGGCG GGCGGGCTGG ACAGCCTCAC GCCGGACCAG
CGGGCAGCGA TCGTGACCAC GGCGACCCTG CTGGGCGGTC TTGCGGCGGG TGCGCTGGGC
CAGAATGCGC TGGCCGGAGC GGATGCAGCG ACCAACGAGT CGCTGTACAA CTCGACCAAT
CCGCTTGATG AAGCGAACAT GAAGAACCTG ATTCCGCTGG AGGGAGGCGT CGGCGGCGCA
GGCGGGTTCG CGGAGGGCGG TGGCGGCACG ATCAGCATGG GCGAGGCGGC CGAGGCGCTG
GCGCAGGGCG TGGCCGATGC AGCGGTCGGT GCCGAAGCGA AAATGAGCAG CGCCCTCAGC
GATGCCGCAA ACTGGATTGA AACGGAGCTC AGCTTCGCGA CTACGCGGGC CACAAACACA
GGGTCATTGA CAACCGCAGA GCAAATGGGG ATCTTGCACG ACGCAGCAAC TGGTGGTTTT
CTTGGAAAAG GCAACTTCAG TCTCGGTCAA GCGACTGCTA CGGAGGCAAA CGAGCTCGGA
GCTGCATGGG TTGGTCCAGG ATACCGTATA TCGAGCGACG GTTCGTCTTG GGTTAGCAGC
GATGGTCTTC GGATCTATCG CCCGCCGTCA GCGAAGCCGA ACAGTTCGTA TGCCACAACC
GGTGTGCAGG CCAATTTCGA ATCGAAGCTC ACGCCTGGTA GTAGGCCCAT TAGCAACGGA
CACTTGAATA TCACGCCATG A
 
Protein sequence
MQRFPFAVLS APSRPLWMRA TACVMTGVMY FAPAVFLADA TAHAAPIVDP RAPVQFQPTI 
TQTSAGVAAV NITAPNANGV SLNQYQSFNV DASGLVLNNS LIAGTPLLGG TLGANPNFVG
RAATTIINQV TSTGPASSLM GPLEVFGSAA TIVVSSPNGV SVGGLSLTNA PGLVLTTGTP
QFLTGGSGTS TDFAHAGAVA YSVNSGSISI NGPAGVNGPG AGIEGTVGNI DLIGQTVNVN
APLRADQRVN IIAGNQTVTP VALGSGGTTY STASNGTANT AAAIGNNGVA IDANRYGSVT
SGQVYIVSTA AGMGVNTQGA LSATAGNVSV TSNGDIAVGS TFAYQNVNLA SVGSTSIGGT
GLANQNYTVT ANGDINATGT VSAGQNVSMS AGGNLAAASV AANGNATLNA GNSMTVGSVS
GQNLALQTGT GDLTVNSSMT APGTIAASAG RDLTVNGSVQ GGSTVGLTAA RNAVINGAVS
GVGDTAVTAR TGTASVAGNV QTDGALTVSS AQGTTLGGTA QAVGPVSISA QSGSVTGNGN
VASSQGAVSL VAGQNIGLTG SVQSGSSVTA QADGNASLGG TVTAPGAISV SAGGDAALGG
NATSGSTLTV TGGGNTTING AAASVGDMTL TSTGGTLSTT GSVTTLGNLT ASGQQGASLG
GNVYSQGNAQ IASGAGSVTV GGTLTTPGMV TVNAGQDATV SGSVHSGQST AITATRDATL
NGGLEADGTG NATVTAGRDI TGSGTLNVAN DTTLSAGRNL GITGAIQTGN NLNATAGTNL
SVGATTAVGT ETLTATNGSA TLGGDALSGG PMSVTAGSGI SAQGSVQSLG DLSLNAKGGD
LTANSAVSSA GNATLTAAGN LTLNGQTTVS KDATLTANNI TTQGMAVGGN LVATATNSID
TSAGQLNAAY NSGAPALTVA GNATLSGANV TTANAVIGGT YSATGTTSLT TGGTAAYQGD
ATLAGGTVTN VGTQMARGNL TVSGPDITNQ GALSSLATMA VNASNLNNAG TVYGPVNNLS
VSGSTTNTGG LLATSALNLV TGALDNSNGL IFSGDVNNPS AATGNTSVTV TGGNGSFNNS
AGQILAQNIA TLALPNQAID PSASAFGTVN GGYGLNLSAQ SVNNTGTWTL PGTTVTVSAT
QGISNSGTIN QGSGSLALNG AVSNAGTVNA HDLTVNGSLA NQTGSTVQAS DAFTLNGSGT
NAGTIEALNA LSIAGSSYDN SNGITKAGNG TSGAGNMTVS LSGDLGNAGG TLSATNDLSI
TANNVNNSTA SGSTTTTTTT VINNPALVMA LNVGTDTLDA AFLYGAENGF CCTITQTPQT
VTLADALSPD GTATDITDGL TFVLPGSPPV TTSGTVTFVE IPTVVGTNNA GNNIVQNLWF
VQTPQNAGMG IATKTVALPT ATETTTTTGS QSGASSVIAA GHNLSVTANA LNNQGGTVSA
GNDASLNLQS LSNGGSTYSS TVTDTVDQAS INSFLSQAPS TISVWNNFYG ATAVGPYTQG
YYIDPAGISL SAPGTVTPLA TSSSVTVQGS TGQIVAGHNL NLSGGNLTNA GTLAAANNVN
ITASGFTNQG TNTGTMTTTA GCASGYSGCT SGSTTNPNSQ TYSYQQNNAT VTAGNDIVIA
ANTVSNTYGN LAARRNVVIG GAGTSASDAS TTPSSLTQAA SVTNTSGTIA AGNDVDINAA
TLTNTIAAPV QVHQNYGSGT PFTGCTSNCE AYVDVQSASP STITANHNVN LAAGSFSNTG
SLVTALNNVT INASSSATSS NQYLSAYWSS GFTHYGTQYA TWGCANNPAL CQQLYGSAYS
SSAAQDPAGL PSSVGLPDFV PATIQAGNTL SVNSPTLTNT GNVIGQNVAL TGSQLINGLT
NPNVYTPPPA VSGQVITLGP PSVPANATTT VNSAGRVTTL SGQPTSVTGA AGLPSNTPIG
VTTVGKPVAP TVAKSTAPAG SSVQTLNGQT VSVSYLTNSP AAQVMGDLSP AALLAALPSN
LQPGNVPFYY DPYTEDQQIE QAALQATGKA SFYSTSYGTP GATDSTSQAS IASQDKAALY
GVALQYAKEH NVALGTQLSQ AQLAQIDAPM LWYVEETVPE PGCTATGNGT CPTVQALMPE
VLLPQNYAVV NADGEITGAN VALNYANSIL NTGSISAQNL TVNTASLTNE QRSTNIGTIY
QEVDGGVAKT TGTVVQQGGF MSAMNYDMNV QTLNQIGGAL QQVNADGSVN QAQTAQLLSN
LKSQLGTSFT QSTVSNHLDT TLIADGGMGP ILVIQEVISL AISIVTANPV LGAALSAAMS
QADSGQFSLS DIAEAVAVAY LTQGVDEGLG LDNFGGVGSN LISSDTAQTL QNIGQAAVAV
GEQSVVNAGI STAIEGGSFL MALRDNAVAD VAAIGSGAIG ANTPSGSFEN VALHAVLGCA
AGAASGQGCG GGAVGAGVSA AAAPYMVQQA GGLDSLTPDQ RAAIVTTATL LGGLAAGALG
QNALAGADAA TNESLYNSTN PLDEANMKNL IPLEGGVGGA GGFAEGGGGT ISMGEAAEAL
AQGVADAAVG AEAKMSSALS DAANWIETEL SFATTRATNT GSLTTAEQMG ILHDAATGGF
LGKGNFSLGQ ATATEANELG AAWVGPGYRI SSDGSSWVSS DGLRIYRPPS AKPNSSYATT
GVQANFESKL TPGSRPISNG HLNITP