Gene Bphyt_6868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_6868 
Symbol 
ID6280051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp3221291 
End bp3229789 
Gene Length8499 bp 
Protein Length2832 aa 
Translation table11 
GC content61% 
IMG OID642617895 
ProductHaemagluttinin domain protein 
Protein accessionYP_001890531 
Protein GI187921499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.587057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGT CGTATATCAG CGTCTGGAAT GACGCAACTG AAACCTGGGT GGCTGCGCCT 
GAAACGGCCA CTGCGCACTC CGGCGGGGGC GCGAAGTCGT CCAAGCCGAT CGTGGCCAGC
AGCGAAGCGG CAAAGCGGCC GTCATCCGCA ATGAAGTTCG CGCTCATGCC GATCACCATG
GCTGTTGGTG CTATCGGTAT GGCTGCACTG CCGGGTACGG CGATGGCGGG GCTTTACGTA
TGCTCAAATG ACGGACTTGG TGAAGGGACT TACTACGGGA CGGATGGTCT CTCCACTGGC
GGCGGGTGCC TCGCTTCGGG GGGTACGACA TTCGCTCTCC TTGCTGGCCC GATCGCAGGC
AGTGGTAATG GTGGGGTCGG TATTTGGGCA AACGTTGATC AGACATTGTC TGTTTACGCT
CCGTCCGGTA TCTACCTCAA CAACGACGTC AACATGGTGG GTAATAAGAT TACCCATATG
CATGCCGGTA CTGCCGATGG CGACGCGGTC AACGTCTCGC AGCTAACTGG CGTAACGCAG
GGCATGGGCG GCGGTGCAGG CATTTCCGGC GGCGGCTCGT TTATTGCGCC GACGTACACA
TTCGACGACG GCACCCTGTC ATACAACAAC TCGGGCGATG CGTTCAATCT CGTTTCGAGC
CGCCTCAAGT ACTTCCACGT CAACTCCACG GCGCCGATGT CGACCGCTGG GGGTAAGCAG
GCAATCAGCA TCGGGCCAGT GGCCTCGGCA TCGGGCGACA ACTCGGTGGC CTTGGGTAAT
ACCGCAGTTG CGACGGGTTC CGGCAGTATT GCTGTCGGTA GCAACGCAAC GGCGCTTGTC
AGCAACTCGG TTGCGCTGGG CGCGAACTCC GTGGCGAGTC GTAACAACAG CGTCTCGGTC
GGCTATCTGT CGGCAGACGG CACGTCGCAG TACACAAGAG TCGTCACCAA CCTGACGGCC
GGCTCGGCCG GCACCGATGC GGTCAACGTC AATCAGTTGA ATGCGGCGAT CGCTAGCGTC
AACGGTGGAG GCGGCGGCGG CAATGTTACC AATGCGGTAA CTTACAACGA CTCCGGAATG
TCGTTGATCA CGCTGAAGGG TTCAGCGGGA ACGAAGATCA ACAATCTGAC GGCGGGCGAT
ATCACGTCGG CCTACAGCAT GGACGCGGTG AACGGTTCGC AACTCTATGC CACGAACCAG
AACGTCGCGA ACGTGGCGGG TAATGTCGTC AACATTGCCG GCAATCTGGC CAACGTTACT
AACACGGTCA ACAACATCAC CAACGGTGGC GGCGTCAAGT ACTTCCACAC CAATTCGTCG
CTGGCCGATT CGTCGGCAAC GGGTGCGGAT TCTGTGGCGA TTGGCGGACT GGCTAGCGCG
TCCGCAAACA ACTCCGTTGC GCTGGGTTCG AAGTCGGTCG CAGATCGTGC GAACGCGGTG
TCAGTAGGTG CATCCGGTGC GGAGCGTCAG ATCACCAACG TGGCAGCAGG TACCGCGAAT
ACCGACGCGG TGAATCTGGG GCAGATGAAC ACGGCCATCG CCTCGATTAG CGGCGGTGGC
TCGCTTGACG CGGTGATCTA CGACTCGTCG GCACATAACA AACTGACGCT GGGTGGGTCA
ACGTCGACGG TGCCGGTTGG TCTGACGAAT CTGGCCGCAG GTCAAGTCAC GTCGAGCAGC
AAAGACGCGG TTAACGGCTC GCAGTTGTTC AACACGGCAA GCGGCGTATC GAGCGCGTTG
GGCGGCGGTT CGAGCGTGAG CGCGAGTGGC GCTGTTACGA ACCCGACCTA CACACTGCAA
GGCGGCACAT ACAACGACGT TGGCTCGGCC CTCAGCGGTC TGGATACGGA AATCTCCGGC
CTCACAACCG ATGTCGCTAA CGCGTCGAAA TACTTCAAGG TCGTTTCAGC TTCTTCCGCT
GCAATCGCAA CAGGCGCCGA GACAATTGCG CTCGGCGGCG GCGCGTACGC AAGTGGGTCC
AATTCGGTGG CAATTGGTTC GGGCTCCCGC TCGATGTTCG CCAACTCGGT GGCGCTCGGA
GCGAACTCGC GCGTGAACGC CGCGAATACC GTTTCCATCG GCGACATCGG TACGGAACGC
CGCATCACGA ACGTGGCTAA CGGCGTGTCG GGCACCGACG CCGCGACGAT TGCTCAGTTG
TCTGCACTGC AGACTTCGGT CAACCAGCAG ATCGCAGCGT TGCCGAGTGG TTCGAAATCG
ATGCTCATGG GCGCTTCGAT GCTGGGCGCG ACGCCGGTTA CCGCCTATAT CTCGGTGAGT
TCGAACGTTA CGCCGGGTAC TAACACGAGT ACCGACAATT CGCTGGATGC AATGGCGATT
GGTCCGACGG CCATGGCATT GGGCCAAGGT TCGCTGGCCG TCGGTGCCGG CGCGGGTACG
GCTCTGGCAG GTTCGACAGC AGTGGGTAAC GGCGCAGCGG CTTTGGCGCT GAATACGACT
GTGATCGGTG CCGGCGCGAA TACCAGTAAC AATGCTACAA ACGCTGTGGC AATCGGCTAC
AACGCGGCTG CACAAGGCGC GAACGCGTTG TCGCTCGGAT CGTCCGCCGT GTCGAACGGT
TCGGGCTCGA TTGCGATGGG TGCCAATGCA TTCGTGACCA CGACAGCCTC GAACGCGCTG
GCTTTAGGTG CCGGCGCAAC TGTTAGCCAA GCCAATACCG TGGCATTGGG TGCCAATTCG
GTGGGGGACC GCGCAAACGC CATTTCGGTT GGCAGCAGCA CGGCGCAGCG CCAGATCACC
TACGTGGCCA ACGGTACGAA CAGCACCGAC GCGGTGAACG TTTCCCAATT GACTGGCGTG
ACGAACCTGA TCGGCGGTGG CGCGGGCGTG AACGCCGACG GCACGATCAA GAAGCCGGGC
TTTACGATCG GTGGTCAGAC ATATTCGGAC GTCGGTTCGG CCATCGCGGC TGCGGTATCC
GGCGGTTCGG CCAATGCGGT GCAATACGAC ACGTCGTCGC ACACCAAGGT GACGTTGGGC
GGTACCGGTA CGACCGTGCC GGTGACGCTG ACGAACGTGG CAAACGGCGT GGCGAGCAGC
GACGCAGTGA GCGTTGCACA GCTGAAAGCC ATGGGCGGCA CGATCGACAG CAGCGGTGTG
ATGACGAACG CATTCGTCGC GTACGACGAC ACGACCAAGA ACACGATCAC CTTGAAGGGC
GCGAGCGGTT CGACGAAGAT TACCGGTCTG ACGGCAGGCG CGCTGTCGGC AACGAGCCTG
GACGCGGTCA ACGGTACGCA GCTGTACCAG ACGAACGCGA ACGTGGCCAA TGTTGCAGGC
ACGGTGGCGA ACGTGACCAA CACGGTCAAC AACATCACCA ATGGTGGCGG CATCAAGTAT
TTCCACGCGA ATTCGACGCT GGCGGACTCG TCGGCAGGCG GCACGAACTC GGTGGCGATC
GGCGGTGCGG CAAACGCCTC GGCGAACAAC TCGGTGGCGC TGGGTTCGAA CTCGGTGGCT
AACCGTGCGA ACGCAGTGTC GGTGGGTGCG GTCGGTTCGG AGCGCCAGAT CATCAACGTG
GCGAACGGTA CGAACGGCAC CGATGCAGTG AACGTATCGC AGTTGCAGGC CATGGGTGCG
AACATCACCA GCGGTGTGGT GACGAACGCG TTTGTTGCCT ACGACGACAC GACCAAGGGT
AAGGTCACCT TTGGCGGCAC GGGTGCTACG AAGGCCGTGA CGCTGACGAA CGTGGCAAAC
GGCGTGGCAA GTAGCGACGC AGTGAACGTT GCGCAGCTGC AGGCCATGGG CGGCACGTTC
AACAGCAGCG GTGTGGTGAC GAACGCGTTT GTCGCGTATG ACGACACGAC CAAGAACACG
ATCACGTTGC AGGGCGCGAG CGGTTCTACG AAGATCACGG GTCTGACGGC CGGCGCGCTG
TCGGCGACAA GCCTGGACGC GGTCAACGGT ACGCAGCTGT ATCAGACGAA CGCGAACGTG
GCCAACGTTG CAGGCACGGT GACGAACGTG ACCAACACGG TCAACAACAT CACCAATGGC
GGCGGCATCA AGTATTTCCA CGCGAATTCG ACGCTGGCCG ATTCGTCGGC AGGTGGCACG
AACTCGGTGG CAATCGGCGG AGCGGCGAAC GCCTCGGCGA ACAACTCGGT GGCGCTGGGT
TCGAACTCGG TGGCTAACCG TGCGAACGCA GTGTCGGTGG GTGCGGTGGG TTCGGAGCGC
CAGATCATCA ACGTGGCGAA CGGTACGAAC AGCACCGATG CGGTGAACAT TGCGCAGTTG
CAAGCCATGG GGGCCGGGTT CAACAGCAGC GGTGTGGTGA CGAACGCGTT TGTTGCGTAC
GACGACACGA CGAAGGGTAA GGTGACGTTG GGCGGCACGG GTGCGACGAA GGCCGTGACG
CTGACGAACG TGGCAAACGG CGTGGCAAGT AGCGACGCAG TGAACGTTGC GCAGCTGCAG
GCCATGGGCG GCACGTTCAA CAGCAGCGGT GTGGTGACGA ACGCGTTTGT CGCGTATGAC
GACACGACCA AGAACACGAT CACGTTGCAG GGCGCGAGCG GTTCTACGAA GATCACGGGT
CTGACGGCCG GCGCGCTATC GGCGACAAGC CTGGACGCGG TCAACGGTAC GCAGCTGTAT
CAGACGAACG CGAACGTGGC CAACGTTGCA GGCACGGTGA CGAACGTGAC CAACACGGTC
AACAACATCA CCAATGGCGG CGGCATCAAG TATTTCCACG CGAATTCGAC GCTGGCCGAT
TCGTCGGCAG GCGGCACGAA CTCGGTGGCA ATCGGCGGAG CGGCGAACGC CTCGGCGAAC
AACTCGGTGG CGCTGGGTTC GAACTCGGTG GCTAACCGTG CGAACGCAGT GTCGGTGGGT
GCGGTCGGTT CGGAGCGCCA GATCATCAAC GTGGCGAACG GTACGAACAG CACCGATGCG
GTGAACATTG CGCAGTTGCA AGCCATGGGG GCCGGGTTCA ACAGCAGCGG TGTGGTGACG
AACGCGTTTG TCGCGTACGA CGATACGACG AAGGGTAAGG TGACGTTCGG CGGCACGGGT
GCGACGAAGG CAGTGACGCT GACGAACGTG GCAAACGGCG TGGCGAGCAG CGACGCAGTG
AATATCGCAC AGCTGCAGGC CATGGGCGGC GGGTTCAACA GCAGCGGTGT GGTGACGAAC
GCGTTTGTCG CGTATGACGA CACGACGAAA AACACGATCA CGTTGAAGGG CGCAAGCGGC
ACGACGAAAA TCACGGCTCT CACGGCGGGC GCGCTGTCGG CTTCGAGTTC TGACGCGATC
AACGGTTCGC AGCTGTACCG CACTGCAACC AGCGTGGCGA ACGCGCTGGG TGGTGGTTCC
AGCATGGGCT TGAATGGAAC GGTTTCCGCA CCCAGCTACC AGCTCAGCGG CGGAACGTAC
GCAGACGTCG GTTCGGCTCT TTCTGGACTG GACGGTGAGA TCGGTTCGAT CAACAACAAC
ATTGCTGACA CCACAAAATA CATCAAGGTG GTGTCGAACT CCAGCGCGGC GATTGCGACT
GGCGGTGAAG CAGTGGCGAT CGGCGGCGGG GCATATGCAA GCGGGTTGAA CTCCTTGTCC
CTCGGTGCAG GCGCGCGTTC ACAGTTTGCG AATTCGGTGG CGCTCGGCGC GAACTCACGG
GTGACGGTTG CGAATACCAT TTCCGTTGGC GATGTGGGTA CAGAGCGTCG CATCATGAAC
GTGGCTAACG GCGTCTTGTC TAGTGATGCC GCAACGCTCG GTCAGTTGAA CGCATTGCAG
AACTCGCTAA CGCAAAAAGT CTCCGCTGAG TCGAGCGGCG TGAAGTCCAT GCTGCTGGGC
GCGGTTCCGG TCACCAACTA TATCGCCGTC AGCACGACGA TTACGCCGGG ACTGCCTGCG
ACGACGACGG ACAACACAGA GAACGCCATG GCCATTGGCC CGGGTGCAAT GGCGCAGGGC
GCTGGCTCGG TGGTCGTTGG CGCGGGTTCG GGCTCGTTCC ACGCCGGTTC TACGGTGATC
GGTTCGAGCG CAGTGGCTGG GGCGGTCAAT GCGACGGTAA TTGGCGGAGG AGCAACGACC
AACAATTCCG CCGATAACTC AGTGGTAATC GGCTACATGG CTGCGGCGCA AGGCACGAAT
GCATTGGCGC TCGGTTCTAA TAGCGTGACG AACGCGACGA ACTCGGTCGG ACTGGGTGCA
AACGCCGTGG TCACCACTAC TGGCACGAAC TCCATGGCGC TCGGCTCGGC TTCCACGGTT
AGCGGGGTGA ATGCCGTAGC GATTGGTTAT AACTCGACGG CTGACCGTGC CAACACGGTT
TCCGTCGGCG CGTCAGGTTC GACGCGTCAG ATCGTCAACG TTGCCGCCGG TTCGCAAGAC
ACCGATGCGG TGAACCTGGG GCAGATGAAT ACTGCGATTA ACGCGATTGC AGGTGGCGGA
GCCCCGAATG CTGTGGTCTA TGACACGTCG GCGCACACCA CGCTTACGCT GAATAAGGGT
GCAGGTGCGG TTACGGTGTC GAACGTGGCG GACGGTGTGG CCAACAACGA CGCAGTGAAC
GTGGAGCAGT TGAAGGCGAT GGGCGCAAAC TTCACCAGCG GTGTGGTGAC GAACAGCTTC
GTTGCGTATG ACGACACGTC GAAGAGCAAG ATCACGCTCG GCGGCGCGGG TTCCACCGTG
CCGGTCGCGT TGACCAATGT TGCTGTGGGG CAGGTTTCGT CGACCAGCAA GGACGCAATC
AACGGTTCGC AACTGTACGG CGCGATGAGC AGCACGGCAG CCGCATTGGG TGGTGGCTCG
TCGGTGACGG CAAGCGGCTC GATCACCAAG CCGAGTTACA CGTTGAACGG CCAGACTTAC
ACTGACGTGG GCACGGCATT GGCTGCGGCT GCATCGGGCG GTGGCGGAGT TGATGCCGTG
AAGTACGACA CGTCGGCGCA CACCTCTGTA ACACTGGGCA ATGCAGGCAC GCCGGTGAAG
CTGTCGAACG TGGCCGACGG TGTGGCCAAC AACGACGCCG TGAACGTCGA GCAGTTGAAG
GCAATGGGCG CAAACTTCAC GAGCGGTGTG GTGACGAACA GCTTCGTTGC GTACGACGAC
ACGTCCAAGA GCAAGGTCAC GCTCGGCGGC GCAGGTTCCA CCGTGCCGGT CGCGTTGACC
AATGTTGCTG TGGGGCAGGT TTCGTCGACC AGCAAGGACG CAATCAACGG TTCGCAACTG
TACGGCGCGA TGAGCAGCAC TGCCGCTGCG TTGGGTGGTG GCTCGACCGT CAGCACTAGC
GGTCAGATCT CGAAGCCCAC GTACACTGTC GGTGGCAACT CGTACACTGG TGTCGACTCG
GCAATTGGCG CCCTCAACGC GGCAATCGCT ACCGGCGGCA ACCCGGACGG CGTGATTTAC
GACACGCCGG CGCATGACAA GCTGACGTTG GGCGGCGCGA ACGCCACCAC CCCGGTAACG
ATCGCAAACG TCGCCGCCGC GACGAGCGAC GATCAGGCGG TCAACCTCAA GCAATTGAAG
GAAGCCGGCT TGAACGTCGA TACGTCAGGC AACGTGACCA GCTCGTTCAT CGCTTATGAC
AACGCGTCCA AGAACACGGT GACGTTTGGC GGCGTTGGCT CGACGACTCC GGTTCTGTTG
AAGAACGTGG CAGCCGGCAT TGCTGACTCC GACGCGGTCA ACGTTGGTCA AATGCATTCG
TATGTCGACC AGCAGATTGG CGGCGGCACG GCGAACGGCG TGGCGTACGA CGACTCGACA
AAGAGCAAGG TGACCTTGGG CGGCGTGGGT TCCACGACGC CGGTGACATT GACGAACGTT
GCGGCAGGTA AAACCGCAAC GGACGCGGTC AACTACGGTC AGTTCTCGGC CCTCGAAAAC
GTCGTGGCCA ACATTCAACC GGGCACGGGC AACTCGACCT ACATCAACAT CAACCCGACT
TCGGGCGGCA CTGCTGCAGT GGCGACTGGT TCGGACTCGA TCGCAATCGG CAACGGCGCC
TCGGCTTCCG GCGAGTCCGC GATTGCAATC GGCAAGAACA CGGTGACGTC GGGTGACAAC
TCGGTCGCGA TGGGTGCAGG CGCTTCGGCT CCGAACACCA ACGCTGTTGC ACTGGGTGCG
AACTCGACGA CGGATCGCGA CAACTCGGTG TCGGTGGGTT CCGCCGGCGC AGAGCGTCAG
ATCACCAACG TCGCAGCCGG TACGCAGGGC ACCGATGCGG TCAACCTGAA TCAGTTGAAC
AGTGCCATGG GTAACATGAG CAACTCGATC AACAACGTTG ACCGTAGCGC GGCCAAGGGT
ATCGCGTCGG CCTCGGCGCT GAACATCGTC ACGCCGTATC TGCCGGGTCG TACCACGCTG
AATGCAGGTG TCGCCAACTA CCGTGGCTAC CAGGCAATTG GTTTGGGTGT GTCCCGCTGG
AACGAGAAGG GCACGATCAA CTACAACCTC GGCGTGTCCA CCTCGGGCGG CAATAGCACC
ATCGTTCGCG CTGGTATCGG TATCGTCCTC GGCAACTAA
 
Protein sequence
MNKSYISVWN DATETWVAAP ETATAHSGGG AKSSKPIVAS SEAAKRPSSA MKFALMPITM 
AVGAIGMAAL PGTAMAGLYV CSNDGLGEGT YYGTDGLSTG GGCLASGGTT FALLAGPIAG
SGNGGVGIWA NVDQTLSVYA PSGIYLNNDV NMVGNKITHM HAGTADGDAV NVSQLTGVTQ
GMGGGAGISG GGSFIAPTYT FDDGTLSYNN SGDAFNLVSS RLKYFHVNST APMSTAGGKQ
AISIGPVASA SGDNSVALGN TAVATGSGSI AVGSNATALV SNSVALGANS VASRNNSVSV
GYLSADGTSQ YTRVVTNLTA GSAGTDAVNV NQLNAAIASV NGGGGGGNVT NAVTYNDSGM
SLITLKGSAG TKINNLTAGD ITSAYSMDAV NGSQLYATNQ NVANVAGNVV NIAGNLANVT
NTVNNITNGG GVKYFHTNSS LADSSATGAD SVAIGGLASA SANNSVALGS KSVADRANAV
SVGASGAERQ ITNVAAGTAN TDAVNLGQMN TAIASISGGG SLDAVIYDSS AHNKLTLGGS
TSTVPVGLTN LAAGQVTSSS KDAVNGSQLF NTASGVSSAL GGGSSVSASG AVTNPTYTLQ
GGTYNDVGSA LSGLDTEISG LTTDVANASK YFKVVSASSA AIATGAETIA LGGGAYASGS
NSVAIGSGSR SMFANSVALG ANSRVNAANT VSIGDIGTER RITNVANGVS GTDAATIAQL
SALQTSVNQQ IAALPSGSKS MLMGASMLGA TPVTAYISVS SNVTPGTNTS TDNSLDAMAI
GPTAMALGQG SLAVGAGAGT ALAGSTAVGN GAAALALNTT VIGAGANTSN NATNAVAIGY
NAAAQGANAL SLGSSAVSNG SGSIAMGANA FVTTTASNAL ALGAGATVSQ ANTVALGANS
VGDRANAISV GSSTAQRQIT YVANGTNSTD AVNVSQLTGV TNLIGGGAGV NADGTIKKPG
FTIGGQTYSD VGSAIAAAVS GGSANAVQYD TSSHTKVTLG GTGTTVPVTL TNVANGVASS
DAVSVAQLKA MGGTIDSSGV MTNAFVAYDD TTKNTITLKG ASGSTKITGL TAGALSATSL
DAVNGTQLYQ TNANVANVAG TVANVTNTVN NITNGGGIKY FHANSTLADS SAGGTNSVAI
GGAANASANN SVALGSNSVA NRANAVSVGA VGSERQIINV ANGTNGTDAV NVSQLQAMGA
NITSGVVTNA FVAYDDTTKG KVTFGGTGAT KAVTLTNVAN GVASSDAVNV AQLQAMGGTF
NSSGVVTNAF VAYDDTTKNT ITLQGASGST KITGLTAGAL SATSLDAVNG TQLYQTNANV
ANVAGTVTNV TNTVNNITNG GGIKYFHANS TLADSSAGGT NSVAIGGAAN ASANNSVALG
SNSVANRANA VSVGAVGSER QIINVANGTN STDAVNIAQL QAMGAGFNSS GVVTNAFVAY
DDTTKGKVTL GGTGATKAVT LTNVANGVAS SDAVNVAQLQ AMGGTFNSSG VVTNAFVAYD
DTTKNTITLQ GASGSTKITG LTAGALSATS LDAVNGTQLY QTNANVANVA GTVTNVTNTV
NNITNGGGIK YFHANSTLAD SSAGGTNSVA IGGAANASAN NSVALGSNSV ANRANAVSVG
AVGSERQIIN VANGTNSTDA VNIAQLQAMG AGFNSSGVVT NAFVAYDDTT KGKVTFGGTG
ATKAVTLTNV ANGVASSDAV NIAQLQAMGG GFNSSGVVTN AFVAYDDTTK NTITLKGASG
TTKITALTAG ALSASSSDAI NGSQLYRTAT SVANALGGGS SMGLNGTVSA PSYQLSGGTY
ADVGSALSGL DGEIGSINNN IADTTKYIKV VSNSSAAIAT GGEAVAIGGG AYASGLNSLS
LGAGARSQFA NSVALGANSR VTVANTISVG DVGTERRIMN VANGVLSSDA ATLGQLNALQ
NSLTQKVSAE SSGVKSMLLG AVPVTNYIAV STTITPGLPA TTTDNTENAM AIGPGAMAQG
AGSVVVGAGS GSFHAGSTVI GSSAVAGAVN ATVIGGGATT NNSADNSVVI GYMAAAQGTN
ALALGSNSVT NATNSVGLGA NAVVTTTGTN SMALGSASTV SGVNAVAIGY NSTADRANTV
SVGASGSTRQ IVNVAAGSQD TDAVNLGQMN TAINAIAGGG APNAVVYDTS AHTTLTLNKG
AGAVTVSNVA DGVANNDAVN VEQLKAMGAN FTSGVVTNSF VAYDDTSKSK ITLGGAGSTV
PVALTNVAVG QVSSTSKDAI NGSQLYGAMS STAAALGGGS SVTASGSITK PSYTLNGQTY
TDVGTALAAA ASGGGGVDAV KYDTSAHTSV TLGNAGTPVK LSNVADGVAN NDAVNVEQLK
AMGANFTSGV VTNSFVAYDD TSKSKVTLGG AGSTVPVALT NVAVGQVSST SKDAINGSQL
YGAMSSTAAA LGGGSTVSTS GQISKPTYTV GGNSYTGVDS AIGALNAAIA TGGNPDGVIY
DTPAHDKLTL GGANATTPVT IANVAAATSD DQAVNLKQLK EAGLNVDTSG NVTSSFIAYD
NASKNTVTFG GVGSTTPVLL KNVAAGIADS DAVNVGQMHS YVDQQIGGGT ANGVAYDDST
KSKVTLGGVG STTPVTLTNV AAGKTATDAV NYGQFSALEN VVANIQPGTG NSTYININPT
SGGTAAVATG SDSIAIGNGA SASGESAIAI GKNTVTSGDN SVAMGAGASA PNTNAVALGA
NSTTDRDNSV SVGSAGAERQ ITNVAAGTQG TDAVNLNQLN SAMGNMSNSI NNVDRSAAKG
IASASALNIV TPYLPGRTTL NAGVANYRGY QAIGLGVSRW NEKGTINYNL GVSTSGGNST
IVRAGIGIVL GN