Gene Pars_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0072 
Symbol 
ID5055713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp61054 
End bp69057 
Gene Length8004 bp 
Protein Length2667 aa 
Translation table11 
GC content51% 
IMG OID640467650 
Producthypothetical protein 
Protein accessionYP_001152339 
Protein GI145590337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACC AAACTAAGAC AATACTGGCA GTCCTAATAA TAACAGCCCT GGCTACGCTA 
TCCTACGCAC AGTACGACCT GAGACAAGCC CCCTCGGCAT TCGACGTACT AATCCAGATA
AACGACGCAA GAGGGGCATA CGGCACTCCC AACTCTATCT GTACGCCCTG GCCTAAGAGC
CTCCAGAACT ACCCACCCTA CAGCTTCGCG GCTGGCAAAA ACTTCACACT AATAATACGC
GAAATAGCCT CAAGTCCCAA ATACCCGTCG GTATTCCAAG ACTATAGAGT ATCTGCCGTC
GCCAACGCGA CGGGCTTCGT CTACTTCCCA ATAAGCCTGC CCGCAGGCGT AAACGTAACT
AAGAAGACTG ACTGGTACGT GGCCATCGTC GTACAGTGGC CAAAAGACGG CATATACTTC
CTCGTATACA ACCAGACATT CACAAGCACA TCCCTCATCG AGGTTATTGG AAACCTCAGC
GGTAGACCAG ACCTAACTGG AAGCGAAGTC TACACAGGTC CATGGGGCGT CCTAACTGTA
ACGAACGGGA AGGTCAGATA CGGTAACCTA ACTTCCAGCG TTATCCACAC ATTCTACATA
GGCGTCAACA CCCTTGGGAT GAGCTACGAC CTTACGCTCT CGAGAGAGGT CACCATAGCG
GGTACAACCT ATCCTCTGGA GACCAAGATT GCGCCGGCTA AGGGACCAAC GACATATGCT
AGCAACATCG ACCCAGGAAT AACCACCGTC TTTGGCCCAT TCACATACCT AGGCACCTAC
GTTACGTCGC TAAAGGATAG CAAAAACAAA GTAAGCGTAG ATCCGAGCAA GGTTACGTAT
AGGCACTACG TCTATATATC GGTAGAGGGC TATGTCGGAA CCGTCATCCA GAGCAAAAAT
GACATATACA CATACACAGC CCTAAACCCG AACCTAAGAT ACGTGGCCAT AACGCTGAAC
ATGACACAAG GCGCCTCTGT ACCCATGTCC TGCAACTTGG TTGTATGGAG CCTAAGCATG
TACACTGTGA CGATCACAGG CTTGCTCGAC TTGAAGGGCA ACCCAATATT CAACCCCGAG
TACTTCAGGT TCAAGATCCA GATGAAAGTA GGAGACAACT GGGTGACGCT GAGCAGAGCG
CAGGGTAGCT GGCGGCTCGA CCTAGGCGAC GTGAAGGCAG TGCTTAACCA GGTATGTGGG
ACCATAAGCT CGTTCTCCGA CATAATGAAC TGTTACAGGA CGTTAGGCCC CAACGGCTTT
AGAAATGCAG TGCTGAAGCT GTACGAACCC GTCACATTGA CGCGTCTAGC CGGGATGTTA
TCGCTAGAAA ATGATGGGTA CAAGATCACT ACGCAGGACT TGGTGACTAA ACTTGTTGTG
GAGTATTCCT ATGCAGCTGG CCAAGACTCT ATTAGGGCTG TGGTGCTCGA AATGCCGTTG
GCAGAGCCTA GGGACTTCAC AGGCGTGTTG AACGTGTCTA TTCTGCCAAT CCAGATAAGG
CTTTGGAAGT GGAGCGGCCT TGATCTGCCG CCGGGCGGTA ACGCCAGGCA GTGGTTCTAC
ACAGACCCGC TAGACCTCGC ATCGCTACGT TTTAAGGTCA GCGGGGATAC CATGTATGTA
AGCAGAATGG CGTACGACGG TGCCGTTTCC TATGATCCAT GGTTAGGCCA AGTAGTCTGG
ACTCTGCCTC CGTACCAGCC GGTATCTGGG CTTGTTGGTA ATGTCAAAGC CGCCCTACTT
ACATTATTCA ATGCAAGCGG GTACCTCCCG TTGCCCACGC TGGTGGCTAA TCTGACGAGC
ACTGGCGTGA AGTATTACAA CTACACTGAC CTAGCAGCCA ATGCTGACTA CTTTAAGAAG
TTTGATGTCG GCGCCGTAGG TAGCTACGCA TATAAGTTCA GGATCTACAG CGGGGATATA
TTGGTAGGCA CTGCGAACAT CGTAGCCTAC TACCCGACGG TTAACAAGTC TGGCTTGCTG
TACAATATGA CATATGCCAG CTCTGTGTTG GAGGCAGAAT ACGGCGATAC AAAGCACTAC
GACATGTACA GCGTACAAGT ATTGAACCCA GGTAGATTCT ACGATAGGGA GCACGTGCTC
CACATTGCCA TTATGAGAGT GTTCCAGAAC ATACTTATGA GAGATGCTTG TGGCAACCCG
CTAACCGGAG TCAGTGCAGG CCTTGCTGGG GCGAACCTAA TGTTGGTGAT CCAAGTAGGC
GGGAAGAACT ACACAATAAC GAACCTACCA CTCGGCAGCG AGGTTCCCGT GGATATTGCC
ATCCCAATTG ACGAATGGGG CGAGCCGCTT GTCGATGTCA AGAGCGGTAA CCTAACCGCC
TACGCTGTCG TCAACTACTT CGGCTACAAG CTCTACGCCG TAGACAACCC GGCAAAGATT
CCCACTTCAG CGCCCGTATG GTTTAACATC CCGGTCAAGG GCGAGAGGAG TGTAGTAAAG
AAGCCAATAA TCTACCTACC GATCTCCCCG CTGTTGTTCA GGACCTGGAG CCAAGTAGTT
TCCGTAGGCT ACGACCCGCT CAAAGAACCG CTGATGGGCT TCGTGGTGAG AATATTGCCA
ACTACCACGG GTGAGATTGC GAGGAGTATC TCGAACAAAG ACGGCTACGC CTATGTGCCG
AATGTGCCCA TAGGTGTTCC GCTCAGGGTA CAGGTGAGAA CTATAGTGCC ATCATCTGAC
AAACGCTGGC CCTACACCTT TTACCAGATA AATAACAATA ATGACTACGC CAGCTACGCC
ACAGCACTGG GCTTCACGGC AGCAGATAAC GTGTATACCC TGGGCACAAG AGGGGATATT
GATAGCGGCC TAGTAGTATT TGACAAACAA ATAATCATTG ACTCAACTAA TGCAACCAAG
TATATATGTG CAAAACAAGC CATAGACCTG CCTCTGGAGG TTTACGACCT CGTAGTGAGG
GTGTTCGACA AGACGGGTAA GTACCTCTTG AGGAGCCAGC CTGTATTCCT CGGCCCCTAC
CCACAGGCGA CGAGGCCTGT ACTGCTGAAC GTAACGTTGC TTCTAGCAGA CGATTACAGC
CCGTATCGCT ACGCCTCGAT GTGGAGAGAC TTCGCAATGG GCGACTTCAA GATTCTAACA
GACTACCGCG CTATTGGTAT CACCGGCATG AGGAGTATAT ACTTAAGCCT CGCCTCGAAA
TACCTCTCCG AGACACAAAA GGCATTGGGC TGTCCTAAGT ACACCACTGC CAACTACTCC
TCAGCAATTA ACTCATATGC GCTTGCATCT ATGGCCGGTT TCGTGGCCAA CGCCTCGACT
GACAGATATG CTGCCGTATA TCTGTTGTCT TCCCAGCAAC CAAAAGACGT CATGGACATC
TGTGCTTTCA AGACGGATCT AGCTAGCCTG AACAAGATGC AGCCTGGCTC TGCAGAGATC
GCGAGACTAT TCATGAAGGG CCAGAGGCTA CGGTTTGTGG TGTGGTATAT GGGACAGAAG
GTCTTTGACG ACTATGTGAC TATCACAGGG CCATTAGTTG ATATAAAGGC TGACGTCTAT
CCAATAAACG TCACCACCTA CACTAAGAGT ATGAGGTTGC CAGTAGACAC TTTCGTAGGA
TTTACCATCA CCGACGTCTA TCTGGGCCTG GCGCTTAACA AGACTAACGG CATGTTCGCC
AACAAGTCGC TAGTGCCGCA ACTAGTTGCT CCGTTTAACA GCACGTACTC AACCTACAAC
TTAACCTACC TGCTTGCCAA TGAGCTTGCG AAGTCTGTAG GACAACAGTA TAATGACAAC
GTCACTGCCT TTGTGGGCAC TAATAAAGTA GGCCAGTGGC TATATGGATA CTATGTGCCT
GCACAATTCG GCGGCGACTT CGCATATCTA CCCAACATCG TGGTGCTACG CAACGGCACA
GTTGCCAAAT ATGACTTTAC TATCCTGAAC GGGCTAGTCA CAACTCAGTC GACATTTAGA
GTATCCCGTG ATATCAGCGC CGGTAGCGGC GCGATTCAGG TGCCTTCCGG CCAGACGATT
GTCTTTAATT TGACAGGCGC GAAGATTGTA ATAGATCCTG CTTCGAACAT CACCTACACC
AAACTCACAG CGTACAACGG CACTGAGAAT AATAATCCAG TTAAGCCGCA ACTTAATATA
ACCGGATCCC TTGTCTCCAT CAATGGGTCA CTGGTGAATA ACTACCGCGT CAATGTCAAC
GCTAACTACT CAATAGCGAT TACTGCCTCA GCACCTTGTA ACGGTAGCAT TACCATTGCA
CCAGGAGCCC GAGATCTTAC TGTGCAGGTT ACTGTTAAAG CAGGCCGCTG CCCTGCGACT
GTCACTTACA GCGCAACTAA CAGGACCTAC AGCACCCAGA CTGTTGTGTC AACTTCGGCG
CCAGGCGCTG AATATGCTGT GAGCTTTGAT AGGTGGTTCT TAGTTCCCTA CGACTGGTAC
CTTACCCAGT ACAATGTGTT GTACCACGCT ACCAATGTAG TAAAGAGAAA CGACGTAATT
TGGGTGTTGA GCCACACTGG CAAGACGGCT ATGTGTGCGA CACCACAAGG ACAGGTCACT
GCTGGTGAGG ATGACAGGTA TATTTACGCG CTGAATCTGA CTGGGCTCGA AGTAGTCAAT
TATCGCACGC TGAGAGTTGA GTTGCCTTGG TTGGCTACGG GAGGTGGTAA GGCTGTTGTG
AACATCACCG CGTACTTTGC GGATAACAAC ACTCTAATTG ACTGGAAGGT CTACAACCTC
ACCGAAATAC TTGCAGGTAC GAGAGGTACG AGAGTAATAG TTACTCTGCC GCTGAACTTT
GGAAAAGTAG GCGCCAAGGC ATACGACATT GCCACTACCA ACGCCGCTGT GAAGTACGTT
ATAACCTTTG ATATGTACAA TCCAAAGGAG GCGCCCAACA GCGTTTGCGC CACCAAGCTA
GTCCCACTTG CAGGTAGCTA CCAGAATGTA TCAATGTACG AATGCAGTGT TCCTACCACT
CCAGGTCTGC CGGAGCGCAT TGATCCGCAG ACTGTGGTTT ATGCCATTGA CTTACAGAGG
TCGTTTGCCA ACGGCTTCAA CGAAACTTAC AGCGCTTACG GCGGATTCGG CACTCCCATT
ACATATGTGG TGAAGTCGGG TGAAATTGCG CTGTTGCCTT CGTGGTACTA CAAGACCTCA
GTCGCTGGTT CAAGAATTGC TAGGATATGG ATAATTGCCG CTAACAAGGA CCCGGCTCAG
GGACCGGCGT TGGGCACAAA GTATTACTCA TACTCTGTAA AGGATGACAA GGTCACAATT
AACATCTATA AGTTCGAGAA GTACTTGGTT GTAAACTACG TACACAATGT GTGTCCTGCA
GGTTGGACTT CGCAGACGTT CCTCGACGAG TTTGACGGCT CCGGGAGGAT TATCGGCCTT
GGATTTGGTA CATCTGGCAC AAGCACGTTG GTGCTGGGCA ACTACACTGG CGTGAGGATG
TGGAACTCCA CCGACATGTG GCTTTCTGGC GGCGTATTCA AGCTACCCAC CGTAGCGCTT
GATAACTTGA CCGTTGAGAA CACTGCGGAC TTCCCAATTG TCGTCAGCTC GCTTAACGTC
AAGTACGGTG ACTACAAGTA TGAGATACCT ATGTCGCCGC TTCGCGTTGA TGCCGGGAAG
ACCAACACCA CGCGTCTAAA TGGTTATGGC TTTGGAAGGA CGTACATGTT CAACATCTCA
GATGTGTGGA GCTTTACGCT GGTTCAGCCC AACTACGCCT ATGGCCTAAG CGCCTACCAC
GCCGGCTTAG TCGACGCTGC CAGGTACTTC GGCCTTGGTA CTGTTGATGT GACGAAGTAT
CTGAAGCCAT TGGAGTCAAA GTACCTCGTG CAGAACGTGC TATATGCAAG CCACGCTGAG
ACTAGCGACT GGACGTTCAA CATATTATAC GGCAAGATAG CAGAGACTAC TAGAGGTACT
TGGGGCGACT TGACTGTGAG GAGCGACGAC ACTGACTACA AGTACGTATT CAGCTTCCCA
ACTCTGCCAC TAAAGGAGAT CCGCGACTGG AACGACAGGC CTTTGGCTAA CCAGACTATT
GCGCTGTTTG ACAGAACTGG CAGGCTATAC GCAGTAGTCT ACAGCGACAG CCGCGGCAGA
TTGGCCTATC CGTTGCCTGA CATATCGGCT ATTGGCCAGA CTAATGTAGT CAGAGTATCA
TGGTTCAACG GCTACCTAAT AGAGTTGCTG AGAGGTAAGC CGGAGTTCAC AATATGGATA
TACAACCAGC TTATCCAGAG AGATGTCACC GAGCTCGGCG ATGCCACAGC GAGCAGTTAT
ATAAGGACGT ACGTCTATCC ACTGACTGCT ACCGTTAAGG ACGACGCTGG TAGGCCGTTG
ACAAATATGT ACGTGAAGGT CATCGACGTT TCCACCCTCG GTCAATTAGT CAATGCGGCT
AACAAGACCG GTGCGGATGG TAGCGCCCAG GTTGTTGACA TGAGGATATC GAAGTACTCC
AGCGGAGTGA TGTCACAAGT GCCTGCCACG AGCTACTACT ACTACGTCTA CGACCAGAGC
GGCGCCTTAG TGGCTCTCGG GAAGTTTGAA ATTCAGCGCG GTGCTTCCCA GCCTTCGACT
GCGTACAACG TGGTCGCTAC GGTGAGGTAC GCCACTGAGA TACCTGTAAA GAACAGCGCC
ACGCGCGGTT ACCTGTTGAT AAAGGGCGTC GAGTTCCTCA ACGGCACTAA GAAAGACGTC
AAGATACCGT TCACAATATC TGGCGGTGCT ATGACGTTGC AAGGCAAGGT TCCCGTGTCC
GTTGAGTACC CAGTGGATAT ATATGTGACT CATGTTACTT TGGGTGGCCA GGAGGTGCCG
GTCAAGGGCG GCCAGTTCCT AGTATTCAGC GGCAAGACCA CTGACCTGCT GTCAGGCCTA
GACTTCGCTG AGCTTGGCTT AACGAGCCTT GTAACTATCC AAGCTGTTGA CTCGGCAGGA
TCGCCGAGAT CTGACTGGAC GGTCTTGGTG CGCTATGGCA ACTTGACTGT AGCCCAGGGC
GCTGGCCAGT TGCAGCTTGT TTTGCCGCGG AGTGACGTCC TGGAGATGCC TTATACGGTG
AACGTCGTTA CAAACGTCTA TACCCCCGAC GGTAAGGCCC TGGTTAAGAG CCAGACATTA
GACCTCAGGC AGAAGTTCGT TTCGCTTCAG ATCCCGATCG CTACCGTTAA GGCTGTGGTG
CAGGCTGTTG ATGGATTTGG CAATGTGAGG AGTGACTGGC CTGTTGTAAT CGAGAACGTG
GCTACAGGCA TGGGCCAGAT CACGACCGAG CTTGTCGAGG GTGAGCGTTA TGTGGTGAGG
GCCACGGGCC TTGGTTACAC CAACGCTACG CAGATCACCG CTAAGGGCCC GCAGATGGTT
GTCCGCGTCA TGATACCCAC GGGCAAGATC GTGGCCAGTG TTGTTGACGG CTTCGGCAAT
GTGAGAAGCG ACTGGCCTAT TGAGATCGTT GGCGTCACAG CTGGACAGGG CAAGATAGGT
GAGATTGATG TAATTGCTGG GCAATACACG GTGAGGACTA AGGTGTTTGG CAAGGAGTTT
GCCTCCACTG TAAATGTCGG TGTCGGTAAA GTCGCCACCG TTACCTTGCA AGTGCCTACT
GCGAAGCTGA GCATTACAGT CGTTGACGAC GACAAGAAGC CAATCGACAA CTACGTCGCA
GAGGTGCTAG TGTCTGACAT GGCCTTCTCG ACGCCGCCAA AGAATTTGGA GGTGCTTGCT
GGCACCTACA CTGTCAAGGT CTCCGCCTTG GGCAAGGACG CCACCACCCA AGTCGCTGTG
AACGCCGGCG AGACTAAGAA TGTGCAGGTT GTGGTGCCGG GTACTGCTGG GCTTGATTTA
TTCGGGACTA GGATACCGTT GCCTACTCTT GTGCTGTACG GCCTGTTGCT ACTAGTTGTT
GTGTTGATAC TGGCGATAAT AGTTATTGAG TACAACAACT GGAGGAGGAG GCGTCTAATG
CAGATACTGG CCCCGCCGAA GTAA
 
Protein sequence
MHNQTKTILA VLIITALATL SYAQYDLRQA PSAFDVLIQI NDARGAYGTP NSICTPWPKS 
LQNYPPYSFA AGKNFTLIIR EIASSPKYPS VFQDYRVSAV ANATGFVYFP ISLPAGVNVT
KKTDWYVAIV VQWPKDGIYF LVYNQTFTST SLIEVIGNLS GRPDLTGSEV YTGPWGVLTV
TNGKVRYGNL TSSVIHTFYI GVNTLGMSYD LTLSREVTIA GTTYPLETKI APAKGPTTYA
SNIDPGITTV FGPFTYLGTY VTSLKDSKNK VSVDPSKVTY RHYVYISVEG YVGTVIQSKN
DIYTYTALNP NLRYVAITLN MTQGASVPMS CNLVVWSLSM YTVTITGLLD LKGNPIFNPE
YFRFKIQMKV GDNWVTLSRA QGSWRLDLGD VKAVLNQVCG TISSFSDIMN CYRTLGPNGF
RNAVLKLYEP VTLTRLAGML SLENDGYKIT TQDLVTKLVV EYSYAAGQDS IRAVVLEMPL
AEPRDFTGVL NVSILPIQIR LWKWSGLDLP PGGNARQWFY TDPLDLASLR FKVSGDTMYV
SRMAYDGAVS YDPWLGQVVW TLPPYQPVSG LVGNVKAALL TLFNASGYLP LPTLVANLTS
TGVKYYNYTD LAANADYFKK FDVGAVGSYA YKFRIYSGDI LVGTANIVAY YPTVNKSGLL
YNMTYASSVL EAEYGDTKHY DMYSVQVLNP GRFYDREHVL HIAIMRVFQN ILMRDACGNP
LTGVSAGLAG ANLMLVIQVG GKNYTITNLP LGSEVPVDIA IPIDEWGEPL VDVKSGNLTA
YAVVNYFGYK LYAVDNPAKI PTSAPVWFNI PVKGERSVVK KPIIYLPISP LLFRTWSQVV
SVGYDPLKEP LMGFVVRILP TTTGEIARSI SNKDGYAYVP NVPIGVPLRV QVRTIVPSSD
KRWPYTFYQI NNNNDYASYA TALGFTAADN VYTLGTRGDI DSGLVVFDKQ IIIDSTNATK
YICAKQAIDL PLEVYDLVVR VFDKTGKYLL RSQPVFLGPY PQATRPVLLN VTLLLADDYS
PYRYASMWRD FAMGDFKILT DYRAIGITGM RSIYLSLASK YLSETQKALG CPKYTTANYS
SAINSYALAS MAGFVANAST DRYAAVYLLS SQQPKDVMDI CAFKTDLASL NKMQPGSAEI
ARLFMKGQRL RFVVWYMGQK VFDDYVTITG PLVDIKADVY PINVTTYTKS MRLPVDTFVG
FTITDVYLGL ALNKTNGMFA NKSLVPQLVA PFNSTYSTYN LTYLLANELA KSVGQQYNDN
VTAFVGTNKV GQWLYGYYVP AQFGGDFAYL PNIVVLRNGT VAKYDFTILN GLVTTQSTFR
VSRDISAGSG AIQVPSGQTI VFNLTGAKIV IDPASNITYT KLTAYNGTEN NNPVKPQLNI
TGSLVSINGS LVNNYRVNVN ANYSIAITAS APCNGSITIA PGARDLTVQV TVKAGRCPAT
VTYSATNRTY STQTVVSTSA PGAEYAVSFD RWFLVPYDWY LTQYNVLYHA TNVVKRNDVI
WVLSHTGKTA MCATPQGQVT AGEDDRYIYA LNLTGLEVVN YRTLRVELPW LATGGGKAVV
NITAYFADNN TLIDWKVYNL TEILAGTRGT RVIVTLPLNF GKVGAKAYDI ATTNAAVKYV
ITFDMYNPKE APNSVCATKL VPLAGSYQNV SMYECSVPTT PGLPERIDPQ TVVYAIDLQR
SFANGFNETY SAYGGFGTPI TYVVKSGEIA LLPSWYYKTS VAGSRIARIW IIAANKDPAQ
GPALGTKYYS YSVKDDKVTI NIYKFEKYLV VNYVHNVCPA GWTSQTFLDE FDGSGRIIGL
GFGTSGTSTL VLGNYTGVRM WNSTDMWLSG GVFKLPTVAL DNLTVENTAD FPIVVSSLNV
KYGDYKYEIP MSPLRVDAGK TNTTRLNGYG FGRTYMFNIS DVWSFTLVQP NYAYGLSAYH
AGLVDAARYF GLGTVDVTKY LKPLESKYLV QNVLYASHAE TSDWTFNILY GKIAETTRGT
WGDLTVRSDD TDYKYVFSFP TLPLKEIRDW NDRPLANQTI ALFDRTGRLY AVVYSDSRGR
LAYPLPDISA IGQTNVVRVS WFNGYLIELL RGKPEFTIWI YNQLIQRDVT ELGDATASSY
IRTYVYPLTA TVKDDAGRPL TNMYVKVIDV STLGQLVNAA NKTGADGSAQ VVDMRISKYS
SGVMSQVPAT SYYYYVYDQS GALVALGKFE IQRGASQPST AYNVVATVRY ATEIPVKNSA
TRGYLLIKGV EFLNGTKKDV KIPFTISGGA MTLQGKVPVS VEYPVDIYVT HVTLGGQEVP
VKGGQFLVFS GKTTDLLSGL DFAELGLTSL VTIQAVDSAG SPRSDWTVLV RYGNLTVAQG
AGQLQLVLPR SDVLEMPYTV NVVTNVYTPD GKALVKSQTL DLRQKFVSLQ IPIATVKAVV
QAVDGFGNVR SDWPVVIENV ATGMGQITTE LVEGERYVVR ATGLGYTNAT QITAKGPQMV
VRVMIPTGKI VASVVDGFGN VRSDWPIEIV GVTAGQGKIG EIDVIAGQYT VRTKVFGKEF
ASTVNVGVGK VATVTLQVPT AKLSITVVDD DKKPIDNYVA EVLVSDMAFS TPPKNLEVLA
GTYTVKVSAL GKDATTQVAV NAGETKNVQV VVPGTAGLDL FGTRIPLPTL VLYGLLLLVV
VLILAIIVIE YNNWRRRRLM QILAPPK