Gene Pisl_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1111 
Symbol 
ID4617579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1003653 
End bp1011623 
Gene Length7971 bp 
Protein Length2656 aa 
Translation table11 
GC content48% 
IMG OID639784205 
Producthypothetical protein 
Protein accessionYP_930625 
Protein GI119872618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACC AAACAAAGAC AATACTGGCA CTCCTACTAC TGACAGTCCT AACAGCAATC 
GCCTACGCTC AATATGGCCC CGGCCAGGCC TTACCTTCGG CATTCAACGT AGTAATGCAG
ATAAACGACG CAAGAGCGGC GGCTGGGTTA GGCTATCCAC TCTGCACGCC CTGGTCCAAG
GGTCTGGCAA ACTACCCGCC ATATAGCTTT GCAGCAGGTA AAAATTTCAC TCTAATCATA
CGTGAAACTG CTTCAAGCCC GGTCTACCCA CCGGTATTCC AAGACTACAG AGTATCTGCA
GTGGCCAACT CTACAGGCTT TGTGACTTTC AACATCAACG TACCGGCTGG TGTAGATGTT
ACAAAGAGGA CAAATTGGTA CGTGGCGATT GTCGTCGAGT GGCCTAGGCC CGGTTACTAT
TTCTTAATCT ACAACCAGAC GTTCACAAAC GCTACGTTTA TAGACGTAAT TGGCAACCTA
AGCGGCAGAC CCGACCTAAC AGATACGAAG ACATACACAG GCCCGTGGGG TGTAATAAAA
GTGACAAACG GAGCAAATAG ATTCGGCAAC CTTTCTTCAA GCGTAATCCA CACATACTAC
ATAGGGATTA ACACACTAGG CAAGCCCTTC ACGTTAACTT TGACAAGAAC TATCACAATA
GCGGGCACTA CAATTACACT TGACGACAAC GTCGGCCCCG CCAGAGGGCC GACTAACTAT
GTGGTGGGCT ACGACCCTAA TGTAAACGTT GTATTTGGGC CATTTGCATA CCTCAGCACG
TATGTAATAT CTCTGGGAGG CCCAGGCAAA ACTGTAACTA TAGACCCCAC AAAGGTGTTC
TTTAACCACT ACGTATACAT CACAATTGAA GGCTTGAGAG GCGTCGTTAT CCAGAGCAAG
AATGACCAAT TTACTGCCTT CGGAGGCCTT AGGTATGTGG CTATTACATT AAACAAGACC
GCAGGCCCTG GCACAAGCGT GAGTGCGCCA ATGTCATGTG GTCTCATAAT ATGGAACATG
ACCATGTATA CGGTAAAAAT TACCGGACTC CTTGACCTAA AGGGCAATCC CATATTTAAC
CCAGAGAACT TTAGATACAA GATCCAGCTT AGAGTTGGAG ACAGCTGGGT GACTTTAGAT
AGGGCACAAG CTTCTTGGAG ATTAGATCTA TCAGATGTAA AAGCTGTATT AAACCAAGTT
TGTGGTACAA TAAAAACATT CACAGATATA ATGTACTGCT ACAGAACTCT AGGCCCCAAT
GAGTTTAGAC AGGCGGTGCT CAAGCTCTAC GAACCTGTAA CACTTGCGAG ATTAGGCGGC
CAGTTACAGT TAACAAATGA CAATGCTAAA GTATCTACTG CGGACTTAAC ATCTAAACTT
GTGGTTGAAT ACTCATATAC GGCGGGCCAG GACTCTGTAA AGGCAGTTGT ACTTGAAGTC
GCACTCGCCG AGCTTGCCAA CTTCCAGGGT GTATTAAATG TGTCTGTCTT GCCGGTTCAG
ATAAGGCTGT GGAGATGGAG TAATTACAGT CTGCCGCCGT CGGTGCCTAC TCCAAGGGAG
TACTTCTATA CAGACCCACT TGACTTGGCA TCTCTAAGAT TTGAAGTAAC AGGCGATACG
ATGTATATTA ATAGAATGGC CTACGACGGC GCTATCTCAT ACGACCCGTG GCTTGGCCAA
GTCATCTGGG TACTACCGCC CTACCAGCCG GTTCCCGGCC TTGTCGGGAA TGTAAAGGCG
TCGTTACTCT CTTTGTTTAA CGCAAGTGGA TACTTGCCAA TGCCGCCTTT AGTCGCCAAC
CTCACAACTG GCGGTGTGAT AAAGTACTTC AATTACTCGG ACGTTGCCGC AAACGCGGAC
TTCTTCAAGC CTTTCCAGAT AGGCCTTGTC GGTAGCTACG CATATAAGTT CAGAATCTAC
AGCGGAGATG CTCTAGTAGG CACAGCCAAC GTAGTCGCCT ACTATCCAGT TATTAATGCA
AGCGGGTTAT TATACCACTC TAAGGCTGGG GACGCCTTAC AAGCTGTATA TGATGATAAT
ATGCATACAG ATATGTACGC AGTGCAAATC GTCACGCCAG GCAGATTCTA TGATAGGGAA
CATGTTATAC ACATAGCAAT CGCGAGGATC TTCCAGAACA TCTTAATGAA AGACGCATGT
GGCAACCCAG TGATGGGCGT GAGCGCTGGG TTTGCAGGCG CCAGTATATC TCTAATAATG
AAGATAGGCG GGAAGAACAT AACTATTGCC AAACTGCCGG TGGGCAGTGA GGTGCCTGTC
GACATAATGG TGCCGATTGA TGAATGGGGC AACCCGCAGT TAGATCTCAA AGGTGGGTAC
ATATCGGCCT ATGTTGTGCT AAATTACTTC GGCTACACGC TGTATCCAGT GGACGACTTA
GCAAAGCTTC CGGCTTCAGC GCCCGTCTGG TTTAACATAC CGATTAAGTT TGGCGTTGTT
AAGAAGCCTG TGATATATCT ACCTATTGCG CCGCTAAACT TCAGAGTTTG GAGCCAAGCT
GTATCTGTCG AATACGACCC ACTGAAGGAG CCTCTAATGG GCTTTGTAGT AAGGATATTT
AGCACTGGAA ATGTGGAGAT CGCCAGGAGT ATTTCTAACA AGGACGGGTA CGCCTATATG
CCAAATGTAC CCATAGGAGT GCCCTTCAAA GTACAGGTAA GGACAATAGT GCCAACTTCT
GACAAGAGGT GGTCGTACAC CTATGAGCAG ATTATCAATA AGAACGACTA CAGCTCCTAC
GCTAAGGCAT TGGGCTTCTC GCCTGGAGAC AACGTCTATA CGCTTGGCAC AAGAGGAGTT
TTTGACAGCG GCTTGGTAGT TTACAGCAAG ACTATGTTAC TTGATGCCTC TAATGCTACT
AAATATATAT GTGCAAAGAG TGCAATAGAT CTGCCAGTAG AAGTTTATGA CATCGTTGTT
AGAGTCTTTG ACAAGACTGG GAAGTATCTA TTAAGAAGCC AGCCAGTGTT CCTCGGCCCG
TACCCACAAG CCACAAGGCC GTTCTTGTTA AATGTAACAC TAGTATTAGC AGATGAATAC
AGTCCATATG ACTATGCCTC AATATGGAGA GATTACTCCA TCGGCGACTT CAAGATCTTG
ACAGACTTCC GCGCCATCGG CATCACTGGG ATGCGTAGTA TCTACCTCAG CTTGGCGTCT
AAGTACCTAG ATGAGACAAA GAAGGCGCTC GGATGTCCAC AGTATACAAC AGCTAACTAC
TCCAGAGCTA TAAACGCCTA TGCCCTGGCC GCAATGGCGG GCTATATTGC CAATGCGTCG
ACAGATAGAT ATGCCGCGGT TTACCTCCTC ACTTCGCAAC AGCCCAAGGA TATAATCGAT
ATATGCCAGA TGAAGCCTTC GCAAGCCGGC GCTGTCGAGA TCGCCAGGCT ATTTATGAAA
GGCCAGAGAC TGAGGTTTGT CGTATGGTAC CTAGGCCAGA AGGTATTTGA CGACTACGTC
ACAATTACAG GACCGTTAGT CGATATAAAG ACAGACGTCT ACCCGATTAA CGTAACGACT
TATACAAAGA GTATGAGACT GCCGGTCGAC ACCTTTGTAG GATTTACAAT TACAGATGTA
TACCTTGGCC TCGCGCTTAA TAAGACAGAT GGAATGTTTG TAAACAAGTC TCTAGTGCCA
CAACTAATAG CGCCGTTCAA TACAACGTAC ACCACATACT CGCTTGCTTA TTTATTAAAC
AACGAATTAG TCAAAGGCAC GGCTCAGCAA TTTAATGACA ACGTCACTGC ATGGGCTGGC
GGGGCTTTGA AAGACAAAGC GCCATATGGA AGTTATGTGC CCGCCCAGTT TGGCGGCGAC
TTTGTATATC TGCCGAACCT AGTCGTCATC CGCAACGCGA CTACGCCCAA GTACCAATTT
ACCATATTAA ACAGATATTC AACAACGACT TACACCTACA GATCATACAG CGATACAACT
GCTAGCGGCT CGCTACAAGT GCCGGCGGGC CAGACTCTAT CAATTACACT AGTAGGCGCC
AAGATAGAGG CGGATCCCTC AAACAACGCG ACATATGTAA AGTTAACTTC ATATAATGGA
GCTGAGAATG TCCAGCCGCA GTTAGTTATC AACGGTAGTC TTGTAGTTAT CAACAACGTC
TATGTGACGA ATTACACGAC AACTATAAAC GCCAACTATA CAATTGTATT GACAATAAAT
GCTAAATGCG GCAGCATACA GATAGCGCCA GGCACAAATG GCCTAAAAAT TACTGTAAAT
GCCGGAACGT GTCCTGCGGC TGTTAGCTAT GTTGCAACTA ATAGGACTTA TACCACTCAG
ACTATTACTG TAACTGCTGT GCCTAGCACA GAATATGCTG TGAGCTTTGA TAGATGGTTC
TTAGTGCCAT ATGACTGGCG TTTTGCGCAG TACAACGTCC TCTACCACGC TACAAACGTC
GTTGAGAGAA ACGATATACT AAGAGTCTTA GCTTACACTG GCTTGACACA GAAGTGTGCC
ACTGCGGCCG GAGTAACACA AGTAGGAGAG GATGACAAAT ACACCTATAC GCTGACGTTG
ACAGGAGTCG AGGTTGTTAA CTACCGCACT TTGAGAGTAG AACTGCCTTG GAAGACGACA
GGCGGCGGGA AGGCCTATGT AAACATAACT GCGTACTTTA CCAACGGAAC TTTAATTGAC
TACAAGGTGT ATAACCTCAC TGAGATACTA GCTGGCGCAA GAGGTACGAG GGTGGTAGTA
ACATTGCCAC TAAACTTTGG AAAAGTGGCT GAGAAGGCCT ATGCAATTGC TACTAAGAAC
CTAGCTGTTA GATATGTAAT CGACTTCTAT ATGTCTGACC CCACCGCCGG GCCTTACGAC
GTCTGTGCAA CTAAGCTGGT ACCTCTCGCA GCCAACTACA AGACTGTGTC TATGTATGAA
TGTGCAGTTC CCTCGGCGCC TGGCGCACTT GAGCGTATTG ATCCAACGAC AGTGGTATAC
GCAATAGACA AGGATCTATC TAATGCCAGC GGATTCCAAG AGACATTCGA CACATATGGT
GGGTTTGGGA CGCCTATCAC CTATGTTGTA AAGGCTGGCG AAATTGCCCT CTTGCCGTCT
TGGTACGGCA AGACCTCTGT CGCTGGCTCT CGCATTGCCA GACTGTGGAT TATTGCGGCT
AATCCAGACT ACGGCCAAGG CCCTGCATTA GGCACTAAGT ATTATCAATA CACTGTCAAA
GATGACAAAG TCACGATTAA CATATACAAA TTCGAGAAGT ATCTAGTAGT CAACTACATA
CCGAATGTAT GTCCAGCCGG ATGGACTACA CAGACCTTCT TAGACGAGTT TGACGGCTTT
GGCAGAATTA TAGGTCTTGG CTTTGGCGCA TCTGGCACAA GCGCATTAGT ATTAAGCAAC
TACACAAAAG TCCCCATGTG GAACTCGACT GCTATGTGGC TTGCTGGTGG AGCCTTCAAA
CTGCCTACTG TTGCGCTTGA TGCATTAACT GTGCAGAACA CAGCTGACTT CCCAATTGTC
GTCAGCTCGC TAAATGTTAG ATACGGCGAC TACAAGTATC CGATACCCAT GCCGTTGTTA
CGCGTAGACG CTGGGAAGAC CAACAGAACT CTGTTAAGTG CCTATGGCTT TGGCAGAACC
TACATGATAA GCGCACAGGA TGTATGGAAC TTCAGACTTA TACAGCCTAA CTATGAATAT
GGCTTAAACG CTTACCACGC CGGACTCTTA GACGCGGCTA AATACTTCGG CCTAGGCGAT
GTAGATGTAG CTAAATACCT AAGGCCGTTG GAGTCTAAGT ACTTCGTCCA GAACGTGCTA
TATGCGAGCC ATGCCGAAAC CTCAGAGTGG ACTTACAATA TACTCGGCAG GAAGATCGCC
GAGTGGACTA GAGGCAGATG GGGCGATCTA ACTGTGAGAA GCGACGACTT TGACTACAAG
TATGTATTCG GATTCCCAAC TTTACCGCTG AAAGAAATAC GTGATTGGAA TGACAGACCG
TTGGCTAACC AGACCGTGGC GCTGTTTGAC AAATCCGGTA GACTCTACGC AGTGGTCTAC
AGCAACAGTC TTGGCAGATT GGTATATCCG TTGCCAGATC TATCAAACAT CGGGCTCTCC
AACGTGGTGA GAGTGGCTTG GTACAATGGC TATCTGGTAG AGTTGTTAAG AGGCAAGCCT
GAGTTCACCA TATGGATTTA CGACCAGTTG ATACAGAGAG ATGTTACCGA ACTTGGCGAT
GCATCTACGA ATAACAAGAT TAGAACCTAC GTCTATCCGT TGACTGTCAC TGTGAAAGAT
GACGCTGGGA GACCGTTGAC GAACATGTAC GTTAAAGTAG TTGATACTTC AACTGTCGGC
CAGTTGGTAA ATGCAGCTAA TAAGACCGGC GCCGACGGTG GCGCACAGGT TGTGGATCTG
AGGATTTCGA AGTACTCAAC TGGCGTGCTG TCGCAGATTC CGCCTACCAG CTACTACTAC
TTCGTCTATG ACCAATCTGG CGCCCTAGTG GCCGCCGGCA GATTTGAGAT ACAACGCGGC
GCCTCGGTGC CGTCCACCGG CTGGAACGTG GTGGCTACTG TGAGGTACGC CACCGAGATA
CCTGTGAAGA ACAGCGCTAC TAGAGGCTAT TTGTTGATTA AGGGCGTGGA GTTCCTCAAC
GGCACTAAGA AAGACATCAA GATTCCGTTT ACCATATCTG GCGGTGTGAT GGTGCTCGGC
GGCAAGGTGC CTGTGTCTGT TGAATACCCG GTTGAGATCT ATGTGACTCA TGTGACGTTG
GGCGGCCAGG AGGTGCCTGT TAAGGGCGGC AAGGCGCTTG TGTTCAGCGG CAAGACCACT
GACTTACTAG CTGGTCTTGA CTTCGCCGAG CTTGGGCTGA CCGGCGTAGT GACGATACAA
GCTGTTGATG CCAGCGGCGC TCCGAGGAGC GACTGGACTG TGCAAGTGCT CTACGGCAAT
CTAACCGCCG CCCAAGGTGC CGGCCAGGTG CAAGTTGTAT TGCCGAGGAC TGATGTGTTA
GACCAGCCGT ATGTCGTGAG AGTGATTACA AACGCAATTG CGCCAAATGG CAAGGCGTTA
GTCAAAGAAC AGACGCTTGA GTTAAAACAG AAGGCGCTGT CTCTACAGAT TCCTGTGTCT
ACTGTGAAGG TGGTTGTGCA AGTGATTGAC GGCTTTGGTA ATGTAAGAAG TGACTGGCCT
GTGGTTGTAG AAAACGTCGC CACTGGCATG GGCCAGGTGG CTACTGAGTT GGTAGATGGT
CAACAGTATG TAGCTCGTGC AACAGGTCTC GGCTACACCA ATACCACAAC CTTTGTGGCC
AAGGGCCCGC AGATGGTGGT GAGGATCAAG ATACCCACTG CTAAGATCGT GGCCCAGGTG
GTGGACGGCT TTGGCAACGT GAGGAGCGAC TGGACTGTCC AAGTCGTCGG CGTAACAAGC
GGTCAGGGTA GCATAGGGCC CGTAGAGGTG CTAGCCGGCA CATACACAGT GAAGACTTCA
GTCTTCGGCA AGGAGTTCAG CCAGACTGTC AACGTCCAGC CGGGCCAGAC CGTGACTGCG
GCTGTGCAAG TGCCCACCGC TAAGCTGAGC GTGACTGCCG TTGACGATGA CAAAAAGCCT
CTTGACCGCT ACGTCTCTTC TGTAGAGCTG ACTGGCCCAC TGTCGTTGAT GTTCTCCACG
CCGCCGAAGG ATGTAGAAGT GCTTGCCGGG ACCTACAGTA TAAAGGTGCA GGCGCTAGGC
AAGGAGGCTT CTGCTCAGAT CACGCTGAAT CCAGGCGAGG TGAAGAACAT ACAGGTGGTG
GTGCCTGGCA CCGCCGGGCT TGACATTGGT GGTACTAGAA TTCCGTTGCC GACGCTGGTG
CTGTACGGCC TGTTGTTGCT GGTGGTCGTT GTGATACTGG CGATATTGAT AATTGAGTAC
AACAACTGGA GGAGGAGACG TCTAATGCAG ATATTAGCCC CGCCGAAGTA A
 
Protein sequence
MHNQTKTILA LLLLTVLTAI AYAQYGPGQA LPSAFNVVMQ INDARAAAGL GYPLCTPWSK 
GLANYPPYSF AAGKNFTLII RETASSPVYP PVFQDYRVSA VANSTGFVTF NINVPAGVDV
TKRTNWYVAI VVEWPRPGYY FLIYNQTFTN ATFIDVIGNL SGRPDLTDTK TYTGPWGVIK
VTNGANRFGN LSSSVIHTYY IGINTLGKPF TLTLTRTITI AGTTITLDDN VGPARGPTNY
VVGYDPNVNV VFGPFAYLST YVISLGGPGK TVTIDPTKVF FNHYVYITIE GLRGVVIQSK
NDQFTAFGGL RYVAITLNKT AGPGTSVSAP MSCGLIIWNM TMYTVKITGL LDLKGNPIFN
PENFRYKIQL RVGDSWVTLD RAQASWRLDL SDVKAVLNQV CGTIKTFTDI MYCYRTLGPN
EFRQAVLKLY EPVTLARLGG QLQLTNDNAK VSTADLTSKL VVEYSYTAGQ DSVKAVVLEV
ALAELANFQG VLNVSVLPVQ IRLWRWSNYS LPPSVPTPRE YFYTDPLDLA SLRFEVTGDT
MYINRMAYDG AISYDPWLGQ VIWVLPPYQP VPGLVGNVKA SLLSLFNASG YLPMPPLVAN
LTTGGVIKYF NYSDVAANAD FFKPFQIGLV GSYAYKFRIY SGDALVGTAN VVAYYPVINA
SGLLYHSKAG DALQAVYDDN MHTDMYAVQI VTPGRFYDRE HVIHIAIARI FQNILMKDAC
GNPVMGVSAG FAGASISLIM KIGGKNITIA KLPVGSEVPV DIMVPIDEWG NPQLDLKGGY
ISAYVVLNYF GYTLYPVDDL AKLPASAPVW FNIPIKFGVV KKPVIYLPIA PLNFRVWSQA
VSVEYDPLKE PLMGFVVRIF STGNVEIARS ISNKDGYAYM PNVPIGVPFK VQVRTIVPTS
DKRWSYTYEQ IINKNDYSSY AKALGFSPGD NVYTLGTRGV FDSGLVVYSK TMLLDASNAT
KYICAKSAID LPVEVYDIVV RVFDKTGKYL LRSQPVFLGP YPQATRPFLL NVTLVLADEY
SPYDYASIWR DYSIGDFKIL TDFRAIGITG MRSIYLSLAS KYLDETKKAL GCPQYTTANY
SRAINAYALA AMAGYIANAS TDRYAAVYLL TSQQPKDIID ICQMKPSQAG AVEIARLFMK
GQRLRFVVWY LGQKVFDDYV TITGPLVDIK TDVYPINVTT YTKSMRLPVD TFVGFTITDV
YLGLALNKTD GMFVNKSLVP QLIAPFNTTY TTYSLAYLLN NELVKGTAQQ FNDNVTAWAG
GALKDKAPYG SYVPAQFGGD FVYLPNLVVI RNATTPKYQF TILNRYSTTT YTYRSYSDTT
ASGSLQVPAG QTLSITLVGA KIEADPSNNA TYVKLTSYNG AENVQPQLVI NGSLVVINNV
YVTNYTTTIN ANYTIVLTIN AKCGSIQIAP GTNGLKITVN AGTCPAAVSY VATNRTYTTQ
TITVTAVPST EYAVSFDRWF LVPYDWRFAQ YNVLYHATNV VERNDILRVL AYTGLTQKCA
TAAGVTQVGE DDKYTYTLTL TGVEVVNYRT LRVELPWKTT GGGKAYVNIT AYFTNGTLID
YKVYNLTEIL AGARGTRVVV TLPLNFGKVA EKAYAIATKN LAVRYVIDFY MSDPTAGPYD
VCATKLVPLA ANYKTVSMYE CAVPSAPGAL ERIDPTTVVY AIDKDLSNAS GFQETFDTYG
GFGTPITYVV KAGEIALLPS WYGKTSVAGS RIARLWIIAA NPDYGQGPAL GTKYYQYTVK
DDKVTINIYK FEKYLVVNYI PNVCPAGWTT QTFLDEFDGF GRIIGLGFGA SGTSALVLSN
YTKVPMWNST AMWLAGGAFK LPTVALDALT VQNTADFPIV VSSLNVRYGD YKYPIPMPLL
RVDAGKTNRT LLSAYGFGRT YMISAQDVWN FRLIQPNYEY GLNAYHAGLL DAAKYFGLGD
VDVAKYLRPL ESKYFVQNVL YASHAETSEW TYNILGRKIA EWTRGRWGDL TVRSDDFDYK
YVFGFPTLPL KEIRDWNDRP LANQTVALFD KSGRLYAVVY SNSLGRLVYP LPDLSNIGLS
NVVRVAWYNG YLVELLRGKP EFTIWIYDQL IQRDVTELGD ASTNNKIRTY VYPLTVTVKD
DAGRPLTNMY VKVVDTSTVG QLVNAANKTG ADGGAQVVDL RISKYSTGVL SQIPPTSYYY
FVYDQSGALV AAGRFEIQRG ASVPSTGWNV VATVRYATEI PVKNSATRGY LLIKGVEFLN
GTKKDIKIPF TISGGVMVLG GKVPVSVEYP VEIYVTHVTL GGQEVPVKGG KALVFSGKTT
DLLAGLDFAE LGLTGVVTIQ AVDASGAPRS DWTVQVLYGN LTAAQGAGQV QVVLPRTDVL
DQPYVVRVIT NAIAPNGKAL VKEQTLELKQ KALSLQIPVS TVKVVVQVID GFGNVRSDWP
VVVENVATGM GQVATELVDG QQYVARATGL GYTNTTTFVA KGPQMVVRIK IPTAKIVAQV
VDGFGNVRSD WTVQVVGVTS GQGSIGPVEV LAGTYTVKTS VFGKEFSQTV NVQPGQTVTA
AVQVPTAKLS VTAVDDDKKP LDRYVSSVEL TGPLSLMFST PPKDVEVLAG TYSIKVQALG
KEASAQITLN PGEVKNIQVV VPGTAGLDIG GTRIPLPTLV LYGLLLLVVV VILAILIIEY
NNWRRRRLMQ ILAPPK