Gene Pcal_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0034 
Symbol 
ID4909755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp31940 
End bp39925 
Gene Length7986 bp 
Protein Length2661 aa 
Translation table11 
GC content54% 
IMG OID640123787 
Producthypothetical protein 
Protein accessionYP_001054940 
Protein GI126458662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACC AAGTAAAGTT AGTACTAGCA CTCCTAATAA TAACCGCCCT GGGAATAATG 
GCATACGCCC AGTATGGGCC TGGCCAAGCA CTGCCCTCAG CCTTCAATGT GGTGATGCAA
ATAAATGATG CTAGGGCGGC TGCGGCCCTC GGATATCCGT TGTGCGCTCC TTGGGAAAAG
GGCTTGGCTC ACTACCCTCC CTATAAATTC GCAGCGGGTA AAAACTTCAC CCTAATAATA
CGTGAGGTAG CCCAAAACCC CGTCTACCCG CCGCAGTTCC AGGACTACAG AGTTTCCTCA
GTTGCCAATG AGACAGGCTT CGTTACGTTT AACATAAACG TGCCAGCTGG CGTAGATGTA
ACAAAGAGAT CTAAGTGGTA TGTGGCAATA ATAGTCGAAT GGCCTAGAAA AGGCTATTAC
TTCTTCGTCT ACGGCCGGGT GCTTGATGGT GCAACTCTGG TAGATGTTAT AGGCAACCTC
AGCGGGAGAC CCGATAAGAC GGGCTTCGAC ACATACGTAA CGCCAGGGGG TGTACTACGT
GTATACCATG GCGATAACAG ATATGGCGTC TTGTCCTCCA GCTATATCCA CACATTCTAT
GTTGGCATTA ACACCTTAGG GAACCCCTAT GACCTCACCT TCACAAGACA AGTAACAATA
AATGGTACCA CCATAACGCT TGAGGACTCG ATAGGACCGG CTTCTGGCCC AACAAACTAT
GTAAAAAGTA TAGATGGAAA AATTACGGCA GTATTCGGAC CTTTCACATA TTTAGCAACA
TACGTTGTAT CACTGGGGAA AGCTGGCACA AGTCCAGAAA CATACCTGGT AACAATAGCC
CCAGGTAAAG TAGCTTACAA TCACTATGTT TACATAACAA TTGAGGGAAT GCGCGGGCTA
GTGATTCAGA GTAAAAATGA TGCATTCGTA GAGCCCCTAG GTTTCGTAGC GTTCACTTTG
AATAGGACAG CTGGGCCTGG TACCGGCGAA CCCGTGCCGC TGTCCTGCGG TTTGGTAGTG
TGGAACATGA CCCTCTACAC AGTCACAGTC ACTGGCCTCC TAGACCTCAA GGGTAACCCC
ATATTCAACC CAGAGAACTT TAGGTACAAG ATCCAGATGA AGGTAGGCGA CAGCTGGGTG
ACATTTAGCA GAGCACAAGC ATCGTGGAAG CTCACCCTAG GCGATGCCAA GGCCGTATTA
AACGAGGTCT GCGGTACCAT TAGCTCCTTC AGCGATATCA TGAACTGCTA TAGGTCGTTG
GGACCCGACG GCTTTAGAAG CGCAGTTCTA CGCCTATATG AACCAGTCAC GCTTGCAAGA
CTAGGGGGGC TGCTCAAGTT GACAAACACA AATGCCGAAA TATCTACCCA AGACTTAATC
TCCAAGCTTG TAGTAGAGTA CTCATACACG GCGGGCAACG ACAGCGTTAA GGCAATAGTC
CTCGAGGCGG CGTTGCTAGA GCCGGCCAAT TTCCAGGGCG TGTTAAACGT GTCTGTGCTC
CCCATTCAGA TAAGGCTATG GAGGTGGGGC AACGACTCTC TGCCTCTGGC ATCGAGAGAG
TTCTACTACA CCGATCCACT CGACTTGGCA TCGCTGAGGT TTGTAGTTAC TGGCGACACA
ATGTACCTAG ACAGAATGGC GTACGACGGA GCTGTAACCT ATGACCCGTG GCTGGGCCAG
GTGATATGGG CACTTCCGCC ATACCAAGTG GTGCCCGGTC TAGTGGGCAA CGTTAAAGCC
CACTTGCTCT CGCTATTCAA CACAAGCGGC TATCTCCCAC TGCCCACCCT AGTCGCCAAT
CTCACAAACG TAGGCGGCAG AGTGACACTC GAGTACTACA ACTACTCAGA CGCTCTTGCA
AATGGAGACT TCTTCGCCAA GTTTGGAATA GGCTTAGTAC AGAAGTACGG ATACAAGTTT
AGGATTTACA GCGGGCAAGT TCTCGTCGGT ACAGCCAACC TAGAGGCCTA TTACCCAGTG
ATAGACAAGT ACGGCTTATT GATACACGCA AGGGCAGGCG ACGCATTACA GGAGAAATAT
GGCGATGACG AACGCAAAGA CATGTATGCC CTGCAAATAT TGCAACCTGG CCGCTTCTAC
GATAGAGAGC ATGTGTTACA CATCGCAATC GTCAGGGTGT TCCAGAATAT CTTGCTCAAG
GACGCTTGTG GCAACCCGGT GTACGGTGTT GGCGCCGGCT TGTCTGGAGC TAGCCTATCG
CTGGTAATAA CAGTGGGCGG GAGAAACTAC ACCATTGCCA AGTTGCCGCT TGGTAGCGAG
GTGCCAGTTG ACTTGTTCAT ACCCATCGAC GAGTGGGGCA ATCCGCAGAT CGACCTCACC
GGCGGGTATG TACAAGCCTA CGCCGTATTG AACTACTACG GCTACACACT ATACCCCGTG
GATAATATAA CAAAGATACC GTCGTCGCAG CCCACGTGGT TCAACATACC CATCAAATTC
GGCGTTGTCA AAAAGCCAGT CCTCTACTTG CCCATAGCGC CATTGCAGTT CAGAGTGTGG
AGTCAGGCGG TTTCTGTGGA CTACGACCCG TTGAAGGAGC CACTGATGGG CTTCGTCGTT
AGAGTATTCA GCACAGCCAC GGGCGACGAG ATTGCCCGCA GTATCTCAAA TAAGGACGGC
TACGCCTATG AGCCAAATGT GCCCATAGGC GTGCCGTTTA GAGTGCAAGT CCGCACAATA
GTACCTACAA GCGACAAGAG GTGGAGCTAC ACCTACGACC AGATTACAAG AAAGAACGAC
TATGCCTCAT ACGCCAAGGC GCTTGGCTTC ACGCCGGCTG ACAATGTCTA TACGCTGGGC
ACAAGAGGCG CGATTGACAG CGGCTTAGTG GTCTACTCCA AGACGATGGC GTTGACGGCT
GAGAATGCTA CTCGGTATAT CTGTGCCAAG AACGCCATAG ACTTGCCCGT GGAGGTGTTC
GACTTGGTGG TGAGAGTGTT TGACAAGACC GGCAAGTACT TGCTGAGGAG CCAGCCCGTC
TTCCTAGGCC CGTACCCACA GGCCACGAGA CCCGTCTTGT TGAACGTGAC GCTGTTGCTT
GCCGACGATT ATAGCCCATA CGCCTACGCC TCGATCTGGA GAGACTTCGG CAACGGCGAC
TTCAAGATCT TAACCGACTT CCGCGCCATA GGCATCACTG GTATGCGCTC CATCTACTTG
AACTTGGCCA GCAAGTATTT GGACGCGGCT AAGAAGGCGC TTGGATGCCC GCAGTATACC
ACTGCCAACT ACTCCGCCGC GATTAACGCG TACGCCCTTG CGGCCATGGC TGGATACGTC
GCCAACGCCT CGACCGACAG ATACGCAGCG GTCTATCTCT TGACGTCGCA GCAGCCCAAG
GACATTATCA ACCTCTGTGA CATGAAGCCG TCGCAGGCTG GTACCGCGGA GATAGCCAGG
CTCTTCATGA AGGGCCAGAG GTTTAGGTTC GTGGTGTGGT ACATGGGCCA GAAGGTCTTC
GACGACTATG TCACAATAAC TGGTCCACTG GTCGACATAA AGGCCGACGT ATACCCAGTG
AACGTCACCA CCTACACCAA GAGCATGAGG CTACCCGTTG ACACCTTCGT GGGCTTCACA
ATCACAGACG TCTACCTAGG GCTAGCCCTC AATAAGACCG ACGGCATGTT CGCCAACAAG
TCGCTAGTGC CGCAGTTAAT TGCGCCGTTT GATACAATGT ACAAGACTTA CAACCTCTAC
TACTTGATAA GGGACGAGTT AGCCAAGGGC ACGGCCACGT TGTACAATGA CAATGTTACA
GCGTTTGTTA CTCCGCCAGC TTCCGTATAT TATGGTGCCT ACAAGCCTGC ACAGTTTGGC
GGCGACTTCG TATACCTGCC CAACCTGGCA ATTCTGAGAA ACGCCACCGC GCCCAAGTAC
CTGTCTACTG TGCTTAACAG TTACATCTCC ACCACTATAC CAATACGCCA GTTGAACACC
ATACCGGCTG GTAGCGGCAC TGTCACCATA CCGGCTGGCG GCTCCGCGAG CATCACCATT
GTTGGCGCAA AGATAGTGCC AGATCCGAAT AACAACGCGA CGTACTTCAA GGTGACTTCG
TACAATGGCA CTGAAAATAG CAATATCGTT CAGCCACAGT TAATAATAAC TGCAGAGTTA
GCGATAGTAG GTGGTTCGTA TGTAAAGAAC TATGTAGCTA CCGTCGATGC CAACTACACA
ATTGACTTGA CGGCTATAGC AAAGTGTGGT AATATAGTGG TTGCGCTCGG TGGGGGCGAT
TTAACATTGC AAATTACTGT GAGTGCTGGC TCATGCCCCG CTACCATCAC CTATGCCGCC
ACTAACGCCA GCTACACGTG GAAGGCGTTA ACATCTTCTG CGGCGCCTTC GACCGAGTAT
GTGGTGAGCT TTGACAGGTG GTTCCTAGTG CCGTACGACT GGCTGTTTGC ACAGTACAAT
GTGCTATACC ATGCAACCAA CGTAGTGGAG CGCAACGACG TGCTACAAGT GTTGGCATAT
CCGGGCGCTA CACAGATATG CGCACAACCC GCTGGACAAG TAGCAGCGGG CGAGGACGAT
GAGTATAAGT ACAAGTTGAC ACTGACAGGC GTTAATGTGG CAAACTACCG CACGCTGGCT
GTGGAGTTGC CTTGGAAGGC TGGTGGCGGC GGCAAGGCCT TGGTGAATAT CACTGCGTAC
TTCGCCAATG GCACTAAGAT TGACAGTATT GTGTACAACT TGACTGACAT GCTGAAAGAC
GCGAGGGGCA CAAGGGTTAA GGTGTTGTTG CCGTTGAACT TCGGCAAAGT TGGCGCAAAG
GCCTACGACA TAGCTGTAAG CAAGGCCACC GTCCGCTTTG ACATTACGTT CATGATGTAC
GACCCGAAGA CTGGTCCATA CAGCGTCTGC GCAGTGAAGC TGGTGCCTCT CGCCGCCAAC
TACACCAGCG TCTCCGAGTA TGAGTGCACT GTGCCAACTG CACCGGGCGT GCCGGAGCGC
ATAGACCCAG CCACTGTGGT ATATGCCATT GATCCGATGC TGTCGAGCGC CAACGGTTTT
GATGAAACGT TCGATGCCTA CGGCGGCTTT GGCACGCCGA TTACCTATGT GGTGAAGTCT
GGCGAGGTTG CTCTACTGCC GTCGTGGTAC TACAAGACCG CTGTGGGTGG CTCCCGCATT
GCTAGAGTCT GGATAATTGC GGCTAGCGAC GACCCGAGCA AGGGCCCGGC GCTTGGCACT
AAGTACTACA GCTACACTGT GAAAGACGAC AAGGTGACGA TCAACGTGTA CAAGTTTGAG
AAGTACTTAG TGGTGAACTT CGTGCCCAAC GTCTGCCCGG CCGGCTATGT GAGTCAGACG
TTCTTAGACG AGTTCGACGG CTCTGGCCGC ATCACTGGCC TAGGCTTTGG CACAGGCGGC
ACCAGCGCTC TGGTGTTGAG CAACTACACG CGTGTACAGA TGTGGAACTC CACTGCCATG
TGGCTAGCCG GCGGCGTGTT TAAGCTGCCC ACCGTCGCGC TCGACGCCTT GACTGTGCAG
AACAACGCCA AGTTCCCAAT CGTCGTTGAC TCGCTGAACG TGAAATACCG CGACTACAAG
TACAGCATAC CCATGGCGCC TGTCCGCGTC AACGCCAGCG AGACTAGGAC CGTGTTGCTG
AACAGCTACG GCTTCGGCAG AACCTACATG TTCAACATAA GCGACGTGTG GGCGTTTAAG
CTTGTACAGC CCAACTACGA GTACGGTCTC AACGTCTACC ATGCCGGGTT GCTAGACGCG
TTGAAGCACT TTGGCATATC CACTGACAAG CCGCTGTCAG CCTACTTGAA GCCGCTGACC
TCTGCCTACT ATGTGCAGAA CGTGCTGTAC GCCAGCCACG CCGAGACCTC CGACTGGACG
TTCGGCATAT TAAACGGCAA GATCACTGAG ATCACGAGAG GTACTTGGGG CGACTTGAGG
ACAGACGCCA GCGACTACGA CTACAAGTAC GTCTTCAACT ACCCGACTCT GCCGCTTAAG
GAGATACGCG ACTGGAACGA CAGGCCGCTT GCCAACCAGA CTGTGGCTCT ATTTGACGGA
AGCGGCGCCT TATACGCCGT CGTGTACACC GGCGCTAACG GCCAGTTGAT CTACCCGTTG
CCCGCCATCG CCAGCCCAGT GGTGCGCGTG GCTTGGTACA ATGGCTACCT TGTAACGCTG
ATTAAGGGCT TGCCGGAGTT TACCATATGG ATCTACGACC AGACCATTAG CCGCGACGTC
ACTGAGTTAG GCGACGCCCG CGTTGTTGAC AAAATTAGGA CGTACGTCTA TCCGGTGACT
GTGTCTGTGT ACGACGAGGC TGGGAGGCCG TTGAACAACA TGTGGGTGCG GGTGATAGAC
GCTGGCACCT CTGGCAACCT AGTTACATCG CTGAACAGTA CTGCCTCCGA TGGCGGTGCG
CAGATTGTTG ACTTGAGAAT TTCGAAGTAC GCCAGTGGCG TGATGTCGCA GATACCTGCC
ACGTCGTATT ACTACTACGT GTATGACCAG AGCGGCGCCT TGGTGGCGGC TGGCAAGTTT
GACATAGAGC GTGGCGCATC TGTTCCATCG ACTGGTTACA ACGTCCAGGG CAAAGTGATT
ACGTACTCGC AGGTGCCTGT GAAGAACTCG GCGACTCGCG GCTACATAGT TGTAAAGGGC
GTGCAGTTTG TCAACGGCAC AGTCAAGGAC GTCGTGTATC CGTTCACAGT GTCTGGCGGT
GTGATGACTA TAAGTGGCAA GTTGCCGCTG AGCTACTCGT ACCCGGTGGA GATCTACGTA
ACTCACGTGA CACTTGGCGG CCAGGAGGTG CCTGTGAAGG GCGGAAGGTT CCTCGTGTAT
AGGGGCACTA CCACCGACCT CGCCGCTGGC CTAGACTTCG CAGAGCTTGG CCTGACTGGC
GTGGTGTCGA TCTCCGCCGT TGATACAACC GGCGCGCCGA GGTCCGACTG GACTGTGCAA
GTGCTCTACG GCAACATAAC TGCGGCGGAG GGCAAGGGCT CTGTCAACGT CGTCCTGCCG
CGCACCGACG TGCTAGGCCA GCCGTACACG GTGAGAGTGG TGACTAATGT CATTACGCCG
GAGGGCAAGG CTCTTGTGAA GGAGCAGGCG CTTGAGGTAA CGCAGAAGGC CCTCGCTGTG
CAGATACCCA TATCCACTGT CCGCGTTGTG GTGCAGGCGG TGGATGGCTT TGGGAACGTT
AGACAAGACT GGCCAGTTGT GCTCGAGAAT GTTGCCTCTG GCATGGGCCA AGTATCTGCA
GAGGTGGTGG AGGGTCAGCG CTATGTGGCT AGAGCCACTG GCTTAGGCTT TACTAACACT
ACCGCCTTCA CGGCGAGGGG TCCGCAGATG GTTGTCGCAA TTAAGATACC CACTGCCAAG
ATCACTGCCC AGGCGAAGGA CGGCTTTGGT AAAGTGAGGA GCGATTGGCC AGTCGAAATT
GTTGGCGTGG CGGCTGGGCA GGGCACTGTT GGGCCAGTGG AGGTGCTTGC TGGGCAGTAC
ACTGTGAAGA CGTCTGTCTT CGGCAAAGAC TTTACGCAGA CTGTCACACT GCAGCCTGGG
CAGTCGCAGA CAGTTGTTGT GCAAGTGCCC ACGGCCGTGT TGAGCGTCAC TGCTGTGGAC
GACGATAGGA AGCCAATTGA TAGATATGTG ACCGCTGTTC AGATCAGTGG CCCAGTGTCG
CAGAGCTTCT CCACGTCGCC TAAGAACCTC GAGGTGCTTG CTGGGCAGTA CACGGTGACA
GTCTCTGCCT TGAACAAGCA GGCCTCGACG CAAGTGACAC TACAGCCTGG CCAGACTGCG
AATGTAGAAG TTGTTGTGCC GGGCACTGCT GGGCTAGACT TCTTGGGCAC GAGGATTCCG
CTTCCAACGC TGGTGCTCTA CGCGCTGTTG CTGTTGGTGA TCGTGGTGAT TCTGGCGATT
ATAATCATTG AGTACAACAA CTGGAGGAGG AGACGCTTAA TGCAGATTTT GGCCCCGCCG
AAGTAA
 
Protein sequence
MHNQVKLVLA LLIITALGIM AYAQYGPGQA LPSAFNVVMQ INDARAAAAL GYPLCAPWEK 
GLAHYPPYKF AAGKNFTLII REVAQNPVYP PQFQDYRVSS VANETGFVTF NINVPAGVDV
TKRSKWYVAI IVEWPRKGYY FFVYGRVLDG ATLVDVIGNL SGRPDKTGFD TYVTPGGVLR
VYHGDNRYGV LSSSYIHTFY VGINTLGNPY DLTFTRQVTI NGTTITLEDS IGPASGPTNY
VKSIDGKITA VFGPFTYLAT YVVSLGKAGT SPETYLVTIA PGKVAYNHYV YITIEGMRGL
VIQSKNDAFV EPLGFVAFTL NRTAGPGTGE PVPLSCGLVV WNMTLYTVTV TGLLDLKGNP
IFNPENFRYK IQMKVGDSWV TFSRAQASWK LTLGDAKAVL NEVCGTISSF SDIMNCYRSL
GPDGFRSAVL RLYEPVTLAR LGGLLKLTNT NAEISTQDLI SKLVVEYSYT AGNDSVKAIV
LEAALLEPAN FQGVLNVSVL PIQIRLWRWG NDSLPLASRE FYYTDPLDLA SLRFVVTGDT
MYLDRMAYDG AVTYDPWLGQ VIWALPPYQV VPGLVGNVKA HLLSLFNTSG YLPLPTLVAN
LTNVGGRVTL EYYNYSDALA NGDFFAKFGI GLVQKYGYKF RIYSGQVLVG TANLEAYYPV
IDKYGLLIHA RAGDALQEKY GDDERKDMYA LQILQPGRFY DREHVLHIAI VRVFQNILLK
DACGNPVYGV GAGLSGASLS LVITVGGRNY TIAKLPLGSE VPVDLFIPID EWGNPQIDLT
GGYVQAYAVL NYYGYTLYPV DNITKIPSSQ PTWFNIPIKF GVVKKPVLYL PIAPLQFRVW
SQAVSVDYDP LKEPLMGFVV RVFSTATGDE IARSISNKDG YAYEPNVPIG VPFRVQVRTI
VPTSDKRWSY TYDQITRKND YASYAKALGF TPADNVYTLG TRGAIDSGLV VYSKTMALTA
ENATRYICAK NAIDLPVEVF DLVVRVFDKT GKYLLRSQPV FLGPYPQATR PVLLNVTLLL
ADDYSPYAYA SIWRDFGNGD FKILTDFRAI GITGMRSIYL NLASKYLDAA KKALGCPQYT
TANYSAAINA YALAAMAGYV ANASTDRYAA VYLLTSQQPK DIINLCDMKP SQAGTAEIAR
LFMKGQRFRF VVWYMGQKVF DDYVTITGPL VDIKADVYPV NVTTYTKSMR LPVDTFVGFT
ITDVYLGLAL NKTDGMFANK SLVPQLIAPF DTMYKTYNLY YLIRDELAKG TATLYNDNVT
AFVTPPASVY YGAYKPAQFG GDFVYLPNLA ILRNATAPKY LSTVLNSYIS TTIPIRQLNT
IPAGSGTVTI PAGGSASITI VGAKIVPDPN NNATYFKVTS YNGTENSNIV QPQLIITAEL
AIVGGSYVKN YVATVDANYT IDLTAIAKCG NIVVALGGGD LTLQITVSAG SCPATITYAA
TNASYTWKAL TSSAAPSTEY VVSFDRWFLV PYDWLFAQYN VLYHATNVVE RNDVLQVLAY
PGATQICAQP AGQVAAGEDD EYKYKLTLTG VNVANYRTLA VELPWKAGGG GKALVNITAY
FANGTKIDSI VYNLTDMLKD ARGTRVKVLL PLNFGKVGAK AYDIAVSKAT VRFDITFMMY
DPKTGPYSVC AVKLVPLAAN YTSVSEYECT VPTAPGVPER IDPATVVYAI DPMLSSANGF
DETFDAYGGF GTPITYVVKS GEVALLPSWY YKTAVGGSRI ARVWIIAASD DPSKGPALGT
KYYSYTVKDD KVTINVYKFE KYLVVNFVPN VCPAGYVSQT FLDEFDGSGR ITGLGFGTGG
TSALVLSNYT RVQMWNSTAM WLAGGVFKLP TVALDALTVQ NNAKFPIVVD SLNVKYRDYK
YSIPMAPVRV NASETRTVLL NSYGFGRTYM FNISDVWAFK LVQPNYEYGL NVYHAGLLDA
LKHFGISTDK PLSAYLKPLT SAYYVQNVLY ASHAETSDWT FGILNGKITE ITRGTWGDLR
TDASDYDYKY VFNYPTLPLK EIRDWNDRPL ANQTVALFDG SGALYAVVYT GANGQLIYPL
PAIASPVVRV AWYNGYLVTL IKGLPEFTIW IYDQTISRDV TELGDARVVD KIRTYVYPVT
VSVYDEAGRP LNNMWVRVID AGTSGNLVTS LNSTASDGGA QIVDLRISKY ASGVMSQIPA
TSYYYYVYDQ SGALVAAGKF DIERGASVPS TGYNVQGKVI TYSQVPVKNS ATRGYIVVKG
VQFVNGTVKD VVYPFTVSGG VMTISGKLPL SYSYPVEIYV THVTLGGQEV PVKGGRFLVY
RGTTTDLAAG LDFAELGLTG VVSISAVDTT GAPRSDWTVQ VLYGNITAAE GKGSVNVVLP
RTDVLGQPYT VRVVTNVITP EGKALVKEQA LEVTQKALAV QIPISTVRVV VQAVDGFGNV
RQDWPVVLEN VASGMGQVSA EVVEGQRYVA RATGLGFTNT TAFTARGPQM VVAIKIPTAK
ITAQAKDGFG KVRSDWPVEI VGVAAGQGTV GPVEVLAGQY TVKTSVFGKD FTQTVTLQPG
QSQTVVVQVP TAVLSVTAVD DDRKPIDRYV TAVQISGPVS QSFSTSPKNL EVLAGQYTVT
VSALNKQAST QVTLQPGQTA NVEVVVPGTA GLDFLGTRIP LPTLVLYALL LLVIVVILAI
IIIEYNNWRR RRLMQILAPP K