Gene YpsIP31758_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1519 
Symbol 
ID5385391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1762407 
End bp1770173 
Gene Length7767 bp 
Protein Length2588 aa 
Translation table11 
GC content57% 
IMG OID640864501 
Producthemagglutinin/adhesin repeat-containing protein 
Protein accessionYP_001400497 
Protein GI153949798 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC 
GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG
CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC
TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG
CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC
AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC
CTCAATAACG GCCACGGTGT CAACCAGACC GAGCTCGGTG GTTTTATCGA CGGCAACCCG
TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG CGACCCCAGT
AAACTCAATG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG
GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC
GGTCAGGCGC AGCTTAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT
GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAGGATC ATACCGACCT GATCGCCCGC
TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAGCTGA GTGTCACCAC CGGGCGCAAT
CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA
ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT
GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC GGCGGGCGAT
ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG
TTGGTGGTCA ATACCGCCGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT
ACCCAACTCA CCACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGCGGGGGAC
ACCTCTGTCC GAGCGGCGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG
AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC
ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAT
CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA CTGCAAAAGG GGGTGAGATC
GATCTGACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA
TTACGTACCG ATAACGCCAG CCTGATCGCC GAACAGATTA CGCTCGACGC GCAGGCACTC
TCTAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG
GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG
GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTTGGGGCGG GAGTACAAAG TGATGGCCGC
CTGACGGAGA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG
ACCCTTGCCG CGGGTACCAT GGCGTTAACC GGCAGCCGGG TTGATCTGGC CGACAGTTAC
ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC
ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG
GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGGC AGTTTGATAA CCAAGGTGGC
CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGATTTCAG CCACCAGGCG
GGCTCGACCC TTCAGGCTGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT
GACGGGCACC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC
GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC
GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT
ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC
GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCTTAGCCC TGCTGGCCCC GGTGATTGAA
AACAGCGATA CCGTCACTCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC
ACGGTTATTC TGGCCGGCGG GCAAGCGAGT GATGGCCATT ACACGTCTGC GGCCCAGGTG
CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG
ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA
GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG
CCGAATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC
TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA
CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG CGAGATTGAT
CTCACCGGCG TCACCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG
ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GGCATGGGCG
TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA
ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG
CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGATCG AATTTAACGG GATCTCGCTG
ACACCGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT
TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC
GTGCTCCAGC AGTTGAACAA TGACCCGGAT GTGATATTCA AACGTCTGGG GGATAACGCC
TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCATTGA CAGGCCAGGC GGTGGCCAGT
GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC
AAGGCGTTCA ATATTGCCCT TGGTACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC
CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG
GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA
GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG
AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT
CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG
CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGTACCGAC
AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CCTGGGGCCG
CTGGCCAGCA TCAATGTCAG CAATAATGCG ACTATCAATA CCGGGCGTGA TTTTATCATG
CAAGGCGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA
CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC
AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACCGCCAAC
GTCGACAATC TGACGGCGGT AGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG
CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC
AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACCC TGCTTGGCAG CCAGTTGAAT
GCCACGGGCG ATATCAATTT GCAGGCGGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA
CAAACCGATG GCGCGCTGAC ACTGGCAGCG GGCGGGGATG TGCTCCTGAC CACCCAGACT
GAGCAGCATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC
CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG
GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAC
CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATCACCGTCG GCACGGCGCA GCAGAGCGAG
AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT
GTCACGGTGG GCAGCAGCAG TACCAAAATG ACCGATTCTG GTCAATCGAT TTCCAGCGTG
GGCAGCACGG TGGGCAGCGT ACTGGGCAAT GTCAGCATGA CCGCCGGTGA AGACCTGAGG
GTGCAAGGTG CCGAGGTGTT GGCCGGTAAA GACATCAATC TGACCGGTAA AAACGTCAGT
ATTCTGGCGG CGGAGAATCA GCTTACCCAG AGCCACACCG TCGAGCAAAA ACAGAGCGGC
CTGACACTGG CACTGTCCGG TGCGGTGGGC AGTGCCGTCA ATACCGCAGT GACCACCGCG
AAAGCGGCCA GCGAAGAGAG CAGTGGCCGC TTGGGGGCAT TGCAGGGGGT TAAAGCGGCG
CTCAATGGCG TGCAGGCGGT GCAGGCTGGG CAGTTGGTGC AGGCGGAGGG GGGCGATGCT
GCCAGCATGT TCGGCATCAG TGCGTCCTTG GGCTCACAAA AATCGTCCTC GGAGCAACAT
CAGGAACAGA CCCACGTGAC GGGCTCGACC CTGACGGCAG GCAACAATCT GACCATCAAT
GCCACCGGTG AGGGGAATGC GGCAAACAGC GGCGATATTG TGGTGCAAGG CAGCCAGCTC
CAGGCCGGTG GCGATACCAC GCTGGATGCG GCGCGTGATG TGCTGCTACT CGGCGCTGCT
AACACACAAA AAACCGACGG CAGCAACAGC AGCAGTGGCG GCAGTGTTGG CGTCAGTCTG
GGTGTCAGTG GGGCCAGCAG TGGTCTGAGT ATTTTTGCCA ACGCCAATAA AGGTCAGGGA
AGTGAGCACG GCGACGGCAT CTCCTGGACT GAAACGACCC TTGACAGCGG CGGCACGCTG
TCGCTGTACA GTGGCCGCGA TACCTCACTG GTCGGTGCGC AGGTCAGCGG CGAAACGGTG
AAGGTGGAGG TGGGCCGCGA CCTGTTGCTG CAAAGCCAGC AGGACAGCGA TAACTATGAT
GCGAAGCAGC AAAGTAGCAG TGTTGGCGGC AGTTTCAGCC CTGGCTCCAT GACGGGCAGT
ATCAGTATCA ATGGCAGCCA GGACAAGCTG AACAGCAACT TTGACTCGGT GCAGGAGCAG
ACGGGTATTT TTGCCGGTTC GGGCGGCTTT GATATCACGG TGGGTGGACA TACCCAGCTT
GACGGTGCGG TGATTGGCAG CACGGCGACG GCCGATAAAA ACACGCTGGA TACCGGGACA
CTGGGCTTCA GTGATATCGA TAATCAAGCC GATTTCAAGG TTGAACATCA AAGTGTGGGT
ATCAGCACCG GGGGGAATAT TGGCAGTCAG TTTGTTGGCA ATATGGCCAA CGGCTTGCTG
GTCGGGGCCA ATAACGAAGG CCACGCCGAC AGCACCACCC ATGCGGCCGT TTCTGAAGGT
ACGATCACGG TGCGCGACAC GGATAACCAG CAGCAGAATG TTGATGACCT GAGCCGTGAC
GTGGAGCAGG CCAACAATGC CCTTTCCCCT ATCTTTGATA AAGAGAAAGA ACAAAACCGG
CTGAAGGAAG CGCAGCTTAT CGGCGAGATA GGCAGTCAGG TGGGTGATGT GTTCCGAACG
CAAGGGCAGA TTATCGCCAC CCAGGCGGCG AATGAAAAAA TGCAGGGGGT GAGTGAGGCT
GATCGTGAGG CGGCGAAAGC CAACTGGGAA AAAGCCAATC CGGGTCAGAT TGCAACGGCT
GAAGATATCA ACGGTCAGGT TTATAAAACG GCCTATGATC AGGCATTCAA TGCATCGGGT
TACGGCACCG GGGGTAAATT CCAGCAGGCG GTACAAGCGG CGACAGCGGC CCTCCAGGGG
CTGGCGGGCG GAGATATAGC CAAAGCGATA GCGGGAGGCA GTGCGCCGTA TCTGGCGGAA
GTGATTAAGC AAAGCACGGG TGATAACGAA GAAGCGCGAC TGGCGGCACA TGCGGTGGTC
GGTTCTGTTC TGGCACATCT ACAGGGCAAT AGCGCGGTTG CGGGAGGCGC AGGTGCCTTG
ACGGGTGAGA TAGCGGCTGA TTTAATCATG CAGCAGTTGT ACCCGGGAAA AATGGTTAGC
GAACTCAGCG AGACAGAAAA ACAGACCATC AGCGCGTTAA GTACATTAGC AGCAGGGCTG
GCGGGGGGCT TGACGGGAGA CAGCAGCGCC GACGCGGTTG CGGGTGCACA GGCGGGGAAA
AATGCGGTAG AGAATAACTC GCTGAACCCG AACGACTTCG GTAAGGGCAT GGCAGACATA
GGGATGTCGC AAACCTCGCT CGGTGCTTCC ATGCTGCAAA GTGGAGCTTC ACCGGATGAA
ATCGCAGCGG CCCTGATCAA AAATGCCCAG GGAGATATGC CGGAAGGTCA GGATGCAGTT
AAAGGCCTAT TGATCGCCTG GGGCGAGTTC TTCGGGGTGC CGGTCAGTGC GTTGACGGCA
AATGGAGAAA TGACGCCAGA GAAAGCGGCG GAAATCCTAG CCAGTGGAGT GCCGACCAGT
GAAGCTAAAC TGGTTCAATA TGTATTTGCG AAAGCGTTTT TGTCAGTGAC GAAAGCTGTC
TATCCTGAAG GGATTAGTTT CAAGATCACC CAACCCGAAC ATTTAGCGAA ATTGGATGGA
TATTCGCAGA GAAAGGGTAT TAGTGGCACG CATAATGCGG ATGCATTCTA TTCAACAGTC
AATGATAAAG GGGTAAAAGT CATTGGCGAA ACTCAATCTA ATATTAAAGG TATAAATGAA
GTTAAATACC AGATACCTTC CTATGATAGA GCTGGTAATG TTATAGGATA TAAAGCTCAA
GTGTTTACTA AAACGATCTA TGATCCTAAG GTCTTCACTG ATCAAAAAAT ATTAGATTTG
GGACAGCAGG CTGCAAGTAG TGGCTATAAG GCTGCTATTG CTTCGGGCCA ACGAGAATAT
ACAGCATCAG CTGGTGGAAT TCAGTTCCAA GTCTACTTAG ATAAAAAAAC AGGAATAGTA
GAAAACTTCT TCCCGGTGAC TAACTGA
 
Protein sequence
MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG 
CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI
LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS
AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR
SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI
GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN
TQLTTAGTLS NSGTLAAAGD TSVRAAEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI
TAQGKNISGT AQRFTAHRLD LSGSQTQSRD ITLTAKGGEI DLTGAELLAS DRLSAATTAL
LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA
ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGTMALT GSRVDLADSY
TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG
QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGHLVAGGTL STHSDSLLNS
GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD
ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGQAS DGHYTSAAQV
LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP
PNGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID
LTGVTLQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWAWA WGPEVSERAT SEYASSFTAK
TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAIEFNGISL TPPNGGLYQF TTDHTVGGGG
YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS
DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV
VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL
QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP
LASINVSNNA TINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT
SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS
KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT
EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD
QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSSTKM TDSGQSISSV
GSTVGSVLGN VSMTAGEDLR VQGAEVLAGK DINLTGKNVS ILAAENQLTQ SHTVEQKQSG
LTLALSGAVG SAVNTAVTTA KAASEESSGR LGALQGVKAA LNGVQAVQAG QLVQAEGGDA
ASMFGISASL GSQKSSSEQH QEQTHVTGST LTAGNNLTIN ATGEGNAANS GDIVVQGSQL
QAGGDTTLDA ARDVLLLGAA NTQKTDGSNS SSGGSVGVSL GVSGASSGLS IFANANKGQG
SEHGDGISWT ETTLDSGGTL SLYSGRDTSL VGAQVSGETV KVEVGRDLLL QSQQDSDNYD
AKQQSSSVGG SFSPGSMTGS ISINGSQDKL NSNFDSVQEQ TGIFAGSGGF DITVGGHTQL
DGAVIGSTAT ADKNTLDTGT LGFSDIDNQA DFKVEHQSVG ISTGGNIGSQ FVGNMANGLL
VGANNEGHAD STTHAAVSEG TITVRDTDNQ QQNVDDLSRD VEQANNALSP IFDKEKEQNR
LKEAQLIGEI GSQVGDVFRT QGQIIATQAA NEKMQGVSEA DREAAKANWE KANPGQIATA
EDINGQVYKT AYDQAFNASG YGTGGKFQQA VQAATAALQG LAGGDIAKAI AGGSAPYLAE
VIKQSTGDNE EARLAAHAVV GSVLAHLQGN SAVAGGAGAL TGEIAADLIM QQLYPGKMVS
ELSETEKQTI SALSTLAAGL AGGLTGDSSA DAVAGAQAGK NAVENNSLNP NDFGKGMADI
GMSQTSLGAS MLQSGASPDE IAAALIKNAQ GDMPEGQDAV KGLLIAWGEF FGVPVSALTA
NGEMTPEKAA EILASGVPTS EAKLVQYVFA KAFLSVTKAV YPEGISFKIT QPEHLAKLDG
YSQRKGISGT HNADAFYSTV NDKGVKVIGE TQSNIKGINE VKYQIPSYDR AGNVIGYKAQ
VFTKTIYDPK VFTDQKILDL GQQAASSGYK AAIASGQREY TASAGGIQFQ VYLDKKTGIV
ENFFPVTN