Gene YPK_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1622 
Symbol 
ID6089746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1795416 
End bp1803053 
Gene Length7638 bp 
Protein Length2545 aa 
Translation table11 
GC content58% 
IMG OID641596692 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001720368 
Protein GI170023863 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC 
GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG
CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC
TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG
CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC
AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC
CTCAATAACG GCCACGGTGT CAACCAGACC GAGCTCGGTG GTTTTATCGA CGGCAACCCG
TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG CGACCCCAGT
AAACTCAATG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG
GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC
GGTCAGGCGC AGCTTAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT
GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAGGATC ATACCGACCT GATCGCCCGC
TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAACTGA GTGTCACCAC CGGGCGCAAT
CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA
ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT
GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC GGCGGGCGAT
ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG
TTGGTGGTCA ATACCGCCGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT
ACCCAACTCA CCACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGCGGGGGAC
ACCTCTGTCC GAGCGGTGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG
AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC
ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAT
CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA TTGCACAAGG GGGTGAGATC
GATCTAACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA
TTGCGTACCG ATAACGCCAG CCTGATCGCC GAACAGATCA CGCTCGACGC GCAGGCACTA
TCCAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG
GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG
GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTGGGGGCGG GAGTACAAAG TGATGGCCGC
CTGACGGAGA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG
ACCCTTGCCG CGGGTACCAT GGCGTTAACC GGCAGCCGGA TTGATCTGGC CGACAGTTAC
ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC
ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG
GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGCC AGTTTGATAA CCAAGGCGGC
CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGACTTCAG CCACCAGGCG
GGCTCGACCC TTCAGGCCGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT
GACGGGCAAC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC
GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC
GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT
ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC
GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCCTAGCCC TGCTGGCCCC GGTGATTGAA
AACAGCGATA CCGTCACCCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC
ACGGTTATTC TGGCCGGCGG GCACGCGAGT GATGGCCATT ACGCCTCTGC GGCTCAGGTG
CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG
ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA
GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG
CCGGATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC
TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA
CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG CGAGATTGAT
CTCACCGGCG TCATCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG
ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GGCATGGGCG
TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA
ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG
CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGATCG AATTTAACGG GATCTCGCTG
ACACCGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT
TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC
GTGCTCCAGC AGTTGAACAA TGACCCGGAT GTGATATTCA AACGTCTGGG GGATAACGCC
TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCATTGA CCGGCCAGGC GGTGGCCAGT
GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC
AAGGCGTTCA ATATTGCCCT TGGTACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC
CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG
GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA
GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG
AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT
CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG
CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGTACCGAC
AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CCTGGGGCCG
CTGGCCAGCA TCAATGTCAG CAATAATGCG TCTATCAATA CCGGGCGTGA TTTTATCATG
CAAGGTGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA
CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC
AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACGGCCAAC
GTCGACAATC TGACGGCGGT GGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG
CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC
AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACGC TGCTTGGCAG CCAGTTGAAT
GCCACGGGCG ATATCAATTT GCAGGCAGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA
CAAACCGATG GCGCGCTGAC GCTGGCGGCG GGCGGGGATG TGCTCCTGAC CACCCAGACC
GAGCAGCATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC
CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG
GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAC
CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATCACCGTCG GCACGGCGCA GCAGAGCGAG
AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT
GTCACGGTGG GCAGCAGCAG TACCAAAATG ACCGATTCTG GTCAATCGAT TTCCAGCGTG
GGCAGCACGG TGGGCAGCGT ACTGGGCAAT GTCAGCATGA CCGCCGGTGA AGACCTGAGG
GTGCAAGGTG CCGAGGTGTT GGCCGGTAAA GACATCAATC TGACCGGTAA AAACGTCAGT
ATTCTGGCGG CGGAGAATCA GCTTACCCAG ATCCACACCG TCGAGCAAAA ACAGAGCGGC
CTGACGCTGG CACTGTCCGG TGCGGTGGGC AGTGCCGTCA ATACCGCGGT GACCACCGCG
AAAGCGGCCA GCGAAGAGAG CAGTGGCCGC TTGGGGGCAT TGCAGGGGGT TAAAGCGGCG
CTCAATGGCG TACAGGCGGT GCAGGCCGGG CAGTTGGTGC AGGCGGAGGG GGGTGATACC
GCCAGCATGT TCGGCATCAG TGCGTCCTTG GGCTCACAAA AATCGTCCTC GGAGCAACAT
CAGGAACAGA CCCACGTGAC GGGCTCGACG CTGACGGCAG GCAACAATCT GACCATCAAT
GCCACCGGTG AGGGGAATGC GGCAAACAGC GGCGATATTG TGGTGCAAGG CAGCCAGCTC
CAGGCCGGTG GCGATACCAC GCTGGATGCG GCGCGTGATG TGCTGTTACT CGGCGCTGCC
AACACACAAA AAACCGACGG CAGCAACAGC AGCAGTGGCG GCAGTGTTGG CGTCAGTCTG
GGCATCAGTG GGGCCAGCAG TGGTCTGAGT ATTTTTGCCA ACGCCAATAA AGGTCAGGGA
AGTGAGCACG GCGACGGCAT CTCCTGGACT GAAACGACCC TTGACAGCGG CGGCACGCTG
TCGGTGCACA GTGGCCGCGA TACCTCACTG GTCGGTGCGC AGGTCAGCGG CGAAACGGTG
AAGGTGGAGG TAGGCCGCGA CCTGTTGCTG CAAAGCCAGC AGGACAGCGA TAACTATGAC
GCCAAACAGC AAAATAGCAG CGTTGGCGGC AGCTTCAGCC CTGGCTCCAT GACGGGCAGT
ATCAGTATCA ATGGCAGCCA GGACAAGCTG AACAGCAACT TTGACTCGGT GCAGGAGCAG
ACGGGTATCT TTGCCGGTTC GGGCGGCTTT GATATCACGG TGGGTGGACA TACCCAGCTT
GACGGTGCGG TGATTGGCAG CACGGCGACA GCCGATAAAA ACACGCTGGA TACCGGGACA
CTGGGCTTCA GTGATATCGA TAATCAAGCC GATTTCAAGG TTGAACATCA AAGTGTGGGT
ATCAGCACCG GGGGGAATAT TGGCAGTCAG TTTGTTGGCA ATATGGCCAA CGGCTTGCTG
GTCGGGGCCA ATAACGAAGG CCACGCCGAC AGCACCACCC ATGCGGCCGT TTCTGAAGGT
ACGATCACGG TGCGCGACAC GGATAACCAG CAGCAGAATG TTGATGACCT GAGCCGTGAT
GTGGAGCATG CCAACAATGC CCTTTCCCCT ATCTTTGATA AAGAGAAAGA GCAAAACCGG
CTGAAGGAAG CGCAGCTTAT CGGCGAGATA GGCAGTCAGG TGGGGGATGT GTTCCGCACA
CAGGGGCAGA TTATCGCCAC CCAGGCGGCG ACTGAAAAAA TGCAGGAGGT GAGTGAGGCT
GATCGTGAGG CGGCAAAATC CAACTGGGAA AAAGCCAATC CGGGTCAGAT TGCAACGGCT
GAAGATATCA ACGGTCAGGT TTATAAAACG GCCTATGATC AGGCATTCAA TGCATCGGGT
TACGGCACCG GGGGTAAATT CCAGCAGGCG GTACAAGCGG CGACAGCGGC CCTCCAGGGG
CTGGCGGGCG GAGATATAGC CAAAGCGATA GCGGGAGGCA GTGCGCCGTA TCTGGCGGAA
GTGATAAAGC AAAGCACGGG TGATAACGAA GAAGCGCGAC TGGCGGCACA TGCGGTGGTC
GGTTCTGTTC TGGCACATCT ACAGGGCAAT AGCGCGGTTG CGGGAGGCGC AGGTGCCTTG
ACGGGTGAGA TAGCGGCTGA TTTAATCATG CAGCAGTTGT ACCCGGGACA AATGGTTAGT
GAACTCAGCG AGACAGAAAA ACAGACCATC AGCGCGTTAA GTACATTAGC AGCAGGGCTG
GCGGGGGGCT TGACGGGAGA CAGCAGCGCC GACGCGGTTG CGGGTGCGCA GGCGGGGAAA
AATGCGGTAG AGAATAATGC GTTGGGTGCG AATGACTTCG GAAAAGGCAT GGCAGATTAC
GGTCAATCTG TCGCTTCTTA CGCGCAATAT GCACAGGACA ATAATTTGCC ACCTGAGCAA
ATCAAAGCTG ACATGGAACG AATGGTCAAA GGTGACTTGC CGGAAGGTTC TGACATTATC
AAAGCGATTC TGGAGAACAA CCCCGGAACC GATACGATAA TGGCGTTGCT ATCAGCGGAA
GATGCGAAAG ATTATGCTCT TGCATTACTA TCATCCATCC CAGCAGAACG AGTACTGGCG
GTTGTAGGTA AGGCTACGAA TGTCATCACG AATAAGATGC TGATCAGTGC TGCGGAGAAG
ATCTCGACGG CGAAACCCGG TGTGCAATCG CCAGTTCCTA GAGATTTGAA TGAGCAAATT
GTTTGGAAGC AGGTGCAGGA AAACCCTGCT AAAGGAGAAA TACTGCCTGG AATGAATAAT
GATCCTCGTT TTCCTGCAAG TGCAGGATTC CAAAAAATGC AGGTAGTCCA GAAAAATGCT
AATGGAGAAT CAATCACTGT TCATTATCAG TACAACTCTA CTACTGGCAA ATCCTACGAC
ATGAAAATTG ATACTCCTCA GCGGGTTAAC TCTAATCCGG CAGATGTGAT CGAGAATATT
AAAGGGCAGA TTAAATGA
 
Protein sequence
MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG 
CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI
LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS
AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR
SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI
GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN
TQLTTAGTLS NSGTLAAAGD TSVRAVEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI
TAQGKNISGT AQRFTAHRLD LSGSQTQSRD ITLIAQGGEI DLTGAELLAS DRLSAATTAL
LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA
ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGTMALT GSRIDLADSY
TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG
QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGQLVAGGTL STHSDSLLNS
GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD
ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGHAS DGHYASAAQV
LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP
PDGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID
LTGVILQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWAWA WGPEVSERAT SEYASSFTAK
TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAIEFNGISL TPPNGGLYQF TTDHTVGGGG
YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS
DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV
VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL
QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP
LASINVSNNA SINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT
SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS
KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT
EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD
QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSSTKM TDSGQSISSV
GSTVGSVLGN VSMTAGEDLR VQGAEVLAGK DINLTGKNVS ILAAENQLTQ IHTVEQKQSG
LTLALSGAVG SAVNTAVTTA KAASEESSGR LGALQGVKAA LNGVQAVQAG QLVQAEGGDT
ASMFGISASL GSQKSSSEQH QEQTHVTGST LTAGNNLTIN ATGEGNAANS GDIVVQGSQL
QAGGDTTLDA ARDVLLLGAA NTQKTDGSNS SSGGSVGVSL GISGASSGLS IFANANKGQG
SEHGDGISWT ETTLDSGGTL SVHSGRDTSL VGAQVSGETV KVEVGRDLLL QSQQDSDNYD
AKQQNSSVGG SFSPGSMTGS ISINGSQDKL NSNFDSVQEQ TGIFAGSGGF DITVGGHTQL
DGAVIGSTAT ADKNTLDTGT LGFSDIDNQA DFKVEHQSVG ISTGGNIGSQ FVGNMANGLL
VGANNEGHAD STTHAAVSEG TITVRDTDNQ QQNVDDLSRD VEHANNALSP IFDKEKEQNR
LKEAQLIGEI GSQVGDVFRT QGQIIATQAA TEKMQEVSEA DREAAKSNWE KANPGQIATA
EDINGQVYKT AYDQAFNASG YGTGGKFQQA VQAATAALQG LAGGDIAKAI AGGSAPYLAE
VIKQSTGDNE EARLAAHAVV GSVLAHLQGN SAVAGGAGAL TGEIAADLIM QQLYPGQMVS
ELSETEKQTI SALSTLAAGL AGGLTGDSSA DAVAGAQAGK NAVENNALGA NDFGKGMADY
GQSVASYAQY AQDNNLPPEQ IKADMERMVK GDLPEGSDII KAILENNPGT DTIMALLSAE
DAKDYALALL SSIPAERVLA VVGKATNVIT NKMLISAAEK ISTAKPGVQS PVPRDLNEQI
VWKQVQENPA KGEILPGMNN DPRFPASAGF QKMQVVQKNA NGESITVHYQ YNSTTGKSYD
MKIDTPQRVN SNPADVIENI KGQIK