Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_1622 |
Symbol | |
ID | 6089746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 1795416 |
End bp | 1803053 |
Gene Length | 7638 bp |
Protein Length | 2545 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641596692 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001720368 |
Protein GI | 170023863 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC CTCAATAACG GCCACGGTGT CAACCAGACC GAGCTCGGTG GTTTTATCGA CGGCAACCCG TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG CGACCCCAGT AAACTCAATG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC GGTCAGGCGC AGCTTAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAGGATC ATACCGACCT GATCGCCCGC TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAACTGA GTGTCACCAC CGGGCGCAAT CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC GGCGGGCGAT ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG TTGGTGGTCA ATACCGCCGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT ACCCAACTCA CCACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGCGGGGGAC ACCTCTGTCC GAGCGGTGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAT CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA TTGCACAAGG GGGTGAGATC GATCTAACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA TTGCGTACCG ATAACGCCAG CCTGATCGCC GAACAGATCA CGCTCGACGC GCAGGCACTA TCCAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTGGGGGCGG GAGTACAAAG TGATGGCCGC CTGACGGAGA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG ACCCTTGCCG CGGGTACCAT GGCGTTAACC GGCAGCCGGA TTGATCTGGC CGACAGTTAC ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGCC AGTTTGATAA CCAAGGCGGC CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGACTTCAG CCACCAGGCG GGCTCGACCC TTCAGGCCGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT GACGGGCAAC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCCTAGCCC TGCTGGCCCC GGTGATTGAA AACAGCGATA CCGTCACCCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC ACGGTTATTC TGGCCGGCGG GCACGCGAGT GATGGCCATT ACGCCTCTGC GGCTCAGGTG CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG CCGGATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG CGAGATTGAT CTCACCGGCG TCATCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GGCATGGGCG TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGATCG AATTTAACGG GATCTCGCTG ACACCGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC GTGCTCCAGC AGTTGAACAA TGACCCGGAT GTGATATTCA AACGTCTGGG GGATAACGCC TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCATTGA CCGGCCAGGC GGTGGCCAGT GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC AAGGCGTTCA ATATTGCCCT TGGTACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGTACCGAC AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CCTGGGGCCG CTGGCCAGCA TCAATGTCAG CAATAATGCG TCTATCAATA CCGGGCGTGA TTTTATCATG CAAGGTGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACGGCCAAC GTCGACAATC TGACGGCGGT GGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACGC TGCTTGGCAG CCAGTTGAAT GCCACGGGCG ATATCAATTT GCAGGCAGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA CAAACCGATG GCGCGCTGAC GCTGGCGGCG GGCGGGGATG TGCTCCTGAC CACCCAGACC GAGCAGCATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAC CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATCACCGTCG GCACGGCGCA GCAGAGCGAG AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT GTCACGGTGG GCAGCAGCAG TACCAAAATG ACCGATTCTG GTCAATCGAT TTCCAGCGTG GGCAGCACGG TGGGCAGCGT ACTGGGCAAT GTCAGCATGA CCGCCGGTGA AGACCTGAGG GTGCAAGGTG CCGAGGTGTT GGCCGGTAAA GACATCAATC TGACCGGTAA AAACGTCAGT ATTCTGGCGG CGGAGAATCA GCTTACCCAG ATCCACACCG TCGAGCAAAA ACAGAGCGGC CTGACGCTGG CACTGTCCGG TGCGGTGGGC AGTGCCGTCA ATACCGCGGT GACCACCGCG AAAGCGGCCA GCGAAGAGAG CAGTGGCCGC TTGGGGGCAT TGCAGGGGGT TAAAGCGGCG CTCAATGGCG TACAGGCGGT GCAGGCCGGG CAGTTGGTGC AGGCGGAGGG GGGTGATACC GCCAGCATGT TCGGCATCAG TGCGTCCTTG GGCTCACAAA AATCGTCCTC GGAGCAACAT CAGGAACAGA CCCACGTGAC GGGCTCGACG CTGACGGCAG GCAACAATCT GACCATCAAT GCCACCGGTG AGGGGAATGC GGCAAACAGC GGCGATATTG TGGTGCAAGG CAGCCAGCTC CAGGCCGGTG GCGATACCAC GCTGGATGCG GCGCGTGATG TGCTGTTACT CGGCGCTGCC AACACACAAA AAACCGACGG CAGCAACAGC AGCAGTGGCG GCAGTGTTGG CGTCAGTCTG GGCATCAGTG GGGCCAGCAG TGGTCTGAGT ATTTTTGCCA ACGCCAATAA AGGTCAGGGA AGTGAGCACG GCGACGGCAT CTCCTGGACT GAAACGACCC TTGACAGCGG CGGCACGCTG TCGGTGCACA GTGGCCGCGA TACCTCACTG GTCGGTGCGC AGGTCAGCGG CGAAACGGTG AAGGTGGAGG TAGGCCGCGA CCTGTTGCTG CAAAGCCAGC AGGACAGCGA TAACTATGAC GCCAAACAGC AAAATAGCAG CGTTGGCGGC AGCTTCAGCC CTGGCTCCAT GACGGGCAGT ATCAGTATCA ATGGCAGCCA GGACAAGCTG AACAGCAACT TTGACTCGGT GCAGGAGCAG ACGGGTATCT TTGCCGGTTC GGGCGGCTTT GATATCACGG TGGGTGGACA TACCCAGCTT GACGGTGCGG TGATTGGCAG CACGGCGACA GCCGATAAAA ACACGCTGGA TACCGGGACA CTGGGCTTCA GTGATATCGA TAATCAAGCC GATTTCAAGG TTGAACATCA AAGTGTGGGT ATCAGCACCG GGGGGAATAT TGGCAGTCAG TTTGTTGGCA ATATGGCCAA CGGCTTGCTG GTCGGGGCCA ATAACGAAGG CCACGCCGAC AGCACCACCC ATGCGGCCGT TTCTGAAGGT ACGATCACGG TGCGCGACAC GGATAACCAG CAGCAGAATG TTGATGACCT GAGCCGTGAT GTGGAGCATG CCAACAATGC CCTTTCCCCT ATCTTTGATA AAGAGAAAGA GCAAAACCGG CTGAAGGAAG CGCAGCTTAT CGGCGAGATA GGCAGTCAGG TGGGGGATGT GTTCCGCACA CAGGGGCAGA TTATCGCCAC CCAGGCGGCG ACTGAAAAAA TGCAGGAGGT GAGTGAGGCT GATCGTGAGG CGGCAAAATC CAACTGGGAA AAAGCCAATC CGGGTCAGAT TGCAACGGCT GAAGATATCA ACGGTCAGGT TTATAAAACG GCCTATGATC AGGCATTCAA TGCATCGGGT TACGGCACCG GGGGTAAATT CCAGCAGGCG GTACAAGCGG CGACAGCGGC CCTCCAGGGG CTGGCGGGCG GAGATATAGC CAAAGCGATA GCGGGAGGCA GTGCGCCGTA TCTGGCGGAA GTGATAAAGC AAAGCACGGG TGATAACGAA GAAGCGCGAC TGGCGGCACA TGCGGTGGTC GGTTCTGTTC TGGCACATCT ACAGGGCAAT AGCGCGGTTG CGGGAGGCGC AGGTGCCTTG ACGGGTGAGA TAGCGGCTGA TTTAATCATG CAGCAGTTGT ACCCGGGACA AATGGTTAGT GAACTCAGCG AGACAGAAAA ACAGACCATC AGCGCGTTAA GTACATTAGC AGCAGGGCTG GCGGGGGGCT TGACGGGAGA CAGCAGCGCC GACGCGGTTG CGGGTGCGCA GGCGGGGAAA AATGCGGTAG AGAATAATGC GTTGGGTGCG AATGACTTCG GAAAAGGCAT GGCAGATTAC GGTCAATCTG TCGCTTCTTA CGCGCAATAT GCACAGGACA ATAATTTGCC ACCTGAGCAA ATCAAAGCTG ACATGGAACG AATGGTCAAA GGTGACTTGC CGGAAGGTTC TGACATTATC AAAGCGATTC TGGAGAACAA CCCCGGAACC GATACGATAA TGGCGTTGCT ATCAGCGGAA GATGCGAAAG ATTATGCTCT TGCATTACTA TCATCCATCC CAGCAGAACG AGTACTGGCG GTTGTAGGTA AGGCTACGAA TGTCATCACG AATAAGATGC TGATCAGTGC TGCGGAGAAG ATCTCGACGG CGAAACCCGG TGTGCAATCG CCAGTTCCTA GAGATTTGAA TGAGCAAATT GTTTGGAAGC AGGTGCAGGA AAACCCTGCT AAAGGAGAAA TACTGCCTGG AATGAATAAT GATCCTCGTT TTCCTGCAAG TGCAGGATTC CAAAAAATGC AGGTAGTCCA GAAAAATGCT AATGGAGAAT CAATCACTGT TCATTATCAG TACAACTCTA CTACTGGCAA ATCCTACGAC ATGAAAATTG ATACTCCTCA GCGGGTTAAC TCTAATCCGG CAGATGTGAT CGAGAATATT AAAGGGCAGA TTAAATGA
|
Protein sequence | MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN TQLTTAGTLS NSGTLAAAGD TSVRAVEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI TAQGKNISGT AQRFTAHRLD LSGSQTQSRD ITLIAQGGEI DLTGAELLAS DRLSAATTAL LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGTMALT GSRIDLADSY TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGQLVAGGTL STHSDSLLNS GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGHAS DGHYASAAQV LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP PDGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID LTGVILQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWAWA WGPEVSERAT SEYASSFTAK TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAIEFNGISL TPPNGGLYQF TTDHTVGGGG YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP LASINVSNNA SINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSSTKM TDSGQSISSV GSTVGSVLGN VSMTAGEDLR VQGAEVLAGK DINLTGKNVS ILAAENQLTQ IHTVEQKQSG LTLALSGAVG SAVNTAVTTA KAASEESSGR LGALQGVKAA LNGVQAVQAG QLVQAEGGDT ASMFGISASL GSQKSSSEQH QEQTHVTGST LTAGNNLTIN ATGEGNAANS GDIVVQGSQL QAGGDTTLDA ARDVLLLGAA NTQKTDGSNS SSGGSVGVSL GISGASSGLS IFANANKGQG SEHGDGISWT ETTLDSGGTL SVHSGRDTSL VGAQVSGETV KVEVGRDLLL QSQQDSDNYD AKQQNSSVGG SFSPGSMTGS ISINGSQDKL NSNFDSVQEQ TGIFAGSGGF DITVGGHTQL DGAVIGSTAT ADKNTLDTGT LGFSDIDNQA DFKVEHQSVG ISTGGNIGSQ FVGNMANGLL VGANNEGHAD STTHAAVSEG TITVRDTDNQ QQNVDDLSRD VEHANNALSP IFDKEKEQNR LKEAQLIGEI GSQVGDVFRT QGQIIATQAA TEKMQEVSEA DREAAKSNWE KANPGQIATA EDINGQVYKT AYDQAFNASG YGTGGKFQQA VQAATAALQG LAGGDIAKAI AGGSAPYLAE VIKQSTGDNE EARLAAHAVV GSVLAHLQGN SAVAGGAGAL TGEIAADLIM QQLYPGQMVS ELSETEKQTI SALSTLAAGL AGGLTGDSSA DAVAGAQAGK NAVENNALGA NDFGKGMADY GQSVASYAQY AQDNNLPPEQ IKADMERMVK GDLPEGSDII KAILENNPGT DTIMALLSAE DAKDYALALL SSIPAERVLA VVGKATNVIT NKMLISAAEK ISTAKPGVQS PVPRDLNEQI VWKQVQENPA KGEILPGMNN DPRFPASAGF QKMQVVQKNA NGESITVHYQ YNSTTGKSYD MKIDTPQRVN SNPADVIENI KGQIK
|
| |