Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_1519 |
Symbol | |
ID | 5385391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 1762407 |
End bp | 1770173 |
Gene Length | 7767 bp |
Protein Length | 2588 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640864501 |
Product | hemagglutinin/adhesin repeat-containing protein |
Protein accession | YP_001400497 |
Protein GI | 153949798 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC CTCAATAACG GCCACGGTGT CAACCAGACC GAGCTCGGTG GTTTTATCGA CGGCAACCCG TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG CGACCCCAGT AAACTCAATG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC GGTCAGGCGC AGCTTAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAGGATC ATACCGACCT GATCGCCCGC TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAGCTGA GTGTCACCAC CGGGCGCAAT CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC GGCGGGCGAT ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG TTGGTGGTCA ATACCGCCGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT ACCCAACTCA CCACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGCGGGGGAC ACCTCTGTCC GAGCGGCGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAT CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA CTGCAAAAGG GGGTGAGATC GATCTGACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA TTACGTACCG ATAACGCCAG CCTGATCGCC GAACAGATTA CGCTCGACGC GCAGGCACTC TCTAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTTGGGGCGG GAGTACAAAG TGATGGCCGC CTGACGGAGA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG ACCCTTGCCG CGGGTACCAT GGCGTTAACC GGCAGCCGGG TTGATCTGGC CGACAGTTAC ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGGC AGTTTGATAA CCAAGGTGGC CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGATTTCAG CCACCAGGCG GGCTCGACCC TTCAGGCTGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT GACGGGCACC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCTTAGCCC TGCTGGCCCC GGTGATTGAA AACAGCGATA CCGTCACTCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC ACGGTTATTC TGGCCGGCGG GCAAGCGAGT GATGGCCATT ACACGTCTGC GGCCCAGGTG CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG CCGAATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG CGAGATTGAT CTCACCGGCG TCACCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GGCATGGGCG TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGATCG AATTTAACGG GATCTCGCTG ACACCGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC GTGCTCCAGC AGTTGAACAA TGACCCGGAT GTGATATTCA AACGTCTGGG GGATAACGCC TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCATTGA CAGGCCAGGC GGTGGCCAGT GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC AAGGCGTTCA ATATTGCCCT TGGTACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGTACCGAC AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CCTGGGGCCG CTGGCCAGCA TCAATGTCAG CAATAATGCG ACTATCAATA CCGGGCGTGA TTTTATCATG CAAGGCGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACCGCCAAC GTCGACAATC TGACGGCGGT AGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACCC TGCTTGGCAG CCAGTTGAAT GCCACGGGCG ATATCAATTT GCAGGCGGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA CAAACCGATG GCGCGCTGAC ACTGGCAGCG GGCGGGGATG TGCTCCTGAC CACCCAGACT GAGCAGCATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAC CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATCACCGTCG GCACGGCGCA GCAGAGCGAG AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT GTCACGGTGG GCAGCAGCAG TACCAAAATG ACCGATTCTG GTCAATCGAT TTCCAGCGTG GGCAGCACGG TGGGCAGCGT ACTGGGCAAT GTCAGCATGA CCGCCGGTGA AGACCTGAGG GTGCAAGGTG CCGAGGTGTT GGCCGGTAAA GACATCAATC TGACCGGTAA AAACGTCAGT ATTCTGGCGG CGGAGAATCA GCTTACCCAG AGCCACACCG TCGAGCAAAA ACAGAGCGGC CTGACACTGG CACTGTCCGG TGCGGTGGGC AGTGCCGTCA ATACCGCAGT GACCACCGCG AAAGCGGCCA GCGAAGAGAG CAGTGGCCGC TTGGGGGCAT TGCAGGGGGT TAAAGCGGCG CTCAATGGCG TGCAGGCGGT GCAGGCTGGG CAGTTGGTGC AGGCGGAGGG GGGCGATGCT GCCAGCATGT TCGGCATCAG TGCGTCCTTG GGCTCACAAA AATCGTCCTC GGAGCAACAT CAGGAACAGA CCCACGTGAC GGGCTCGACC CTGACGGCAG GCAACAATCT GACCATCAAT GCCACCGGTG AGGGGAATGC GGCAAACAGC GGCGATATTG TGGTGCAAGG CAGCCAGCTC CAGGCCGGTG GCGATACCAC GCTGGATGCG GCGCGTGATG TGCTGCTACT CGGCGCTGCT AACACACAAA AAACCGACGG CAGCAACAGC AGCAGTGGCG GCAGTGTTGG CGTCAGTCTG GGTGTCAGTG GGGCCAGCAG TGGTCTGAGT ATTTTTGCCA ACGCCAATAA AGGTCAGGGA AGTGAGCACG GCGACGGCAT CTCCTGGACT GAAACGACCC TTGACAGCGG CGGCACGCTG TCGCTGTACA GTGGCCGCGA TACCTCACTG GTCGGTGCGC AGGTCAGCGG CGAAACGGTG AAGGTGGAGG TGGGCCGCGA CCTGTTGCTG CAAAGCCAGC AGGACAGCGA TAACTATGAT GCGAAGCAGC AAAGTAGCAG TGTTGGCGGC AGTTTCAGCC CTGGCTCCAT GACGGGCAGT ATCAGTATCA ATGGCAGCCA GGACAAGCTG AACAGCAACT TTGACTCGGT GCAGGAGCAG ACGGGTATTT TTGCCGGTTC GGGCGGCTTT GATATCACGG TGGGTGGACA TACCCAGCTT GACGGTGCGG TGATTGGCAG CACGGCGACG GCCGATAAAA ACACGCTGGA TACCGGGACA CTGGGCTTCA GTGATATCGA TAATCAAGCC GATTTCAAGG TTGAACATCA AAGTGTGGGT ATCAGCACCG GGGGGAATAT TGGCAGTCAG TTTGTTGGCA ATATGGCCAA CGGCTTGCTG GTCGGGGCCA ATAACGAAGG CCACGCCGAC AGCACCACCC ATGCGGCCGT TTCTGAAGGT ACGATCACGG TGCGCGACAC GGATAACCAG CAGCAGAATG TTGATGACCT GAGCCGTGAC GTGGAGCAGG CCAACAATGC CCTTTCCCCT ATCTTTGATA AAGAGAAAGA ACAAAACCGG CTGAAGGAAG CGCAGCTTAT CGGCGAGATA GGCAGTCAGG TGGGTGATGT GTTCCGAACG CAAGGGCAGA TTATCGCCAC CCAGGCGGCG AATGAAAAAA TGCAGGGGGT GAGTGAGGCT GATCGTGAGG CGGCGAAAGC CAACTGGGAA AAAGCCAATC CGGGTCAGAT TGCAACGGCT GAAGATATCA ACGGTCAGGT TTATAAAACG GCCTATGATC AGGCATTCAA TGCATCGGGT TACGGCACCG GGGGTAAATT CCAGCAGGCG GTACAAGCGG CGACAGCGGC CCTCCAGGGG CTGGCGGGCG GAGATATAGC CAAAGCGATA GCGGGAGGCA GTGCGCCGTA TCTGGCGGAA GTGATTAAGC AAAGCACGGG TGATAACGAA GAAGCGCGAC TGGCGGCACA TGCGGTGGTC GGTTCTGTTC TGGCACATCT ACAGGGCAAT AGCGCGGTTG CGGGAGGCGC AGGTGCCTTG ACGGGTGAGA TAGCGGCTGA TTTAATCATG CAGCAGTTGT ACCCGGGAAA AATGGTTAGC GAACTCAGCG AGACAGAAAA ACAGACCATC AGCGCGTTAA GTACATTAGC AGCAGGGCTG GCGGGGGGCT TGACGGGAGA CAGCAGCGCC GACGCGGTTG CGGGTGCACA GGCGGGGAAA AATGCGGTAG AGAATAACTC GCTGAACCCG AACGACTTCG GTAAGGGCAT GGCAGACATA GGGATGTCGC AAACCTCGCT CGGTGCTTCC ATGCTGCAAA GTGGAGCTTC ACCGGATGAA ATCGCAGCGG CCCTGATCAA AAATGCCCAG GGAGATATGC CGGAAGGTCA GGATGCAGTT AAAGGCCTAT TGATCGCCTG GGGCGAGTTC TTCGGGGTGC CGGTCAGTGC GTTGACGGCA AATGGAGAAA TGACGCCAGA GAAAGCGGCG GAAATCCTAG CCAGTGGAGT GCCGACCAGT GAAGCTAAAC TGGTTCAATA TGTATTTGCG AAAGCGTTTT TGTCAGTGAC GAAAGCTGTC TATCCTGAAG GGATTAGTTT CAAGATCACC CAACCCGAAC ATTTAGCGAA ATTGGATGGA TATTCGCAGA GAAAGGGTAT TAGTGGCACG CATAATGCGG ATGCATTCTA TTCAACAGTC AATGATAAAG GGGTAAAAGT CATTGGCGAA ACTCAATCTA ATATTAAAGG TATAAATGAA GTTAAATACC AGATACCTTC CTATGATAGA GCTGGTAATG TTATAGGATA TAAAGCTCAA GTGTTTACTA AAACGATCTA TGATCCTAAG GTCTTCACTG ATCAAAAAAT ATTAGATTTG GGACAGCAGG CTGCAAGTAG TGGCTATAAG GCTGCTATTG CTTCGGGCCA ACGAGAATAT ACAGCATCAG CTGGTGGAAT TCAGTTCCAA GTCTACTTAG ATAAAAAAAC AGGAATAGTA GAAAACTTCT TCCCGGTGAC TAACTGA
|
Protein sequence | MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN TQLTTAGTLS NSGTLAAAGD TSVRAAEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI TAQGKNISGT AQRFTAHRLD LSGSQTQSRD ITLTAKGGEI DLTGAELLAS DRLSAATTAL LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGTMALT GSRVDLADSY TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGHLVAGGTL STHSDSLLNS GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGQAS DGHYTSAAQV LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP PNGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID LTGVTLQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWAWA WGPEVSERAT SEYASSFTAK TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAIEFNGISL TPPNGGLYQF TTDHTVGGGG YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP LASINVSNNA TINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSSTKM TDSGQSISSV GSTVGSVLGN VSMTAGEDLR VQGAEVLAGK DINLTGKNVS ILAAENQLTQ SHTVEQKQSG LTLALSGAVG SAVNTAVTTA KAASEESSGR LGALQGVKAA LNGVQAVQAG QLVQAEGGDA ASMFGISASL GSQKSSSEQH QEQTHVTGST LTAGNNLTIN ATGEGNAANS GDIVVQGSQL QAGGDTTLDA ARDVLLLGAA NTQKTDGSNS SSGGSVGVSL GVSGASSGLS IFANANKGQG SEHGDGISWT ETTLDSGGTL SLYSGRDTSL VGAQVSGETV KVEVGRDLLL QSQQDSDNYD AKQQSSSVGG SFSPGSMTGS ISINGSQDKL NSNFDSVQEQ TGIFAGSGGF DITVGGHTQL DGAVIGSTAT ADKNTLDTGT LGFSDIDNQA DFKVEHQSVG ISTGGNIGSQ FVGNMANGLL VGANNEGHAD STTHAAVSEG TITVRDTDNQ QQNVDDLSRD VEQANNALSP IFDKEKEQNR LKEAQLIGEI GSQVGDVFRT QGQIIATQAA NEKMQGVSEA DREAAKANWE KANPGQIATA EDINGQVYKT AYDQAFNASG YGTGGKFQQA VQAATAALQG LAGGDIAKAI AGGSAPYLAE VIKQSTGDNE EARLAAHAVV GSVLAHLQGN SAVAGGAGAL TGEIAADLIM QQLYPGKMVS ELSETEKQTI SALSTLAAGL AGGLTGDSSA DAVAGAQAGK NAVENNSLNP NDFGKGMADI GMSQTSLGAS MLQSGASPDE IAAALIKNAQ GDMPEGQDAV KGLLIAWGEF FGVPVSALTA NGEMTPEKAA EILASGVPTS EAKLVQYVFA KAFLSVTKAV YPEGISFKIT QPEHLAKLDG YSQRKGISGT HNADAFYSTV NDKGVKVIGE TQSNIKGINE VKYQIPSYDR AGNVIGYKAQ VFTKTIYDPK VFTDQKILDL GQQAASSGYK AAIASGQREY TASAGGIQFQ VYLDKKTGIV ENFFPVTN
|
| |