Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0341 |
Symbol | |
ID | 5387008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 400246 |
End bp | 404829 |
Gene Length | 4584 bp |
Protein Length | 1527 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640863311 |
Product | YD repeat-/RHS repeat-containing protein |
Protein accession | YP_001399335 |
Protein GI | 153950146 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCG CACAAAGTAT TGGCACCGGG TATGGCGCGG CTGGCGCGCA ATCGGCATTA CGTCAAACCG CGCTGGGGCA ACAGAGCCCG GCGCGGACGG ATTATCAGGT CAGTAATCCC AATGTGGGGA ATATCGCGCG AGCGAGTGAT TCTCTGCTGA ATGTGGCGGA AAGTCAGCAG TTTAACACTC TGGTGACCGC CGGTTTTGGG ATGCAGGCGA TTGCCGCCGG GGTGGCCGTT CCCTTTACCG TCGGGCTGGC CGCCGGGATG GCCGGGGGAT ATGTCGGTGC CAAAATCGGC AATAAGTTAG GTCACGGTAT TGCCCGTGCG CTTAATTTTA ATCAAGTGGC GACGGAAGGA GAAAGCCCCG CGCACCTGGG CCATCCCATT GCCCATCAGA AAAAAGATTG GGGAGTGTGG GGGGCCATTG GCGGTATTTT ACTGGGGGCA GCGGCGGCGG CATTGGTGGT CGTCACCTTT GGTACGGGGT TGGTGGTGAT TGCCGCGGCG GCGGCCGCAG CCGGATTAAT CGGCGGTATC GCCGCCGCGA CCGGGGCGGC ATTAGGACAA TACGGTGATA ACAAAGGCGT GATTGCTGAA GGCTCCGCCA ACGTCTTCTT TGAAGGCCAG CCGGTGGCCC GGGTGGGGGA TAAGATCCAA TGCAGCGATC ACCCCAGTTC ACCGCCGCCG ATGATTGCCG AGGGCGCAAA AACCGTGTTT GCCAACCAAA AGCAGATCGC GCGTCTGGGG CACCGTACCA CCTGCGATGG CAATATCAAT GCGGGATGCG GTTCGCTCGC CATCACGCAG GAAACGGCTT ATGTGTATGA AGTGGCAGAC AGCCGTAATC CGTATTTACG CTGGTCAGCG GTTATTCTGA GTTTCTTGCC AATACAGAAA AAATTTGAGC AAGGGTTTCG CTCCTTAAAA AAGCCGCCCA ATACCGCGGT CAACGCTACC CATAACTGCC CGACCGGCAG TGATCCTGTT GATATGGTCT CCGGCGATTA CCTTCAGGTC TGGCCCGTGA TTGATATTGC CGGTGTTTTG CCTGTCCGGT TACAGCGTAC TTATCGTTCC GGCGATTATT TCACTGTCTC GGGTTGCTTT GGTCATAAAT GGGCGGATAG CTGGTCACAA CACTTGGTCG TGCACGAAGA CAACATTGAT TACATTGACG AGGAAGGGGT GGGGCTGAGC TTCTTTACGC CAGAGAATAA AGTTCAGGCC GTCAATCTGT ATAACCTGCG TTATGAGTTA GTGGGGGAGC GTCATGGCGA ACTCAGGGTT TTTGACCGGT CAACTCAGCA GACACTGCAT TTTAATCAAC AGCAACAACA GCAGCGTTAT TTATCGGCTA TCACCGACCG CAAAGGGAAC CGAATCGATT TTCGTTACCA ACAGGGGGAA CTGATATCGG TCGAGCACAG TGACGGCTAT GTGCTGGAGA TCGACAGCCG TGGCAGGACA ATACACGCGG TCGAGCTTGT CACGCAGGAG AAAAGACAAA AGCTGTTGCA GAGTACATTC AGTGAGCGCG GTTACCTCGT TCAGTGCCAG AGTTTCCAGT ATGGCACGCT GTCGCATGAA TATGATCCGA AGGGGTATAT GGTGCGCTGG CGGGATACGG ATAGCACGGA TGTGGCTGTC CGTTATGATA TTTCAGGGCG GGTGGTGGCA CTAAAAACAT CGACCGGTTT TTTTGCTGAC CATTTTATTT ATCATGACAA AGAACGCTAT ACGATCTATC GCGATGGGGA AGGGGGAGAA ACCGGTTATC ACTATAATGA AAACAATCTG CTGATAAAGT TAGTCGATCC GTTAGGCAAT ATCACCCTCA CCGACTGGGA CTTGACCCAG AAAATAAAAG AAACCGATGC ATTAGGGCGT ATCACCCGGT TTATTTACAA TGAGCGCGGT GATTTAACCG CCGTGATCCA CCCGGATGAA ACCCGTACCG AATATGAGTA TAACCCGTCT GGTGTGGTCA CTGCGTTTAC GTCTTCGGCG GGGGATAGCT GGCAATATCA GTATGACAGG CAGGGATTAT TACGACAGGT AACTTATCCT TCCGGCCAGA CGATGTCGTT TCGCTACGGC AAGAAGGGAG AAGTGCTGCG CAAGATCATC GCGGAAGATC AGGTGTGGCG TTACCACTAT GATCACCACG GTTGTTTAAG CACCATTATC GATCCGAAGG GGAACAGTAC AGCCGTCACA CTGGATGTGC TGGGGCGGTT GTTTTCTCAC CAAAACGCCC TCGGGGAGCT CACCCGTTAC ACACACAGTG ACGCGCATGC CAGTCCGGCG GGCAGCGTCA CAAAAATGGT CATGCCGGAT GGTGTGGAGC AGGCCATTGC CTATGACAGC GAGAAGCGTA TTGCCGCGCT GACCGATGGC GCGGGGAAAA CCACCCGTTA CGAATACGGG GGTTTTGATT TACTGACCGG CCTGATACGC CCGGACGGGC AGCGGCTGAC CTTCGGCTAT GACACACTGA CGCGCCTGAA TCAGGTGACC AATGCGTCAG GTGACACCTA TCGTTATACC CGTGATCGGG CCGGGCAGGT GATCAGCGAA ACCGATTTTA CCGGGCGCAC GGTTCATTAT CAGTATGATG CGGTGGGGCG GCGTATCGGG GCGCGTTACC CGGATCAGCG CCTTGTTCGC TGGCACTACT CGATGCAGGA TCAGGTGCTG GCGCAGCAGA CCTGGCACTG TGATGCACTG AGTTCCACGC TGGTTGGCAC GGTCAGCTAC GGTTATGACC GTGCCGGTCG TTTACTGAGC GCGACCAATG CGGATGCGGT GGTGGAGTTT GATTACGACG AGGCGGGTCA GTTGGTTGCG GAGCGGCTGA ATGGCCGGGA GGTGCGGCAT CAGTGGGATG CGCTAAACGG CACCCCTGTT GCCCGACAGG TGGGTGAGCT GGGGCTGACG TTTGTGTACG GCGCACAGGG CGAACTGACC CGGCTACAGC TGGCGGGCCA TCAGCCCCTT CAGTTGCAGC ATGACCGGTT GGGGCGAGAG ACGGTACGTG AAAGCGCGGC CGGGTTTATT CAGGCCTGCA ATTATACCCC GAGCGGGCTG TTGGCCCATC AGGCGGCCGG GCGTAACTCG GCATTATTCC AGCAACAGTT GATCGCCCCG GAGAGCCCGG CCTTGCACGG CAGTGCGGTC AACCGCAGTT GGCAGTATGA CCGGGCTTAT AACGTGGTGG GGATGGATGA CGGGCGTTGG GGTAAAACGC AGTACCAATA TGACCGCAAT GACCAGGTCG TCCGCGCTGA TTTTGGTGGT TTTTTGCCGT TGCAGGAGCA GTTTAGCTAC GATGTGAACC AGAACCTGCG CGAGCATCGC TGTTTACCGC GTGGGGCACA GGCGGTTTTG GCGCAGGCGA GCCAGCAACA GCAGGCGGGC CGGGTGGTGA AGCGCGGTGA CAGTGAGTAC CGTTACGATG CTGGCGGGCG ACTGGTGGAG AAACGCAGCC AAAAAGACGG CTACCGGCCA CAGCTGTGGC GCTACCGCTG GAATGAGCAG GACCAGTTAT CCGAATTGAT CACGCCGACG GGGGCGCGCT GGCGCTACGG TTATGATGCC TTTGGGCGAC GTATTCGAAA ACTGCGGGTA GTGGATACCC CGCCGCTCAA TGAGATGGAC GCTCCTTCCA CCGGTCCGGC CACGGCATCA CTGGCCGGTT ACGCGTACCT GTGGAGCGGC GACCAATTAA TCGAAGAGGT GCCGGTGTAT GCCGACGGCA CGGTGGCGTA TGAGCAGGGG ATCCACTGGC TGTATGCGCC GGGCGGATTA ACGCCGATGG CCCGCTACGC GCAAGGCAAA CTGCATTATG TGGTGGCGGA TCATCTGGGT ACGCCGCGGG AGTTGTTGAA TGAGCAGGGC AAGGTGGTGT GGGCGAGCCG TCTGAGCACG TGGGGGCAGG CGGAATTATG GCGACAGGCG GCGAATGAGG AGGATCGGGT CAGCTGTAAC CTGCGTTTTG CGGGTCAGTA CGCGGATGCA GAATCCGGGT TGCATTACAA CCGGTTCCGC TACTATGATG GCGAGACGGG ACAATATCTA TGCCCGGACC CGATAGGGCT GGCGGGGGGA TTAAACCCGT ACGGGTATGT GCATAATCCG GTGAAGTATG TGGATCCGCT GGGGTTGTGT AAAACTGATG TAGCAAGAGA GCGCCAAGCC CAAATGCTGC AAGATGATGT AGGTTATAAT ATAAGCCCTA AGAGTTGGGA TCAGTTCCCT TCAATTGGTA GAGATGGTTC GTTTATTACG GATAAAAAAG GGGCCCTTAA ATATTTTAAT GGTATGCAAA CTGGTAATGT AACTATTTCT AAGTCAGTAG CAGCATCTAT TGAAAAGGAT ATGGGATTAA GTCTTGGTTC TTTAAACGGT GGATTTAATA TAAGAAAAAT TGACGGCATT TCTAATATGC AACCACGAAG CCCATTGAGT GGTAATGATT ACTTCCTCGG TCCAGGTCAG CATTTACCGG GGGGCGCTCC TGAAATGGTT ATAAACTCTG TGCCGACATC AACACCAGTC ACTATAAGGG TAAGTGTAAA ATGA
|
Protein sequence | MTVAQSIGTG YGAAGAQSAL RQTALGQQSP ARTDYQVSNP NVGNIARASD SLLNVAESQQ FNTLVTAGFG MQAIAAGVAV PFTVGLAAGM AGGYVGAKIG NKLGHGIARA LNFNQVATEG ESPAHLGHPI AHQKKDWGVW GAIGGILLGA AAAALVVVTF GTGLVVIAAA AAAAGLIGGI AAATGAALGQ YGDNKGVIAE GSANVFFEGQ PVARVGDKIQ CSDHPSSPPP MIAEGAKTVF ANQKQIARLG HRTTCDGNIN AGCGSLAITQ ETAYVYEVAD SRNPYLRWSA VILSFLPIQK KFEQGFRSLK KPPNTAVNAT HNCPTGSDPV DMVSGDYLQV WPVIDIAGVL PVRLQRTYRS GDYFTVSGCF GHKWADSWSQ HLVVHEDNID YIDEEGVGLS FFTPENKVQA VNLYNLRYEL VGERHGELRV FDRSTQQTLH FNQQQQQQRY LSAITDRKGN RIDFRYQQGE LISVEHSDGY VLEIDSRGRT IHAVELVTQE KRQKLLQSTF SERGYLVQCQ SFQYGTLSHE YDPKGYMVRW RDTDSTDVAV RYDISGRVVA LKTSTGFFAD HFIYHDKERY TIYRDGEGGE TGYHYNENNL LIKLVDPLGN ITLTDWDLTQ KIKETDALGR ITRFIYNERG DLTAVIHPDE TRTEYEYNPS GVVTAFTSSA GDSWQYQYDR QGLLRQVTYP SGQTMSFRYG KKGEVLRKII AEDQVWRYHY DHHGCLSTII DPKGNSTAVT LDVLGRLFSH QNALGELTRY THSDAHASPA GSVTKMVMPD GVEQAIAYDS EKRIAALTDG AGKTTRYEYG GFDLLTGLIR PDGQRLTFGY DTLTRLNQVT NASGDTYRYT RDRAGQVISE TDFTGRTVHY QYDAVGRRIG ARYPDQRLVR WHYSMQDQVL AQQTWHCDAL SSTLVGTVSY GYDRAGRLLS ATNADAVVEF DYDEAGQLVA ERLNGREVRH QWDALNGTPV ARQVGELGLT FVYGAQGELT RLQLAGHQPL QLQHDRLGRE TVRESAAGFI QACNYTPSGL LAHQAAGRNS ALFQQQLIAP ESPALHGSAV NRSWQYDRAY NVVGMDDGRW GKTQYQYDRN DQVVRADFGG FLPLQEQFSY DVNQNLREHR CLPRGAQAVL AQASQQQQAG RVVKRGDSEY RYDAGGRLVE KRSQKDGYRP QLWRYRWNEQ DQLSELITPT GARWRYGYDA FGRRIRKLRV VDTPPLNEMD APSTGPATAS LAGYAYLWSG DQLIEEVPVY ADGTVAYEQG IHWLYAPGGL TPMARYAQGK LHYVVADHLG TPRELLNEQG KVVWASRLST WGQAELWRQA ANEEDRVSCN LRFAGQYADA ESGLHYNRFR YYDGETGQYL CPDPIGLAGG LNPYGYVHNP VKYVDPLGLC KTDVARERQA QMLQDDVGYN ISPKSWDQFP SIGRDGSFIT DKKGALKYFN GMQTGNVTIS KSVAASIEKD MGLSLGSLNG GFNIRKIDGI SNMQPRSPLS GNDYFLGPGQ HLPGGAPEMV INSVPTSTPV TIRVSVK
|
| |