Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0333 |
Symbol | |
ID | 5387593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 391335 |
End bp | 395591 |
Gene Length | 4257 bp |
Protein Length | 1418 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640863303 |
Product | RHS/YD repeat-containing protein |
Protein accession | YP_001399327 |
Protein GI | 153949101 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAAG CGGCCCGTGT TGATGACAAG CTTTATCATT CCAGTGCCTT AGCGGGTTTT ATTATTGGCT CCATTATTGG TGCCGCCGTG ATTTTTGCGG CCGCGGCTTA CGCCGCCTCC ATTGTTCTCA CCGGCGGGGC GACGCTGGTC GCTACCGGCT TTATTGTGGG TATGGGGGTG ACCACGCTGG GCGTCGTTGC CGGTGGGTTA ATACGCTCCG TGGGCGAAAA AATAGGGAGC ATGTGCCATC ACGATGTCGG ACAAATTACG ACAGGGTCCA AAAACGTTAA AGTGAACAGT AAACGGGCGG CGCATGTCGA GCTCAGTACC GTGGCCTGTA AAGATGACTC CGCCATTCAG CGCATGGCCG AAGGTTCGTC AAATATCTTT ATTAACAGTA AAGCCGCCGT TCGTCTGGAA GATAAAACGA CCTGTGATGC GGTTGTCGAT TCCGCTTCCA GCAATGTGAC GTTTGGTGGG GGGCGCGTTC AGTATCTCGA TATTAAACGC GAGATTTCTG ATGAAATGCG TGATTTGTCA GAGAAGCTGT TTATTGTCGC CGGGCTGGCG GGCGGCATAT TTGGGGCGGC AAAACAGGCG GGGTGTTTCG GCCTTAAATG CCTGAGCAAG ATTGCGTTGG GTGAGATGGC CGGGGCGGCT GCCGGGTATG GGCTGGAAAA AGGGGTTGGG GCCATCGCCG GTTATTTCGG TTACCCGGTT GATGTGATCA GTGGACAGAA ATTGCTGACA GGTGAGGGCG ATGATACCGA TTTTATTCTG CCGGGTATCT TCCCGCTGCA CTGGAGCCGG ATTTATCGCA GTGAAAATCA CCATGTCGGG GCGCTGGGAC AAGGCTGGTC TCTGGTATGG GAGCGTTCAT TACGCAAAGA AGATGACAGC ATTGTTTATC AGAATGATGA AGGTCGGGAG ATTGTCTTTC CCCTGATTAA ACGTGGAGAG CGCTATTTCT CCCCCACGGA GCATATCTGG CTGGCACGTA CCGAGCGTGA TACCTATGCC ATCAGCAGCC CGTTTGAAAC CTGTTTTATT TTTGAGGCCT TTTCTGAGGC TGGCGTTGCG AAATTAGCCA GCCTCGAAGA TCTCAATGGT CATGCCCTGT ATTTCTCTTA TGACGATATC GGGCAACTGA AAAAAATATC GACCACCAGC GGTTATGGGG TGTATTGCCA GTATGAAAAA GGGCGTCTGG TGTCCGTTGC CTGCGTCAAG GGCGGTACGC CGGGCACACT GGTCCGCTAC CAGTATAATG AACAGCACCA GTTGGTCAGC GTCACTAACC GTGAGGGGCA AATCACCCGC CAGTTTGGTT ACCATGGCCA TCTGATCAAT AAACTGGCGG ATGTCAGGGG GCTGGAGTGC CGTTACACAT GGGCTGATAT CGGCGGAACC CCGCGAATTA CGCACAGTGC CACCAATCTG GGGGAGCAGT GGCAGTTTGA TTATGATATC GACAATCAAC AGACCACCCT GACGGACCTC AATACCGGGC AGACCGCCTG CTGGGGATAT AACGCCCAAC ATTTAATTAC CGACTATCGG GATTTTGATG GCGGGAAATA TGCATTTGAC TACAACGACC TCAATATGCC GGTACGCGTT GTGCTGGCAG GCGAGAGAAC GCTCGTTCTG GCTTACGATG CACTGGCGCG CCCGATCCAG ATCACCGATC CGCTAAAACG TGAAATCCAC ATTGATTATC ACCGTAACAG TCTGCGGGTG ATGCGCCGTC AGTACCCTGA CGGGCAGGTC TGGAAGGGGG AATATGACCG AACCGGCCGT TTGCTGAAAG AGAACGCGCC GGATGGCGGG GTGACGCGTT ATCATTATCC GGGGGCCTCA TCCCTTCCTG AACGCATAAC CAATGCCGTA GGGGCGCAGA CACACTTTGG CTGGGAAAGG CACGGGCAAC TGACGGAGCA CACCGACTGT TCGGGTAAAC TGACCCGCTA CGAATATGAT ATCGATGGCC ATCTGCTGAC GGTCATCGAT GCTGAAAACC ATGCAACACA TTACAGCTAC AACCGTCTCG GGCAGCTCAC CGGGGTCAGG TACGCCGATG GCCGCAAAGA GCAGTTGCGG TATAACGCTC AGGGACTGGT TGAACAGTTT ACCGATCCTG TCGGGCGGCA GTTGCACTGG CGTTATAACC TGCGGGGTCA GCCGGTCAGC TTTACTGATC GTCTGCAACG GCAATACCGT TACCGCTATG ACTGCCATGG GCAGATGATT GAGCTGGATA ATGCCAATGG TGGCCAGTAT CACTTCCGGT GGAGCAGCGG CGGGCAATTG GTGGAAGAGC AGTATCCCGA TAACCTTGTC CGGCGTTATC GCTATGGGGA GAGCGGGATG CTGATGGCGC TGGAGACCAC CGCGCCCACG GTTGACGATC TTACCGTCTC CCGGCAGGTC AGTTTTGACT ATGATGCGGG CGGGCGAATG ACGCAGCGCC TGACGGGCAT GAGTGCGACC CGGTATGACT GGGACATTAT GGACCGTTTA TTGCTGGCCG AGCGTGTGCC AACGGCGGTG GGCGAACAGG CGGGGATCGT CGGTCATGGT GTTCGTTTGG CGTATGACAA GGCCGGGCAT TTACTGACGG AAAGCGGTGA CCTGGGTGCG GTGACGTATC AGTGGGATCC GCTGCATCAC CTGGCCGCCC TGACGCTGCC CGATGGGCAG ACGCTGTCAT GGTTGCGTTA CGGTGCGGGC CATGTCAGTG CCATTCGTCA TGGTGATACG CTTATTTCCG AGTTCAGCCG GGATAATCTT CATCGGGAAG TGAGCCGGAC CCAGGGTATT TTGACGCAGT ATCGTGATTA TGACGCGATG GGGCGGCGGT TGTGGCAATC GGCGGGTTCT GATGCGCCGA CAGTGGCGGC CGATCTGCTG CCCCGTCAGG GGGATATCTG GCGTAAATTT AGCTTTGACA CTGCCGGTGA ACTGAGCATG GCCACCGATT TTATCCGGGG TGAGCAGCAG TACCGTTATG ATGCGGAAGG GCGGCTGACT GACAGCCGGG AGCGTCATCA GTTATCCGTT GCGGAGGATT TTGCTTACGA CAATGCGGAT AACCTGCTGA ACCTGAGGAA ACTGCCGTTT GACACCGTCG ATCCACTGTA CGATACACCG GTCGCCAACA ACCGTTTGAC GCAATGGCAG CATTACCGTT TTGAGTATGA TGCCTGGGGA AACATGACCA CGCGGCATGC CGGTGGTCGG ATGCAACATT TTGCCTATGA CGATGATAAC CGGCTGCTGC GGGCCTGGGG AACCGGGCCG TTAGGGGAGC ATGACAGCCA CTATCGGTAT GATGCGCTGG GGCGGCGTAT CCACAAATCG GTGACGATAA AGCGCGGCGC AGAAAAAACC ACCCGTCAGA CCGATTTTAT CTGGCAGGGG TTGCGGTTAT TGCAGGAGCA ACATGCGGAC GGTAACGCGA CCTATATTTA CGACCCGAAC GAAAGTTATA CGCCGCTGGC GCGGGTCGAT CAGCGTCATG GCGAGACAGA AAGTCAGGTG TATTATTTTC ATACGGATAT CAACGGTACC CCGCTGGATG TCACGGACGG AGAGGGTAAG CACCGTTGGT CAGGGAAATA CCACGCCTGG GGCAAAGTTA CCCGGCAGAA TGTCAGCGAT CCAAGGCAAA GCACGGTCAG CCGGTTCGCG CAGCCGCTGC GTTATCCGGG GCAATACAGT GATGACGAGA CGGGTTTGCA CTACAATACG TTCAGGTACT ATGACCCGGA GATAGGGCGA TTTAGTACGC AGGACCCGAT AGGGCTGGCG GGGGGGATAA ATCTTTATCA GTATGGGCCA AATCCGCTAG GTTGGGTGGA TCCTTTAGGA TGGATGCCTT GGGCGTGGAA TCCAAATGGT ATGGGGCATC ACCTTATTCC TCGGAATAAA GCTAATAGCA TTGGACTTAC TGAGCTAGGA ACGAAATTAA ATACGCCTAC TTTCTTCCCA GACCCTTATC AGGCTGGTAT GCATGAGGAA CTGCATAGAG CAATTAAAAA CGATATAGGG AAAATTCAAG GTCCTTGGAA AGGTTCTGCA GCCGATTTAT TTGAAGCTAC TGGTAGAAAT TTAGATTCCG TCTCTCATAT TCGAGGGGAT TTACGTATTC CTTCAACTGG AGAAGTTATT GCTAGAAATG TCACTCCTAA AGAAGCTCAT TCAAGATTAA CTGAATGGTT TAATAATAAA AAGTCAGGTG GTGGAGGTGG TTGTTAA
|
Protein sequence | MFEAARVDDK LYHSSALAGF IIGSIIGAAV IFAAAAYAAS IVLTGGATLV ATGFIVGMGV TTLGVVAGGL IRSVGEKIGS MCHHDVGQIT TGSKNVKVNS KRAAHVELST VACKDDSAIQ RMAEGSSNIF INSKAAVRLE DKTTCDAVVD SASSNVTFGG GRVQYLDIKR EISDEMRDLS EKLFIVAGLA GGIFGAAKQA GCFGLKCLSK IALGEMAGAA AGYGLEKGVG AIAGYFGYPV DVISGQKLLT GEGDDTDFIL PGIFPLHWSR IYRSENHHVG ALGQGWSLVW ERSLRKEDDS IVYQNDEGRE IVFPLIKRGE RYFSPTEHIW LARTERDTYA ISSPFETCFI FEAFSEAGVA KLASLEDLNG HALYFSYDDI GQLKKISTTS GYGVYCQYEK GRLVSVACVK GGTPGTLVRY QYNEQHQLVS VTNREGQITR QFGYHGHLIN KLADVRGLEC RYTWADIGGT PRITHSATNL GEQWQFDYDI DNQQTTLTDL NTGQTACWGY NAQHLITDYR DFDGGKYAFD YNDLNMPVRV VLAGERTLVL AYDALARPIQ ITDPLKREIH IDYHRNSLRV MRRQYPDGQV WKGEYDRTGR LLKENAPDGG VTRYHYPGAS SLPERITNAV GAQTHFGWER HGQLTEHTDC SGKLTRYEYD IDGHLLTVID AENHATHYSY NRLGQLTGVR YADGRKEQLR YNAQGLVEQF TDPVGRQLHW RYNLRGQPVS FTDRLQRQYR YRYDCHGQMI ELDNANGGQY HFRWSSGGQL VEEQYPDNLV RRYRYGESGM LMALETTAPT VDDLTVSRQV SFDYDAGGRM TQRLTGMSAT RYDWDIMDRL LLAERVPTAV GEQAGIVGHG VRLAYDKAGH LLTESGDLGA VTYQWDPLHH LAALTLPDGQ TLSWLRYGAG HVSAIRHGDT LISEFSRDNL HREVSRTQGI LTQYRDYDAM GRRLWQSAGS DAPTVAADLL PRQGDIWRKF SFDTAGELSM ATDFIRGEQQ YRYDAEGRLT DSRERHQLSV AEDFAYDNAD NLLNLRKLPF DTVDPLYDTP VANNRLTQWQ HYRFEYDAWG NMTTRHAGGR MQHFAYDDDN RLLRAWGTGP LGEHDSHYRY DALGRRIHKS VTIKRGAEKT TRQTDFIWQG LRLLQEQHAD GNATYIYDPN ESYTPLARVD QRHGETESQV YYFHTDINGT PLDVTDGEGK HRWSGKYHAW GKVTRQNVSD PRQSTVSRFA QPLRYPGQYS DDETGLHYNT FRYYDPEIGR FSTQDPIGLA GGINLYQYGP NPLGWVDPLG WMPWAWNPNG MGHHLIPRNK ANSIGLTELG TKLNTPTFFP DPYQAGMHEE LHRAIKNDIG KIQGPWKGSA ADLFEATGRN LDSVSHIRGD LRIPSTGEVI ARNVTPKEAH SRLTEWFNNK KSGGGGGC
|
| |