Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0330 |
Symbol | |
ID | 5387952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 385627 |
End bp | 389835 |
Gene Length | 4209 bp |
Protein Length | 1402 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640863300 |
Product | RHS/YD repeat-containing protein |
Protein accession | YP_001399324 |
Protein GI | 153949971 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.581458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAAG CGGCCCGTGT TGATGACAAG CTTTATCATT CCAGTGCCTT AGCGGGTTTT ATTATTGGCT CCATTATTGG TGCCGCCGTG ATTTTTGCGG CCGCGGCTTA CGCCGCCTCC ATTGTTCTCA CCGGCGGGGC GACGCTGGTC GCTACCGGCT TTATTGTGGG TATGGGGGTG ACCACGCTGG GCGTCGTTGC CGGTGGGTTA ATACGCTCCG TGGGCGAAAA AATAGGGAGC ATGTGCCATC ACGATGTCGG ACAAATTACG ACAGGGTCCA AAAACGTTAA AGTGAACAGT AAACGGGCGG CGCATGTCGA GCTCAGTACC GTGGCCTGTA AAGATGACTC CGCCATTCAG CGCATGGCCG AAGGTTCGTC AAATATCTTT ATTAACAGTA AAGCCGCCGT TCGTCTGGAA GATAAAACGA CCTGTGATGC GGTTGTCGAT TCCGCTTCCA GTAATGTGAC GTTTGGTGGG GGGCGCGTTC AGTATCTCGA TATTAAACGC GAGATTTCTG ATGAAATGCG TGATTTGTCA GAGAAGCTGT TTATTGTCGC CGGGCTGGCG GGCGGCATAT TTGGGGCGGC AAAACAGGCG GGGTGTTTCG GCCTTAAATG CCTGAGCAAG ATTGCGTTGG GTGAGATGGC CGGGGCGGCT GCCGGGTATG GGCTGGAAAA AGGGGTTGGG GCCATCGCCG GTTATTTCGG TTACCCGGTT GATGTGATCA GTGGACAGAA ATTGCTGACA GGTGAGGGCG ATGATACCGA TTTTATTCTG CCGGGTATCT TCCCGCTGCA CTGGAGCCGG ATTTATCGCA GTGAAAATCA CCATGTCGGG GCGCTGGGAC AAGGCTGGTC TCTGGTATGG GAGCGTTCAT TACGCAAAGA AGATGACAGC ATTGTTTATC AGAATGATGA AGGTCGGGAG ATTGTCTTTC CCCTGATTAA ACGTGGAGAG CGCTATTTCT CCCCCACGGA GCATATCTGG CTGGCACGTA CCGAGCGTGA TACCTATGCC ATCAGCAGCC CGTTTGAAAC CTGTTTTATT TTTGAGGCCT TTTCTGAGGC TGGCGTCGCG AAATTAGCCA GCCTCGAAGA TATCAATGGT CATGCCCTGT ATTTCTTTTA TGACGATATC GGGCAACTGA AAAAAATATC GACCACCAGC GGCTATGGGG TGTATTGCCA GTATGAAAAA GGGCGTCTGG TGTCCGTTGC CTGTGTTAAG GGCGGTACGC CGGGCACACT GGTCCGCTAC CAGTATAATG AACAGCACCA GTTGGTTAGC GTCACTAACC GTGAGGGGCA AATCACCCGC CAGTTTGGTT ACCATGGCCA TCTGATCAAT AAACTGGCGG ATGTCAGGGG GCTGGAGTGC CGTTACACAT GGGCTGATAT CGGCGGAACC CCGCGAATTA CGCACAGTGC CACCAATCTG GGGGAGCAGT GGCAGTTTGA TTATGATATC GACAATCAAC AGACCACCCT GACGGACCTC AATACCGGGC AGACCGCCTG CTGGGGATAT AACGCCCAAC ATTTAATTAC CGACTATCGG GATTTTGATG GCGGGAAATA TGCATTTGAC TACAACGACC TCAATATGCC GGTACGCGTT GTGCTGGCAG GCGAGAGAAC GCTCGTTCTG GTTTACGATG CACTGGCGCG CCCGATCCAG ATCACCGATC CGCTAAAACG TGAAACCCAC ATTGATTATC ACCGTAACAG TCTGCGGGTG GTGCGCCGTC AGTACCCTGA CGGGCAGGTC TGGAAGGGGG AATATGACCG TACCGGCCGT TTGCTGAAAG AGAACGCGCC GGATGGCGGG GTGACGCTTT ATCATTATCC AGGGGCCTCA TCCCTTCCTG AACGCATAAC CAATGCCGTA GGGGCGCAGA CACACCTTGG TTGGGAAAGG CACGGGCAAC TGACGGAGCA CACCGACTGC TCGGGTAAAC TGACCCGCTA CGAATATGAT ATCGATGGCC ATCTGCTGAC GGTCATCGAT GCTGAAAACC ATTCAACACA TTACAGCTAC AACCGTCTCG GGCAGCCCAC CGGGGTCAGG TACGCCGATG GCCGCAAAGA GCAGTTGCGG TATAACGCTC AGGGGCTGGT TGAACAGTTT ACCGATCCTG TCGGGCGGCA GTTGCACTGG CGTTATAACC TGCGGGGTCA GCCGGTCAGC TTTACTGATC GTCTGCAACG GGAATACCGT TACCGCTATG ACTGCCATGG GCAGATGATT GAGCTGGATA ATGCCAATGG GGGCCAGTAT CACTTCCGGT GGAGCAGCGG CGGGCAATTG GTGGAAGAGC AGTATCCCGA TAACCTTGTC CGGCGTTATC GCTATGGGGA GAGCGGGATG CTGATGGCGC TGGAGACCAC CGCGCCCACG GTTGACGATC TTACCGTCTC CCGGCAGGTC AGTTTTGACT ATGATGCGGG CGGGCGAATG ACGCAGCGCC TGACGGGCAT GAGTGCGACC CGGTATGACT GGGACATCAT GGACCGTTTA TTGCTGGCCG AGCGTGTGCC AACGGCGGTG GGCGAACAGG CGGGGATCGT CGGTAATGGT GTTCGTTTGG CGTATGACAA GGCCGGGCAT TTACTGACGG AAAGCGGTGA CCTGGGTGCG GTGACGTATC AGTGGGATCC GCTGCATCAT CTGGCCGCCC TGACGCTGCC CGATGGTCAG ACGCTGTCAT GGTTGCGTTA CGGTGCGGGC CATGTCAGTG CCATTCGTCA TGGTGATACG CTTATTTCCG AGTTCAGCCG GGATAATCTT CATCGGGAAG TGAGCCGGAC CCAGGGTATT TTGACGCAGT ATCGTGATTA TGACGCGATG GGGCGGCGGT TGTGGCAATC GGCGGGTTCT GATGCGCCGA CAGTGGCGGC CGATCTGCTG CCCCGTCAGG GGGATATCTG GCGTAAATTT AGCTTTGACA CTGCCGGTGA ACTGAGCATG GCCACCGATT TTATCCGGGG TGAGCAGCAG TACCGTTATG ATGCGGAAGG GCGGCTGACT GACAGCCGGG AGCGTCATCA GTTATCCGTT GCGGAGGATT TTGCTTACGA CAATGCGGAT AACCTGCTGA ACCTGAGGAA ACTGCCGTTT GACACCGTCG ATCCACTGTA CGATACACCG GTCGCCAACA ACCGTTTGAC GCAATGGCAG CATTACCGTT TTGAGTATGA TGCCTGGGGA AACATGACCA CGCGGCATGC CGGTGGTCGG ATGCAACATT TTGCCTATGA CGATGATAAC CGGCTGCTGC GGGCCTGGGG AACCGGGCCG TTAGGGGAGC ATGACAGCCA CTATCGGTAT GATGCGCTGG GGCGGCGTAT CCACAAATCG GTGACGATAA AGCGCGGCGC AGAAAAAACC ACCCGTCAGA CCGATTTTAT CTGGCAGGGA CTGCGGTTAT TGCAGGAGCA ACATGCGGAC GGCAACGCGA CCTATATTTA CGACCCGAAC GAAAGTTATA CGCCGCTGGC GCGGGTCGAT CAGCGTCATG GCGAGACAGA AAGTCAGGTG TATTATTTTC ATACGGATAT CAACGGTACC CCGCTGGATG TCACGGACGG AGAGGGTAAG CACCGCTGGT CAGGGAAATA CCACGCCTGG GGCAAAGTTA CCCGGCAGAA TGTCAGCGAT CCAAGGCAAA GCACGGTCAG CCGGTTCGCG CAGCCGCTGC GTTATCCGGG GCAATACAGT GATGACGAGA CGGGTTTGCA CTACAATACG TTCAGGTACT ATGACCCGGA GATAGGGCGA TTTAGTACGC AGGACCCGAT AGGGCTGGCG GGGGGGGTGA ATCTTTATCA GTATGGGCCA AATCCGTTAA CGTGGATCGA TCCTTGGGGT TATACAGGAA CATATATTTT TACTGACGGT GTTGTATCTT ATATAGGTAA GGGCCCGTTA GGACGAATGG TAGCGTCTAT GGGACAAAGA ATTGGCGGTT CTTTGAATGC AATACAGTCG GCTCATTTGG ACTTTGGTAG TGATAAGTTA GGATTCATGG TAGAACATCG GATGATGGAA AAGTATGGTG CTCGTTATTC TCCCGACTTT GCTAACAGTG AACGCGTTGG TTCACCGGGA AAAAAATTAT ATGATGCCGC CGATTTGAAA ACACAAAAAA AGGTTGACCG TCTAGCTAAT AAATTAGATA AGAATTTTAA GTCATCTAAA GGATGTTAA
|
Protein sequence | MFEAARVDDK LYHSSALAGF IIGSIIGAAV IFAAAAYAAS IVLTGGATLV ATGFIVGMGV TTLGVVAGGL IRSVGEKIGS MCHHDVGQIT TGSKNVKVNS KRAAHVELST VACKDDSAIQ RMAEGSSNIF INSKAAVRLE DKTTCDAVVD SASSNVTFGG GRVQYLDIKR EISDEMRDLS EKLFIVAGLA GGIFGAAKQA GCFGLKCLSK IALGEMAGAA AGYGLEKGVG AIAGYFGYPV DVISGQKLLT GEGDDTDFIL PGIFPLHWSR IYRSENHHVG ALGQGWSLVW ERSLRKEDDS IVYQNDEGRE IVFPLIKRGE RYFSPTEHIW LARTERDTYA ISSPFETCFI FEAFSEAGVA KLASLEDING HALYFFYDDI GQLKKISTTS GYGVYCQYEK GRLVSVACVK GGTPGTLVRY QYNEQHQLVS VTNREGQITR QFGYHGHLIN KLADVRGLEC RYTWADIGGT PRITHSATNL GEQWQFDYDI DNQQTTLTDL NTGQTACWGY NAQHLITDYR DFDGGKYAFD YNDLNMPVRV VLAGERTLVL VYDALARPIQ ITDPLKRETH IDYHRNSLRV VRRQYPDGQV WKGEYDRTGR LLKENAPDGG VTLYHYPGAS SLPERITNAV GAQTHLGWER HGQLTEHTDC SGKLTRYEYD IDGHLLTVID AENHSTHYSY NRLGQPTGVR YADGRKEQLR YNAQGLVEQF TDPVGRQLHW RYNLRGQPVS FTDRLQREYR YRYDCHGQMI ELDNANGGQY HFRWSSGGQL VEEQYPDNLV RRYRYGESGM LMALETTAPT VDDLTVSRQV SFDYDAGGRM TQRLTGMSAT RYDWDIMDRL LLAERVPTAV GEQAGIVGNG VRLAYDKAGH LLTESGDLGA VTYQWDPLHH LAALTLPDGQ TLSWLRYGAG HVSAIRHGDT LISEFSRDNL HREVSRTQGI LTQYRDYDAM GRRLWQSAGS DAPTVAADLL PRQGDIWRKF SFDTAGELSM ATDFIRGEQQ YRYDAEGRLT DSRERHQLSV AEDFAYDNAD NLLNLRKLPF DTVDPLYDTP VANNRLTQWQ HYRFEYDAWG NMTTRHAGGR MQHFAYDDDN RLLRAWGTGP LGEHDSHYRY DALGRRIHKS VTIKRGAEKT TRQTDFIWQG LRLLQEQHAD GNATYIYDPN ESYTPLARVD QRHGETESQV YYFHTDINGT PLDVTDGEGK HRWSGKYHAW GKVTRQNVSD PRQSTVSRFA QPLRYPGQYS DDETGLHYNT FRYYDPEIGR FSTQDPIGLA GGVNLYQYGP NPLTWIDPWG YTGTYIFTDG VVSYIGKGPL GRMVASMGQR IGGSLNAIQS AHLDFGSDKL GFMVEHRMME KYGARYSPDF ANSERVGSPG KKLYDAADLK TQKKVDRLAN KLDKNFKSSK GC
|
| |