Gene YpsIP31758_2640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2640 
SymbolrumB 
ID5385608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2994144 
End bp2995274 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content52% 
IMG OID640865629 
Product23S rRNA methyluridine methyltransferase 
Protein accessionYP_001401605 
Protein GI153949749 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR02085] 23S rRNA (uracil-5-)-methyltransferase RumB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0000831969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTGCG CACAGTATGC GGCAGGTCGC TGCCGTTCTT GTCAGTGGTT GGATAAACCC 
TATCCACAAC AACTCGCTGA TAAACAGCAT CATCTGGAAA GCCTGTTGGC TGGGCATGCG
GTCACTCAGT GGCTAGCACT GGTTTTTGGG CGCGAAAGTG CCTTTCGTAA CAAGGCGAAA
ATGGTCGTCA GCGGCAGTGT GGAACGCCCG TTACTGGGTA TGCTGCATCG TGACGGTACA
CCGGTTGATT TATGCGCATG TCCACTTTAT CCGCCCAGCT TCGAACCGGT ATTTACGGTA
CTGAAAACCT TTATTGCCAG GGCGGGTTTG ACTCCCTATA ACGTTGCTCG CAAGCGTGGC
GAACTTAAAT TCCTGTTACT CACGGAAAGT ACCTACAACG GTGAATTGAT GCTGCGTTTT
GTGCTGCGTT CTGAAACTAA ATTAGCGCAG TTAACTGCTG CGTTGCCGTG GCTGCAACAA
CAGTTGCCGC AGTTGGCGGT GATCTCGGCT AATATTCAGC CAGTGCATAT GGCCATTCTG
GAAGGGGAGC GGGAGATCCC GCTGACCGAA CAACAGGCCC TGCCTGAGCG GTTTAATCAG
GTGCCGTTGT ATATCCGCCC ACAAAGTTTT TTCCAGACCA ATCCACCGGT GGCGGCTTCG
TTGTATGCAA CAGCACGGCA GTGGGTGCAG GAGCATGAGG TTCACAGTAT GTGGGATCTG
TTCTGTGGTG TGGGCGGCTT TGGTTTACAT TGTGCGGGGC CAGAGACTCA ATTGACCGGT
ATTGAAATCA GTGCTGAAGC TATCGCCTGT GCCCGCCAGT CGGCTGAGCA GTTAGGGCTA
AAAAATGTCA GTTTCGCCGC GCTGGATTCT ACCCGCTTTG CTACCGCTGA AGCCCAAATA
CCTGAACTGG TCTTGGTGAA TCCACCACGG CGGGGGATCG GCCGCGAGTT ATGTGATTAC
CTGAGCCAGA TGGCACCTAA ATTTATTCTC TATTCAAGTT GTAATGCAGA GACGATGGCG
AAAGATATCA GTTTGCTTGC GGGTTACCAC ATTGAACGGG TACAGCTGTT TGATATGTTC
CCGCATACCA GTCACTACGA AGTGCTCACC TTGCTGGCTC TCCGTCGCTA G
 
Protein sequence
MHCAQYAAGR CRSCQWLDKP YPQQLADKQH HLESLLAGHA VTQWLALVFG RESAFRNKAK 
MVVSGSVERP LLGMLHRDGT PVDLCACPLY PPSFEPVFTV LKTFIARAGL TPYNVARKRG
ELKFLLLTES TYNGELMLRF VLRSETKLAQ LTAALPWLQQ QLPQLAVISA NIQPVHMAIL
EGEREIPLTE QQALPERFNQ VPLYIRPQSF FQTNPPVAAS LYATARQWVQ EHEVHSMWDL
FCGVGGFGLH CAGPETQLTG IEISAEAIAC ARQSAEQLGL KNVSFAALDS TRFATAEAQI
PELVLVNPPR RGIGRELCDY LSQMAPKFIL YSSCNAETMA KDISLLAGYH IERVQLFDMF
PHTSHYEVLT LLALRR