Gene EcSMS35_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0887 
SymbolrumB 
ID6145152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp891150 
End bp892277 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content54% 
IMG OID641615775 
Product23S rRNA methyluridine methyltransferase 
Protein accessionYP_001742967 
Protein GI170681470 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR02085] 23S rRNA (uracil-5-)-methyltransferase RumB 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.848351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTGCG CACTTTACGA CGCGGGTCGC TGTCGTTCCT GTCAGTGGAT AACGCAGCCG 
ATTCCAGAGC AACTCTCCGC TAAAACCGCC GATCTTAAAA ATCTGCTCGC CGATTTTCCG
GTTGAGGAAT GGTGTGCGCC GGTGTCAGGC CCGGAACAAG GGTTTCGTAA TAAAGCCAAA
ATGGTGGTGA GTGGTAGTGT TGAAAAACCA CTGCTCGGTA TGCTGCATCG AGATGGTACA
CCGGAAGACC TTTGTGACTG CCCGCTTTAT CCAGCCTCAT TTGCGCCCGT TTTTGCGGCG
CTAAAACCCT TCATCGCCCG TGCGGGGTTA ACACCTTACA ACGTGGCGCG TAAACGTGGT
GAACTGAAAT ACATTCTGCT GACTGAAAGC CAGAGCGATG GCGGCATGAT GCTGCGTTTT
GTACTGCGTT CTGATACCAA ACTGGCGCAA CTGCGTAAGG CGCTGCCGTG GTTACAGGAA
CAATTACCGC AGCTGAAAGT TATTACCGTC AATATTCAGC CGGTACATAT GGCGATTATG
GAAGGGGAGA CGGAGATCTA CCTGACCGAA CAACAGGCGC TGGCGGAGCG TTTTAACGAT
GTGCCGCTGT GGATCCGTCC GCAAAGTTTC TTCCAGACCA ATCCGGCGGT CGCCAGCCAG
CTTTACGCTA CCGCGCGCGA CTGGGTGCGG CAACTACCGG TAAACCATAT GTGGGATCTC
TTCTGCGGCG TGGGGGGCTT TGGTTTACAC TGCGCGACGC CTGACATTCA GTTAACCGGG
ATCGAAATTG CACCAGAGGC CATTGCCTGT GCGAAGCAGT CAGCCGCTGA ACTGGGCTTA
ACGCGTTTGC AATTTCAGGC GCTGGACTCT ACGCAGTTTG CCACCGCCCA GGGGGAAGTG
CCGGAGCTGG TGCTGGTTAA CCCGCCGCGC CGCGGCATTG GTATACCGCT GTGTGATTAT
CTCTCAACGA TGGCACCGCG TTTTATCATA TACTCCAGCT GTAACGCCCA AACCATGGCG
AAAGATATCC GCGAACTGCC AGGTTACCGT ATTGAACGGG TACAGCTTTT TGATATGTTC
CCGCACACCG CGCACTATGA AGTGCTGACG CTGCTGGTGA AGCAATAA
 
Protein sequence
MQCALYDAGR CRSCQWITQP IPEQLSAKTA DLKNLLADFP VEEWCAPVSG PEQGFRNKAK 
MVVSGSVEKP LLGMLHRDGT PEDLCDCPLY PASFAPVFAA LKPFIARAGL TPYNVARKRG
ELKYILLTES QSDGGMMLRF VLRSDTKLAQ LRKALPWLQE QLPQLKVITV NIQPVHMAIM
EGETEIYLTE QQALAERFND VPLWIRPQSF FQTNPAVASQ LYATARDWVR QLPVNHMWDL
FCGVGGFGLH CATPDIQLTG IEIAPEAIAC AKQSAAELGL TRLQFQALDS TQFATAQGEV
PELVLVNPPR RGIGIPLCDY LSTMAPRFII YSSCNAQTMA KDIRELPGYR IERVQLFDMF
PHTAHYEVLT LLVKQ