Gene EcSMS35_2923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2923 
SymbolrumA 
ID6146651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2996951 
End bp2998252 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content51% 
IMG OID641617792 
Product23S rRNA 5-methyluridine methyltransferase 
Protein accessionYP_001744947 
Protein GI170681551 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00164007 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT TCTACTCTGC AAAACGACGC ACGACGACGC GTCAGATCAT AACCGTTTCA 
GTCAATGACC TCGACTCTTT TGGTCAGGGC GTGGCGCGAC ATAACGGCAA AACGTTATTT
ATTCCTGGAT TATTGCCACA GGAAAACGCG GAAGTTACTG TTACTGAAGA TAAAAAACAG
TACGCCCGCG CTAAAGTCGT ACGCCGGTTA AGCGATAGCC CGGAACGCGA AACGCCACGC
TGCCCTCATT TTGGCGTATG CGGTGGCTGT CAGCAACAAC ACGCCAGCGA GGATTTACAG
CAGCGAAGCA AAAGTGCGGC ACTCGCCCGA TTAATGAAAC ACGAAGTCTC TGAAGTGATC
GCCGATGTTC CCTGGGGCTA TCGCCGTCGC GCGCGTTTAA GTTTGAACTA CTTACCGAAA
ACACAGCAAC TTCAGATGGG GTTTCGCAAA GCGGGCTCCA GTGACATTGT CGACGTTAAA
CAATGCCCCA TTTTAGTGCC CCAACTTGAA GCATTGCTGC CCAAAGTCAG GGCATGCCTG
GGCAGCTTAC AAGCTATGCG CCATCTTGGT CATGTTGAAC TGGTACAGGC AACCAGCGGC
ACGCTGATGA TTTTGCGCCA TACCGCACCG CTAAGTTCGG TAGATCGCGA AAAACTGGAA
CGCTTTTCGC ATTCTGAAGG CCTGGATCTG TATCTCGCCC CCGATAGTGA GATACTCGAA
ACCGTCTCTG GTGAGATGCC CTGGTATGAC TCAAACGGGT TGCGCTTAAC TTTTAGCCCG
CGCGATTTTA TTCAGGTCAA TGCGGGTGTG AACCAAAAAA TGGTAGCGCG TGCGTTGGAA
TGGCTGGATG TACAACCTGA AGATCGCGTA CTGGATCTGT TCTGCGGTAT GGGCAACTTT
ACACTGCCAT TGGCGACACA AGCTGCCAGT GTGGTGGGTG TAGAAGGCGT TCCGGCGCTG
GTGGAAAAAG GCCAGCAGAA TGCGCGTCTT AATGGCTTAC ACAATGTGAC GTTTTATCAC
GAAAATCTTG AAGAAGATGT CACAAAGCAG CCGTGGGCGA AAAACGGCTT CGATAAAGTG
TTGCTGGACC CGGCGCGAGC AGGTGCCGCA GGTGTTATGC AGCAAATTAT AAAACTGGAA
CCTATTCGTA TAGTTTATGT ATCCTGTAAT CCTGCAACGC TGGCTCGGGA TAGCGAAGCG
TTATTAAAAG CAGGATATAC CATTGCGCGA CTGGCGATGC TGGATATGTT CCCACACACG
GGACATCTGG AATCGATGGT ACTTTTCTCG CGCGTTAAAT AG
 
Protein sequence
MAQFYSAKRR TTTRQIITVS VNDLDSFGQG VARHNGKTLF IPGLLPQENA EVTVTEDKKQ 
YARAKVVRRL SDSPERETPR CPHFGVCGGC QQQHASEDLQ QRSKSAALAR LMKHEVSEVI
ADVPWGYRRR ARLSLNYLPK TQQLQMGFRK AGSSDIVDVK QCPILVPQLE ALLPKVRACL
GSLQAMRHLG HVELVQATSG TLMILRHTAP LSSVDREKLE RFSHSEGLDL YLAPDSEILE
TVSGEMPWYD SNGLRLTFSP RDFIQVNAGV NQKMVARALE WLDVQPEDRV LDLFCGMGNF
TLPLATQAAS VVGVEGVPAL VEKGQQNARL NGLHNVTFYH ENLEEDVTKQ PWAKNGFDKV
LLDPARAGAA GVMQQIIKLE PIRIVYVSCN PATLARDSEA LLKAGYTIAR LAMLDMFPHT
GHLESMVLFS RVK