Gene ECH74115_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4045 
SymbolrumA 
ID6971254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3740495 
End bp3741796 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content51% 
IMG OID643387807 
Product23S rRNA 5-methyluridine methyltransferase 
Protein accessionYP_002272250 
Protein GI209399664 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000364507 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.378409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT TCTACTCTGC AAAACGACGC ACGACGACGC GTCAGATCAT AACCGTTTCA 
GTCAACGACC TCGACTCTTT TGGTCAGGGC GTGGCGCGAC ATAACGGCAA AACGCTATTT
ATCCCCGGAT TATTGCCGCA GGAAAACGCG GAAGTTACTG TTACTGAGGA TAAAAAACAG
TATGCCCGCG CTAAAGTCGT ACGCCGGTTA AGCGATAGCC CGGAACGCGA AACGCCACGC
TGTCCTCATT TTGGCGTATG CGGCGGCTGT CAGCAACAAC ACGCCAGCGT GGATTTACAG
CAGCGAAGCA AAAGTGCGGC ACTCGCCCGA TTAATGAAAC ACGAAGTCTC TGAAGTGATC
GCCGATGTTC CCTGGGGCTA TCGCCGTCGC GCGCGTTTAA GTCTGAACTA CTTACCGAAA
ACACAGCAAC TTCAGATGGG GTTTCGCAAA GCGGGCTCCA GTGACATTGT CGACGTTAAA
CAATGTCCCA TTTTAGTGCC CCAACTTGAA GCATTGCTGC CCAAAGTCAG GGCATGCCTG
GGCAGCTTAC AAGCTATGCG CCATCTTGGT CATGTTGAAC TGGTACAGGC AACCAGCGGC
ACGCTGATGA TTTTGCGCCA TACCGCACCG CTAAGTTCGG CAGATCGCGA AAAACTGGAA
CGCTTTTCGC ATTCTGAAGG CCTGGATCTG TATCTCGCCC CCGATAGTGA GATACTCGAA
ACCGTCTCTG GTGAGATGCC CTGGTATGAC TCAAACGGGT TGCGCTTAAC TTTTAGCCCG
CGCGATTTTA TTCAGGTTAA TGCGGGTGTG AACCAAAAAA TGGTAGCGCG TGCGTTGGAA
TGGCTGGAGG TGGAACCTGA AGATCGCGTA CTGGATCTGT TCTGCGGTAT GGGCAACTTT
ACACTACCAT TGGCGACACA AGCTGCCAGT GTGGTGGGTG TAGAAGGTGT TCCGGCGCTG
GTGGAAAAAG GCCAGCAGAA TGCGCGTCTT AACTGCTTAC AGAATGTGAC GTTTTATCAC
GAAAATCTTG AAGAAGATGT CACAAAGCAG CCGTGGGCGA AAAACGGCTT CGATAAAGTG
TTGCTGGACC CGGCGCGAGC AGGTGCCGCA GGTGTTATGC AGCAAATTAT AAAACTGGAA
CCTATTCGTA TAGTTTATGT ATCCTGTAAC CCTGCAACGC TGGCTCGGGA TAGCGAAGCG
TTATTAAAAG CAGGATATAC CATTGCGCGA CTGGCGATGC TGGATATGTT CCCACACACG
GGACATCTGG AATCGATGGT ACTTTTCTCG CGCGTTAAAT AG
 
Protein sequence
MAQFYSAKRR TTTRQIITVS VNDLDSFGQG VARHNGKTLF IPGLLPQENA EVTVTEDKKQ 
YARAKVVRRL SDSPERETPR CPHFGVCGGC QQQHASVDLQ QRSKSAALAR LMKHEVSEVI
ADVPWGYRRR ARLSLNYLPK TQQLQMGFRK AGSSDIVDVK QCPILVPQLE ALLPKVRACL
GSLQAMRHLG HVELVQATSG TLMILRHTAP LSSADREKLE RFSHSEGLDL YLAPDSEILE
TVSGEMPWYD SNGLRLTFSP RDFIQVNAGV NQKMVARALE WLEVEPEDRV LDLFCGMGNF
TLPLATQAAS VVGVEGVPAL VEKGQQNARL NCLQNVTFYH ENLEEDVTKQ PWAKNGFDKV
LLDPARAGAA GVMQQIIKLE PIRIVYVSCN PATLARDSEA LLKAGYTIAR LAMLDMFPHT
GHLESMVLFS RVK