Gene EcSMS35_4918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4918 
SymbolrsmC 
ID6143662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5035035 
End bp5036066 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content55% 
IMG OID641619721 
Product16S ribosomal RNA m2G1207 methyltransferase 
Protein accessionYP_001746825 
Protein GI170681708 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00252655 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAT TTACCCCGGC AAGTGAAGTC TTGCTGCGTC ACAGTGATGA TTTCGAACAA 
AGCCGTATTC TGTTTGCCGG AGACTTACAG GATGACCTGC CCGCGCGTTT AGATACCGCG
GCCAGCCGTG CTCATACCCA GCAATTCCAC CACTGGCAGG TGTTAAGCCG CCAGATGGGG
GATAACGCCC GTTTTAGTCT GGTCGCCACG GCGAATGATG TCGCAGATTG CGATACGCTG
ATTTACTACT GGCCGAAGAA CAAACCGGAA GCCCAGTTCC AGTTGATGAA TTTACTTTCT
CTGCTGCCAG TGGGCACGGA TATTTTTGTC GTTGGCGAGA ACCGCAGCGG CGTGCGCAGC
GCCGAGCAGA TGCTTGCAGA TTATGCACCG TTGAATAAGG TGGACAGCGC CCGTCGCTGT
GGCCTCTATT TTGGTCGTCT GGAAAAACAG CCGGTATTTG ATGCCAATAA ATTCTGGGGC
GAATATAACG TCGATGGCCT GACGGTCAAA ACGCTGCCTG GCGTGTTTAG CCGCGACGGT
CTGGATGTCG GTAGCCAGTT GCTGCTCTCG ACGTTAACCC CGCACACGAA AGGTAAAGTG
CTGGATGTCG GCTGTGGCGC GGGCGTACTT TCGGTTGCCT TTGCGCGCCA CTCACCGAAG
ATTCGTCTCA CGTTGTGCGA TGTTTCTGCG CCAGCAGTTG AAGCCAGCCG TGCAACACTT
GCGGCCAACG GTATTGAAGG TGAAGTCTTT GCCAGCAACG TCTTTTCTGA GGTGAAAGGT
CGTTTTGATA TGATCATCTC CAACCCGCCG TTCCACGATG GGATGCAAAC CAGCCTCGAT
GCGGCGCAAA CGCTGATTCG CGGCGCGGTG CGTCATCTTA ATAGCGGCGG CGAGCTGCGA
ATTGTAGCGA ACGCCTTCCT GCCTTACCCG GACGTGCTGG ATGAGACATT TGGCTTCCAC
GAAGTGATCG CGCAAACCGG GCGCTTCAAG GTGTATCGCG CCATTATGAC CCGCCAGGCG
AAGAAAGGTT GA
 
Protein sequence
MSAFTPASEV LLRHSDDFEQ SRILFAGDLQ DDLPARLDTA ASRAHTQQFH HWQVLSRQMG 
DNARFSLVAT ANDVADCDTL IYYWPKNKPE AQFQLMNLLS LLPVGTDIFV VGENRSGVRS
AEQMLADYAP LNKVDSARRC GLYFGRLEKQ PVFDANKFWG EYNVDGLTVK TLPGVFSRDG
LDVGSQLLLS TLTPHTKGKV LDVGCGAGVL SVAFARHSPK IRLTLCDVSA PAVEASRATL
AANGIEGEVF ASNVFSEVKG RFDMIISNPP FHDGMQTSLD AAQTLIRGAV RHLNSGGELR
IVANAFLPYP DVLDETFGFH EVIAQTGRFK VYRAIMTRQA KKG