Gene EcolC_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3686 
SymbolrsmC 
ID6067053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4035275 
End bp4036306 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content56% 
IMG OID641603101 
Product16S ribosomal RNA m2G1207 methyltransferase 
Protein accessionYP_001726624 
Protein GI170021670 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.217552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00016948 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGCAT TTACCCCGGC AAGTGAAGTC TTGCTGCGTC ACAGTGATGA TTTCGAACAA 
AGCCGTATTC TGTTTGCCGG AGACTTACAG GATGACCTGC CCGCGCGTTT AGATACCGCG
GCCAGCCGTG CTCATACCCA GCAATTCCAC CACTGGCAGG TATTAAGCCG CCAGATGGGG
GATAACGCCC GTTTCAGTCT GGTCGCCACG GCGGATGACG TCGCAGATTG CGATACGCTG
ATTTACTACT GGCCGAAGAA CAAACCGGAA GCCCAGTTCC AGTTGATGAA TTTACTTTCT
CTGCTGCCAG TGGGGACAGA TATTTTTGTC GTTGGCGAGA ACCGCAGCGG CGTGCGCAGC
GCCGAGCAGA TGCTGGCAGA TTATGCGCCG TTGAATAAAG TCGACAGCGC TCGTCGCTGT
GGCCTCTATT TTGGTCGTCT GGAAAAACAG CCGGTATTTG ATGCCGATAA ATTCTGGGGC
GAATACAGCG TCGATGGCCT GACGGTCAAA ACGCTGCCTG GCGTGTTTAG CCGCGACGGT
CTGGATGTCG GTAGCCAGTT GCTGCTCTCG ACGTTAACCC CGCACACGAA AGGTAAAGTG
CTGGATGTCG GCTGTGGCGC GGGTGTGCTT TCAGTTGCCT TTGCGCGCCA TTCGCCGAAA
ATTCGTCTCA CCTTGTGCGA TGTCTCTGCG CCAGCGGTAG AAGCCAGCCG CGCAACACTT
GCGGCCAACG GTGTTGAAGG TGAAGTCTTT GCCAGCAACG TCTTTTCCGA GGTGAAAGGT
CGTTTTGATA TGATCATCTC CAACCCGCCG TTCCACGATG GGATGCAAAC CAGCCTGGAT
GCGGCGCAAA CGCTGATTCG CGGCGCGGTG CGTCATCTTA ATAGCGGCGG CGAGCTGCGA
ATTGTAGCGA ACGCCTTCCT GCCTTACCCG GACGTGCTGG ATGAGACATT TGGCTTCCAT
GAAGTGATTG CGCAAACCGG GCGTTTCAAG GTGTATCGCG CCATTATGAC CCGCCAGGCG
AAGAAAGGTT AA
 
Protein sequence
MSAFTPASEV LLRHSDDFEQ SRILFAGDLQ DDLPARLDTA ASRAHTQQFH HWQVLSRQMG 
DNARFSLVAT ADDVADCDTL IYYWPKNKPE AQFQLMNLLS LLPVGTDIFV VGENRSGVRS
AEQMLADYAP LNKVDSARRC GLYFGRLEKQ PVFDADKFWG EYSVDGLTVK TLPGVFSRDG
LDVGSQLLLS TLTPHTKGKV LDVGCGAGVL SVAFARHSPK IRLTLCDVSA PAVEASRATL
AANGVEGEVF ASNVFSEVKG RFDMIISNPP FHDGMQTSLD AAQTLIRGAV RHLNSGGELR
IVANAFLPYP DVLDETFGFH EVIAQTGRFK VYRAIMTRQA KKG