Gene EcSMS35_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1352 
SymbolyebU 
ID6147046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1338431 
End bp1339876 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content53% 
IMG OID641616230 
ProductrRNA (cytosine-C(5)-)-methyltransferase RsmF 
Protein accessionYP_001743410 
Protein GI170683942 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.312061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0948873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTGG CCCAACACAC CGTTTATTTC CCGGACGCCT TTCTGACACA AATGCGCGAA 
GCTATGCCTT CGACGCTCTC TTTTGATGAT TTTCTTGCCG CCTGTCAGCG CCCGTTGCGC
CGCAGCATTC GCGTTAATAC GCTGAAAATC TCCGTTGCTG ATTTCCTGCA ATTAACCGCT
CCTTATGGCT GGACGCTTAC GCCAATTCCG TGGTGTGAAG AAGGTTTCTG GATTGAACGC
GACGATGAAG ATGCATTGCC ATTGGGTAGT ACCGCCGAGC ATTTAAGCGG CCTGTTTTAT
ATTCAGGAAG CCAGTTCAAT GTTGCCCGTT GCCGCCTTGT TTGCTGACGG TAATGCACCA
CAGCGGGTGA TGGATGTCGC TGCCGCGCCC GGCTCCAAAA CGACGCAAAT TGCCGCGCGG
ATGAATAACG AAGGGGCGAT CCTTGCCAAT GAGTTTTCCG CCAGTCGGGT AAAAGTGTTA
CATGCCAATA TCAGCCGCTG TGGCATCAGT AATGTTGCGC TCACACATTT TGATGGCCGC
GTGTTTGGTG CGGCAGTGCC AGAAATGTTC GATGCCATTT TGCTGGACGC TCCCTGCTCC
GGCGAAGGCG TGGTGCGTAA AGATCCCGAT GCGCTAAAAA ACTGGTCACC AGAAAGCAAT
CAGGAAATCG CAGCGACCCA ACGGGAACTG ATCGACAGCG CCTTTCATGC ATTACGCCCT
GGCGGTACGC TGGTTTACTC GACCTGTACC TTAAACAGGG AAGAAAACGA AGCCGTTTGC
CTGTGGCTGA AAGAGACTTA CCCCGACGCA GTAGAGTTTT TACCGCTTGG CGATCTCTTC
CCTGGTGCAA ATAAGGCGCT GACCGAAGAA GGCTTTTTGC ATGTTTTCCC ACAAATTTAC
GACTGCGAAG GCTTCTTCGT TGCTCGTCTG CGTAAAACTC AGGCGATCCC CGTCTTACCC
GCCCCAAAAT ACAAAGTGGG CAATTTCCCG TTTAGCCCGG TGAAAGATCG CGAAGCCGGT
CAAATTCGTC AGGCGGCTGC AGGTGTTGGC TTAAACTGGG ATGGAAACCT GCGACTCTGG
CAACGCGACA AAGAACTGTG GTTGTTCCCG GTAGGCATTG AAGCCCTGAT CGGTAAAGTC
CGATTTTCTC GGTTGGGGAT TAAACTTGCC GAAACGCATA ACAAAGGTTA TCGCTGGCAG
CATGAAGCGG TTATTGCCCT TGCCTCCCCC GACAATGTGA ACGCTTTTGA ACTGACACCG
CAGGAAGCGG AAGAGTGGTA TCGCGGGCGC GATGTTTACC CGCAAGCCGC GCCAGTAGCG
GATGATGTAT TGGTTACTTT CCAGCATCAG CCGATTGGTT TAGCCAAACG AATTGGTTCG
CGACTGAAAA ACAGCTACCC GCGTGAACTG GTGCGAGACG GGAAACTTTT TACCGGTAAC
GCCTGA
 
Protein sequence
MLVAQHTVYF PDAFLTQMRE AMPSTLSFDD FLAACQRPLR RSIRVNTLKI SVADFLQLTA 
PYGWTLTPIP WCEEGFWIER DDEDALPLGS TAEHLSGLFY IQEASSMLPV AALFADGNAP
QRVMDVAAAP GSKTTQIAAR MNNEGAILAN EFSASRVKVL HANISRCGIS NVALTHFDGR
VFGAAVPEMF DAILLDAPCS GEGVVRKDPD ALKNWSPESN QEIAATQREL IDSAFHALRP
GGTLVYSTCT LNREENEAVC LWLKETYPDA VEFLPLGDLF PGANKALTEE GFLHVFPQIY
DCEGFFVARL RKTQAIPVLP APKYKVGNFP FSPVKDREAG QIRQAAAGVG LNWDGNLRLW
QRDKELWLFP VGIEALIGKV RFSRLGIKLA ETHNKGYRWQ HEAVIALASP DNVNAFELTP
QEAEEWYRGR DVYPQAAPVA DDVLVTFQHQ PIGLAKRIGS RLKNSYPREL VRDGKLFTGN
A