Gene EcE24377A_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2064 
SymbolyebU 
ID5590679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2047814 
End bp2049259 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content52% 
IMG OID640925734 
ProductrRNA (cytosine-C(5)-)-methyltransferase RsmF 
Protein accessionYP_001463137 
Protein GI157157684 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.2344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTGG CCCAACACAC CGTTTATTTC CCGGACGCCT TTCTGACACA AATGCGCGAA 
GCGATGCCTT CGACGCTCTC ATTTGATGAT TTTCTTGCCG CCTGTCAGCG CCCGTTGCGC
CGCAGCATTC GCGTTAATAC GCTGAAAATC TCCGTTGCTG ATTTCCTGCA ATTAACCGCT
CCTTATGGCT GGACGCTTAC GCCAATTCCG TGGTGTGAAG AAGGTTTCTG GATTGAACGC
GACAATGAAG ATGCATTGCC ATTGGGTAGT ACCGCCGAGC ATTTAAGCGG CCTGTTTTAT
ATTCAGGAAG CCAGTTCAAT GTTGCCCGTT GCCGCCTTGT TTGCTGACGA TAATGCACCA
CAGCGGGTGA TGGATGTCGC AGCTGCGCCA GGCTCCAAAA CGACGCAAAT TGCCGCGCGG
ATGAATAACG AAGGGGCAAT CCTTGCCAAT GAGTTTTCCG CCAGTCGGGT AAAAGTGTTA
CATGCCAATA TCAGCCGCTG TGGCATCAGT AATGTTGCGC TCACACATTT TGATGGCCGC
GTGTTTGGTG CGGCAGTGCC AGAAATGTTC GATGCCATTT TGCTGGACGC TCCCTGCTCT
GGCGAAGGCG TGGTGCGTAA AGATCCCGAT GCGCTAAAAA ACTGGTCACC AGAAAGCAAT
CAGGAAATCG CAGCTACACA ACGGGAGCTT ATCGACAGCG CCTTTCATGC ATTACGTCCT
GGTGGTACGC TGGTTTACTC GACCTGTACC TTAAACCAGG AAGAAAACGA AGCCGTTTGC
CTGTGGCTGA AAGAGACTTA CCCCGACGCA GTAGAGTTTT TACCACTTGG CGATCTCTTC
CCTGGTGCAA ACAAAGCGCT GACCGAAGAA GGCTTTTTGC ATGTTTTCCC ACAAATTTAC
GACTGCGAAG GCTTCTTCGT TGCTCGTCTG CGTAAAACTC AGGCGATCCC CGCCTTACCC
GCCCCCAAAT ACAAAGTCGG TAATTTCCCG TTCAGCCCGG TGAAAGATCG CGAAGCCGGA
CAAATTCGTC AGGCGGCTGC AAGTGTTGGC TTAAACTGGG ATGGAAACCT GCGACTCTGG
CAACGCGACA AAGAACTGTG GTTGTTCCCG GTGGGCATTG AAGCCCTGAT CGGTAAAGTC
CGATTTTCTC GCTTGGGGAT TAAACTTGCC GAAACGCACA ACAAAGGTTA TCGCTGGCAG
CATGAAGCAG TTATTGCCCT TGCCACCCCC GACAATGTGA ACGCTTTTGA ACTGACACCG
CAGGAAGCGG AGGAGTGGTA TCGCGGGCGC GATGTTTACC CGCAAGCCGC GCCAGTGGCG
GATGACGTGT TGGTTACTTT CCAGCATCAA CCGATTGGTT TAGCCAAACG GATTGGTTCG
CGATTGAAAA ACAGCTATCC GCGTGAACTG GTGCGCGATG GGAAACTTTT TACCGGTAAC
GCCTGA
 
Protein sequence
MLVAQHTVYF PDAFLTQMRE AMPSTLSFDD FLAACQRPLR RSIRVNTLKI SVADFLQLTA 
PYGWTLTPIP WCEEGFWIER DNEDALPLGS TAEHLSGLFY IQEASSMLPV AALFADDNAP
QRVMDVAAAP GSKTTQIAAR MNNEGAILAN EFSASRVKVL HANISRCGIS NVALTHFDGR
VFGAAVPEMF DAILLDAPCS GEGVVRKDPD ALKNWSPESN QEIAATQREL IDSAFHALRP
GGTLVYSTCT LNQEENEAVC LWLKETYPDA VEFLPLGDLF PGANKALTEE GFLHVFPQIY
DCEGFFVARL RKTQAIPALP APKYKVGNFP FSPVKDREAG QIRQAAASVG LNWDGNLRLW
QRDKELWLFP VGIEALIGKV RFSRLGIKLA ETHNKGYRWQ HEAVIALATP DNVNAFELTP
QEAEEWYRGR DVYPQAAPVA DDVLVTFQHQ PIGLAKRIGS RLKNSYPREL VRDGKLFTGN
A