Gene EcolC_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1797 
SymbolyebU 
ID6066012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1994252 
End bp1995697 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content52% 
IMG OID641601212 
ProductrRNA (cytosine-C(5)-)-methyltransferase RsmF 
Protein accessionYP_001724774 
Protein GI170019820 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.396046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0585336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTGG CCCAACACAC CGTTTATTTC CCGGACGCCT TTCTGACGCA AATGCGCGAA 
GCGATGCCTT CGACGCTCTC TTTTGATGAT TTTCTTGCCG CCTGTCAGCG CCCGTTGCGC
CGCAGCATTC GCGTTAATAC GCTGAAAATC TCCGTTGCTG ATTTCCTGCA ATTAACCGCT
CCTTATGGCT GGACGCTTAC GCCAATTCCG TGGTGTGAAG AAGGTTTCTG GATTGAACGC
GACAATGAAG ATGCATTGCC ATTGGGTAGT ACCGCCGAGC ATTTAAGCGG CCTGTTTTAT
ATTCAGGAAG CCAGTTCAAT GTTGCCCGTC GCCGCCTTGT TTGCTGACGG TAATGCACCA
CAGCGGGTGA TGGATGTCGC TGCTGCGCCA GGCTCCAAAA CGACGCAAAT TGCCGCACGG
ATGAATAACG AAGGGGCAAT CCTTGCCAAT GAGTTTTCCG CCAGTCGGGT AAAAGTGTTA
CATGCCAATA TCAGCCGCTG TGGCATCAGT AATGTTGCGC TCACACATTT TGATGGCCGC
GTGTTTGGTG TGGCAGTGCC AGAAATGTTC GATGCCATTT TGCTGGACGC TCCCTGCTCT
GGCGAAGGCG TGGTGCGTAA AGATCCCGAT GCGCTAAAAA ACTGGTCACC AGAAAGCAAT
CAGGAAATCG CAGCTACACA ACGGGAGCTT ATCGACAGCG CCTTTCATGC ATTACGTCCT
GGTGGTACGC TGGTTTACTC GACCTGTACC TTAAACAGGG AAGAAAACGA AGCCGTTTGC
ATGTGGCTGA AAGAGACTTA CCCTGACGCA GTAGAGTTTT TACCACTTGG CGAGCTCTTC
CCTGCTGCAA ACAAAGCGCT GACCGAAGAA GGCTTTTTGC ATGTTTTCCC ACAAATTTAC
GACTGCGAAG GCTTCTTCGT TGCTCGTCTG CGTAAAACTC AGGCCATTCC CGCCTTACCC
GCCCCCAAAT ACAAAGTCGG TAATTTTCCG TTCAGCCCGG TGAAAGATCG CGAAGCTGGA
CAAATTCGTC AGGCGGCTGC AGGTGTTGGC TTAAACTGGG ATGAAAACCT GCGCCTCTGG
CAGCGTGACA AAGAACTGTG GTTGTTCCCG GTGGGCATTG AAGCCCTGAT CGGTAAAGTC
CGATTTTCTC GCTTGGGGAT TAAACTTGCC GAAACGCACA ACAAAGGTTA TCGCTGGCAG
CATGAAGCAG TTATTGCCCT TGCCACCCCC GACAATGTGA ACGCTTTTGA ACTGACACCG
CAGGAAGCGG AGGAGTGGTA TCGCGGGCGC GATGTTTACC CGCAAGCCGC GCCAGTGGCG
GATGACGTGT TGGTTACTTT CCAGCATCAG CCGATTGGTT TAGCCAAACG GATTGGTTCG
CGACTGAAAA ACAGCTACCC GCGTGAACTG GTGCGGGACG GGAAACTTTT TACCAGTAAC
GCATGA
 
Protein sequence
MLVAQHTVYF PDAFLTQMRE AMPSTLSFDD FLAACQRPLR RSIRVNTLKI SVADFLQLTA 
PYGWTLTPIP WCEEGFWIER DNEDALPLGS TAEHLSGLFY IQEASSMLPV AALFADGNAP
QRVMDVAAAP GSKTTQIAAR MNNEGAILAN EFSASRVKVL HANISRCGIS NVALTHFDGR
VFGVAVPEMF DAILLDAPCS GEGVVRKDPD ALKNWSPESN QEIAATQREL IDSAFHALRP
GGTLVYSTCT LNREENEAVC MWLKETYPDA VEFLPLGELF PAANKALTEE GFLHVFPQIY
DCEGFFVARL RKTQAIPALP APKYKVGNFP FSPVKDREAG QIRQAAAGVG LNWDENLRLW
QRDKELWLFP VGIEALIGKV RFSRLGIKLA ETHNKGYRWQ HEAVIALATP DNVNAFELTP
QEAEEWYRGR DVYPQAAPVA DDVLVTFQHQ PIGLAKRIGS RLKNSYPREL VRDGKLFTSN
A