Gene EcHS_A1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1926 
SymbolyebU 
ID5592630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1939194 
End bp1940639 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content52% 
IMG OID640921069 
ProductrRNA (cytosine-C(5)-)-methyltransferase RsmF 
Protein accessionYP_001458620 
Protein GI157161302 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.79578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTGG CCCAACACAC CGTTTATTTC CCGGACGCCT TTCTGACGCA AATGCGCGAA 
GCGATGCCTT CGACGCTCTC TTTTGATGAT TTTCTTGCCG CCTGTCAGCG CCCGTTGCGC
CGCAGCATTC GCGTTAATAC GCTGAAAATC TCCGTTGCTG ATTTCCTGCA ATTAACCGCT
CCTTATGGCT GGACGCTTAC GCCAATTCCG TGGTGTGAAG AAGGTTTCTG GATTGAACGC
GACAATGAAG ATGCATTGCC ATTGGGTAGT ACCGCCGAGC ATTTAAGCGG CCTGTTTTAT
ATTCAGGAAG CCAGTTCAAT GTTGCCCGTC GCCGCCTTGT TTGCTGACGG TAATGCACCA
CAGCGGGTGA TGGATGTCGC TGCTGCGCCA GGCTCCAAAA CGACGCAAAT TGCCGCACGG
ATGAATAACG AAGGGGCAAT CCTTGCCAAT GAGTTTTCCG CCAGTCGGGT AAAAGTGTTA
CATGCCAATA TCAGCCGCTG TGGCATCAGT AATGTTGCGC TCACACATTT TGATGGCCGC
GTGTTTGGTG TGGCAGTGCC AGAAATGTTC GATGCCATTT TGCTGGACGC TCCCTGCTCT
GGCGAAGGCG TGGTGCGTAA AGATCCCGAT GCGCTAAAAA ACTGGTCACC AGAAAGCAAT
CAGGAAATCG CAGCTACACA ACGGGAGCTT ATCGACAGCG CCTTTCATGC ATTACGTCCT
GGTGGTACGC TGGTTTACTC GACCTGTACC TTAAACAGGG AAGAAAACGA AGCCGTTTGC
ATGTGGCTGA AAGAGACTTA CCCTGACGCA GTAGAGTTTT TACCACTTGG CGAGCTCTTC
CCTGCTGCAA ACAAAGCGCT GACCGAAGAA GGCTTTTTGC ATGTTTTCCC ACAAATTTAC
GACTGCGAAG GCTTCTTCGT TGCTCGTCTG CGTAAAACTC AGGCCATTCC CGCCTTACCC
GCCCCCAAAT ACAAAGTCGG TAATTTTCCG TTCAGCCCGG TGAAAGATCG CGAAGCTGGA
CAAATTCGTC AGGCGGCTGC AGGTGTTGGC TTAAACTGGG ATGAAAACCT GCGCCTCTGG
CAGCGTGACA AAGAACTGTG GTTGTTCCCG GTGGGCATTG AAGCCCTGAT CGGTAAAGTC
CGATTTTCTC GCTTGGGGAT TAAACTTGCC GAAACGCACA ACAAAGGTTA TCGCTGGCAG
CATGAAGCAG TTATTGCCCT TGCCACCCCC GACAATGTGA ACGCTTTTGA ACTGACACCG
CAGGAAGCGG AGGAGTGGTA TCGCGGGCGC GATGTTTACC CGCAAGCCGC GCCAGTGGCG
GATGACGTGT TGGTTACTTT CCAGCATCAG CCGATTGGTT TAGCCAAACG GATTGGTTCG
CGACTGAAAA ACAGCTACCC GCGTGAACTG GTGCGGGACG GGAAACTTTT TACCAGTAAC
GCATGA
 
Protein sequence
MLVAQHTVYF PDAFLTQMRE AMPSTLSFDD FLAACQRPLR RSIRVNTLKI SVADFLQLTA 
PYGWTLTPIP WCEEGFWIER DNEDALPLGS TAEHLSGLFY IQEASSMLPV AALFADGNAP
QRVMDVAAAP GSKTTQIAAR MNNEGAILAN EFSASRVKVL HANISRCGIS NVALTHFDGR
VFGVAVPEMF DAILLDAPCS GEGVVRKDPD ALKNWSPESN QEIAATQREL IDSAFHALRP
GGTLVYSTCT LNREENEAVC MWLKETYPDA VEFLPLGELF PAANKALTEE GFLHVFPQIY
DCEGFFVARL RKTQAIPALP APKYKVGNFP FSPVKDREAG QIRQAAAGVG LNWDENLRLW
QRDKELWLFP VGIEALIGKV RFSRLGIKLA ETHNKGYRWQ HEAVIALATP DNVNAFELTP
QEAEEWYRGR DVYPQAAPVA DDVLVTFQHQ PIGLAKRIGS RLKNSYPREL VRDGKLFTSN
A