Gene EcolC_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4051 
Symbol 
ID6065170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4468804 
End bp4469904 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content51% 
IMG OID641603474 
ProducttRNA (uracil-5-)-methyltransferase 
Protein accessionYP_001726977 
Protein GI170022023 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR02143] tRNA (uracil-5-)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.378635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00385339 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA 
CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG
GTCAGTCATT ACCGGATGCG CGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT
CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCCGCC
AGTGAACTTA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT
CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACTACGC TGAGTAATCA GGCGGTGGTT
TCCCTGCTAT ACCATAAGAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA GGCCCTGCGC
GATGCACTGC GCGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA
ATCGAGCTGG ATCAGGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC
TACCGTCAGG TAGAAAACAG CTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG
GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC
AACGGTAACT TTTCATTAGC GCTGGCGCGC AATTTTGATC GGGTATTAGC CACCGAAATC
GCTAAGCCGT CGGTTGCTGC TGCGCAATAC AACATCGCAG CTAACCATAT TGATAACGTA
CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGTGT GCGCGAGTTT
AACCGCCTGC AAGGGATCGA CTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCA
CCGCGCAGCG GTCTGGACAG TGAAACCGAG AAAATGGTGC AGGCGTATCC GCGTATTTTG
TACATCTCCT GTAACCCGGA AACGTTATGT AAGAATCTGG AAACATTAAG CCAGACGCAC
AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC
GTATTACTGA CCGCGAAGTA A
 
Protein sequence
MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY 
HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV
SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI
YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI
AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP
PRSGLDSETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG
VLLTAK