Gene ECD_03850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03850 
SymboltrmA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4070471 
End bp4071571 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content51% 
IMG OID 
ProducttRNA (uracil-5-)-methyltransferase 
Protein accessionACT45643 
Protein GI253979973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00010868 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA 
CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG
GTCAGTCATT ACCGGATGCG CGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT
CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCCGCC
AGTGAACTTA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT
CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACTACGC TGAGTAATCA GGCGGTGGTT
TCCCTGCTAT ACCATAAGAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA GGCCCTGCGC
GATGCACTGC GCGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA
ATCGAGCTGG ATCAGGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC
TACCGTCAGG TAGAAAACAG CTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG
GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC
AACGGTAACT TTTCATTAGC GCTGGCGCGT AATTTTGATC GGGTATTAGC CACCGAAATC
GCTAAGCCGT CGGTTGCTGC TGCGCAATAC AACATCGCAG CTAACCATAT TGATAACGTA
CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGTGT GCGCGAGTTT
AACCGCCTGC AAGGGATCGA CTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCA
CCGCGCAGCG GTCTGGACAG TGAAACCGAG AAAATGGTGC AGGCGTATCC GCGTATTTTG
TACATCTCCT GTAACCCGGA AACGTTATGC AAGAATCTGG AAACATTAAG CCAGACGCAC
AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC
GTATTACTGA CCGCGAAGTA A
 
Protein sequence
MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY 
HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV
SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI
YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI
AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP
PRSGLDSETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG
VLLTAK