Gene EcDH1_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4021 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4351200 
End bp4352300 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content51% 
IMG OID 
ProducttRNA (uracil-5-)-methyltransferase 
Protein accessionACX41621 
Protein GI260451199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.989315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG AACACCTTCC AACAGAACAG TATGAAGCGC AGTTAGCCGA AAAAGTGGTA 
CGTTTGCAAA GTATGATGGC ACCGTTTTCT GACCTGGTTC CGGAAGTGTT TCGCTCGCCG
GTCAGTCATT ACCGGATGCG CGCGGAGTTC CGCATCTGGC ACGATGGCGA TGACCTGTAT
CACATCATTT TCGATCAACA AACCAAAAGC CGCATCCGCG TGGATAGCTT CCCCGCCGCC
AGTGAACTTA TCAACCAGTT GATGACGGCG ATGATTGCGG GTGTGCGTAA TAATCCCGTT
CTGCGCCACA AGTTGTTCCA GATTGATTAC CTCACTACAC TGAGTAATCA GGCGGTGGTT
TCCCTGCTAT ACCATAAGAA GCTGGATGAT GAGTGGCGTC AGGAAGCGGA GGCCCTGCGC
GATGCACTGC GCGCGCAGAA TCTGAATGTG CATCTGATTG GTCGGGCAAC GAAAACCAAA
ATCGAGCTGG ATCAGGATTA CATCGATGAA CGTCTGCCGG TCGCAGGGAA AGAGATGATC
TACCGTCAGG TAGAAAACAG CTTTACCCAG CCGAACGCGG CGATGAATAT TCAGATGCTG
GAATGGGCGC TGGACGTAAC CAAAGGCTCA AAAGGCGATT TACTGGAGCT GTACTGCGGC
AACGGTAACT TTTCATTAGC GCTGGCGCGT AATTTTGATC GGGTATTAGC CACCGAAATC
GCTAAGCCGT CGGTTGCTGC TGCGCAATAC AACATCGCAG CTAACCATAT TGATAACGTA
CAAATTATTC GTATGGCGGC AGAAGAATTT ACTCAGGCGA TGAATGGTGT GCGCGAGTTT
AACCGCCTGC AAGGGATCGA CTTAAAGAGT TATCAGTGCG AAACCATTTT TGTCGACCCT
CCGCGCAGCG GTCTGGACAG TGAAACCGAG AAAATGGTGC AGGCGTATCC GCGTATTTTG
TACATCTCCT GTAACCCGGA AACGTTATGC AAGAATCTGG AAACATTAAG CCAGACGCAC
AAGGTCGAAC GTCTGGCTCT GTTTGATCAG TTCCCCTACA CGCACCATAT GGAGTGCGGC
GTATTACTGA CCGCGAAGTA A
 
Protein sequence
MTPEHLPTEQ YEAQLAEKVV RLQSMMAPFS DLVPEVFRSP VSHYRMRAEF RIWHDGDDLY 
HIIFDQQTKS RIRVDSFPAA SELINQLMTA MIAGVRNNPV LRHKLFQIDY LTTLSNQAVV
SLLYHKKLDD EWRQEAEALR DALRAQNLNV HLIGRATKTK IELDQDYIDE RLPVAGKEMI
YRQVENSFTQ PNAAMNIQML EWALDVTKGS KGDLLELYCG NGNFSLALAR NFDRVLATEI
AKPSVAAAQY NIAANHIDNV QIIRMAAEEF TQAMNGVREF NRLQGIDLKS YQCETIFVDP
PRSGLDSETE KMVQAYPRIL YISCNPETLC KNLETLSQTH KVERLALFDQ FPYTHHMECG
VLLTAK