Gene EcSMS35_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2667 
SymbolispG 
ID6146301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2736647 
End bp2737765 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID641617538 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001744703 
Protein GI170682819 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.260982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG 
CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAC
GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGTG TTGGCGCTGA TATCGTCCGT
GTCTCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCAGGTTAAC
GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CGCTGAAAGT AGCGGAATAC
GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG
GTGGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGTA TTGGCGTTAA CGCCGGATCG
CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCT
GCCATGCGTC ATGTTGATCA TCTCGATCGC CTGAACTTCG AACAGTTCAA AGTCAGCGTG
AAAGCGTCAG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAGATCGAT
CAGCCGCTGC ATCTGGGGAT CACCGAAGCG GGTGGTGCGC GCAGCGGGGC GGTTAAATCC
GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCGCTG
GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC
CGTTCGCGCG GGATCAACTT CATCGCCTGC CCGACCTGTT CGCGTCAGGA ATTTGATGTT
ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT
TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC
ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC
AACAACGATA TGATCGATCA GCTGGAAGCG CGCATTCGTG CGAAAGCCAG TCAGCTGGAC
GAAGCGCGTC GAATTGACGT TCAGCAGGTT GAAAAATAA
 
Protein sequence
MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR 
VSVPTMDAAE AFKLIKQQVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM
VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFEQFKVSV
KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL
AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV
SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD
EARRIDVQQV EK