Gene EcHS_A2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2666 
SymbolispG 
ID5591669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2678325 
End bp2679443 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID640921782 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001459308 
Protein GI157161990 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.253817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG 
CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAT
GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGCG TTGGCGCTGA TATCGTCCGT
GTCTCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCGGGTTAAC
GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CGCTGAAAGT AGCGGAATAC
GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG
GTAGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGCA TTGGCGTTAA CGCCGGATCG
CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCC
GCCATGCGCC ATGTTGATCA TCTCGATCGC CTGAACTTCG ATCAGTTCAA AGTCAGCGTG
AAAGCGTCTG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAAATCGAT
CAGCCGCTGC ATCTGGGGAT CACCGAAGCG GGTGGCGCGC GCAGCGGGGC AGTAAAATCC
GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCACTG
GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC
CGTTCGCGAG GTATCAACTT CATCGCCTGT CCGACCTGTT CGCGTCAGGA GTTTGATGTT
ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT
TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC
ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC
AACAACGATA TGATCGACCA GCTGGAAGCA CGCATTCGTG CGAAAGCCAG TCAGCTGGAC
GAAGCGCGTC GAATTGACGT TCAGCAGGTC GAAAAATAA
 
Protein sequence
MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR 
VSVPTMDAAE AFKLIKQRVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM
VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFDQFKVSV
KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL
AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV
SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD
EARRIDVQQV EK