Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1162 |
Symbol | ispG |
ID | 6068100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1270946 |
End bp | 1272064 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641600578 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001724156 |
Protein GI | 170019202 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.010209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAC GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGCG TTGGCGCTGA TATCGTCCGT GTATCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCAGGTTAAC GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CACTGAAAGT AGCGGAATAC GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG GTGGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGTA TTGGCGTTAA CGCCGGATCG CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCC GCCATGCGCC ATGTTGATCA TCTCGATCGC CTGAACTTCG ATCAGTTCAA AGTCAGCGTG AAAGCGTCTG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAGATCGAT CAGCCGCTGC ATCTGGGGAT CACCGAAGCC GGTGGTGCAC GCAGCGGGGC AGTAAAATCC GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCACTG GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC CGTTCGCGCG GGATCAACTT CATCGCCTGC CCGACCTGTT CGCGTCAGGA ATTTGATGTT ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC AACAACGATA TGATCGACCA GCTGGAAGCA CGCATTCGTG CGAAAGCCAG TCAGCTGGAC GAAGCGCGTC GAATTGACGT TCAGCAGGTT GAAAAATAA
|
Protein sequence | MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR VSVPTMDAAE AFKLIKQQVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFDQFKVSV KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD EARRIDVQQV EK
|
| |