Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2667 |
Symbol | ispG |
ID | 6146301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2736647 |
End bp | 2737765 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617538 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001744703 |
Protein GI | 170682819 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.260982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAC GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGTG TTGGCGCTGA TATCGTCCGT GTCTCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCAGGTTAAC GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CGCTGAAAGT AGCGGAATAC GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG GTGGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGTA TTGGCGTTAA CGCCGGATCG CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCT GCCATGCGTC ATGTTGATCA TCTCGATCGC CTGAACTTCG AACAGTTCAA AGTCAGCGTG AAAGCGTCAG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAGATCGAT CAGCCGCTGC ATCTGGGGAT CACCGAAGCG GGTGGTGCGC GCAGCGGGGC GGTTAAATCC GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCGCTG GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC CGTTCGCGCG GGATCAACTT CATCGCCTGC CCGACCTGTT CGCGTCAGGA ATTTGATGTT ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC AACAACGATA TGATCGATCA GCTGGAAGCG CGCATTCGTG CGAAAGCCAG TCAGCTGGAC GAAGCGCGTC GAATTGACGT TCAGCAGGTT GAAAAATAA
|
Protein sequence | MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR VSVPTMDAAE AFKLIKQQVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFEQFKVSV KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD EARRIDVQQV EK
|
| |