Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2666 |
Symbol | ispG |
ID | 5591669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2678325 |
End bp | 2679443 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921782 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001459308 |
Protein GI | 157161990 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 0.253817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAT GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGCG TTGGCGCTGA TATCGTCCGT GTCTCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCGGGTTAAC GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CGCTGAAAGT AGCGGAATAC GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG GTAGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGCA TTGGCGTTAA CGCCGGATCG CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCC GCCATGCGCC ATGTTGATCA TCTCGATCGC CTGAACTTCG ATCAGTTCAA AGTCAGCGTG AAAGCGTCTG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAAATCGAT CAGCCGCTGC ATCTGGGGAT CACCGAAGCG GGTGGCGCGC GCAGCGGGGC AGTAAAATCC GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCACTG GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC CGTTCGCGAG GTATCAACTT CATCGCCTGT CCGACCTGTT CGCGTCAGGA GTTTGATGTT ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC AACAACGATA TGATCGACCA GCTGGAAGCA CGCATTCGTG CGAAAGCCAG TCAGCTGGAC GAAGCGCGTC GAATTGACGT TCAGCAGGTC GAAAAATAA
|
Protein sequence | MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR VSVPTMDAAE AFKLIKQRVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFDQFKVSV KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD EARRIDVQQV EK
|
| |