Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3390 |
Symbol | ispG |
ID | 7873881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3709080 |
End bp | 3710339 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700329 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_002890361 |
Protein GI | 237654047 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCCC GACACGAACT CCAGCCGATC GAAGCCCGCC CGCTCGCCCG TCATCGCACC CATCAGGTGC GCGTCGGCAA GGTGAGGATC GGCGGCGAAG CCCCGGTCGT CGTGCAGTCG ATGACCAATA CCGACACGGC CGACGTGCTC GCCACCGCGA TGCAGGTCGC CGAGCTCGCC CGCGCCGGCT CCGAGATCGT GCGCATCACG GTCAACAACG AGGCCGCGGC GGCGGCGGTG CCGAAGATCC GCGACCGCCT GCTCGCGCTC AACATGGACG TGCCGCTGGT CGGCGACTTC CACTACAACG GCCACAAGCT GCTCACCGAC TTCCCGGCGT GCGCCGAGGC GCTCGCCAAG CTGCGCATCA ACCCGGGCAA CGTCGGCGCC GGCCGCAAGC GCGACCCGCA GTTCGCCGCG ATCGTCGAGC TTGCCTGCCG CTACGACAAG CCGGTGCGCA TCGGCGTGAA CTGGGGCAGC CTCGACCAGT CGGTCCTTGC CCGCATCATG GACGCCAACG CCAAGCGTGC CGAGCCGCGC GACGCCGGCG CGGTGATGCG CGAGGCGCTC GTCGTCTCGG CGCTCGAATC CGCGGCCAAG GCCGAGGAAT ACGGCCTCGG CCGCGAGCGC ATCATCCTGT CGGCCAAGGT TTCCAGCGTG CAGGACCTGA TCGCGGTGTA CCGCGATCTC GCCCGGCGCA GCGACTACGC GCTGCATCTG GGCCTCACCG AGGCCGGCAT GGGCAGCAAG GGCATCGTCG GCTCCACCGC CGCGCTCGCC GTGCTGCTGC AGGAAGGCAT CGGCGACACC ATCCGCATCT CGCTCACCCC CGAGCCGGGC GGCAGCCGCA CCCAGGAGGT CGTGGTCGCG CAGGAGATCC TGCAGACCAT GGGCCTGCGC GCCTTCACCC CCATGGTGAC TGCCTGCCCG GGCTGCGGCC GCACCACCAG CACCTTCTTC CAGGAGCTCG CCTCCGGTAT CCAGGACTAC GTGCGCGCGC AGATGCCGGT GTGGCGCGAA CAGTACGACG GCGTCGAGAA CATGACGCTG GCAGTGATGG GCTGCGTGGT CAACGGCCCG GGCGAGAGCA AGCACGCCAA CATCGGCATC TCGCTGCCGG GCACCGGCGA AACCCCGGCG GCGCCGGTGT TTGTCGACGG CGAGAAGGTC GTCACCCTGC GCGGCGACAA CATCGCTGCA GAGTTCAAGG CGCTCGTCGA CGATTACGTC GCCACCCGCT ACGTGAAGAA GGGCGCCTGA
|
Protein sequence | MNPRHELQPI EARPLARHRT HQVRVGKVRI GGEAPVVVQS MTNTDTADVL ATAMQVAELA RAGSEIVRIT VNNEAAAAAV PKIRDRLLAL NMDVPLVGDF HYNGHKLLTD FPACAEALAK LRINPGNVGA GRKRDPQFAA IVELACRYDK PVRIGVNWGS LDQSVLARIM DANAKRAEPR DAGAVMREAL VVSALESAAK AEEYGLGRER IILSAKVSSV QDLIAVYRDL ARRSDYALHL GLTEAGMGSK GIVGSTAALA VLLQEGIGDT IRISLTPEPG GSRTQEVVVA QEILQTMGLR AFTPMVTACP GCGRTTSTFF QELASGIQDY VRAQMPVWRE QYDGVENMTL AVMGCVVNGP GESKHANIGI SLPGTGETPA APVFVDGEKV VTLRGDNIAA EFKALVDDYV ATRYVKKGA
|
| |