Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1461 |
Symbol | ispG |
ID | 4270242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1667469 |
End bp | 1668716 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126217 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_742300 |
Protein GI | 114320617 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0743791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGA CCAGCAGCCA GCACCCGAGA CGCCGCAGCG TGCCCGTGCC CGTAGGCCCG GTCACCATTG GCGGCAACCA CCCCATCGTG GTCCAGTCGA TGACCAACAC CGATACCGCC GATGACATCC GCACGGTGGT GCAGGTGGCC GAGCTGGCAC GGGCGGGCTC GGAAATTGTC CGGCTCACCG TCAACAACGA CGAAGCGGCG GCGGCGGTTC CGCACATCCG CGAGCGGCTG GATGCCATGG GCCTGGAGGT GCCCCTGGTC GGTGACTTCC ACTTTAACGG CCACAAGCTG CTGGCGAAAC ACCCCGCCTG TGCCGAGGCA TTGGCCAAGT TCCGCATCAA CCCCGGCAAT GTCGGCAAGG GCCGTCGCCG GGACCCGCAG TTTGCCGAGA TGATCGAGTT CGCCTGTCGC TACGACAAGC CGGTACGCAT CGGCGTCAAC TGGGGCAGCC TGGACCAGGA TCTGCTGGCC GCGATGATGG ATGAGAATGC GGCTCTCCCC CGCCCGCTCC CACCGGAGCA GGTGATGAAG CAGGCGGTGA TCGCCTCCGC ACTGCAGAGC GCGGAGAAGG CCGAGGCCCT CGGCCTGCCA CGGGAGCGGA TCGTGCTCTC CTGCAAGATG TCCGGCGTCC AGGACTTGAT CGAGGTCTAC CGCGATCTCG CGGCCCGCTG CGACTACGCT CTGCACCTGG GCCTCACCGA GGCCGGTATG GGCTCCAAGG GCATTGTCGC CTCCACTGCC GCGCTGGCGG TCCTGCTGCA GGAGGGGATC GGCGACACCA TCCGGGTCTC CCTCACCCCG GAGCCGGACC AGCCCCGCAC CGACGAGGTG GTGGTTGCCC AGCAGATCCT GCAGACCATG GGCCTGCGCG CCTTCACCCC CATGGTCACC GCCTGCCCCG GCTGTGGCCG GACCACCAGC ACCTACTTCC AGGCGCTGGC CCGCGACATC CAGGCTCACG TCCAGCGGCG TATGCCGGAG TGGCGTCGTA CCTATCCGGG GGTCGAGAAC CTGACCCTGG CGGTGATGGG TTGCGTGGTC AACGGTCCGG GCGAGAGCCG GAACGCCGAT ATCGGCATCA GCCTGCCCGG TACCGGAGAA CGGCCCGTGG CCCCGGTCTA CGTGGATGGC GAGAAAACAG TGACCCTGAA GGGCGAGCGT ATCGCCGAAG AGTTCCAGGC CATCGTCGAA GACTATATCG AGGACCGCTT CGGGCAGCAG CGGGCCGGCG ACCGCTGA
|
Protein sequence | MPETSSQHPR RRSVPVPVGP VTIGGNHPIV VQSMTNTDTA DDIRTVVQVA ELARAGSEIV RLTVNNDEAA AAVPHIRERL DAMGLEVPLV GDFHFNGHKL LAKHPACAEA LAKFRINPGN VGKGRRRDPQ FAEMIEFACR YDKPVRIGVN WGSLDQDLLA AMMDENAALP RPLPPEQVMK QAVIASALQS AEKAEALGLP RERIVLSCKM SGVQDLIEVY RDLAARCDYA LHLGLTEAGM GSKGIVASTA ALAVLLQEGI GDTIRVSLTP EPDQPRTDEV VVAQQILQTM GLRAFTPMVT ACPGCGRTTS TYFQALARDI QAHVQRRMPE WRRTYPGVEN LTLAVMGCVV NGPGESRNAD IGISLPGTGE RPVAPVYVDG EKTVTLKGER IAEEFQAIVE DYIEDRFGQQ RAGDR
|
| |