Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2256 |
Symbol | ispG |
ID | 4285364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 2458840 |
End bp | 2460000 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638141758 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_757486 |
Protein GI | 114570806 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.239777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAAT CCAGCTCCGA TCCGCGCAGT GTTCGCCCGT GGCGGATGAT TGACCGTCGC AAGAGCCGCA AGATCAAGGT TGGCCCGCTT GAGGTCGGCG GTGATGCGCC GATCAGCGTG CAGACGATGA CCAATACCCC CACCTCCGAT GCCGGTGCCA CGATCGACCA GATCCGGCGC TGCGAGGAGG CCGGTGTCGA TCTGGTGCGC GTGTCCTGCC CGGACGAGGA CTCAACCGCA GCCTTCAAGA CGATTGCCAA GGCGGCCAAG GTGCCGCTGA TCGCCGACAT CCATTTCCAC TACAAGCGTG GCATCGAGGC TGCCGAGGCC GGCGCCGCTT GCCTGCGTAT CAATCCGGGC AATATCGGCT CGATGGACCG GGTCAGGGAG GTCGTCCAGG CCGCCCGCGA TCATGGCTGT GCGATCCGGA TCGGGGTCAA TGCCGGCTCG CTCGAGCGTC ACCTGCTGGA GAAATATGGC GAGCCCTGTC CCGAGGCGAT GGTCGAGAGC GCGCTGGACC ATGCCCGCAT TCTCGATGAT CTCGATTTCC GAGATTACAA GATTTCGGTG AAGGCCTCCG ATCCCTTCCT CACGGTTGCG GCCTATCAAT CCCTGTCCGA GGCCACTGAC GCGCCCTTGC ATCTGGGGGT CACCGAGGCC GGGGGCACGC GGATCGGCAC GGTGAAATCC TCGATCGGCA TCGGCTCGAT GTTGTGGGCC GGGATCGGCG ACACCATTCG GGTCTCACTG TCGGCGGAGC CGGAAGAAGA AGTCCGGGTC GGCTTCGACA TTCTCAAATC GCTGGGGCTG CGAACCCGCG GCGTCAATAT CATCGCCTGC CCGTCCTGCG CCCGCCAGGG CTTTGACGTG ATCCGTACGG TGGAGACGCT GGAAGCCCGG CTGGCCCATA TTTCAGAGCC GATCTCGCTG TCCATCATCG GCTGTGTGGT CAACGGGCCG GGCGAAGCCC TGATGACCGA TCTGGGCTTT ACTGGCGGCG GTGCCGGGCG CGGCAAGATG TATGTGTCCG GACGCCCGGA CCACAATGTC TCCAATGAGG AGATGGTCGA TCATATTGTC GAGATGGTTG AAGACCGGGC TGCCGAGATT CGGGCTTCGG AGTTGTCCCC GGAGGGTGAC TCGGTCGAGG CTGCCGAATA G
|
Protein sequence | MSQSSSDPRS VRPWRMIDRR KSRKIKVGPL EVGGDAPISV QTMTNTPTSD AGATIDQIRR CEEAGVDLVR VSCPDEDSTA AFKTIAKAAK VPLIADIHFH YKRGIEAAEA GAACLRINPG NIGSMDRVRE VVQAARDHGC AIRIGVNAGS LERHLLEKYG EPCPEAMVES ALDHARILDD LDFRDYKISV KASDPFLTVA AYQSLSEATD APLHLGVTEA GGTRIGTVKS SIGIGSMLWA GIGDTIRVSL SAEPEEEVRV GFDILKSLGL RTRGVNIIAC PSCARQGFDV IRTVETLEAR LAHISEPISL SIIGCVVNGP GEALMTDLGF TGGGAGRGKM YVSGRPDHNV SNEEMVDHIV EMVEDRAAEI RASELSPEGD SVEAAE
|
| |