Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1333 |
Symbol | ispG |
ID | 8136660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1566033 |
End bp | 1567088 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868947 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_003021151 |
Protein GI | 253699962 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.00000208857 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA CGACCCGGCA GATCATGATA GGGAATATCC CCATCGGCGG AGGAGCTCCC TGCTCCGTCC AGTCCATGTG CTCGACCGAC ACCCGTGATG TCGCCGCGAC CCTTGGGCAG ATCGGCCGGC TTGCCGCCGC CGGATGCGAG ATCGTGCGCT GCGCAGTACC CGATATGGAC GCCGCCCTGG CCCTTGCCGC CATAAAATCC GGCTCCCCCA TGCCGCTCAT AGCTGATATC CACTTCGACT ACAAGCTCGC CTTGAAGGCC CTGGAGTCCG GTGTTGGCGG GCTTCGTCTC AATCCCGGCA ACATAGGCGA AAAGTGGAAG GTCGCCGAGG TGGTGAAAGC CGCGGCCGAG CGCAACGTTC CCATCCGCAT CGGCGTCAAC GGCGGTTCGC TGGAAAAGGA ACTGCTGGTG AAGTACGGGC ACCCGACTCC CGAGGCCATG GTCGAGTCGG CGCTGGGGCA CGTGCGGATC CTGGAGGAGC TGGGCTATCA GCAGATAAAG ATATCGATCA AGGTCTCCGA CGTGTTGCGG ACCCTGGAAG CGTACCGGCT CCTTTCCGAC GCCGTGGACT ACCCGCTGCA CATAGGCGTA ACCGAAGCCG GAACCATCTT CGCCGGAACC GTCAAGTCCT CCGTTGGTCT CGGGATCCTT CTGCACCAGG GGATCGGCGA CACCATGCGC GTCTCCCTCA CCGGCGACCC GGTTGACGAG GTGCGGGTGG CGTACGACAT CCTCAAATCG CTCGGTTTGC GGACGCGCGG CATCAACTTC GTCTCCTGCC CCACCTGCGG GCGCTGCCAG GTAAACCTGA TACCCGTAGC CGAGGAAGTA GAGCGGCGCC TGGCGCATCT GGATACGACG ATCACCGTTG CCGTCATGGG ATGTTCCGTC AACGGTCCAG GCGAGGCCCG CGAGGCGGAC TTCGGCATAG CCGGGGGTAG GGGAGAAGGG CTTCTCTTCA GGCACGGCGA GATCCTGCGC AAGGTCCCCG AAGCCGAGCT AGCCGACGCC CTGGTGGAAG AAGTTTTGAA GAACAGTCAG ACCTGA
|
Protein sequence | MKKTTRQIMI GNIPIGGGAP CSVQSMCSTD TRDVAATLGQ IGRLAAAGCE IVRCAVPDMD AALALAAIKS GSPMPLIADI HFDYKLALKA LESGVGGLRL NPGNIGEKWK VAEVVKAAAE RNVPIRIGVN GGSLEKELLV KYGHPTPEAM VESALGHVRI LEELGYQQIK ISIKVSDVLR TLEAYRLLSD AVDYPLHIGV TEAGTIFAGT VKSSVGLGIL LHQGIGDTMR VSLTGDPVDE VRVAYDILKS LGLRTRGINF VSCPTCGRCQ VNLIPVAEEV ERRLAHLDTT ITVAVMGCSV NGPGEAREAD FGIAGGRGEG LLFRHGEILR KVPEAELADA LVEEVLKNSQ T
|
| |