Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2798 |
Symbol | ispG |
ID | 7104284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2885570 |
End bp | 2886790 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643475835 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_002372954 |
Protein GI | 218247583 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACCC TGGAAACGCC TAAACCTTTA ACAAACACGA GCCCTGAATT TGATACTACC ATTCATCGCC GCCAAACCCG TCCCGTCAAA GTCGGAGATA TCACCATTGG GGGAGGCTAT CCTGTGGTCG TCCAATCGAT GATCAATGAG GATACCCTAG ATATTGAGGG TTCGGTGGCT GGTATTCGTC GTTTACACGA AATTGGCTGC GAAATCGTCC GTGTTACCGT TCCTAGCATG GCCCACGCTA AAGCCTTAGC CGAAATTAAC CAAAAACTAG CGGAAGTTTA TCGACGGGTT CCCCTAGTGG CGGATGTACA CCATAATGGA CTGAAAATCG CCCTAGAAGT AGCTAAGCAC GTCGATAAAG TCCGTATTAA CCCTGGATTA TACGTCTTTG AGAAACCTAG TACTACCCGC AGTGAATACA CCCAAGCTGA ATTTGATGAA ATTGGCGAAA AAATCAGTGA AACCCTGAAA CCCTTGGTCG TTTCGCTACG AGATCAAGGT AAAGCGATGC GAATTGGCGT TAACCACGGT TCTCTCGCTG AACGGATGCT ATTTACCTAC GGCGATACTC CTGAAGGGAT GGTAGAATCA GCCTTAGAAT TTATCCGTAT TTGCGAATCC CTCGATTTTC GCAACTTAGT CATCTCTCTT AAAGCCTCTC GTGTTCCTGT GATGTTAGCC GCCTATCGCT TGATGGTCAA ACGGATGGAT GAGTTAGGAA TGGATTATCC CTTGCATTTA GGGGTCACAG AAGCCGGAGA TGGGGAATAC GGACGGATTA AGTCTACGGC GGGTATTGCT ACCCTTTTGG CCGAAGGCAT TGGGGATACC ATTCGGGTTT CGTTAACCGA GGCCCCAGAA AAAGAAATTC CCGTTTGCTA CAGTATTTTG CAAGCCTTGG GACTGCGGAA AACGATGGTG GAATATGTGG CCTGTCCCTC CTGTGGCCGA ACCCTATTTA ACTTAGAAGA GGTACTCCAT AAGGTTCGAG AAGCCACCAA ACATTTAACG GGGTTAGATA TTGCGGTCAT GGGTTGCATT GTCAATGGAC CGGGAGAAAT GGCTGACGCT GACTACGGTT ATGTCGGGAA ACAAGCGGGT TATATTTCTC TGTATCGCGG ACGGGAAGAA ATTAAGCGGG TTCCTGAAGA CCAAGGAGTT CAGGAATTGA TTGAATTAAT TAAAGCCGAT GGCCGTTGGG TTGAACCATA A
|
Protein sequence | MQTLETPKPL TNTSPEFDTT IHRRQTRPVK VGDITIGGGY PVVVQSMINE DTLDIEGSVA GIRRLHEIGC EIVRVTVPSM AHAKALAEIN QKLAEVYRRV PLVADVHHNG LKIALEVAKH VDKVRINPGL YVFEKPSTTR SEYTQAEFDE IGEKISETLK PLVVSLRDQG KAMRIGVNHG SLAERMLFTY GDTPEGMVES ALEFIRICES LDFRNLVISL KASRVPVMLA AYRLMVKRMD ELGMDYPLHL GVTEAGDGEY GRIKSTAGIA TLLAEGIGDT IRVSLTEAPE KEIPVCYSIL QALGLRKTMV EYVACPSCGR TLFNLEEVLH KVREATKHLT GLDIAVMGCI VNGPGEMADA DYGYVGKQAG YISLYRGREE IKRVPEDQGV QELIELIKAD GRWVEP
|
| |