Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0041 |
Symbol | ipk |
ID | 7978481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 51905 |
End bp | 52774 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644797001 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_002948249 |
Protein GI | 239825625 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000358804 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGTTGT TAGTCAAAGC GCCAGCAAAA ATCAACTTAT CATTGGACGT GTTACATAAG CGGCCGGATG GGTATCATGA AGTGAAAATG GTCATGACAA CGATTGATTT GGCAGATCGG ATCGAATTAA TTCCACAAAT GGATGATACC ATACAAATTA TTTCAAAAAA CCGATTCGTT CCCGATGACC ACCGCAACTT GGCGTATCAG GCTGCAAAGT TATTGAAAGA TACATTTGCG ATTAAACAAG GCATAGCGAT CTCTATTACA AAAAATATTC CAGTAGCGGC CGGGCTGGCG GGAGGAAGCA GTGACGCCGC CGCGACGCTC CGCGGTTTAA ATAAGCTTTG GAATTTAGGC CTTACACTGG ATGAGTTAGC AGAACTAGGA GCAAAAATCG GTTCTGACGT ATCATTTTGC GTTTACGGTG GAACCGCGAT TGCAACAGGA CGGGGCGAAA AAATTACACC GATTCCTGCT CCACCGCCAT GCTGGGTTAT TTTGGCCAAA CCATCGATCG GTGTTTCTAC TGCTGAAGTG TATCGAAATT TAAAGGTTGA TGAGATTCCA CATCCGGATG TGGACGGAAT GGTAGAGGCG ATTTATCGCC AAGATTATGC GGCTATTTGT AAACTGGTTG GAAACGTATT AGAGGAAGTA ACATTAAAAA AATATCCAGA AGTAGCGCAT ATTAAAGAGC AAATGAAGCG GTTTGGAGCA GACGCGGTAT TGATGAGCGG CAGCGGGCCG ACGGTGTTCG GGTTAGTCCA GCACGATTCA AGATTGCAGC GAATTTATAA CGGACTCCGC GGTTTTTGTG ATCAAGTGTT TGCTGTCCGT ATATTAGGCG AACGCCATTC ACTTGATTAA
|
Protein sequence | MRLLVKAPAK INLSLDVLHK RPDGYHEVKM VMTTIDLADR IELIPQMDDT IQIISKNRFV PDDHRNLAYQ AAKLLKDTFA IKQGIAISIT KNIPVAAGLA GGSSDAAATL RGLNKLWNLG LTLDELAELG AKIGSDVSFC VYGGTAIATG RGEKITPIPA PPPCWVILAK PSIGVSTAEV YRNLKVDEIP HPDVDGMVEA IYRQDYAAIC KLVGNVLEEV TLKKYPEVAH IKEQMKRFGA DAVLMSGSGP TVFGLVQHDS RLQRIYNGLR GFCDQVFAVR ILGERHSLD
|
| |