Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0072 |
Symbol | ipk |
ID | 3832681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 72508 |
End bp | 73365 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828004 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_428954 |
Protein GI | 83588945 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATGTCC TCACCTTGCC GGCCTATGGT AAAATAAATC TGACTTTAAA GGTCCTCGGG CGGCGTTCCG ACGGTTATCA CAACCTGAGC ACCATCTTCC AGTCAATTGC CCTGGCGGAT AGACTCACTT TCAGCCGTTG CCGGGAAGGA ATACGCCTGG AAACCTCAGG GCTGCCGGTA CCGCAAGGAC CGGAAAACCT GGCCTACCGG GCAGCAGCCC GGTTGCAGTC CCGTTACGGT TTTCCCGGGG TGCGGATAAC CCTGAAGAAG CAAATCCCCC TGGCAGCCGG CCTGGCCGGG GGCAGCGCTG ATGCTGCGGC TACCCTGATA GGTGTGAATG CCCTTTTTAA CCTGGGCCTG ACCCCCGGCC AGCTGGCCCG GGAAGGGGCG GCCCTGGGGT CGGACGTCCC TTTTTGCGTT ATTGGTGGTA CGGCCTTAGG ACGGGGGCGG GGGGAAGAAC TCTCCCTTTT ACCTCCCCTC CCGACACTAT GGCTGGTACT GGTAAAACCG TCCTTCGGGG TCAGTACGGC AGCCGTATAC CGGGGTTGGG ATGCCAGCCC CGGCCAAACC CCCATGGAGG CCCCCGACGA GGAAAGGGCC CTGGCGGCCA TTAGGCGGGG CGACCGGGCA GGAATTATGG CATCCCTGGG TAATGACCTG GAAGCAGTCA CCTGCCGCCT GTACCCGGAA GTTATGGCCA TTAAGATGCG TCTCCTGGCC GAAGGGGCAG AGCGGGCGGT GATGTGCGGC AGCGGCCCGG CGGTCTTCGG GGTGGCTGCC GATGGAGAAA CCGCCAGGCG CATCGCCTCC CGGTTACAGG AGACTTACCC CGAAACTATA GTCACCCGGA CCTTATGA
|
Protein sequence | MDVLTLPAYG KINLTLKVLG RRSDGYHNLS TIFQSIALAD RLTFSRCREG IRLETSGLPV PQGPENLAYR AAARLQSRYG FPGVRITLKK QIPLAAGLAG GSADAAATLI GVNALFNLGL TPGQLAREGA ALGSDVPFCV IGGTALGRGR GEELSLLPPL PTLWLVLVKP SFGVSTAAVY RGWDASPGQT PMEAPDEERA LAAIRRGDRA GIMASLGNDL EAVTCRLYPE VMAIKMRLLA EGAERAVMCG SGPAVFGVAA DGETARRIAS RLQETYPETI VTRTL
|
| |