Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2403 |
Symbol | ipk |
ID | 4811055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2870084 |
End bp | 2870935 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107816 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001038798 |
Protein GI | 125974888 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCAA TAAGTTTGAA AGCCCATGCA AAAATTAATT TGTCCCTTGA TGTAATTGGA AAGAGACAGG ATGGTTATCA TGAGGTCAGA ATGATTATGC AGTCCATAGC TTTGCATGAC GAAGTTGTAA TCGAAAAAAG GGCTGCAGGA ATCAAAGTGG AATGCGACAA GCCGTGGGTG CCCGAAGGAA GCGGCAATAT TGCTTATAAA GCGGCAAACC TCATGATGGA AAGGTATAAG ATTGAAAGCG GCGTTGGCAT TAAAATCCTA AAGAGAATTC CCGTGGCGGC AGGCCTTGCA GGGGGAAGTG CCGATGCGGC GGCGGTAATT AAAGGGATGA ATGAACTTTT TAATTTGAAT GCTGATGAAG CCGAACTTAT GGACATCGGA AAGCAGGTGG GAGCGGATGT GCCTTTCTGC ATCAAGGGAG GAACCATGCT CTCCGAAGGA ATCGGGGAGA AGCTTACAAA AATCCCTTCT TTTGAGGGAG TGAATATTGT ATTGGTAAAA CCCAAAGTCG GCGTGTCGAC AGCCTGGGTA TACAGCAATT TAAAGCTGAA TGAAATATCC TCAAGACCTG ATACCGAACT TTTGATAAAA GCAATTTATG AGAAAAATAT TGGCTGTCTG GCGCAAAACA TGACAAATGT GCTGGAGACA GTTACAATAA AAAAGTATGG AGTTATAAAT GATATTAAAA ACGAGCTCTT GAGGCTCGGA GCTTTGGGAA GCATGATGAG CGGAAGCGGT CCTTCCGTGT TTGGCATATT TGAAAATGAA AAACAGGCTT GTCTTGCTTA TGAAGGCCTT AAAAACAGTG AGTGGGAATG CTTTGTTACA CAAACAATAT AA
|
Protein sequence | MESISLKAHA KINLSLDVIG KRQDGYHEVR MIMQSIALHD EVVIEKRAAG IKVECDKPWV PEGSGNIAYK AANLMMERYK IESGVGIKIL KRIPVAAGLA GGSADAAAVI KGMNELFNLN ADEAELMDIG KQVGADVPFC IKGGTMLSEG IGEKLTKIPS FEGVNIVLVK PKVGVSTAWV YSNLKLNEIS SRPDTELLIK AIYEKNIGCL AQNMTNVLET VTIKKYGVIN DIKNELLRLG ALGSMMSGSG PSVFGIFENE KQACLAYEGL KNSEWECFVT QTI
|
| |