Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1538 |
Symbol | ipk |
ID | 7409045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1624007 |
End bp | 1624867 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643715909 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_002573409 |
Protein GI | 222529527 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00561271 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAACTCA AGGCTTATGC AAAAATCAAC TTAGCATTGG ATGTGCTTTC GAAAAGAGAA GATGGCTATC ATGAAATAAG AACTATAATG CAAACAGTGG ATTTGTATGA TATAATCAAT ATTGAAAAGA TAGAAGAAGA CAACATAATT GTGACAACTT CAAGTGAAAA TATTCCAACT GACAATAAAA ACCATGCATA CATTGCAGCT TCACTTTTAA AAGAGCGTTT TGGCGTAAAG CAAGGTGTGA GAATACATAT TGAAAAGAAC ATTCCGGTCT CTGCGGGTTT AGCTGGTGGA AGCACTGACG CAGCAGCAGT TTTAAAAGGT CTGAATGAAA TATTTGAGCT AAATCTTTCT GAGCAGCAGC TTATGGAAAT CGGAAGAGAG ATTGGTGCTG ATGTTCCATT TTGTTTGGTA GGCGGCACAG CCCTTTGTGA GGGAATTGGC GAAAAGGTGA TAAAGCTAAA ATCAGCTCCT CAGATGAATA TCCTCATTGC AAAGCCAGAG GTATATGTTT CTACGCAGGC TGTGTATGAG GCATTGGATC TTAGCAAGAT AAAAAAGAGA CCAAACATTG AAGCTATGAT TTCGGCAATT GAAGAAGGTA ATGTAAAAGA GATAGCAAAG AATCTTTGCA ATGTTTTAGA GGTGGTTACA GTAAATCAGT ATCCAGTCAT AAACAGAGTC AAGGACATTA TGAGAAATAA CAATGCTCTT GGGACAGTTA TGACAGGAAG CGGACCAGCT GTATTTGGGA TTTTTGGCAA CAAGTATAAT GCTTTAAAAG CTGCAGAGAG GCTCAAGGTG TTTATAAAAG AAATTATCTT GACTACAACA TGTGAAGGTA GCGGATTTTA G
|
Protein sequence | MKLKAYAKIN LALDVLSKRE DGYHEIRTIM QTVDLYDIIN IEKIEEDNII VTTSSENIPT DNKNHAYIAA SLLKERFGVK QGVRIHIEKN IPVSAGLAGG STDAAAVLKG LNEIFELNLS EQQLMEIGRE IGADVPFCLV GGTALCEGIG EKVIKLKSAP QMNILIAKPE VYVSTQAVYE ALDLSKIKKR PNIEAMISAI EEGNVKEIAK NLCNVLEVVT VNQYPVINRV KDIMRNNNAL GTVMTGSGPA VFGIFGNKYN ALKAAERLKV FIKEIILTTT CEGSGF
|
| |