Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0476 |
Symbol | ipk |
ID | 3846883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 528512 |
End bp | 529393 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637840149 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_441033 |
Protein GI | 83719126 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.299781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATA CGACCCGCTC GCTGCGCGAC TGCCTCGCCC CGGCGAAACT GAACCTGTTC CTGCACATCA CGGGCCGTCG TCCGGACGGC TATCACGAGC TGCAAAGCGT GTTCCAGCTG CTCGACTGGG GCGACCGGCT GCACTTCACG CTGCGCGACG ACGGCAAGGT GTCGCGCAAG ACCGACGTGC CGGGCGTACC CGAGGAAACC GACCTCATCG TGCGCGCGGC GTCGCTGCTG AAAGCGCACA CGGGCACGGC GGCGGGCGTC GACATCGAGA TCGACAAGCG ACTGCCGATG GGCGCGGGCC TCGGCGGAGG CAGCTCGGAT GCGGCGACGA CGTTGCTCGC GCTCAACCGC CTCTGGAAGC TCGACTTGCC GCGCGCCACG CTGCAATCGC TCGCGGTGAA GCTCGGCGCC GACGTGCCGT TCTTCGTCTT CGGAAAAAAT GCGTTCGCAG AGGGTATCGG AGAAGCGCTG CAAGCTGTAG AATTGCCGAC TCGCTGGTTT CTGGTTGTGA CACCGCGGGT TCACGTTCCG ACCGCAGCGA TTTTTTCCGA AAAATCGTTG ACAAGAGATT CGAAACCCAT CACAATTACG GACTTTCTTG CACAGCAAGA CTGCAACACG GGATGGCCTG ACAGTTTCGG TCGGAATGAC ATGCAGCCGG TTGTGACAAG CAAGTACGCG GAAGTTGCAA AGGTGGTCGG ATGGTTTTAT AATCTGACCC CCGCGCGGAT GACCGGCTCC GGAGCTAGCG TGTTTGCAGC GTTCAAGAGC AAGGCGGAGG CAGGAGCGGC GCAAGCCCAA CTGCCGGCCG GCTGGGACAG CGCAGTTGCC GAGAGCTTGG GTGAGCATCC ACTCTTCGCT TTCGCGTCAT AA
|
Protein sequence | MTDTTRSLRD CLAPAKLNLF LHITGRRPDG YHELQSVFQL LDWGDRLHFT LRDDGKVSRK TDVPGVPEET DLIVRAASLL KAHTGTAAGV DIEIDKRLPM GAGLGGGSSD AATTLLALNR LWKLDLPRAT LQSLAVKLGA DVPFFVFGKN AFAEGIGEAL QAVELPTRWF LVVTPRVHVP TAAIFSEKSL TRDSKPITIT DFLAQQDCNT GWPDSFGRND MQPVVTSKYA EVAKVVGWFY NLTPARMTGS GASVFAAFKS KAEAGAAQAQ LPAGWDSAVA ESLGEHPLFA FAS
|
| |