Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0571 |
Symbol | ipk |
ID | 4885306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 540693 |
End bp | 541574 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640126499 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001057623 |
Protein GI | 126442103 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000369391 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATA CGACCCGCTC GCTGCGCGAC TGCCTCGCCC CGGCGAAACT GAACCTGTTC CTGCACATCA CGGGCCGCCG CCCGGACGGC TATCACGCGC TGCAAAGCGT GTTCCAGCTG CTCGACTGGG GCGACCGGCT GCACTTCACG CTGCGCGACG ACGGCAAGGT CTCGCGCGTG ACGGACGTGC CGGGCGTGCC CGAGGAATCC GACCTCGTCG TGCGCGCCGC GTCGCTGCTG AAGGCGCACG CCGGCGCGAC GCTGGGCGTT GACATCGAGA TCGACAAGCG GCTGCCGATG GGCGCGGGCC TGGGCGGCGG CAGCTCGGAC GCGGCGACGA CGTTGCTCGC GCTCAACCGG CTATGGCGGC TCGACCTGCC GCGCACCACG CTGCAATCGC TCGCGGTGAA GCTCGGCGCC GACGTGCCGT TCTTCGTCTT CGGAAAAAAT GCGTTCGCGG AGGGTATCGG AGAAGCGCTA CAAGCTGTAG AATTGCCGGC TCGCTGGTTC CTGGTTGTGA CACCGCGGGT TCACGTGCCG ACGGCAGCGA TTTTTTCCGA AAAATCGTTG ACAAGAGATT CGAAACCCAT CACAATTACG GACTTTCTTG CACAGTGCGG CATCGACGCA GGATGGCCAG ACAGCTTCGG CCGGAATGAC ATGCAGCCGG TTGTGACAAG CAAGTACGCG GAAGTTGCAA AGGTGGTCGA ATGGTTTTAT AATCTGACCC CCGCGCGGAT GACCGGCTCT GGAGCGAGCG TGTTTGCAGC GTTCAAGAGC AAGGCTGATG CAGAAGCGGC GCAAGCCAAA CTGCCTGCCG GCTGGAACAG CGCAGTTGCC GAGAGCATGA GTGAGCATCC ACTCTTCGCT TTCGCGTCAT AA
|
Protein sequence | MTDTTRSLRD CLAPAKLNLF LHITGRRPDG YHALQSVFQL LDWGDRLHFT LRDDGKVSRV TDVPGVPEES DLVVRAASLL KAHAGATLGV DIEIDKRLPM GAGLGGGSSD AATTLLALNR LWRLDLPRTT LQSLAVKLGA DVPFFVFGKN AFAEGIGEAL QAVELPARWF LVVTPRVHVP TAAIFSEKSL TRDSKPITIT DFLAQCGIDA GWPDSFGRND MQPVVTSKYA EVAKVVEWFY NLTPARMTGS GASVFAAFKS KADAEAAQAK LPAGWNSAVA ESMSEHPLFA FAS
|
| |