Gene BURPS1106A_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0587 
Symbolipk 
ID4903182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp555449 
End bp556330 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content63% 
IMG OID640133817 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001064869 
Protein GI126452519 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0416429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATA CGACCCGCTC GCTGCGCGAC TGCCTCGCCC CGGCGAAACT GAACCTGTTC 
CTGCACATCA CGGGCCGCCG CCCGGACGGC TATCACGCGC TGCAAAGCGT GTTCCAGCTG
CTCGACTGGG GCGACCGGCT GCACTTCACG CTGCGCGACG ACGGCAAGGT CTCGCGCGTG
ACGGACGTGC CGGGCGTGCC CGAGGAATCC GACCTCGTCG TGCGCGCCGC GTCGCTGCTG
AAGGCGCACG CCGGCGCGAC GCTGGGCGTC GACATCGAGA TCGACAAGCG GCTGCCGATG
GGCGCGGGCC TGGGCGGCGG CAGCTCGGAC GCGGCGACGA CGTTGCTCGC GCTCAACCGG
CTATGGCGGC TCGACCTGCC GCGCACCACG CTGCAATCGC TCGCGGTGAA GCTCGGCGCC
GACGTGCCGT TCTTCGTCTT CGGAAAAAAT GCGTTCGCGG AGGGTATCGG AGAAGCGCTA
CAAGCTGTAG AATTGCCGGC TCGCTGGTTC CTGGTTGTGA CACCGCGGGT TCACGTGCCG
ACGGCAGCGA TTTTTTCCGA AAAATCGTTG ACAAGAGATT CGAAACCCAT CACAATTACG
GACTTTCTTG CACAGCGCGG CATCGACGCA GGATGGCCAG ACAGCTTCGG CCGGAATGAC
ATGCAGCCGG TTGTGACAAG CAAGTACGCG GAAGTTGCAA AGGTGGTCGA ATGGTTTTAT
AATCTGACCC CCGCGCGGAT GACCGGCTCT GGAGCGAGCG TGTTTGCAGC GTTCAAGAGC
AAGGCTGATG CAGAAGCGGC GCAAGCCAAA CTGCCTGCCG GCTGGAACAG CGCAGTTGCC
GAGAGCATGA GTGAGCATCC ACTCTTCGCT TTTGCGTCAT AA
 
Protein sequence
MTDTTRSLRD CLAPAKLNLF LHITGRRPDG YHALQSVFQL LDWGDRLHFT LRDDGKVSRV 
TDVPGVPEES DLVVRAASLL KAHAGATLGV DIEIDKRLPM GAGLGGGSSD AATTLLALNR
LWRLDLPRTT LQSLAVKLGA DVPFFVFGKN AFAEGIGEAL QAVELPARWF LVVTPRVHVP
TAAIFSEKSL TRDSKPITIT DFLAQRGIDA GWPDSFGRND MQPVVTSKYA EVAKVVEWFY
NLTPARMTGS GASVFAAFKS KADAEAAQAK LPAGWNSAVA ESMSEHPLFA FAS