Gene BURPS1106A_A0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0365 
SymboldhaK 
ID4904338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp347636 
End bp348640 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content69% 
IMG OID640143472 
Productdihydroxyacetone kinase, subunit I 
Protein accessionYP_001074408 
Protein GI126458269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTG TCATCAATCA TCCCGACTAC GTCGTCGAGG ACATGCTGCG CGGGATCGTC 
GCCGCGCATC CGGCGCTCGC GCTCGACGCG GACAACCCGC GCGTGATCGG CGTCGCGCAT
CCGGTGCCGG GCAAGGTCGG CGTCGTCACG GGCGGCGGCT CGGGCCACGA GCCGGCGTTT
GTCGGCTACA CGGGGCCGGG GCTCGTCGAC GCGGTCGCGA TCGGCGAGAT CTTCTCGTCG
CCCACCGCGA AGAGCTTTCT CGACGCGTTC AGGCGCGCCG ATCGCGGCGC GGGCGTCGCC
TGCCTGTACG GCAACTACGC GGGCGACAAC ATGAACGTGA AGATGGCGAT CAAGATGGCC
GCCGCGCAGG GCATCGACGT GAAGACCGTC GTCGCGAACG ACGACGTCGC GTCCGCGCCG
CGCGAGGAGC GCGCGAAGCG CCGCGGCGTC GCGGGCGAGA TCCTGATGTG GAAGGCGGGC
GGCGCGCGGG CGGCCGCGGG CGGCGATCTC GACGCCGTGA TCGCGAGCGC GAAGAAGGCG
ATCGACAACA CGCGCTCGGT CGGCATCGGC CTGTCCGCGT GCACGATCCC GGCGAACGGC
AAGGCGAACT TCCACATCGC CGACGGCGAG ATGGAGGTCG GCATCGGCCA TCACGGCGAG
CATGGCGTGC GCGTGATGCG CACGGTGAGC GCGAAGGACA TGGCGGCGAT GATGCTCGAC
ATCGTGCTGC CGGATTTCCC GCTCGAACGC GGCGAGGAAG TCGCGGTGCT GGTGTCGGGG
CTCGGCGCGA CGCCGCTGAT GGAGCAGTAC ATTCTGTATG CCGAAGTATC GCAGCGGCTC
GCGGCCGCTG GGCTGAAGAT CGGCTTTCGC CTCGTCGGCA ATCTGTTCAC GTCGCTCGAG
ATGATGGGCG TCACGCTGAC CGTCACCCGG CTCGACGACG AGTTGAAGCA ACTGTTCGCC
GCGCCGTGCA GCAGCATCGG CCTCACCGTG GGAGAACGCG CATGA
 
Protein sequence
MNRVINHPDY VVEDMLRGIV AAHPALALDA DNPRVIGVAH PVPGKVGVVT GGGSGHEPAF 
VGYTGPGLVD AVAIGEIFSS PTAKSFLDAF RRADRGAGVA CLYGNYAGDN MNVKMAIKMA
AAQGIDVKTV VANDDVASAP REERAKRRGV AGEILMWKAG GARAAAGGDL DAVIASAKKA
IDNTRSVGIG LSACTIPANG KANFHIADGE MEVGIGHHGE HGVRVMRTVS AKDMAAMMLD
IVLPDFPLER GEEVAVLVSG LGATPLMEQY ILYAEVSQRL AAAGLKIGFR LVGNLFTSLE
MMGVTLTVTR LDDELKQLFA APCSSIGLTV GERA