Gene Namu_4393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4393 
Symbol 
ID8450019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4875114 
End bp4876109 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID645043440 
Productdihydroxyacetone kinase, DhaK subunit 
Protein accessionYP_003203669 
Protein GI258654513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.838317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TCGTCAACGA CCCGAAGTCA TTCGTTCCGG AGATGTTGGA GGGACTGGCC 
CTGGCCAACC CCAACACGCT GCGGTACGAG CCCGAGTGGA ACCTGATCAT GCGGGCCGAC
GCGCCGCGCG CGGACAAGGT CTCCATCGTC CAGGGGTCCG GATCCGGTCA CGAACCCGCA
CACGTCATGG TGGTGGGCAA GGGCATGCTG GACGCCGCCT GTCCGGGGGA TGTCTTCGCC
GCCCCGCCGA TGGACTACGT GTACGAGACG GCCAGACGAC TCGCCTCGCC CAAGGGCGTG
CTGCTGCTGG TCAACAACTA CACCGGCGAC CGGATGGCCT TCGAGATGGG TAAGGAGATG
GCCGAGTCCG ACGGGGTCAA GGTCGAGATC CTGATGATCA ACGACGACGT CGCGGTCAAG
GACTCGCTGT ACACCATCGG CCGGCGCGGG GTGGCCGGGA ACTTCTTCGT CATCAAGGCG
GTCGGTGCGG CCAGCGAACG CGGCGACAGC CTGGAGGAGG TCATCCGGAT CGGCAAGAAG
GTCAACGACG TCACCCGGAC CATGGGCGTG GCGTTGACCG CCTGCACCCC GCCGGCCAAG
GGCGAGCCGC TGTTCGAGAT GGCCGAGGAC GAGATGGAGG TCGGCGTCGG CATCCACGGC
GAACCCGGCC GGGAACGGGT CAAGATCAAG ACGGCCGACG AGATCGTCGA CCTGCTGCTG
GACGCCACCG TCAACGACCT GCCCTACCGG TCCGGCGACC GGGTGGCGCT GATGATCAAC
GGGCTCGGCG GTACGCCGAT CAGCGAGCTG TACATCCTGT TCCGGCGGGC CCATCAGCAA
CTCGCGGCCA AGGGCATCAC GGTCGCCCGC AGCTACGTCA ACGAGTACTG CACCTCCCTG
GACATGGCCG GGGCGTCACT GACCCTGGTC CGGCTCGACG ACGAGATCGA GGAGTTGCTG
GAGGCGCCGG CGGAGATCCC CAACCGGGTC TTCTGA
 
Protein sequence
MKKFVNDPKS FVPEMLEGLA LANPNTLRYE PEWNLIMRAD APRADKVSIV QGSGSGHEPA 
HVMVVGKGML DAACPGDVFA APPMDYVYET ARRLASPKGV LLLVNNYTGD RMAFEMGKEM
AESDGVKVEI LMINDDVAVK DSLYTIGRRG VAGNFFVIKA VGAASERGDS LEEVIRIGKK
VNDVTRTMGV ALTACTPPAK GEPLFEMAED EMEVGVGIHG EPGRERVKIK TADEIVDLLL
DATVNDLPYR SGDRVALMIN GLGGTPISEL YILFRRAHQQ LAAKGITVAR SYVNEYCTSL
DMAGASLTLV RLDDEIEELL EAPAEIPNRV F