Gene Arth_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0172 
Symbol 
ID4447365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp177435 
End bp178436 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID639687967 
Productdihydroxyacetone kinase subunit DhaK 
Protein accessionYP_829673 
Protein GI116668740 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC TGATCAACGA TCCCCGCGCT GTAGTAGACG AGTCCGTGGA AGGCTTCGGC 
CTTGCGCATG CGGACATTGT GACCGTCAGC GCCGAGCCGA AGTTCATTAC CCGCAAGGAC
GCCCCCGTGG CCGGGAAAGT GGGCCTTGTC AGCGGCGGCG GCAGCGGCCA TGAACCGCTT
CACGGCGGCT TCGTCGGGCT GGGAATGCTC GACGCCGCCG TGCCGGGGGC CGTCTTCACC
TCGCCCACCC CTGATCAGAT CATTCCTGCG ACCCTCGCCG TAAACTCGGG TGCCGGCGTC
GTCCACATCG TCAAGAACTA CACCGGCGAC GTCCTGAATT TCGAAACGGC CGCCGAAATG
GCGGAAGCCG AAGGCGTGCA GGTCCGCACC GTACTGGTCA ACGACGACGT CGCCGTGGAG
GACTCGCTGT ACACGGCGGG CCGGCGCGGC GTAGGCGGAA CTGTCCTGGT GGAGAAGATC
GCCGGTGCGG CAGCGGAACG CGGGGATGAC CTGGATGCCG TCGCCGCCAT TGGGGACCGG
GTCAACCAAA ACGTCCGCAG CATGGGCGTC GCGCTATCCG CCTGCACGGT CCCGCACGCA
GGGGTGCCCA GCTTTGACCT GGAAGAGAAC GAAATCGAAA TCGGCATCGG GATCCACGGC
GAGCCCGGAC GGCACCGGAT CCCCATGGAA AATGCCGACG GCATCACCGA CCGCCTCCTG
GAGCCCATCC TGTCCGACCT GGGCATTGCC TCCGGCGAGA AAGTGCTCCT GTTCGTGAAC
GGCATGGGCG GGACGCCGCA AAGCGAGCTC TACATCGTGT ACCGCCGTGC AGCGCAGGTT
CTCGCGGAGA AAGGCGTCAC GGTGGAGCGC TCCCTGGTGG GCAACTACAT CACCTCACTG
GAGATGCAGG GCTGCTCCAT CACTGTTCTT CGGCTCGACG ACGAACTGAC CAGCCTCTGG
GACGCCCCGG TCCACACTGC CGCGCTGCGC TGGGGCATCT GA
 
Protein sequence
MKKLINDPRA VVDESVEGFG LAHADIVTVS AEPKFITRKD APVAGKVGLV SGGGSGHEPL 
HGGFVGLGML DAAVPGAVFT SPTPDQIIPA TLAVNSGAGV VHIVKNYTGD VLNFETAAEM
AEAEGVQVRT VLVNDDVAVE DSLYTAGRRG VGGTVLVEKI AGAAAERGDD LDAVAAIGDR
VNQNVRSMGV ALSACTVPHA GVPSFDLEEN EIEIGIGIHG EPGRHRIPME NADGITDRLL
EPILSDLGIA SGEKVLLFVN GMGGTPQSEL YIVYRRAAQV LAEKGVTVER SLVGNYITSL
EMQGCSITVL RLDDELTSLW DAPVHTAALR WGI