Gene Arth_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3953 
Symbol 
ID4447771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4466305 
End bp4468068 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content70% 
IMG OID639691784 
Productdihydroxyacetone kinase 
Protein accessionYP_833428 
Protein GI116672495 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.389024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGAA TTTTCAACGA TCCGTCCGAT TTCGCCGAGG AAGCCCTCGC CGGTTTCTGC 
GACGTCCACG CAGGCCTGGT TCGCCAGGTT CCCGGCGGGG CCGTGCGCCG CCAACGCCCT
GCCAAACCCA AGGTGGCCGT CCTTGCCGGC GGCGGCTCCG GCCACTACCC GGCCTTCGCC
GGCCTGATCG GTACCGGACT GGCCGACGGC GCCGTGGTGG GCAACATCTT CACCTCGCCG
TCCGCGCAGC AGGTCTACGC CGTGGCGAAG GCGGCGGACT CCGGCGCGGG TGTGGTGTTC
ACCTACGGCA ACTACGCCGG GGACGTCCTG AACTTCGGCA TGGCCAGCGA ACGGCTCGCC
GCAGAGGGAA TCCAGGTGGA AAACGTCTTG GTGACGGACG ACGTCGCCAG TGCCCCGCCG
TCGGAAGCGG AGAAACGCCG CGGCATCGCG GGCGACTTCA CCGTCTTCAA AATCATGGGC
GCCGCCGCGG AGGCGGGAGC CGGCCTGGCC GACGTCGTGC GCCTGGGCCG GAAGGCCAAC
AGCCTGACCC GAACCATCGG CAGTGCCTTC AACGGCTGCA CGTTCCCTGG CGCCGAGGCT
CCGCTGTTCA CCCTCAAGGA CCGTCAGATG GGTCTCGGCC TGGGCATCCA CGGCGAACCC
GGCCTGTTCG ACACCGAACT GCCGCCCGCC AAGGAGCTGG GCCAGGAATT CGTCTCCCGG
CTGTTGGCGG AGACACCGCA CGGTGCGGGC GACCGCATCG CCGTCATCCT CAACGGCCTG
GGCTCCACCA AGCACGAGGA ACTCTTTGTC CTCTGGCTCA CCGTTGCCCC GCTGCTCCGT
GCCGCCGGCT ACACGCTGGT CATGCCGGAG GTGGGAGAAC TGGTCACCAG CCTGGACATG
TCCGGTGTCT CCCTCACGAT CACCTGGCTG GACGAGGAAC TTGAGCCGCT GTGGACTGCG
CCGGCTGAAA CGCCGGCCTA CCGGCGGGGC AACGCCGCCC TGCAGTCCGG CGACCTGATG
GCGGAACATG CGTCAGACGG CACCGCCGCG CCCGCTGCCT TCGAAGCCAC CGAAGAGTCG
CGCGGCTACG CGGCCAGCTG CCTGGCGGCG CTCGCTGCCG CCCGGGACTC ACTGCACGCG
GCTGAAGCGA GGCTGGGGGA CATGGACGCT GTCGCGGGCG ATGGAGACCA CGGGCGCGGG
ATGGTCCGCG GTATCGACGC GGCCGTGGCT GCAGCCGGGA ACGCCTTGCC CCGCGGTGCC
GGAGCAGGCG CGGTTCTTTC GGCCGCCGGG GACGCCTGGG CTGACAAGGC AGGCGGAACC
TCCGGCGTGC TGTGGGGTGC AGGCCTGCGG TCCTTCGGCG AAGCCCTCGG CAACCAGCTG
GCTCCGGGAC CTTCGGAACT GGCCGCGGCT GTCGCGGCAT TCTCGGCACG GATCACCGGC
CTCGGCAAGG CGGACATCGG GGACAAGACC ATGGTGGACG CCCTGCTGCC CTTCACAGAG
ACGTTCAGCC GGCTGGTGGC CGACGGCGGC AGCCACGCGG CGGCAGCCGA AGCGGCATGG
GCCGAAGCCG CTGCGGTGTC CACGGCGGCA GCGGAAGCCA CCGCCTCCCT GCGGCCGCTC
AAAGGCCGCG CCCGGCCCCT CGCGGAAAAG AGCGTTGGCA CGGCGGACCC CGGTGCCACA
TCACTGGCCA TGATCTTCAC CGTGATGGGG GCGCACCTTG CAGCGCCGGT TCCACCCCGC
CAGCCCGCCG GAACACCGTC ATGA
 
Protein sequence
MTRIFNDPSD FAEEALAGFC DVHAGLVRQV PGGAVRRQRP AKPKVAVLAG GGSGHYPAFA 
GLIGTGLADG AVVGNIFTSP SAQQVYAVAK AADSGAGVVF TYGNYAGDVL NFGMASERLA
AEGIQVENVL VTDDVASAPP SEAEKRRGIA GDFTVFKIMG AAAEAGAGLA DVVRLGRKAN
SLTRTIGSAF NGCTFPGAEA PLFTLKDRQM GLGLGIHGEP GLFDTELPPA KELGQEFVSR
LLAETPHGAG DRIAVILNGL GSTKHEELFV LWLTVAPLLR AAGYTLVMPE VGELVTSLDM
SGVSLTITWL DEELEPLWTA PAETPAYRRG NAALQSGDLM AEHASDGTAA PAAFEATEES
RGYAASCLAA LAAARDSLHA AEARLGDMDA VAGDGDHGRG MVRGIDAAVA AAGNALPRGA
GAGAVLSAAG DAWADKAGGT SGVLWGAGLR SFGEALGNQL APGPSELAAA VAAFSARITG
LGKADIGDKT MVDALLPFTE TFSRLVADGG SHAAAAEAAW AEAAAVSTAA AEATASLRPL
KGRARPLAEK SVGTADPGAT SLAMIFTVMG AHLAAPVPPR QPAGTPS