Gene Cagg_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0074 
Symbol 
ID7269071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp102542 
End bp103540 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content58% 
IMG OID643564947 
Productdihydroxyacetone kinase subunit DhaK 
Protein accessionYP_002461463 
Protein GI219847030 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAAT TGATCAACGC ACCGGAGAAT GTGGTCAAAG AGGCATTGGC CGGAATGGCT 
ATTGCCCACG CCGATCTGAT CGATGTCCAC TTTGACCCCG ACTATATCGT GCGCAAGGGT
GCCCCCCACA ACAAAGTCGG TGTGATCTCC GGCGGTGGTT CCGGCCACGA GCCAATGCAC
GGTGGCTTTG TCGGCTACGG GATGCTCGAT GCTGCCTGCC CCGGCGCCGT CTTTACATCG
CCGGTGCCGG ATCAGATGCT GGCTGCTACT AAGGCGGTCA ACGGCGGCAA AGGGGTACTG
CACATTGTCA AAAACTATAC CGGCGACGTG ATGAACTTCG AGATGGCTGC CGAATTGGCC
GCTGCTGAAG GTATCGAGGT TGCCAGTGTG GTGACAAACG ACGATGTGGC AGTGGAAAAT
AGCACGTGGA CGGCCGGCCG TCGTGGTGTT GGTGTGACGG TGCTGCTTGA GAAAATCGTG
GGTGCGGCTG CTGAGGCTGG CGCCGATCTG GCTACCTGCA AGGCGATTGC CGAGCGGGTC
AATGCCAACG GGCGCAGCAT GGGCATGGCG TTGACCCCCT GCACGGTGCC ACAGGCAGGT
AAACCGGGCT TTGAGCTGGC TGAAGATGAG ATGGAGGTCG GTATCGGTAT CCACGGTGAA
CCGGGCCGCC GACGTGAGAA GTTGGCTCCG GCCCGCGATA TTGCTGAGAT GCTGGCCGGC
CCTATCCTCG AAGACCTGCC GTTCAAGAGT GGTGATGCCG TGCTGGCATT CGTTAACGGC
ATGGGTGGCA CGCCGCTGAT CGAGCTGTAT GTGATGTACA ATGAATTAGC ACGCATCCTA
AAGGATCGTA ACATCACCAT TGCCCGCTCA CTGGTAGGAA GCTATATCAC CTCGCTTGAG
ATGGCCGGTG TCAGCTTTAC ACTCCTCCGC CTCGATGACG AGATGATCAA GCTCTGGGAT
GCGCCGGTGC ATACGCCCGC ATTACGTTGG GGTATGTAA
 
Protein sequence
MKKLINAPEN VVKEALAGMA IAHADLIDVH FDPDYIVRKG APHNKVGVIS GGGSGHEPMH 
GGFVGYGMLD AACPGAVFTS PVPDQMLAAT KAVNGGKGVL HIVKNYTGDV MNFEMAAELA
AAEGIEVASV VTNDDVAVEN STWTAGRRGV GVTVLLEKIV GAAAEAGADL ATCKAIAERV
NANGRSMGMA LTPCTVPQAG KPGFELAEDE MEVGIGIHGE PGRRREKLAP ARDIAEMLAG
PILEDLPFKS GDAVLAFVNG MGGTPLIELY VMYNELARIL KDRNITIARS LVGSYITSLE
MAGVSFTLLR LDDEMIKLWD APVHTPALRW GM