Gene Dtox_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1008 
Symbol 
ID8427947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1027708 
End bp1028760 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content45% 
IMG OID645033343 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003190517 
Protein GI258514295 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000707148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTATTG AATTTAACCG CTTTGCACCG GAGAAAGATA TAGATTTAGT ATTGGGCAGG 
CTTAATGATG AAGGTGTATT GGCGTTTAGG ACCAGAAGCA ATATGAATCC GGTGATAGTT
TCCACATTAA AGGTTGATGG TTTGAAAAAA CTTAATATTG AAGGGGTTGC TTCTGTTCAG
CGGGTAGTTG AAGTTTCTAC GCCGTTTAAA CTGGCCAGCA GGGCTTTTAA GAAAACCGAT
ACAGTAGTTA AGGTAGATAA TTTGGAAATA GGTGGAGACA AAATTCATGT TATGGCAGGG
CCGTGTGCGG TTGAAAGCCG GGAGCAGCTT TTGAAAACAG CTCATACAGT TAAAGCTTGC
GGTGCTACTT TTTTGCGTGG TGGAGCTTTT AAGCCTCGCA CCTCTCCCTA CTCATTTCAG
GGTCTCAATG AAGACGGCCT AAAATTTTTA GCTGAGGCCA GGACATTAAC CGGACTGAAA
ATTGTAACAG AGGTTATGGA TACAAGATCT GTGCAGCTAG TGGCTAAATA CGCCGATGTG
CTGCAGATTG GCGCCAGGAA TATGCAAAAC TTTGACTTGT TAAAGGAAGT TGGACGGGTT
AGCAAACCGG TACTTTTAAA GAGGGGAGCC AATGCTACTA TTGAGGAGTT TTTAGCCGCG
GCAGAATATG TTTTAGCCGG AGGTAATGAA GATGTTATCC TTTGCGAGCG TGGTGTTAGA
GGTGTAAACG AGTTTACGCG TTATACTCTG GATATTGCGG CTGTTCCTGT ACTTAAAAAA
CTTACCCACC TGCCGGTAGT AGTGGATCCC AGTCATGCAA CGGGTCACTG GAATTATGTT
GAGCCAATGG CTATGGCTTC TCTCGCAGCA GGAGCAGATG GCCTGATCAT AGAAGTTCAT
CCCGAACCGG AAAAGGCTCT TTGTGACGGG GCACAGTCAC TGACACCGAA AAATTTTACT
GTATTGATGC TTAGATTGGC CGGCCTGCAG AATTCTGTTT GCCGAGATTT GGCCGTCCCT
GATATGCATT TGCAGCAGCT TGACATCAGT TAG
 
Protein sequence
MLIEFNRFAP EKDIDLVLGR LNDEGVLAFR TRSNMNPVIV STLKVDGLKK LNIEGVASVQ 
RVVEVSTPFK LASRAFKKTD TVVKVDNLEI GGDKIHVMAG PCAVESREQL LKTAHTVKAC
GATFLRGGAF KPRTSPYSFQ GLNEDGLKFL AEARTLTGLK IVTEVMDTRS VQLVAKYADV
LQIGARNMQN FDLLKEVGRV SKPVLLKRGA NATIEEFLAA AEYVLAGGNE DVILCERGVR
GVNEFTRYTL DIAAVPVLKK LTHLPVVVDP SHATGHWNYV EPMAMASLAA GADGLIIEVH
PEPEKALCDG AQSLTPKNFT VLMLRLAGLQ NSVCRDLAVP DMHLQQLDIS