Gene Dtox_2298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2298 
Symbol 
ID8429282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2465705 
End bp2466904 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content47% 
IMG OID645034605 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_003191734 
Protein GI258515512 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0690274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000304265 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTTA GCACTATCGA AGAAGCTATT GAAGACATCG CGCAAGGGAA GATCTTGGTT 
GTGGTAGATG ATGAGGATCG GGAAAATGAG GGTGACCTGA TCATGGCTGC AGATAAAGTA
ACCCCTGAAG CGGTTAACTT TATGGCAACA TACGGACGAG GCTTAATCTG CATGCCAATA
CTGGGTGAAA GACTGGATGA ACTGGATCTC CCTGCTATGG TCAACAATAA CACCGACCCT
AACGGAACAG CCTTTACTGT TTCTATCGAT CATAAAGACA CCACTACAGG AATCTCGGCG
TTCGAGCGCG CTGCTACCAT TAAAGCTGTG CTTGATCCGG CAACCAAGTC GGAGGATTTG
CGGCGTCCGG GGCATATATT TCCCCTGCGA GCCCAAGAAG GCGGGGTACT GCGGCGTGCC
GGGCATACAG AGGCGTCAGT AGACCTGGCT AAAATGGCCG GTCTTCATCC TTCAGCCGTA
ATATGTGAAA TCATGAAAGA AGACGGAACT ATGGCCCGAA TACCGGAACT TATGGATTTC
GTTAAAGAAC ACGGGTTGAA AATAATTACT ATTGCCGATT TGATTGAATA CCGCCGTAGA
ACAGAAAGAT TAATCCGTCG GGTTGAAGTT GTGAAACTGC CGACTAGATT CGGGGAGTTT
ACAGCCGTTG CTTATGAAAG CCTGTTGGAC GGTAAGGGGC ATATAGCTCT GGTTAAAGGT
GAACCGGATA AAAGCCAGGC GCCTCTGGTG AGAGTACACT CGGAATGCCT GACCGGTGAT
GTTTTTGGTT CCTCTCGCTG CGATTGCGGT GATCAATTAG CGCAGGCAAT GCGCATGATT
GAAAAAGAGG GAACCGGTGT ATTGCTATAT ATGCGGCAGG AAGGAAGAGG CATAGGTTTG
CTCAATAAAA TTCGCGCCTA TAAGCTGCAG GATGAAGGCA AAGATACAGT TGAAGCTAAT
GAAGCGCTTG GTTTTCCGGC CGATTTGAGA GATTACGGCC TGGGTGCTCA GATTCTTGCC
GATCTGGGAT TGAGCAAAAT TCGTTTAATT ACCAATAATC CTCGTAAAAT AGCCGGCCTG
GAAGGCCACG GCCTGGAAGT TATAAAGAGA GTTCCTATAG AGATTTGTCC GGGAGAGTAT
AATAATTATT ACCTTTCCAC CAAAAAAGCA AAACTAGGAC ATATGTTAAA TATTAACTGA
 
Protein sequence
MKFSTIEEAI EDIAQGKILV VVDDEDRENE GDLIMAADKV TPEAVNFMAT YGRGLICMPI 
LGERLDELDL PAMVNNNTDP NGTAFTVSID HKDTTTGISA FERAATIKAV LDPATKSEDL
RRPGHIFPLR AQEGGVLRRA GHTEASVDLA KMAGLHPSAV ICEIMKEDGT MARIPELMDF
VKEHGLKIIT IADLIEYRRR TERLIRRVEV VKLPTRFGEF TAVAYESLLD GKGHIALVKG
EPDKSQAPLV RVHSECLTGD VFGSSRCDCG DQLAQAMRMI EKEGTGVLLY MRQEGRGIGL
LNKIRAYKLQ DEGKDTVEAN EALGFPADLR DYGLGAQILA DLGLSKIRLI TNNPRKIAGL
EGHGLEVIKR VPIEICPGEY NNYYLSTKKA KLGHMLNIN