Gene Dtox_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0904 
Symbol 
ID8427843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp914912 
End bp915931 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content48% 
IMG OID645033247 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003190421 
Protein GI258514199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000034525 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000051427 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTATAG TTATGTCTAT TGAATCCACA GACGCACAGA TAGAGGCAGT TATAGAGAAA 
TTAGATGGTC TGGGTTTTAA GACACAGGTC ATTCGCGGGG TAAAGAGAAT AGTTATAGGT
GCGGTAGGTG ACCGGCAGGC GCTCGACAGC GTTAGTCTGA AACAAATGCC AGGTGTGGAG
GATATTGTTA AAATTATGAA GCCTTTCAAA ATGGTCAGCA GAGAAGCCAG GGAAGAAAAC
ACGGTTATTA ATATACGCGG TATCAGCATA GGTGGAGAGG GCGTTGTTGT TATGGCCGGT
CCCTGTGCGG TGGAAAGCAG GGAACAGCTT TTGACCGCCG CCCGGCAGGT TAAAGCTGCG
GGTGGTCATG TCCTGCGTGG CGGTGCTTTC AAACCACGAA CATCTCCTTA CAGTTTCCAG
GGTATGGAGG AGGAGGGTTT AAAGCTTTTA AAAGAAGCTT CGGAGGAAAC AGGTTTGCCA
ACGGTGACGG AAGTTATTGA TGAACACAGT TTGCAGCTGG CGCATGATTA TGTAGACATA
ATACAAATCG GTGCCAGAAA TATGCAAAAT TTCCGTTTGC TCAGGGCGGC CGGACAGACA
GATAAAATTA TTTTACTGAA AAGAGGATTG TCCGCTACCA TAGAGGAATG GTTAATGTCA
GCCGAGTATA TTATGTCAGA GGGCAATGGC AAGATTATTC TCTGCGAGAG AGGTATCCGT
ACTTTTGAAA CCTATACCCG CAATACTCTC GATCTCAGTG CCGTTCCGCT GGTTAAGAGG
CTCAGTCACC TGCCGGTAAT TGTTGATCCC AGCCATGCCA CAGGTGACAG GCAGCTGGTT
GTGCCTATGT CTCTGGCGGC TGCGGCGGGG GGTGCGGACG GTCTGATAGT GGAAATGCAT
CCGGAACCGA GTAAAGCTCT TTGTGACGGT GCACAGTCTC TGCACCCGGG GGAACTGGTT
GGCTTGATAG CTAAATTAAC AAAAATGATG CCTGCGATAG ATCGCACTAT GTCAGTTTAA
 
Protein sequence
MIIVMSIEST DAQIEAVIEK LDGLGFKTQV IRGVKRIVIG AVGDRQALDS VSLKQMPGVE 
DIVKIMKPFK MVSREAREEN TVINIRGISI GGEGVVVMAG PCAVESREQL LTAARQVKAA
GGHVLRGGAF KPRTSPYSFQ GMEEEGLKLL KEASEETGLP TVTEVIDEHS LQLAHDYVDI
IQIGARNMQN FRLLRAAGQT DKIILLKRGL SATIEEWLMS AEYIMSEGNG KIILCERGIR
TFETYTRNTL DLSAVPLVKR LSHLPVIVDP SHATGDRQLV VPMSLAAAAG GADGLIVEMH
PEPSKALCDG AQSLHPGELV GLIAKLTKMM PAIDRTMSV