Gene Dtox_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0589 
Symbol 
ID8427524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp610036 
End bp611700 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content49% 
IMG OID645032954 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003190132 
Protein GI258513910 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000577484 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTAGTG ATGCTATGAA ATTAGGTCTG GAAAAAGCTC CTCACAGATC CCTGTTTAAA 
GCATTGGGTT ATACTGACCA GGAATTGGCC AGACCTTTAA TAGGTGTAGT GAATGCACAA
AATGAAATAG TTCCCGGTCA TCTCCATCTG GACGATATTG CTGAGGCAGT TAAGGCCGGA
ATCAGGATGG CCGGCGGTAC TCCGATAGAA TTTCCCGCTA TTGCAGTTTG TGACGGTATC
GCCATGAATC ACACAGGCAT GAAGTACTCA TTGGCCAGCC GTGAGTTGAT TGCTGATTCC
ATTGAAGTTA TGTCTATCGC GCATCCTTTT GACGGCTTAG TTTTAATTCC CAGCTGTGAC
AAGATAGTTC CGGGCATGTT GATGGCGGCG GCCCGCTTGA ATATTCCGGC TATAGTTGTC
AGTGGCGGTC CGATGCTGGC AGGTAAAATT AAGGGCCAGC ATAAATCCCT GACAAATGTT
TTTGAGGCTG TGGGTTCTGT AAGAGCCGGC AAGATGTCGG AAGAAGAGTT GGCCGATTTG
GAAGAGGCCG CTTGTCCCGG TTGCGGTTCC TGTTCGGGTA TGTTTACGGC TAATTCAATG
AATTGCTTAA CTGAAGTGCT GGGTATGGCC CTGCCGGGCA ACGGAACCAT TCCCGCTGTT
TCGGCAGCAC GCAGGCGTTT GGCTAAACAG ACAGGCATGC AGATAATGTA TCTGGTGAAA
GAAAACATTT GTCCCTCGGA TATTCTAACC ATGGATGCTT TCAATAACGG CTTGACCGTG
GATATGGCGC TTGGTTGTTC AACCAATACG ATTTTACACC TGCCTGCCAT TGCCAGTGAG
GCGGGAGTGA TCATTGATCT GGAGCTGGTT AATAAAACCA GCGAGCGCAC ACCGAATCTG
TGCAAGTTAA GCCCGGCCGG GCCGCATTTT ATTGAAGAAC TGGATGAGGC AGGCGGCATA
CCGGCTGTAA TGGCAGAACT CTCAAAGCAC GATTTGTTGA ATTTGAACAG CAGAACAGTA
TCCGGAGTTA CTGTAGGGGA AAACATCAAC GGCAGTAGAG TATTGCGCCG GGATATTATC
CGCAATATTG AAGATCCTTA TAGTCCCAGC GGCGGTATCA CTGTTATGAG AGGCAATCTC
GCTCCGGACG GCGCTGTGGT GAAGAAATCC GCAGTAGCAC CTGAGATGCT GGTGCACCGG
GGTCCGGCCC GCGTGTTCAA CTCGGAAGAG GAATCAATGG ATGCCATTAT GAACCAGACT
ATACAAAAAG GTGATGTAGT GGTTATCCGT TATGAAGGCC CCAGGGGCGG GCCCGGTATG
AGAGAGATGC TTACCCCGAC GGCTACCTTG GCCGGTCTGG GTTTGGATAA AGAAGTAGCT
TTGTTAACTG ACGGCCGCTT CTCCGGAGCA ACCAGGGGAG CCGCTATTGG CCACGTTTCG
CCGGAAGCAG CTCTGGGCGG TGTTATAGCT GTAATTCAAG ATGGAGATAT GATAGATATT
GATATTCCCA ACTGCCGCCT AAACGTAGAT TTGACAGAAG CTGAAATAGA CGAAAGAATG
AAAAAGCTGG TAATACCGGA GCCAAAGATT ACAAGAGGAT ATCTGGCCCG TTACGCAAAA
ATGGTTACTT CTGCAAGTAC AGGAGCAGTA TTGGCAAAAG ATTGA
 
Protein sequence
MRSDAMKLGL EKAPHRSLFK ALGYTDQELA RPLIGVVNAQ NEIVPGHLHL DDIAEAVKAG 
IRMAGGTPIE FPAIAVCDGI AMNHTGMKYS LASRELIADS IEVMSIAHPF DGLVLIPSCD
KIVPGMLMAA ARLNIPAIVV SGGPMLAGKI KGQHKSLTNV FEAVGSVRAG KMSEEELADL
EEAACPGCGS CSGMFTANSM NCLTEVLGMA LPGNGTIPAV SAARRRLAKQ TGMQIMYLVK
ENICPSDILT MDAFNNGLTV DMALGCSTNT ILHLPAIASE AGVIIDLELV NKTSERTPNL
CKLSPAGPHF IEELDEAGGI PAVMAELSKH DLLNLNSRTV SGVTVGENIN GSRVLRRDII
RNIEDPYSPS GGITVMRGNL APDGAVVKKS AVAPEMLVHR GPARVFNSEE ESMDAIMNQT
IQKGDVVVIR YEGPRGGPGM REMLTPTATL AGLGLDKEVA LLTDGRFSGA TRGAAIGHVS
PEAALGGVIA VIQDGDMIDI DIPNCRLNVD LTEAEIDERM KKLVIPEPKI TRGYLARYAK
MVTSASTGAV LAKD