Gene Dtox_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1914 
Symbol 
ID8428893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2026780 
End bp2028441 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID645034244 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003191378 
Protein GI258515156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.33952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.193822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACGA AGCATGATTT TGTGCAGCAT AACTGTGATC ATTACGACGG TAACAGGTAT 
GACCATTTGG ATAAACGAAA ATATGGTGAA GAAGCTGATA TATGTATTGT CGGTGCAGGT
GCTGCCGGAG GAGTGTTGGC TTACGAACTA AGTCAAGCCG GCTTTAAAGT GGTTGTCATT
GAGGCAGGAC CTTTTTGGAA CCCGCAAACT GATTTTGCCA GTGATGAGTT GTCCATGCAC
AGCTTGGCCT GGAATGATAC TAGATTAGTT GCAGGCAATA ACCCACTGGC AATGGGACAT
AACAATTCGG GACGCGGGGT AGGCGGAGGT ACAGTTCATT TTACCGGTGT ATTTTACCGT
TTTCACGAGA GTGATTTTCA ACTAAGAACT ATGGATGGTG TAGCCGACGA TTGGCCAATT
ACATATAAGG ATCTCGAACC ATATTATGAA AAAATTGAAA AAGATATAGC GGTATCAGGT
CCCAAGCATT TCCCCTGGGG TCCTTTTCAA GGACCATACC CCTACCCGGA ACGGGAACCT
ATCAGTGCTA ACTCTGAACT GTTTCGAAGA GGTTGTGAAA AGCTGGGCAT AAGAAGTGCA
GTGGCTCCTT TGGCCATACT ATCAGCCCCT TTTGACGGAC GTCCACCTTG CATTAACAGA
GGATTTTGTA ATCAAGGTTG TTTGCCCAAC GCTAAGTTCA GTACTTTGAT TCATTATATA
CCTAAAGCCA TAGGACACGG GGCTGAGGTG CTTAGTGATT GTATGGTTAC ACAAGTTGTT
GTAGATAAAA GCGGTAGAGT AACGGGAGTT ACCTTTATCC ATGATGATAA GGAATATTTC
CAAAAAGCCA AGATAACTAT AATCTCGGCC TTTTGTATTG AAACACCAAG ACTATTATTA
CATTCAGCCT GCCCCATGTT TCCAAACGGA CTTGCTAACA GCAGCGGTAT GGTAGGTAAA
GCGTTGATGA CCCATACCGG GCATGATATA TATGCAAAAT TCCACGATGA GGTACGTATC
TATAAGGGAA CGCCTGTGAT GGCAGTATCC CAGGAATTTT ATGAAACTGA TAAATCAAGG
GGCTTTGTTA AAGGATATAC CCTTAATGCC CATGGCTCCA GGCCTCTGGG TCTGGCTAAA
AATCTAGTCT CTAAGGCTGA TATTTGGGGC GAGAAACTAT ACGACATCAT GCGAGACTAT
AATTTTTACG CTCAGATTAC CATGGTAGGA GAAGTGCTTC CTGATAACAA TAATTCTGTA
ACATTAAGCA ACGAAAAAGA TGAGTATGGG ATACCTAGAC CTATAATTAC TTTTAGTTAC
GGTGAGAATG ACAATAAATT AATCGTTCAC GGGGTGCAAA AAGCTAATGA AATATTGGAG
GCTACCGGTG GTAAACCTGC CTTTGTGATT CCTGATACCG GTCACCTAAT GGGCACCTGT
CGAATGGGTA ATAACTCATC TACTTCTGTG GTAGACGGAT TTTGCCGCAG CCATGATATT
CCTAACCTTT ATATTTGCAG TGCTGCTGTT TTTGTGACCT CCGGTGGGTG TAATCCTACA
GAAACGGTTA TGGCTATCGC CGCTAGGACA GCGGATTATA TCATTGAAGA GGCAAAAAAG
GACGGTCAGT ATGGAATGAG AAAAACTTCC TCCATGACAT AA
 
Protein sequence
MPTKHDFVQH NCDHYDGNRY DHLDKRKYGE EADICIVGAG AAGGVLAYEL SQAGFKVVVI 
EAGPFWNPQT DFASDELSMH SLAWNDTRLV AGNNPLAMGH NNSGRGVGGG TVHFTGVFYR
FHESDFQLRT MDGVADDWPI TYKDLEPYYE KIEKDIAVSG PKHFPWGPFQ GPYPYPEREP
ISANSELFRR GCEKLGIRSA VAPLAILSAP FDGRPPCINR GFCNQGCLPN AKFSTLIHYI
PKAIGHGAEV LSDCMVTQVV VDKSGRVTGV TFIHDDKEYF QKAKITIISA FCIETPRLLL
HSACPMFPNG LANSSGMVGK ALMTHTGHDI YAKFHDEVRI YKGTPVMAVS QEFYETDKSR
GFVKGYTLNA HGSRPLGLAK NLVSKADIWG EKLYDIMRDY NFYAQITMVG EVLPDNNNSV
TLSNEKDEYG IPRPIITFSY GENDNKLIVH GVQKANEILE ATGGKPAFVI PDTGHLMGTC
RMGNNSSTSV VDGFCRSHDI PNLYICSAAV FVTSGGCNPT ETVMAIAART ADYIIEEAKK
DGQYGMRKTS SMT