Gene Dtox_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0934 
Symbol 
ID8427873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp954254 
End bp955882 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content50% 
IMG OID645033276 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_003190450 
Protein GI258514228 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00530941 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.265769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGC CTTTAAAAGT CGGGCGGTCA ATACCAAGGT TGGATGCGGA AGACAAGGCA 
GCGGGAAGAG AAAAATATGC GGCAGACTAT TATCCGGAAG ATTTCCTTGT CATAGGCATC
AAGAGATCAC CCTATCCACA TGCGCGGGTT TTACAGATTG ATTCCTCGAA GGCAAAAAGA
ATACCGGGTG TTGTTGCTGT GTTGACTCAC CGGGATATAG CGGGTTCCAA TCAACTGGGC
ATTATTGTGA AGGACCAGCC TGTACTGGCC CGAAATGTTG TGCGTTTCAT CGGGGATGCC
GTAGTGTTGG CTGTTGCGGA AAACAAAGAA GTTCTGGAGG AAGCGCTTGC TCAAATTGAG
GTGGAGTATG AGCCGCTGAC TCCTTTATTT TGTCCCCAAG CTGCTCTATT AGAAAACAGT
GTAAAAGTCC ATGCCGACTG GCAAAACGGA AATATCCTTT TGGCAGGGAA GCTTGAAACA
GGCAATGCCA AAGAGGCACT TGGGGACTGT GCGCACAAGG TGAGGGTTGA ATTGCAACTG
GGATGCCAGG AGCATGCCTG CCTGGAAACC GAATGCGGTG TGGCCTGGAT TGAGGATGAC
GGTAATATGG TTATCACTGC ATCCACCCAG AGTCCTTTCC GGGACCGGCT AGAACTGTCA
CATGCATTGG GGATACCACC GGATCGCATA CGTGTTATAG CACCCTTTCT GGGTGGTGGC
TTTGGCCGCA AGGACGGAGT ATCTGTACAA GCCTATTTGG CACTGGCAGC GTTAAACTCC
AACGGAAGAC CGGTAAAAAT ACAGTTATCA AGGGAAGAGA GCATTGCAAC GGGGACAAAA
AGACACGCAG CTGAGATTTG CGTAGAACTG GGTTGTGATA CGCAAGGCAA GCTTTCGGCT
CTTTGTTGCG ATGTTTTGAT GGACACCGGA GCCTATGCGT CACTGGGAGG GGAAGTATTG
ACATTGGGGA TGGAACATGC CGGAGGTCCT TACCGTATTC CCAATGTTAT TATTGAGGGC
AAGGCGGTAT ATACAAATAA CGTTCCGGCC GGCGCTTTCC GTGGTTTTGG CGTTCCACAA
ACCACAGCGG GAATTGAGCA GGCGATGGAC GAATTGGCTA AAGTCGCCGG GTTTGACCCG
CTTATATTCC GGTTGGTAAA TGCAGTTAAA CAGGGGGAGA GAAATTCAGC CGGGGTGATT
ATGACCCAAT CCGTCGGATT AACTGCCTGT CTGGAAACAG TAGCTGCCTG CCCTGAGTGG
AAAGATCGCC AAAATTGGAT AAACAGTGCG CCGCCTTTTA CGCGGCGTGG TGTCGGTTTG
TCAGCTATGC ACCATGCTCA GGGATTTGGG CCTGTAATTC CTGACAATGC CAATGCCAAA
ATCGAGCTTG ACCCAGAGGG CTGTTTTATC ATTTATGTGG GTGTGGCGGA TATGGGACAG
GGCAATGCCA CAACTTATCT GCAAATAGCA GGGGATATTT TAGGCCAGGG CTTTGACCGG
CTGAAAATGG TTTTGCCGGA TACCCAAAAA GCTTTGCCTT CCGGCTCATC ATCTGCCAGC
CGCACAACAT TTACCTTTGG GAATGCAGTT ATCGGTGCTG CCAGACTTCT GTCGGGACGT
ATTATATAG
 
Protein sequence
MNEPLKVGRS IPRLDAEDKA AGREKYAADY YPEDFLVIGI KRSPYPHARV LQIDSSKAKR 
IPGVVAVLTH RDIAGSNQLG IIVKDQPVLA RNVVRFIGDA VVLAVAENKE VLEEALAQIE
VEYEPLTPLF CPQAALLENS VKVHADWQNG NILLAGKLET GNAKEALGDC AHKVRVELQL
GCQEHACLET ECGVAWIEDD GNMVITASTQ SPFRDRLELS HALGIPPDRI RVIAPFLGGG
FGRKDGVSVQ AYLALAALNS NGRPVKIQLS REESIATGTK RHAAEICVEL GCDTQGKLSA
LCCDVLMDTG AYASLGGEVL TLGMEHAGGP YRIPNVIIEG KAVYTNNVPA GAFRGFGVPQ
TTAGIEQAMD ELAKVAGFDP LIFRLVNAVK QGERNSAGVI MTQSVGLTAC LETVAACPEW
KDRQNWINSA PPFTRRGVGL SAMHHAQGFG PVIPDNANAK IELDPEGCFI IYVGVADMGQ
GNATTYLQIA GDILGQGFDR LKMVLPDTQK ALPSGSSSAS RTTFTFGNAV IGAARLLSGR
II