Gene Dtox_2933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2933 
Symbol 
ID8429923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3113957 
End bp3115828 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content47% 
IMG OID645035189 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_003192312 
Protein GI258516090 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000257353 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGT GTCCTTCAGC AGATAGTGTT TTGAGCGCTT TTATGTCCGC CAAATCAGAT 
GTGGAAACAT CCTTTAACCG TGTGCCGGAG CAGTCTTTAA AATGCGGTTT CGGTATGCAA
GGCGTTTGCT GCCGTCTTTG TTCCAACGGT CCCTGCCGGA TTACCCCGAC TTCCCCGAAA
GGAGTTTGCG GTGCCGACGC TGATACCATA GTTGCCAGGA ACTTTTTACG TGCAGTAGCC
GCAGGTGCGG CCTGCTATCT TCATATAGTC GAAAATACGG CCAATAACTT GCGTGAGACC
GGTTTGGGCA ACACCCTGGT GACCCTTAAG GGGCTTGATA TTCTGCAAGA AACAGCAGAA
CTAATAGGTA TTGAGGAAAG TAATCCCAAC CTGCAAGCTG TAAAAATAGC CGATAAAGTA
CTTGAGGATC TTTACCGCCC CCGCAACCAG ACTATGACTC TAACAGAAAA AATGGCATAT
GGCCCCAGGT ACCAACGCTG GCAGCAATTA AACATTTTGC CCGGCGGAGC CAAATCCGAA
GTATTTGACG CACTGGTAAA AACCAGCACT AACCTTTCCA GCGATCCTGT GGATATGCTC
CTGCACGCCT TAAGACTGGG TATCGCTACG GGGCTCTATG GTTTAACATT AACTAACCAT
TTAAATGATA TCATGCTGGG AGAACCGCAA ATAACTCCCG CCAGAGTAGG CTTTTCAGTT
ATTAACGATG CCTATATCAA TATTATGGTT ACGGGACACC AGCATTCTAT TATATCTGTA
CTGCAAGAAA AACTGGTCAG TCCGGAAGCA AAAAAAATGG CCCTAGAGGC AGGTGCAAAA
GGATTTAAAC TTGTAGGCTG CACCTGTGTC GGCCAGGATC TCCAACTGCG CGGCGTACAC
TGCAAGGAAG TCTTTGCCGG TCATGCAGGA AACAATTTTA CCAGCGAAGC TTTAATTTCC
ACCGGTGCAA TTGATCTTGT GCTAAGTGAG TTTAACTGTA CTCTGCCCGG TATTGAGCCT
ATCTGTGACA GCTTCCTGGT AAAACAAATC TGCCTGGACG ATGTTTGCAA GAAAGCTAAT
GCTGAGTATA TACCTTTCGA TATTAAAAAC GCTGCTGCAA CAAGCAATCA AATTATGCTG
GCGGCTGTTT CCAGCTATAA GGAGCGCCGG GGTAAAGTAC AGATAGACAT TCCTGAACAT
GGTTATAACG ACGTTATCAC CGGGATCAGT GAAAAATCAC TGAAAAAATT CCTGGGCGGT
ACTTTTCAAC CTCTCATCGA TCTAATCGCT GCCGGCACCA TACAGGGTGT TGCTGCAGTA
GTTGGCTGCT CCAATCTTAC AGCCAAAGGA CACGATGTTT TCAGTGTTGA ACTGACTAAA
GAATTAATTA AAAGAGACAT TATAGTGCTG TCAGCAGGCT GCACCAGCGG TGGTTTGGAA
AATTGCGGGT TAATGTCTCC GTCAGCTGCT GAACTGGCGG GAGAAAACCT CAAAGCTGTA
TGTAAACAAC TGGGCATTCC ACCTGTATTA AACTTCGGTC CCTGCCTGTC CATAGGCAGA
TTGGAAATAG TAGCCACAGA ACTGGCCAGA GCTTTGAATA TTGATATACC ACAACTGCCC
CTGGTTCTCT CAGCCCCCCA GTGGTTGGAA GAACAGGCAC TGGCTGACGG CGCCTTTGGC
CTTGCACTTG GTTTGCCGCT GCACCTGGCT ATACCTCCGT TCGTAACCGG CAGCAAGCTG
GTGAGCAAGG TTTTAACAGA GGATCTGAAA GAACTGACCG GTGGCAAGGT AATTTTAGAA
GGGGAAATTA TCCCGGCGGC TGATCAACTG GAAACTATTA TCAAACAAAA AAGAAAAGCT
CTGGGCTTAT AG
 
Protein sequence
MKMCPSADSV LSAFMSAKSD VETSFNRVPE QSLKCGFGMQ GVCCRLCSNG PCRITPTSPK 
GVCGADADTI VARNFLRAVA AGAACYLHIV ENTANNLRET GLGNTLVTLK GLDILQETAE
LIGIEESNPN LQAVKIADKV LEDLYRPRNQ TMTLTEKMAY GPRYQRWQQL NILPGGAKSE
VFDALVKTST NLSSDPVDML LHALRLGIAT GLYGLTLTNH LNDIMLGEPQ ITPARVGFSV
INDAYINIMV TGHQHSIISV LQEKLVSPEA KKMALEAGAK GFKLVGCTCV GQDLQLRGVH
CKEVFAGHAG NNFTSEALIS TGAIDLVLSE FNCTLPGIEP ICDSFLVKQI CLDDVCKKAN
AEYIPFDIKN AAATSNQIML AAVSSYKERR GKVQIDIPEH GYNDVITGIS EKSLKKFLGG
TFQPLIDLIA AGTIQGVAAV VGCSNLTAKG HDVFSVELTK ELIKRDIIVL SAGCTSGGLE
NCGLMSPSAA ELAGENLKAV CKQLGIPPVL NFGPCLSIGR LEIVATELAR ALNIDIPQLP
LVLSAPQWLE EQALADGAFG LALGLPLHLA IPPFVTGSKL VSKVLTEDLK ELTGGKVILE
GEIIPAADQL ETIIKQKRKA LGL