Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2933 |
Symbol | |
ID | 8429923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 3113957 |
End bp | 3115828 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645035189 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_003192312 |
Protein GI | 258516090 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000257353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGT GTCCTTCAGC AGATAGTGTT TTGAGCGCTT TTATGTCCGC CAAATCAGAT GTGGAAACAT CCTTTAACCG TGTGCCGGAG CAGTCTTTAA AATGCGGTTT CGGTATGCAA GGCGTTTGCT GCCGTCTTTG TTCCAACGGT CCCTGCCGGA TTACCCCGAC TTCCCCGAAA GGAGTTTGCG GTGCCGACGC TGATACCATA GTTGCCAGGA ACTTTTTACG TGCAGTAGCC GCAGGTGCGG CCTGCTATCT TCATATAGTC GAAAATACGG CCAATAACTT GCGTGAGACC GGTTTGGGCA ACACCCTGGT GACCCTTAAG GGGCTTGATA TTCTGCAAGA AACAGCAGAA CTAATAGGTA TTGAGGAAAG TAATCCCAAC CTGCAAGCTG TAAAAATAGC CGATAAAGTA CTTGAGGATC TTTACCGCCC CCGCAACCAG ACTATGACTC TAACAGAAAA AATGGCATAT GGCCCCAGGT ACCAACGCTG GCAGCAATTA AACATTTTGC CCGGCGGAGC CAAATCCGAA GTATTTGACG CACTGGTAAA AACCAGCACT AACCTTTCCA GCGATCCTGT GGATATGCTC CTGCACGCCT TAAGACTGGG TATCGCTACG GGGCTCTATG GTTTAACATT AACTAACCAT TTAAATGATA TCATGCTGGG AGAACCGCAA ATAACTCCCG CCAGAGTAGG CTTTTCAGTT ATTAACGATG CCTATATCAA TATTATGGTT ACGGGACACC AGCATTCTAT TATATCTGTA CTGCAAGAAA AACTGGTCAG TCCGGAAGCA AAAAAAATGG CCCTAGAGGC AGGTGCAAAA GGATTTAAAC TTGTAGGCTG CACCTGTGTC GGCCAGGATC TCCAACTGCG CGGCGTACAC TGCAAGGAAG TCTTTGCCGG TCATGCAGGA AACAATTTTA CCAGCGAAGC TTTAATTTCC ACCGGTGCAA TTGATCTTGT GCTAAGTGAG TTTAACTGTA CTCTGCCCGG TATTGAGCCT ATCTGTGACA GCTTCCTGGT AAAACAAATC TGCCTGGACG ATGTTTGCAA GAAAGCTAAT GCTGAGTATA TACCTTTCGA TATTAAAAAC GCTGCTGCAA CAAGCAATCA AATTATGCTG GCGGCTGTTT CCAGCTATAA GGAGCGCCGG GGTAAAGTAC AGATAGACAT TCCTGAACAT GGTTATAACG ACGTTATCAC CGGGATCAGT GAAAAATCAC TGAAAAAATT CCTGGGCGGT ACTTTTCAAC CTCTCATCGA TCTAATCGCT GCCGGCACCA TACAGGGTGT TGCTGCAGTA GTTGGCTGCT CCAATCTTAC AGCCAAAGGA CACGATGTTT TCAGTGTTGA ACTGACTAAA GAATTAATTA AAAGAGACAT TATAGTGCTG TCAGCAGGCT GCACCAGCGG TGGTTTGGAA AATTGCGGGT TAATGTCTCC GTCAGCTGCT GAACTGGCGG GAGAAAACCT CAAAGCTGTA TGTAAACAAC TGGGCATTCC ACCTGTATTA AACTTCGGTC CCTGCCTGTC CATAGGCAGA TTGGAAATAG TAGCCACAGA ACTGGCCAGA GCTTTGAATA TTGATATACC ACAACTGCCC CTGGTTCTCT CAGCCCCCCA GTGGTTGGAA GAACAGGCAC TGGCTGACGG CGCCTTTGGC CTTGCACTTG GTTTGCCGCT GCACCTGGCT ATACCTCCGT TCGTAACCGG CAGCAAGCTG GTGAGCAAGG TTTTAACAGA GGATCTGAAA GAACTGACCG GTGGCAAGGT AATTTTAGAA GGGGAAATTA TCCCGGCGGC TGATCAACTG GAAACTATTA TCAAACAAAA AAGAAAAGCT CTGGGCTTAT AG
|
Protein sequence | MKMCPSADSV LSAFMSAKSD VETSFNRVPE QSLKCGFGMQ GVCCRLCSNG PCRITPTSPK GVCGADADTI VARNFLRAVA AGAACYLHIV ENTANNLRET GLGNTLVTLK GLDILQETAE LIGIEESNPN LQAVKIADKV LEDLYRPRNQ TMTLTEKMAY GPRYQRWQQL NILPGGAKSE VFDALVKTST NLSSDPVDML LHALRLGIAT GLYGLTLTNH LNDIMLGEPQ ITPARVGFSV INDAYINIMV TGHQHSIISV LQEKLVSPEA KKMALEAGAK GFKLVGCTCV GQDLQLRGVH CKEVFAGHAG NNFTSEALIS TGAIDLVLSE FNCTLPGIEP ICDSFLVKQI CLDDVCKKAN AEYIPFDIKN AAATSNQIML AAVSSYKERR GKVQIDIPEH GYNDVITGIS EKSLKKFLGG TFQPLIDLIA AGTIQGVAAV VGCSNLTAKG HDVFSVELTK ELIKRDIIVL SAGCTSGGLE NCGLMSPSAA ELAGENLKAV CKQLGIPPVL NFGPCLSIGR LEIVATELAR ALNIDIPQLP LVLSAPQWLE EQALADGAFG LALGLPLHLA IPPFVTGSKL VSKVLTEDLK ELTGGKVILE GEIIPAADQL ETIIKQKRKA LGL
|
| |