Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dred_0652 |
Symbol | |
ID | 4955852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum reducens MI-1 |
Kingdom | Bacteria |
Replicon accession | NC_009253 |
Strand | + |
Start bp | 703454 |
End bp | 705577 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640179826 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_001112016 |
Protein GI | 134298520 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00052858 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACA AGAATAATGA AAATTACGAT ACGCTAATCC ATAATAGTGA TCCTACCCAC CGGGCCGATA CCAATGACCC CAGTGCCCAT CAGGATCAAA ACAAAACAGA GGTACCCTAT GATTACCGCA AATTTGAACC CTCTCCCACG GGGGAAGAAT TACACCGTTG GCAACGGGAA CATATTAAAA AGGATGATCA ATCCAAAGAG GGCTATCCCT TAAATGTTAT TGTGGACCCT GCCATGCGAG AAATGTATCA AGTGGTGCAT AATCAGGGTC TGACAAACGT TTTTGACCGT TTTTCGGAAC AGCAACCCCA GTGTAATTTC TGTGCCCCGG GTCTCTCTTG CCAGCTTTGT GCCAATGGAC CCTGCCGAAT TACGAAGAAG GCCCAGCGGG GTGTTTGTGG TGTGGATGCC CATGTAATGG TGGCCCGGAA CTTTACCTAT CGTCACACTA CCATAGGCAC CTCCGCCAAC TGTTATCATG CTTTACAGGC TGCCAGAACC CTGCGGGCAG CGGGCTCCAA CCCGGAAAGT GGGCTAAAGA TCAGGGAACC GGAAAAATTA AAAAAATATG CAGAGTTTTT GGGTATGGAT AAAAATAAGC CCATTGAACA ATTGGCTGTG GAGTTTGCGG ATTTCTTTAT CGAGGACCTT CACCGGCCCA ACTTTATGGA ATCCAAACTG GTGGAAGCCT TTGCGCCGCC CCGTCGTAAA GAACTCTGGC GTAAATTGGG CCTCTACCCC GGCGGCGCCT TCTCTGAAGT GGGTTTTGCC CAAACTAAGT GCATGACCAA CCTGGGTGCA GATCCTGTGG ACTTTTTATT AACCTGCGTA CGCTTGGGGG TGGCCAATGA ATTCCAGGGA TTGTGGCCCC TGGACTTACT ACAAGAAATT TTAATTGGCA CCCAGGAGAT TACGCAAAGG AAGCAAAATA TGGGCTTGTT GGATCCCAAT AAAGTAAATA TTATTACCAA TGGACACATG CCCTTGTTAG CCCATGTGGT CATTGATTTG GCCTCCACCG AAGAATGGCA GAGTAAAGCC AAAAAAGCAG GGGCCACTGG TTTACAAATC ATGGGTCATG TTTGTGAGGG CCAGCAACTG ATAAACTACT CCGGCACCCA CGAAATGTCA GCCCTGGCCG GTCAAGAGGG AGAATGGCTG TCTGAAGAAT ATTTATTGGC CACAGGCTGT GTGGATCTGT TTATGTTTGA CTACAACTGT ACCGTTCCTA CTCTTCCTTT GTATGCTGAG CGTTTTGGTA CCAAATTGAT GAGTGTGGAC CCGGTTATCC GCTTCCCCGA TACCGAGGCT CTGGATTTCA AACCTGAGCA AATGATAAAG CAAGCAGAGG AGTGTCTGGA ACGGGCAATT GAATTTTTTA AGAAACGCAA GGAAGAAAAC CGTAACGTTT ATGTACCACC CCACGTCAGT GATTGCATGG TAGGTTTTTC CACTGAATCA GTTAAGCAAG CCCTTGGTGG CAGTTGGCAG CCTTTAATTG ATCAAATTGC CAACGGAAAC ATCCGTGGTA TTGCAACTTT GGTAGGTTGT ACCACCGCCC GGTACGGTCA GGGCGGCAGC AATGCCTTTA AACTGGCAAA GGCCTTGATT GAGCGTGACG TCCTAGTACT GTCCGGTGGT TGTGTATCAG CCGTCTTTGA ATATACTGGC CTGTGTAAGC CAGAAGCAGC CAATGAGGCA GGGGACGGTC TTAAGCAAGT TTGCCAAACC CTAGGGATAC CGCCGGTATT GTCCTATGGT GCCTGTGTGG ATGTTGGTAA GATGACCCAC ACCGCCATGG AACTGGCAGA CGCTCTGGAT GTGGATACCA ATGCCCTTCC TCTGGTCATC GGTGCACCTG AGTATTTGGA GCAAAAGGCC GTGGCAGACG CCTGCACCGC AGTGGCAATG GGCTGGCTGG TCCACGTGGC ACCGGTTCCA TCGGTTACCG GCAGTGAAGT GGTAGTAAAG ACCCTTACTG AAACCACTGA AACCTTGGGA CTTGGTAAAA TGATGATTGA GCTGGATGCC GAAAAGGCAG CGGATATTTA CATTGAGCAC ATAGAGAAGA AACGAGCTGG ATTGGGCTTA TCCGCTAACC TGCCGAACCC ATAG
|
Protein sequence | MSDKNNENYD TLIHNSDPTH RADTNDPSAH QDQNKTEVPY DYRKFEPSPT GEELHRWQRE HIKKDDQSKE GYPLNVIVDP AMREMYQVVH NQGLTNVFDR FSEQQPQCNF CAPGLSCQLC ANGPCRITKK AQRGVCGVDA HVMVARNFTY RHTTIGTSAN CYHALQAART LRAAGSNPES GLKIREPEKL KKYAEFLGMD KNKPIEQLAV EFADFFIEDL HRPNFMESKL VEAFAPPRRK ELWRKLGLYP GGAFSEVGFA QTKCMTNLGA DPVDFLLTCV RLGVANEFQG LWPLDLLQEI LIGTQEITQR KQNMGLLDPN KVNIITNGHM PLLAHVVIDL ASTEEWQSKA KKAGATGLQI MGHVCEGQQL INYSGTHEMS ALAGQEGEWL SEEYLLATGC VDLFMFDYNC TVPTLPLYAE RFGTKLMSVD PVIRFPDTEA LDFKPEQMIK QAEECLERAI EFFKKRKEEN RNVYVPPHVS DCMVGFSTES VKQALGGSWQ PLIDQIANGN IRGIATLVGC TTARYGQGGS NAFKLAKALI ERDVLVLSGG CVSAVFEYTG LCKPEAANEA GDGLKQVCQT LGIPPVLSYG ACVDVGKMTH TAMELADALD VDTNALPLVI GAPEYLEQKA VADACTAVAM GWLVHVAPVP SVTGSEVVVK TLTETTETLG LGKMMIELDA EKAADIYIEH IEKKRAGLGL SANLPNP
|
| |