Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1234 |
Symbol | |
ID | 8419062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 1446576 |
End bp | 1448465 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645037809 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_003198100 |
Protein GI | 258405358 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0405701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.377093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAGG AAAAGCGGAG TATCGAAGAG TTGAGTCTCT GGGAAGACGC GCAACGAATG ATCGCCAAAG CCCAGCGAGA AGGAGTGGAA ACGGTTTGGG ATCGCCTGGA GGAGCAGACC CCGCATTGCA CTTTTTGCGA ACAAGGGTTG ACCTGTCAGA AATGCGTGAT GGGGCCTTGC CGTATCAACC CCAAAGACAA CGGCAAAAAA CAACGGGGGG TCTGTGGCGC AGGTGCGGAT CTGACCGTGG CCCGCAATTT CGGCCGCTTC ATTGCCGCAG GGGCGGCCTC GCACTCTGAC CACGGGCGGG ATCTCGTCGA GGTACTCGAA GCTGTGGGGC AGGGAAAGAC CACTGATTAT GCCGTACGCG ACGAAGACAA ACTCCGTCGC TTGGCCGATG AAGTCGGCGT AGAGCACGGG GACAAATCAG TCCAGGAGAC CGCAGCCGCA CTGGCCGAGG TCTTTATGGA CGACTACAGT TTTCGCCGCA ACGGGGTCAG TTTCGCGGCC CGGGCCCCGG AAAAACGGCG CCGTGTCTGG GAGGAAACCG GAATCACCCC ACGCGGCGTG GATCGGGACG TGGTGGAGAT GATGCACCGC ACCCACATGG GGGTGGACAG CGACGCCACA AGCATTTGCC TGCACTCGGC CCGCGTGGCT TTGACCGACG GCTGGGGCGG CTCCATGATC GCCACCGAAC TGTCAGACAC CTTGTTCGGC ACCCCGCAAC CCCGCCAATC CACAGCCAAT CTCAGCGTTC TCAAAGAAGA CCAGATCAAT ATTCTGGTCC ACGGCCACAG CCCCATTGTT TCTGAAATGC TCTTGACCGC AGTCCAGGAC CCGGATCTTC TTCAGGAGGC CCGGGATGCG GGCGCTGCAG GGATTAACCT CGCCGGACTC TGTTGCACGG GCAACGAACT GCTCATGCGT CAGGGCGTGC CCATGGCTGG CAATCACCTC ATGACCGAAC TCGCGCTCAT CACCGGGGCG GTGGAACTCA TGGTCGTGGA TTATCAATGC ATCATGCCCA GTCTGGTGAC GATCGCCGGA TGTTATCACA CCCAATTCGT CTCCACCTCG GAAAAGGCTC ATTTTACGGG CGCCACCCAT GTCGAATTCA CGTACACCAA TGCGATAGAG CAAGCCAGAA ACGTGGTCCG CATGGCCATT GAGGCCTACC GCAATCGGGA TCCGCAGCGG GTGGAGATCC CGGAGGGACC GATGCAGCTG ACCACCGGTT TTTCCAACGA GGCCATCCTG GAGGCCCTGG GCGGCACTCC TGACCCGCTT CTCGATGCGC TCAAGAACGG AAGTGTCCGC GGCGTCGTGG GAATCGTCGG TTGCAACAAT CCCAAACTCA AGCACGACCA TTGCCACGTC AATCTGGCCC GGGAGCTGAT CAAGAAAGAT GTCCTGGTTT TGGCCACCGG CTGCGCCACT GTCGCTTTGG GCAAGGCTGG TCTGCTCATG CCGGATGCCG CCGGAGAAGC CGGCTCCGGG TTGCAGTCCG TCTGCCAATC CCTCGGCATC CCCCCGGTAC TGCATGTCGG CAGTTGTGTC GACAACGCCC GCATCCTGCA TCTGTGCGGT GTGCTGGCCA ACGCCCTTGG CGTGGACATC AGCGATCTGC CTGTGGCCGC CTCGGCCCCG GAATGGTATT CGGAAAAAGC CGCGGCTATC GGCCTCTATG CCGTGGCCAG CGGGATCTAC ACCCATCTAG GGCTCCCACC GAACATCCAG GGGAGTCAGA TGGTTACCGA TCTGGCCCTG AACGGACTCA ACGATGTGGT CGGTGCCGCG TTTGGTGTTT CTCCAGATCC GTTTGAGGCG GCGGACATGA TCGACGCCCG GATACGGGAG AAGCGAAAAG GGCTGGGATT GTCCGAGTGA
|
Protein sequence | MSKEKRSIEE LSLWEDAQRM IAKAQREGVE TVWDRLEEQT PHCTFCEQGL TCQKCVMGPC RINPKDNGKK QRGVCGAGAD LTVARNFGRF IAAGAASHSD HGRDLVEVLE AVGQGKTTDY AVRDEDKLRR LADEVGVEHG DKSVQETAAA LAEVFMDDYS FRRNGVSFAA RAPEKRRRVW EETGITPRGV DRDVVEMMHR THMGVDSDAT SICLHSARVA LTDGWGGSMI ATELSDTLFG TPQPRQSTAN LSVLKEDQIN ILVHGHSPIV SEMLLTAVQD PDLLQEARDA GAAGINLAGL CCTGNELLMR QGVPMAGNHL MTELALITGA VELMVVDYQC IMPSLVTIAG CYHTQFVSTS EKAHFTGATH VEFTYTNAIE QARNVVRMAI EAYRNRDPQR VEIPEGPMQL TTGFSNEAIL EALGGTPDPL LDALKNGSVR GVVGIVGCNN PKLKHDHCHV NLARELIKKD VLVLATGCAT VALGKAGLLM PDAAGEAGSG LQSVCQSLGI PPVLHVGSCV DNARILHLCG VLANALGVDI SDLPVAASAP EWYSEKAAAI GLYAVASGIY THLGLPPNIQ GSQMVTDLAL NGLNDVVGAA FGVSPDPFEA ADMIDARIRE KRKGLGLSE
|
| |