Gene Dret_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1234 
Symbol 
ID8419062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1446576 
End bp1448465 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content60% 
IMG OID645037809 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_003198100 
Protein GI258405358 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0405701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.377093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGG AAAAGCGGAG TATCGAAGAG TTGAGTCTCT GGGAAGACGC GCAACGAATG 
ATCGCCAAAG CCCAGCGAGA AGGAGTGGAA ACGGTTTGGG ATCGCCTGGA GGAGCAGACC
CCGCATTGCA CTTTTTGCGA ACAAGGGTTG ACCTGTCAGA AATGCGTGAT GGGGCCTTGC
CGTATCAACC CCAAAGACAA CGGCAAAAAA CAACGGGGGG TCTGTGGCGC AGGTGCGGAT
CTGACCGTGG CCCGCAATTT CGGCCGCTTC ATTGCCGCAG GGGCGGCCTC GCACTCTGAC
CACGGGCGGG ATCTCGTCGA GGTACTCGAA GCTGTGGGGC AGGGAAAGAC CACTGATTAT
GCCGTACGCG ACGAAGACAA ACTCCGTCGC TTGGCCGATG AAGTCGGCGT AGAGCACGGG
GACAAATCAG TCCAGGAGAC CGCAGCCGCA CTGGCCGAGG TCTTTATGGA CGACTACAGT
TTTCGCCGCA ACGGGGTCAG TTTCGCGGCC CGGGCCCCGG AAAAACGGCG CCGTGTCTGG
GAGGAAACCG GAATCACCCC ACGCGGCGTG GATCGGGACG TGGTGGAGAT GATGCACCGC
ACCCACATGG GGGTGGACAG CGACGCCACA AGCATTTGCC TGCACTCGGC CCGCGTGGCT
TTGACCGACG GCTGGGGCGG CTCCATGATC GCCACCGAAC TGTCAGACAC CTTGTTCGGC
ACCCCGCAAC CCCGCCAATC CACAGCCAAT CTCAGCGTTC TCAAAGAAGA CCAGATCAAT
ATTCTGGTCC ACGGCCACAG CCCCATTGTT TCTGAAATGC TCTTGACCGC AGTCCAGGAC
CCGGATCTTC TTCAGGAGGC CCGGGATGCG GGCGCTGCAG GGATTAACCT CGCCGGACTC
TGTTGCACGG GCAACGAACT GCTCATGCGT CAGGGCGTGC CCATGGCTGG CAATCACCTC
ATGACCGAAC TCGCGCTCAT CACCGGGGCG GTGGAACTCA TGGTCGTGGA TTATCAATGC
ATCATGCCCA GTCTGGTGAC GATCGCCGGA TGTTATCACA CCCAATTCGT CTCCACCTCG
GAAAAGGCTC ATTTTACGGG CGCCACCCAT GTCGAATTCA CGTACACCAA TGCGATAGAG
CAAGCCAGAA ACGTGGTCCG CATGGCCATT GAGGCCTACC GCAATCGGGA TCCGCAGCGG
GTGGAGATCC CGGAGGGACC GATGCAGCTG ACCACCGGTT TTTCCAACGA GGCCATCCTG
GAGGCCCTGG GCGGCACTCC TGACCCGCTT CTCGATGCGC TCAAGAACGG AAGTGTCCGC
GGCGTCGTGG GAATCGTCGG TTGCAACAAT CCCAAACTCA AGCACGACCA TTGCCACGTC
AATCTGGCCC GGGAGCTGAT CAAGAAAGAT GTCCTGGTTT TGGCCACCGG CTGCGCCACT
GTCGCTTTGG GCAAGGCTGG TCTGCTCATG CCGGATGCCG CCGGAGAAGC CGGCTCCGGG
TTGCAGTCCG TCTGCCAATC CCTCGGCATC CCCCCGGTAC TGCATGTCGG CAGTTGTGTC
GACAACGCCC GCATCCTGCA TCTGTGCGGT GTGCTGGCCA ACGCCCTTGG CGTGGACATC
AGCGATCTGC CTGTGGCCGC CTCGGCCCCG GAATGGTATT CGGAAAAAGC CGCGGCTATC
GGCCTCTATG CCGTGGCCAG CGGGATCTAC ACCCATCTAG GGCTCCCACC GAACATCCAG
GGGAGTCAGA TGGTTACCGA TCTGGCCCTG AACGGACTCA ACGATGTGGT CGGTGCCGCG
TTTGGTGTTT CTCCAGATCC GTTTGAGGCG GCGGACATGA TCGACGCCCG GATACGGGAG
AAGCGAAAAG GGCTGGGATT GTCCGAGTGA
 
Protein sequence
MSKEKRSIEE LSLWEDAQRM IAKAQREGVE TVWDRLEEQT PHCTFCEQGL TCQKCVMGPC 
RINPKDNGKK QRGVCGAGAD LTVARNFGRF IAAGAASHSD HGRDLVEVLE AVGQGKTTDY
AVRDEDKLRR LADEVGVEHG DKSVQETAAA LAEVFMDDYS FRRNGVSFAA RAPEKRRRVW
EETGITPRGV DRDVVEMMHR THMGVDSDAT SICLHSARVA LTDGWGGSMI ATELSDTLFG
TPQPRQSTAN LSVLKEDQIN ILVHGHSPIV SEMLLTAVQD PDLLQEARDA GAAGINLAGL
CCTGNELLMR QGVPMAGNHL MTELALITGA VELMVVDYQC IMPSLVTIAG CYHTQFVSTS
EKAHFTGATH VEFTYTNAIE QARNVVRMAI EAYRNRDPQR VEIPEGPMQL TTGFSNEAIL
EALGGTPDPL LDALKNGSVR GVVGIVGCNN PKLKHDHCHV NLARELIKKD VLVLATGCAT
VALGKAGLLM PDAAGEAGSG LQSVCQSLGI PPVLHVGSCV DNARILHLCG VLANALGVDI
SDLPVAASAP EWYSEKAAAI GLYAVASGIY THLGLPPNIQ GSQMVTDLAL NGLNDVVGAA
FGVSPDPFEA ADMIDARIRE KRKGLGLSE