Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0884 |
Symbol | |
ID | 5410765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 863253 |
End bp | 865214 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640868110 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_001404045 |
Protein GI | 154150427 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.17634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.977834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAAC AGGAACCCAA ACCCGTACTG GTCCGGGAAC GGATCGATGT CTGTGAGCTG GACAGGGCCC GGATGTCTCT GATGAACCCT GCGCTCATCG AGCAGAAAAA AAAGGAGCGC ACGATCGATG AGAACGCCAA GCCCCTGATC GAAATGGCGC TGCGGGAAGG TTTCGAGACG GTCTGGGACC GGTACGAGAT GCAGCAGCCG TCCTGCAAGT ACTGTGAGGC CGGGCTCTCC TGCGCCCGGT GCACTATGGG ACCGTGCCGG ATCATCCCGC CCCACCGGAT CCGGGGCGTC TGCGGGGCCG ATGCGGACCT GATCGTATCC CGCAACCTCC TGGATATGAT CGCCACCGGT TCTGCAGCCC ATTCCGACCA CGGTCGCGAC ATCGTCGAAA CACTCTATCT CGTGGGAATG GGAACGGCCA CGGATTACGG CATCGCTGAC GAAGATAAGC TGCGCCTCCT TGCACAGGAG TTCGGTATCG CTAATGATAA GAAATCCGCA AAAGAGATCG CAGCCCTGCT CGGGCGTGCC ATGCTCGAAG AGTACGGCAT GACCAGGAAC ACACTCCAGT GCCTCAGCCG GGCCCCAAAA GCCACGCAGG AGATCTGGGA TGCAGCCGGC ATCACACCCC GGGGCATCGA CCGCGAGGTC GTGGACAGCA TGCACCGGGT CCAGATGGGG GTGGGTGCGG ATTATACCAA TATCCTGTTA CAGGGCCTGC GCTGCAGCCT CTCTGACGGC TGGGGCGGCT CGATGCTGGG GACCGATGTC TCGGATGTTC TCTTTGGGAC CCCGTCCATC CGGGAATCGA AGGTCAACCT TGCAGTGCTC AAAAAGGATC ATGTCAACAT CGCCGTCCAC GGCCATAACC CGGTCCTCTC CGAGATGGTG GTAAAGGCTG TCACCGATCC CGAACTTGTG GCGCTTGCAA AGAAAAACGG CGCGGCGGGC ATCAACCTTG TGGGGCTCTG CTGCACGGGA AGCGAGCTCC TGATGCGCAA GGGCGTTCCC ATGTCCGGCA ACCATCTCAA CCAGGAACTG GTTATCATGA CCGGGGCGCT CGAAGCAATG ATCGTAGATT ACCAGTGCAT CTTTCCCTCC CTGCCACGGA CGGCAAGCTG CTTCCACACG CTTATTGTCT CCACCAGTCC CAAGGCAAAA ATCCCGGGCT CGTATTTCTT TGACTTTTCC CCGGACAAGG GCCTGGTGAC CGCAAAGGCG ATTGTCCGCA TGGCCGTTGA GAATTTCAAA AACCGGAACC CGGCACGGGT CAACATCCCG GGAAGCCCCG TACCGCTCAT GACCGGTTTC TCCAACGAGG CGATAAAAAA AGCCCTGGGC GGCTCTTTCA AGCCCCTCAT CAATCTCATT GCGGCCGGGA AGATCCGGGG GGCAGTCGGG ATCGTGGGCT GCAACAACCC GCATGTGAAG CATGATTACG GCCACGTGGA GCTCACAAAA GCGCTGATCA AAAAGAACAT CCTCTGTGTC GAGACCGGCT GTGCTGCGAT CGCTTCAGGA AAGGCCGGCC TCCTCATGCC CTCTGCTGCC GCCATGGCCG GGGATGACCT GAGGGCAGTT TGTGAGTCCC TGAAAATCCC GCCCGTGCTC CACATGGGTT CCTGTGTGGA CAACTCGCGT ATCCTCGTGC TTGCCGCAGA GCTGGGTAAT GCCCTCAATG TCCCGATCCA CAAGCTGCCG ATTGCCGGGG CGGCACCGGA GTGGTACTCG CAGAAAGCCG TCTCCATCGG CGCATACTTT GTTGCCTCCG GGGTCTATAC CGTGCTTGGC GTCATGCCGC ATATCTCGGG AAGTCCGGCG GTGGTGTCGC TCCTGACCGA TGGCCTTAAG GGTGCGGTAA ACGCAAACTT TGCCGTGGAG CCAGACCCGG TAAAGGCAGC AGACCTCATC GCGGACCATA TCGAGAAGAA ACGCACCGGG CTGGGGATCT GA
|
Protein sequence | MAEQEPKPVL VRERIDVCEL DRARMSLMNP ALIEQKKKER TIDENAKPLI EMALREGFET VWDRYEMQQP SCKYCEAGLS CARCTMGPCR IIPPHRIRGV CGADADLIVS RNLLDMIATG SAAHSDHGRD IVETLYLVGM GTATDYGIAD EDKLRLLAQE FGIANDKKSA KEIAALLGRA MLEEYGMTRN TLQCLSRAPK ATQEIWDAAG ITPRGIDREV VDSMHRVQMG VGADYTNILL QGLRCSLSDG WGGSMLGTDV SDVLFGTPSI RESKVNLAVL KKDHVNIAVH GHNPVLSEMV VKAVTDPELV ALAKKNGAAG INLVGLCCTG SELLMRKGVP MSGNHLNQEL VIMTGALEAM IVDYQCIFPS LPRTASCFHT LIVSTSPKAK IPGSYFFDFS PDKGLVTAKA IVRMAVENFK NRNPARVNIP GSPVPLMTGF SNEAIKKALG GSFKPLINLI AAGKIRGAVG IVGCNNPHVK HDYGHVELTK ALIKKNILCV ETGCAAIASG KAGLLMPSAA AMAGDDLRAV CESLKIPPVL HMGSCVDNSR ILVLAAELGN ALNVPIHKLP IAGAAPEWYS QKAVSIGAYF VASGVYTVLG VMPHISGSPA VVSLLTDGLK GAVNANFAVE PDPVKAADLI ADHIEKKRTG LGI
|
| |