Gene Mpal_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0842 
Symbol 
ID7272332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp868560 
End bp870449 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content65% 
IMG OID643569491 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_002465927 
Protein GI219851495 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATC CAATTGCGAT AAAGAAGGTG GCAGAGAAGC GGGCGCTGGA CAGGACCAGC 
CAGATGATGA TCGAGAAGAC GATCGAAGAC GGTGTCGAGA CCGCCTGGGA CCGGCTGCAG
GCGCAGCAGC CGCAGTGCGG GTTCGGGCAG CTCGGGCTCT GCTGCAACAA CTGCTCGATG
GGACCCTGTC GGATCGATCC CTTCGGCGGA ACGCCGACGC GGGGTGTCTG CGGGGCGACG
GCGGACACGA TCGTGGCGAG GAACCTGCTC GACGACCTCG CCGTCGGTGC GGCCTCCCAC
TCGGATCACG GCCGGGAGGT CGTCGAGACG CTCCTCCATA CCGCAGAGGG GAAGGCCCAG
GGTTACCAGA TCACGGATGC GGTGAAGTTG CACCTGATCG CCGAGGAGTA CGGGATTCCG
ACCGAGGGCC GGGATGACGC AGCGATCGCC GGCGACCTCG CCCGGGGGAT GCTCGAGGAG
TTCGGATCGA TCAAGAACCG GATCCAGCTG GTTGACCGGG CCCCGGAGAA GACCCGGAAG
GTCTGGCAGG ATCTGAACAT CACTCCCCGG GGCGTGGACC GGGAGGTCGT GGAGGCGATG
CACCGGGTGC ATATGGGGGT CGACGCCGAT TATCTGAACA TCCTGCTGCA TGCGATGCGG
ACCTCGCTCT CTGACGGCTG GGGCGGGTCG ATGATGGCGA CCGACTGCTC CGATATCCTC
TTCGGGACCC CGACGCCGGT CACTTCCTTG GCGAACCTGG GCACGCTCAG CAGGGATAAA
GTCAATATCG TCCTCCACGG ACACAACCCG GTCCTCTCCG AGATGATCCT CAAGGCGGTG
GAGGAGCCGG CGGCCAAGGA GGCGGCCCGG GCGAAGGGTG CGGCCGGCAT CAACCTGGTG
GGGATGTGCT GCACCGGGAA CGAGGTGCTG ATGAGGCACG GGATCCCGAT CGCCGGCACG
ATCCTGGACC AGGAACTGGC CATCGCCACC GGAGCCGTGG AGGTGATGGT GATCGATTAC
CAGTGCATCT TCCCGTCCAT CACGGCGACG GCCAGCTGTT ATCACACGAA GGTCGTGGCG
ACCAGTGAGA AATCCAAGGT GCCGGGTTCG ATCTACAAGG AGTTCAGGCC CGAGACGGCG
CTGGACACGG CCAGGGAGAT CGTGGGGATG GCGATCGAGA ACTTCGCCGC CAGGGACCCG
AACCGGGTCA GGATCCCGGA CCGGCCGGTC CAGATGATGG CCGGGTTCTC TGAGGAGGCG
ATCAGAAAGG CCCTCGGAGG TACCTATAAA CCTTTGATCG ATGCGATCGT CGCCGGCACG
ATCAAGGGGG TTGTCGGGGT TGTCGGCTGC AACAACCCGA AGATCAAACA GGACTCAGGA
CATATCGCCC TCGGCAGGGA ACTGATCAGG CGGAACATTC TCGTCGTCGA GACCGGGTGT
GCGGCCATCG CGAGCGGAAA GGCCGGCCTG CTCGTCCCCG AAGCGGCGGA CCTGGCCGGG
GACGGGCTCA AGGCGGTCTG CAAGGCCCTG GGGATCCCGC CGGTGCTGCA CATGGGCTCA
TGTGTCGACT GCTCCCGGAT CCTGGTGATG GCCTCGCATG TGGCGGACGA ACTGGGCGTT
GGGATCGGCG ATCTGCCGCT CGGGGGGGCC GCCCCCGAAT GGTACTCGCA GAAGGCGATC
GCGATCGGGA CGTACTTCGT CTCGTCGGGT GTCTACACGG TGCTCGGGAT CCCCCCGAAG
ATCTTCGGCA GCCAGAACGT CCTCTCCCTC CTCGCGAGTC AACTCACAGG CGTGGTGAAC
GCCTCCTTCG CAGTCGAACC GGACCCCGTC AAGGCGGCCG ACCTGCTCGA GGCGGAGATC
GATCGGAAGC GACAGGCGCT CGGGATCTGA
 
Protein sequence
MYDPIAIKKV AEKRALDRTS QMMIEKTIED GVETAWDRLQ AQQPQCGFGQ LGLCCNNCSM 
GPCRIDPFGG TPTRGVCGAT ADTIVARNLL DDLAVGAASH SDHGREVVET LLHTAEGKAQ
GYQITDAVKL HLIAEEYGIP TEGRDDAAIA GDLARGMLEE FGSIKNRIQL VDRAPEKTRK
VWQDLNITPR GVDREVVEAM HRVHMGVDAD YLNILLHAMR TSLSDGWGGS MMATDCSDIL
FGTPTPVTSL ANLGTLSRDK VNIVLHGHNP VLSEMILKAV EEPAAKEAAR AKGAAGINLV
GMCCTGNEVL MRHGIPIAGT ILDQELAIAT GAVEVMVIDY QCIFPSITAT ASCYHTKVVA
TSEKSKVPGS IYKEFRPETA LDTAREIVGM AIENFAARDP NRVRIPDRPV QMMAGFSEEA
IRKALGGTYK PLIDAIVAGT IKGVVGVVGC NNPKIKQDSG HIALGRELIR RNILVVETGC
AAIASGKAGL LVPEAADLAG DGLKAVCKAL GIPPVLHMGS CVDCSRILVM ASHVADELGV
GIGDLPLGGA APEWYSQKAI AIGTYFVSSG VYTVLGIPPK IFGSQNVLSL LASQLTGVVN
ASFAVEPDPV KAADLLEAEI DRKRQALGI