Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0842 |
Symbol | |
ID | 7272332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 868560 |
End bp | 870449 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643569491 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_002465927 |
Protein GI | 219851495 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.183197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.106762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGATC CAATTGCGAT AAAGAAGGTG GCAGAGAAGC GGGCGCTGGA CAGGACCAGC CAGATGATGA TCGAGAAGAC GATCGAAGAC GGTGTCGAGA CCGCCTGGGA CCGGCTGCAG GCGCAGCAGC CGCAGTGCGG GTTCGGGCAG CTCGGGCTCT GCTGCAACAA CTGCTCGATG GGACCCTGTC GGATCGATCC CTTCGGCGGA ACGCCGACGC GGGGTGTCTG CGGGGCGACG GCGGACACGA TCGTGGCGAG GAACCTGCTC GACGACCTCG CCGTCGGTGC GGCCTCCCAC TCGGATCACG GCCGGGAGGT CGTCGAGACG CTCCTCCATA CCGCAGAGGG GAAGGCCCAG GGTTACCAGA TCACGGATGC GGTGAAGTTG CACCTGATCG CCGAGGAGTA CGGGATTCCG ACCGAGGGCC GGGATGACGC AGCGATCGCC GGCGACCTCG CCCGGGGGAT GCTCGAGGAG TTCGGATCGA TCAAGAACCG GATCCAGCTG GTTGACCGGG CCCCGGAGAA GACCCGGAAG GTCTGGCAGG ATCTGAACAT CACTCCCCGG GGCGTGGACC GGGAGGTCGT GGAGGCGATG CACCGGGTGC ATATGGGGGT CGACGCCGAT TATCTGAACA TCCTGCTGCA TGCGATGCGG ACCTCGCTCT CTGACGGCTG GGGCGGGTCG ATGATGGCGA CCGACTGCTC CGATATCCTC TTCGGGACCC CGACGCCGGT CACTTCCTTG GCGAACCTGG GCACGCTCAG CAGGGATAAA GTCAATATCG TCCTCCACGG ACACAACCCG GTCCTCTCCG AGATGATCCT CAAGGCGGTG GAGGAGCCGG CGGCCAAGGA GGCGGCCCGG GCGAAGGGTG CGGCCGGCAT CAACCTGGTG GGGATGTGCT GCACCGGGAA CGAGGTGCTG ATGAGGCACG GGATCCCGAT CGCCGGCACG ATCCTGGACC AGGAACTGGC CATCGCCACC GGAGCCGTGG AGGTGATGGT GATCGATTAC CAGTGCATCT TCCCGTCCAT CACGGCGACG GCCAGCTGTT ATCACACGAA GGTCGTGGCG ACCAGTGAGA AATCCAAGGT GCCGGGTTCG ATCTACAAGG AGTTCAGGCC CGAGACGGCG CTGGACACGG CCAGGGAGAT CGTGGGGATG GCGATCGAGA ACTTCGCCGC CAGGGACCCG AACCGGGTCA GGATCCCGGA CCGGCCGGTC CAGATGATGG CCGGGTTCTC TGAGGAGGCG ATCAGAAAGG CCCTCGGAGG TACCTATAAA CCTTTGATCG ATGCGATCGT CGCCGGCACG ATCAAGGGGG TTGTCGGGGT TGTCGGCTGC AACAACCCGA AGATCAAACA GGACTCAGGA CATATCGCCC TCGGCAGGGA ACTGATCAGG CGGAACATTC TCGTCGTCGA GACCGGGTGT GCGGCCATCG CGAGCGGAAA GGCCGGCCTG CTCGTCCCCG AAGCGGCGGA CCTGGCCGGG GACGGGCTCA AGGCGGTCTG CAAGGCCCTG GGGATCCCGC CGGTGCTGCA CATGGGCTCA TGTGTCGACT GCTCCCGGAT CCTGGTGATG GCCTCGCATG TGGCGGACGA ACTGGGCGTT GGGATCGGCG ATCTGCCGCT CGGGGGGGCC GCCCCCGAAT GGTACTCGCA GAAGGCGATC GCGATCGGGA CGTACTTCGT CTCGTCGGGT GTCTACACGG TGCTCGGGAT CCCCCCGAAG ATCTTCGGCA GCCAGAACGT CCTCTCCCTC CTCGCGAGTC AACTCACAGG CGTGGTGAAC GCCTCCTTCG CAGTCGAACC GGACCCCGTC AAGGCGGCCG ACCTGCTCGA GGCGGAGATC GATCGGAAGC GACAGGCGCT CGGGATCTGA
|
Protein sequence | MYDPIAIKKV AEKRALDRTS QMMIEKTIED GVETAWDRLQ AQQPQCGFGQ LGLCCNNCSM GPCRIDPFGG TPTRGVCGAT ADTIVARNLL DDLAVGAASH SDHGREVVET LLHTAEGKAQ GYQITDAVKL HLIAEEYGIP TEGRDDAAIA GDLARGMLEE FGSIKNRIQL VDRAPEKTRK VWQDLNITPR GVDREVVEAM HRVHMGVDAD YLNILLHAMR TSLSDGWGGS MMATDCSDIL FGTPTPVTSL ANLGTLSRDK VNIVLHGHNP VLSEMILKAV EEPAAKEAAR AKGAAGINLV GMCCTGNEVL MRHGIPIAGT ILDQELAIAT GAVEVMVIDY QCIFPSITAT ASCYHTKVVA TSEKSKVPGS IYKEFRPETA LDTAREIVGM AIENFAARDP NRVRIPDRPV QMMAGFSEEA IRKALGGTYK PLIDAIVAGT IKGVVGVVGC NNPKIKQDSG HIALGRELIR RNILVVETGC AAIASGKAGL LVPEAADLAG DGLKAVCKAL GIPPVLHMGS CVDCSRILVM ASHVADELGV GIGDLPLGGA APEWYSQKAI AIGTYFVSSG VYTVLGIPPK IFGSQNVLSL LASQLTGVVN ASFAVEPDPV KAADLLEAEI DRKRQALGI
|
| |