Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1341 |
Symbol | |
ID | 4462146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1444813 |
End bp | 1446684 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639700357 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_843756 |
Protein GI | 116754638 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAACA AAACTGCAAC GGAAAGCATC GATCCGGCCA CCACGAAGAT GCTGCTCGAG GCACGCAGGG CTGGGCTGGA GACTGTTTGG GATCGTTTCA ATAAACAGCA ACCGCAGTGT GGATTCGGAC AGCTCGGCAT TTGCTGCCGC AACTGCAACA TGGGTCCATG CAGAATAGAT CCCTTTGGGG ATGGACCTGA TAAGGGCGTA TGTGGCGCGA CAGCTGACAT CATTGTTGCG CGCAACCTCC TGAGGATGAT AGCTGCAGGC GCAGCAGCAC ATGCTGACCA TGCAAGGGAT GCAGTCATAG TGTTCAAAGA GGCATTAGAG GGAAGGGCAA GGAGTTATCA GATCAGGGAT GAGGCGAAGC TCAGAGAGCT TGCAGCCGAG TACAACATCT CGAGCTGGGG GGAGCAGGGA GGTGCAGCGG ATCTAGCAAA CGCCCTTCTC TCAGACTTCG GAAGGCAGGA GGGATATGTG ACGCTCACCC GCAGAGCCCC GGAGAAGCGC CGGAAGATCT GGGAGAGTCT GGGGATAAGC CCGAGAGGCA TCGACAGGGA GATCGTGGAG TGCATGCACC GCACCCATAT GGGTGTGGAC AACGACCCGC TTCACATCCT GCAGCATGGG CTGAGAACCA GCATCGCCGA CGGCTGGGGC TCGTCGATGA TTGCGACAGA GGTGCAGGAT ATTCTCTTTG GAACGCCATC TCCCAGGAGA TCAGAGGCGA ACCTGGGAGT GCTGAAGGAG GATGAAGTTA ACATAATCGT CCACGGCCAC GAGCCCATCC TCTCGGAGAT GATTGTTGAA GCTGCATCAG ATCCGGAGAT GATCAAGCTC GCAAAGAGTG TCGGAGCTAA CGGAATCAAC GTCGCTGGGA TATGCTGCAC CGGAAATGAG GTGCTGATGC GCCATGGAAT TCCAGTTGCA GGAAACTTCC TCCAGCAGGA GCTCGCTGTA ATCACGGGCG CTGTTGAGGC GATGGTGGTT GATGTGCAGT GCATCATGCC GTCGCTAGGC GGCCTCGCCG CGTGCTATCA CACGAAGTTC ATCTCGACCT CTCCAAAGGC AGAGTTCCCC GGGGCGCTCA GGATGGAGTT CAGCGAGGAG AGGGCAGCAG AGATCGCAAG GGAGATCGTG AGAACCGCGG TCGAGAACTA CCCGAAGCGG GACAGGGGCA GAGTTCTCAT ACCGAGGGAG AGGAGCGAGT GCATGGTTGG GTTCAGCGTC GAGGCCATCC TGAAGGCGCT CGGCGGAACA CCGCAGCCGC TGATAGATGC GATAGTAAAC GGATCCATAA AGGGCATCGC GGCGGTCGTC GGGTGCAACA ACCCAAAGGT TCCGCACGAC CATGGTCACG TCAACCTCGT CAGGGAGCTG ATCAGGAACA ACGTGCTCGT AGTAACCACA GGATGCAACG CGATCGCCTG CGCAAAAGCA GGTCTGCTCA GGCCTGCGGC TGCAAAGGAG GCCGGAGACG GTCTCAGAGG TGTCTGCGAG TCCCTCGGCG TCCCGCCGGT TTTGCACATG GGCTCATGCG TTGACATAAG CAGAATTCTT GTAGTGGCAG CAGCGATCGC AAACAAGCTT GGCGTGGACA TAAGCGATCT TCCAGTGGCA GGCGCGGCTC CAGAGTGGAT GAGCGAGAAG GCGGTTAGCA TCGGGGCATA TGTTGTTGCA TCCGGAGTGT TTACAGTGCT GGGCACGGTT CCGCCTGTTC TCGGAAGCCC TGTGGTTACG AGAATCCTGA CGAAGGACCT CGGCGATGCT GTTGGGGCGA CGTTTGCAGT AGAGCCGGAT CCGTTCAAGG CATCCAAACT CATTATAGAA CACATTGAGA GCAAGAGAAG GGCGCTGGGT CTGAAGGTGT GA
|
Protein sequence | MANKTATESI DPATTKMLLE ARRAGLETVW DRFNKQQPQC GFGQLGICCR NCNMGPCRID PFGDGPDKGV CGATADIIVA RNLLRMIAAG AAAHADHARD AVIVFKEALE GRARSYQIRD EAKLRELAAE YNISSWGEQG GAADLANALL SDFGRQEGYV TLTRRAPEKR RKIWESLGIS PRGIDREIVE CMHRTHMGVD NDPLHILQHG LRTSIADGWG SSMIATEVQD ILFGTPSPRR SEANLGVLKE DEVNIIVHGH EPILSEMIVE AASDPEMIKL AKSVGANGIN VAGICCTGNE VLMRHGIPVA GNFLQQELAV ITGAVEAMVV DVQCIMPSLG GLAACYHTKF ISTSPKAEFP GALRMEFSEE RAAEIAREIV RTAVENYPKR DRGRVLIPRE RSECMVGFSV EAILKALGGT PQPLIDAIVN GSIKGIAAVV GCNNPKVPHD HGHVNLVREL IRNNVLVVTT GCNAIACAKA GLLRPAAAKE AGDGLRGVCE SLGVPPVLHM GSCVDISRIL VVAAAIANKL GVDISDLPVA GAAPEWMSEK AVSIGAYVVA SGVFTVLGTV PPVLGSPVVT RILTKDLGDA VGATFAVEPD PFKASKLIIE HIESKRRALG LKV
|
| |