Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3525 |
Symbol | |
ID | 3625413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 4520175 |
End bp | 4522049 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637702351 |
Product | carbon monoxide dehydrogenase |
Protein accession | YP_306974 |
Protein GI | 73670959 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGG AAATATATGA GAAAAGTATT GACCTTGCAA GTCAGAAAAT GCTTCAGAAG GCCGAAAAAG AAGGCATTGA AACCGCCTGG GACAGGTATG AAAAACAACT GCCTCAATGC AGTTTCGGGC AGCTAGGAGT TTGCTGCCGG AATTGTAATA TGGGTCCATG CAGGATAGAT CCCTTCGGAG AGGGAGCTCA AAAGGGAATT TGCGGCGCTA CTGCAGACAT CATAGTTGCA AGGAATTTCC TTAGAATGAT TGCCGCAGGT GCAGCAGCTC ATTCCGACCA TGCAAGAGAC GCTGTTCTGA CCTTTAAGAA AATGAGTGAA GGAGAAGCTG GAAGTTACGG GATAAAAGAT GAAGCAAAAC TGCTATCTCT TGCCTCGGAA TATGGGATTT CCTCAGAGGG CAAAAACCTG GAGGAAGTCG CAGCCAAACT TGCTAACACA CTGCTTCAGG AATTTGGAAA ACAGGATGGA CATCTCCAGT GTACCAGAAG AGTTCCGGAA TCAAGGCTCA AGCTCTGGGC CGAGCTTGGA ATAGAGCCCA GGGGGATTGA CCGGGAAATC GTGGAATGCA TGCACAGGAC TCATATAGGT GTGGACAATG ATGCGGTTCA TATCCTGCTG CAAGGACTGC GCACAGGGCT TTCCGATGGT TGGGGAGGCT CGATGATAGC CACCGACGTC CAGGACATAC TCTTTGGAGT TCCACAGCCT AGAAGAAGCA CCGTTAATCT GGGGGTGCTG TCCAGAGACA AAGTAAATGT AATTGTCCAC GGGCACGAAC CTATTCTCTC AGAAATGATC GTGGAAGCTA CAGAAAACCC CGAGCTCCTT AAACTTGCAG AGGAAAAGGG TGCAGCAGGC ATCAACGTTG TAGGAATATG CTGTACTGGT AACGAGACCC TAATGCGTCA TGGGATTCCC ATAGCCGGAA ATTTCCTTCA GCAGGAGCTG GCAGTAATTA CAGGCGCTGT GGAAGCTATG GTTGTTGATG TCCAGTGTAT CATGCCCTCG CTTGGAGAAC TTACAGGCTG CTACCACACA AAGTTTATAT CTACATCTCC AAAGGCCGAT TTTCCCAACA CCGTAAGAAT GGAATTCCAT GAGGACAGAG CTTATGAAAC CGCAACAGAA ATCGTAAGGA CTGCCGTTGA AAACTTCCCT AACAGGGTTC CGGAAAAGGT TACCATACCT GAAGAAAAGC AGGAGTGCAT GGCAGGCTTC AGTGCCGAGG CTATACTCAG TGCCCTTGGA GGAAGCCCTG ATCCTCTCAT TGAGGCGATA AAAGGCGGGG CTGTTCGGGG AATAGGGGCT GTTGTCGGCT GCAATAACAT AAAAGTAAAG CACAACTACG GCCATGTCAA CCTCGTTAAA GAACTGATCG AAAACAATGT GCTTGTGGTC ACAACCGGCT GCAATGCAAT TGCATGTGCC GAAGCAGGGC TTCTTCTGCC GGAAGCTTCC GAACTTGCAG GCGATGGTTT AAAAGCCGTC TGTAAGGCTT TGGGCATACC CCCTGTTCTG CACATGGGTT CCTGTGTTGA TATCAGCCGT ATCCTTGTGC TTTCAGCTGC GATTGCAAAC CGCCTGGGTG TGGACATAAG TGATCTTCCA GCAGCAGGAG CAGCCCCTGA ATGGATGAGC GAAAAGGCGG TTAGCATCGG GGCTTATGTG GTTTCATCCG GCGTATTTAC CGTGCTCGGG ACTATTCCGG CTGTTCTTGG AAGTCAGGCT GTTACGTCCC TTCTGACAGG AGGTTTGAAC GACGTGGTCG GAGCAAGCTT TGCAGTTGAG CCTGATCCCT TTAAAGCTGC AGACCTGATG CTTGAGCACA TTGACGAAAA AAGAAAGGCA CTGGGCTTGA ACTAA
|
Protein sequence | MKTEIYEKSI DLASQKMLQK AEKEGIETAW DRYEKQLPQC SFGQLGVCCR NCNMGPCRID PFGEGAQKGI CGATADIIVA RNFLRMIAAG AAAHSDHARD AVLTFKKMSE GEAGSYGIKD EAKLLSLASE YGISSEGKNL EEVAAKLANT LLQEFGKQDG HLQCTRRVPE SRLKLWAELG IEPRGIDREI VECMHRTHIG VDNDAVHILL QGLRTGLSDG WGGSMIATDV QDILFGVPQP RRSTVNLGVL SRDKVNVIVH GHEPILSEMI VEATENPELL KLAEEKGAAG INVVGICCTG NETLMRHGIP IAGNFLQQEL AVITGAVEAM VVDVQCIMPS LGELTGCYHT KFISTSPKAD FPNTVRMEFH EDRAYETATE IVRTAVENFP NRVPEKVTIP EEKQECMAGF SAEAILSALG GSPDPLIEAI KGGAVRGIGA VVGCNNIKVK HNYGHVNLVK ELIENNVLVV TTGCNAIACA EAGLLLPEAS ELAGDGLKAV CKALGIPPVL HMGSCVDISR ILVLSAAIAN RLGVDISDLP AAGAAPEWMS EKAVSIGAYV VSSGVFTVLG TIPAVLGSQA VTSLLTGGLN DVVGASFAVE PDPFKAADLM LEHIDEKRKA LGLN
|
| |