Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2858 |
Symbol | |
ID | 3103082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 3050903 |
End bp | 3051799 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637171986 |
Product | formylmethanofuran--tetrahydromethanopterin formyltransferase |
Protein accession | YP_115251 |
Protein GI | 53803053 |
COG category | [C] Energy production and conversion |
COG ID | [COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase |
TIGRFAM ID | [TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0626869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCATCA ACGGCGTACA CATCGACGAG ACGTTCGCGG AGGCGTTCCC GATGCGCGCC ACGCGCGTGA TCGTCACGGC CCAGAACCTC AAATGGGCCC ATCACGCTGC CCAGGCCATG ACAGGCTTCG CGACATCGGT CATCGCCTGC GGCTGCGAAG CCGGTATCGA GCGGGAACTC GATCCCGCCG AAACGCCGGA TGGACGTCCC GGCGTATCGG CGCTGCTGTT CGCCATGGGC GGCAAGGGGC TGGCGAAGCA ACTGGAGACA CGGGCCGGGC AATGCGTGCT GACCTCGCCT ACCTCCGCGC TGTTCGCAGG GATCGTCGAA GGCGAGCAAA TCCCCCTGGG GAAGAATCTG CGCTATTTCG GTGACGGCTT CCAGATTTCC AAGCGGATCG GCGGCAAGCG CTATTGGCGC ATACCGGTCA TGGACGGAGA GTTCCTCTGC CAGGAGACCA CCGGGATGAT CAAGGCGGTC GGCGGCGGCA ATTTTCTCAT CCTGGCCGAA TCGCAGCCCC AGGCGCTGGC CGCGTGCGAG GCGGCGATCG AGGCGATGCG CAGGATTCCC AACGTCATCA TGCCGTTCCC CGGCGGCGTC GTCCGTTCGG GTTCCAAGGT CGGCTCGAAG TACAAGACCC TGCCGGCGTC TACCAATGAC GCATTCTGTC CGACCTTGAA AGGGCAAACG CGGACCGAGC TTTCGCCGGA AATCGAGTCG GTGATGGAAA TCGTGATCGA TGGCTTGAGC GATGCCGACA TCGCCAAGGC GATGCGCGCA GGCATCGAGG CGGCTTGCGG ACTGGGCGCG GCCAACGGCA TCCGGCGCAT CAGCGCCGGC AACTACGGAG GCAAGCTCGG CCCATTCCTC TTCCACCTCC GCGAAATCAT GGCTTGA
|
Protein sequence | MIINGVHIDE TFAEAFPMRA TRVIVTAQNL KWAHHAAQAM TGFATSVIAC GCEAGIEREL DPAETPDGRP GVSALLFAMG GKGLAKQLET RAGQCVLTSP TSALFAGIVE GEQIPLGKNL RYFGDGFQIS KRIGGKRYWR IPVMDGEFLC QETTGMIKAV GGGNFLILAE SQPQALAACE AAIEAMRRIP NVIMPFPGGV VRSGSKVGSK YKTLPASTND AFCPTLKGQT RTELSPEIES VMEIVIDGLS DADIAKAMRA GIEAACGLGA ANGIRRISAG NYGGKLGPFL FHLREIMA
|
| |