Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2244 |
Symbol | |
ID | 7272541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2393188 |
End bp | 2394075 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643570856 |
Product | formylmethanofuran--tetrahydromethanopterin formyltransferase |
Protein accession | YP_002467260 |
Protein GI | 219852828 |
COG category | [C] Energy production and conversion |
COG ID | [COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase |
TIGRFAM ID | [TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00428201 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0387564 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTTA ATGGAGTACC AATCGATGAC ACCTATGCCG AGGCTTTCCC AATCTGGATC TCGCGGGTGC TCATCACTGC TGCGACGATG GACCTCGCCT ACACCGCTGC TGTTGAGGCA AGCGGGTTCG CAACATCATG TATAGGATGT ACTGCTGAGT GCGGTGTCGA CACCTTCGTC CCCGCCGAGG AGACACCTGA TGGAAGACCT GGTTACTCGA TGCTGATCTG CAACCCATCC AAGAAGAAAC TGAAAGAACA GTTGATCGAG CGAATCGGCG AGTGCATCCT GACCGCTCCG ACCACAGCAG CCTTTAACGG TCTGCCGGGC GCTGAGGAGA AGATCCCAGT GAAGCTCCAC TTCTTCGGAG ATGGCTATGA ATACCAGCAG AAGGTCGGAG ACCGGGACTG CTGGGCCATC CCGCTGATGG GTGGCGAGTT CATCATCGAG GAGGAGTACG GTGCAGTCAA GGGAGTCGCA GGCGGTAACT TCTTTGTGAT GGGCGAGAAC CAGATGGCTG CACTGGTCGG GGCCCAGGCC GCCAGCGATG CGATCAGCGG CGTCGAGGGT GCCATCACGT CATTCCCAGG TGGTATCGTT GCGAGTGGCT CGAAGGTCGG CTCGAAGAAG TACAAGTTCA TGAATGCCTC GACCAACGAG GCCTACTGCC CGTCCCTGAA GGGAAAGGTC GAGGACTCTA AGATCCCTGA CGGCGTGAAC TCGGTCTTTG AGATCGTCAT AGATGGTGTC GACGCCGAGA CCGTCGCCGA TGCAATGGGG GCGGGGATCA GAGCGGCATG CATGATCCCT GGCGTGAAGT TCATCAGCGC AGGGAACTAC GGCGGCAGCC TCGGCCCGCA CCAGTTCCAG TTAAAGGACC TCTTCTGA
|
Protein sequence | MEFNGVPIDD TYAEAFPIWI SRVLITAATM DLAYTAAVEA SGFATSCIGC TAECGVDTFV PAEETPDGRP GYSMLICNPS KKKLKEQLIE RIGECILTAP TTAAFNGLPG AEEKIPVKLH FFGDGYEYQQ KVGDRDCWAI PLMGGEFIIE EEYGAVKGVA GGNFFVMGEN QMAALVGAQA ASDAISGVEG AITSFPGGIV ASGSKVGSKK YKFMNASTNE AYCPSLKGKV EDSKIPDGVN SVFEIVIDGV DAETVADAMG AGIRAACMIP GVKFISAGNY GGSLGPHQFQ LKDLF
|
| |