Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1643 |
Symbol | |
ID | 4462515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1788136 |
End bp | 1789026 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639700662 |
Product | formylmethanofuran--tetrahydromethanopterin formyltransferase |
Protein accession | YP_844050 |
Protein GI | 116754932 |
COG category | [C] Energy production and conversion |
COG ID | [COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase |
TIGRFAM ID | [TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATTA ATGGTGTGGA GATAGATGAT ACGTTCGCGG AGGCATTCCC GATAAAGATA GGGAGGGTGC TTATAACCGC GATAAGTGAG CGCTGGGCGC TTGAGGCGGC GCGTGAGGCC ACGGGCTTCG GGACCTCTGT GATAATGTGC CCCGCCGAGG CTGGTATCGA ATGCATCGTT CCGAGCACCG AGACGCCCGA CGGGAGGCCG GGAGTGTACA TACAGATATG TAACATGAGC TGGAAGAGCC TGGAGACGTC GCTCCTGGCA CGCATAGGCC AGTGTGTCCT CACAGCTCCC ACGACAGCAG TATTCAACGG TCTGCCAGAG GCAGAGAAGC AGTTCGACAC GGGAAAGAAG CTCGGCTACT TCGGAGATGG ATACCAGTGC GAGATCAAGT ACTGCAACAG GAATTTCTGC AAGATCCCCA TCATGGAGGG AGATTTTCTG GTGGAAGAAA CGATCGGGGC TGTGGATGGG ATAGCCGGTG GCAACTTCTA CATTCTCGGG CAGAACCAGC CTGCTGCGCT CATGGCCGCT GAGGCCGCGG TCGATGCGAT AAGCAAGCTC AGAGGGACGA TAACGCCGTT CCCCGGAGGC GTGGTAGCGA GCGGCTCCAA GGTCGGGAGC AGGTACAAAT TCCTGAAGGC CTCTACGAAT GTGGCGTTCT GTCCCAGCCT GAAGGAGCAG GGTGTCTGCG CGCTTCCTGA GAACGTCACC GCAGGGTACG AGATCGTGAT CAACGGGATA AGCAGGGAGG CGATCGAGGA AGCGATGCGC GTCGGGATCA GAGCTGCATG CACCGTGCCG GGCGTGCTCA GGATATCCGC GGGGAACTTC GGTGGGAAGC TGGGGCCGCA TCAGTTCCAT CTGCACAGGA TACTTGATTG A
|
Protein sequence | MEINGVEIDD TFAEAFPIKI GRVLITAISE RWALEAAREA TGFGTSVIMC PAEAGIECIV PSTETPDGRP GVYIQICNMS WKSLETSLLA RIGQCVLTAP TTAVFNGLPE AEKQFDTGKK LGYFGDGYQC EIKYCNRNFC KIPIMEGDFL VEETIGAVDG IAGGNFYILG QNQPAALMAA EAAVDAISKL RGTITPFPGG VVASGSKVGS RYKFLKASTN VAFCPSLKEQ GVCALPENVT AGYEIVINGI SREAIEEAMR VGIRAACTVP GVLRISAGNF GGKLGPHQFH LHRILD
|
| |