Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1097 |
Symbol | |
ID | 4463129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1185945 |
End bp | 1187645 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639700114 |
Product | amidohydrolase 3 |
Protein accession | YP_843520 |
Protein GI | 116754402 |
COG category | [C] Energy production and conversion |
COG ID | [COG1229] Formylmethanofuran dehydrogenase subunit A |
TIGRFAM ID | [TIGR03121] formylmethanofuran dehydrogenase subunit A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAATTA AAGGGGGCAT CGTCTACGAT CCTGCCAACG GGATCTTCGG CGAGGAGATG GATATATGCA TAGAGAACGG CCGGATAGCT GAGGACGCCG GCGGGGAGGT GATTGATGCC AGAGGTCTTC TGGTTATGCC TGGCGGTGTG GATGCGCACT CGCATATCGC AGGGCCGAAG CTGAACACTG GAAGGATAAT GCGCCCCGAC GACTCCAGGA TGGGAACTGA GCCGAGGACG AGGGTCTGCA GGCCGTCGAC GGGATATACG GTGCCTAACT GCTACGCCAT AGGCTACAGG TACGCCAGGC TTGGATACAC CACCGCGTTC GAGGCCGCGA CGCCGATAAT AGAGGCCCGC CACACGCATG AGGAGCTGGA GGAGATCCCA ATCGTTGACA AGGGCGCCCT TACGCTCTTC GGAAGCAACT GGACCGTGAT GGAGTGCGTG CGCGAGAACG ATATGGATAT GCTCGCTGCA TACGTTGCGT GGGGCTTGAG GGCCGCGCGC GGATACGGCG TTAAGATCGT GAATCCGGGC GGCGGCGAGG CATGGGGTTT CGGATCGAAC GTGAAGAGCG TGCACGATCC GGTGCCGCAC TTCGATGTAA CTCCAGCACA GATTATACGA GCGCTTGCAG AGGTCAACGA GCGTCTCAGA CTTCCACACT CGATACACCT GCACTTCAAC AATCTGGGCA GGCCTGGGAA TTACACCACA GCGATCGAGA CCCTTGAGCT GCTGAAGGAC ATAAAGCCGA GCAGGATGCG GCAGGTTGTG CATGTCGCGC ACATGCAGTT CTCAGCGTAT GGCGGCACAG GCTGGAAGGA CTTCGAGTCG AAGGCATCTG CGATAGCGGA GTACTTCAAC CAAACAAATC ATGCTACGAT GGATCTCGGC CAGATAATAT TCGGCCCTGC CACAACAATG ACCGCTGATG CGCCGCTGGA GTATGCGAAC GCCAGGATCG GGCACCAGAA ATGGTCGAAC CACGATATAG AGCTAGAGGA GTCCAGCGGT GTGGTGCCGT GGGTTTACAC GAGAAAGATG CCTGTCAACG CGGTCCAGTG GGCGATAGGG CTGGAGCTAG CGCTTCTCAC AAAAGATCCT TGGAAGGTGG TCATGACAAC AGACCATCCG AACGGCGGGC CGTTCGTCAA CTACCCTGAG ATAATATCGC TGCTGATGAG CAGGGAGAAG CGCGAGGAGG AGATGAAGAC GCTGCACGAG GTGGTGAGAT CAAGAAGCAC CCTACCATCG ATTGAGAGAG AGATGGATTG GTCAGAGATC GTGATAATGA CGAGAGCTGC ACCTGCGAGA ATTCTTGGCC TGGAGGACAA GGGGCATCTT GGCATCGGCG CTGATGGAGA TGTATCGATA TACAACATCA GACCAGATGA GATCGACCCG TCGAAGGATC ATGCAGTGGT GAAGGCAGGC ATGTCCAGGG CGAAGTACAC GATAAAGGGC GGTGCTGTTG TAGTAAGGGA CGGCGAGATA GTCGCAGCGC CACAGGGAAG GACATACTGG GTGGATGCCG CTGTTCCTGA GAGCGATATG GATCGCATGC TGTCTTCTCT CAAAGAGAAG TTCGAGAGGT ACTACAGCAT AAGGATGTCG AACTACATGG TGCAGGACGC GTATGTACCG AATCCGGTTG TGGTGAATGC TGGAATGCAG CCTCTGAAAG AGGTGATCTG A
|
Protein sequence | MIIKGGIVYD PANGIFGEEM DICIENGRIA EDAGGEVIDA RGLLVMPGGV DAHSHIAGPK LNTGRIMRPD DSRMGTEPRT RVCRPSTGYT VPNCYAIGYR YARLGYTTAF EAATPIIEAR HTHEELEEIP IVDKGALTLF GSNWTVMECV RENDMDMLAA YVAWGLRAAR GYGVKIVNPG GGEAWGFGSN VKSVHDPVPH FDVTPAQIIR ALAEVNERLR LPHSIHLHFN NLGRPGNYTT AIETLELLKD IKPSRMRQVV HVAHMQFSAY GGTGWKDFES KASAIAEYFN QTNHATMDLG QIIFGPATTM TADAPLEYAN ARIGHQKWSN HDIELEESSG VVPWVYTRKM PVNAVQWAIG LELALLTKDP WKVVMTTDHP NGGPFVNYPE IISLLMSREK REEEMKTLHE VVRSRSTLPS IEREMDWSEI VIMTRAAPAR ILGLEDKGHL GIGADGDVSI YNIRPDEIDP SKDHAVVKAG MSRAKYTIKG GAVVVRDGEI VAAPQGRTYW VDAAVPESDM DRMLSSLKEK FERYYSIRMS NYMVQDAYVP NPVVVNAGMQ PLKEVI
|
| |