Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0644 |
Symbol | |
ID | 4462284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 678262 |
End bp | 679476 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639699652 |
Product | hypothetical protein |
Protein accession | YP_843074 |
Protein GI | 116753956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00156729 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGA TGCTCTGTCT TATCAGCGAG CAGCATGTTC CGAACCTGCT GGGCGTTCAC GAGCTGCGGC CGGATCTCCT CGTGCTGCTT GAGACCGAGG GGATGAAAAG GAGGGAGGCT GCAAACAGAT TCCTGAAAGC CCTTGCGATC GGAGGTCAGG ATTACCTAAC AAGAAATGAG ATCGTGCCGC TGGAGGATGG TGACTCAATA GAGGAGACTG AGAGGGCGCT GAAAGGGGTC TATGAGAGAT ACAGAGATGC GGAGTGGATC GTGAACATCA CAGGCGGCAC GAAGCCGATG AGCATAGGAG CATACGGGTT TTTCAGGCAA AAGAAGAATG CCAGGATAAT CTATGTCTCC GCGTCTGACC AGTCGAGGGC GCTGGACTTC TCGGGTGGAG CGGACATACC TCTGAGCCAC AGGATATCTG TGGCTGAGTT CCTCGCAGGC TATGGGTTTG ATGTGCTCCA TTACGACAAG GTCCAGGAGA ACGAGGAGCG GAGCAGGAGG TGGCTTGGTC TTGCAGCAGA GATCGCGGCG AGGAGCCAGA ATGGCGCCAT TCTCGGGCTT CTCGCGAATT TATCGAGGAT ATCGAAAGAG CGGAGGGGCA GGTACAGGGG ACTCAAGATC TCAGAATCAG ATGGTCTATT TCTGAACGAT GGTCATCTGC GTGAGATGAT CGCTTCGAGC TTTGGTCTGG CATGTGATGG TGGGCACTTC ACAGGCGCCC TGGATAAATA CGCTGTCAGG TTCCTCACAG GCGGCTGGCT TGAGGTCTTC ACATGGGGGT TGCTGAGGGG GCTTGATCGT GTCTGGGATG TGCATCTCGG TTTGCAGATT GGAATGAAGA ACGAGAAGCT CCAGAACGAT CTGGATGTTG TGTTCATGAC AGATCAGTCC CTCAGGATCG TGGAGTGCAA GAGCGGCGGG CAGGAGCACG ACAGGGAGGG GAGTGATACG CTGTACAAGA TTGAGGCGAT ACGGAAGCAG CTCGGAGCAC TTCGTGTTCG ATCCTATCTT GTCACGACCT CTGATAACGT GATCGATTCC GAGACCGGTA ATATCAAGGA GCATCTGGAG GACAGATCGA GGCTCTATGA GTGCAACATT GTGAAGCCTG AAGATGTTCG CAGTCTTGCG CAGATGTACC TCGCAGGTGA CGTGCGGCTG AACGCGAGGG TTGCGCAGGT CTTCAACATA CGGCAGGCGG TTTGA
|
Protein sequence | MKAMLCLISE QHVPNLLGVH ELRPDLLVLL ETEGMKRREA ANRFLKALAI GGQDYLTRNE IVPLEDGDSI EETERALKGV YERYRDAEWI VNITGGTKPM SIGAYGFFRQ KKNARIIYVS ASDQSRALDF SGGADIPLSH RISVAEFLAG YGFDVLHYDK VQENEERSRR WLGLAAEIAA RSQNGAILGL LANLSRISKE RRGRYRGLKI SESDGLFLND GHLREMIASS FGLACDGGHF TGALDKYAVR FLTGGWLEVF TWGLLRGLDR VWDVHLGLQI GMKNEKLQND LDVVFMTDQS LRIVECKSGG QEHDREGSDT LYKIEAIRKQ LGALRVRSYL VTTSDNVIDS ETGNIKEHLE DRSRLYECNI VKPEDVRSLA QMYLAGDVRL NARVAQVFNI RQAV
|
| |