Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0019 |
Symbol | |
ID | 4462257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 15964 |
End bp | 17475 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639699026 |
Product | hypothetical protein |
Protein accession | YP_842462 |
Protein GI | 116753344 |
COG category | [K] Transcription [S] Function unknown |
COG ID | [COG1900] Uncharacterized conserved protein [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGCAA AAAAATCGCT ATCCGAGATC AACGAGCGGA TACGTGACGG AAGCGTCAGG GTTGTGACCG CGGAGGAGAT GCCCTCCATC GTCGAGGAGC TGGGGCCGGA TGGCGCTGTG CGCGAGGTTG ATGTCGTGAC CACCGGCACC TTCGGGGCGA TGTGCTCCTC TGGAGTCTTC CTGAACCTCG GTCACAGCGA TCCGCCGATA AAGATCTCAA GGGCCTGGCT TAACCAGGTT GAGGCATACG GTGGAGTTGC TGCAGTCGAT CTATTTCTCG GAGCGACCCA GCCATCTGAG GACAGGGGCA TAGAGTATGG TGGGGCTCAT GTCATAGAGG ATCTCGTCTC CGGAAGAGCT GTGGACGTGA TGGGCGAGGG CGTGGGCACA GACTGTTATC CCAGGATGGA GATCGAGACG ACCCTGCATC TGGAGGACCT AAACCAGGCG CTCATGGTCA ATCCAAGAAA TGCTTATCAG AGGTACAACG CTGCCACAAA CTCCTCCGAC AGGACCCTTC ACACGTATAT GGGAACGCTT CTTCCACACT TCGGGAATGT CCATTACAGC GGCGCTGGCG TTCTCAACCC GATCTCCAAC GATCCGGGAT TCGAGTACAT AGGGACTGGG GTCAGGATAT TCCTCGGCGG CGCGCAGGGT TACATCGTTG GTCCAGGCAC GCAGCACAGC CCCGGGACAG GCTTCGCCAC GCTGATGGTC TCGGGAGATC TCAAGAGGAT GAGCAGCGAG TTCCTGAGGG CGGCCACGTT CACAAAATAT GGTCCGACGC TTTACGTTGG CGTTGGCGTC CCGATACCGA TACTGAACGA GAGGCTCGCG CTGAGCACTG CTGTCAGGGA CAGCGACATC ACCGTTCCTG TTGTCGATTA TGGTGTGCAG CGAAGGGACA GGCCTGTGCT GAAGAGCACG AGCTATGCGG AACTGAGGTC AGGTTTCGTG GAGATTAATG GAAAAGAGGT TCCAACCGCA TCGCTATCAA GCTTCCACAT GGCAAGAATG GTCGCGGGAA CCCTGAAGAA GTGGATCGAA AGGGGGGAGT TCCTGCTGAC GGAGCAGGCA GAGCCTCTCT CAAAGAGTGG CGTCAGCAGG CCGATGAAGC AGACGAAGGA GCTTCCCTAC GTCGGGGATG TCATGAACAG GGATGTTGTT ACAGTGGGCG AGAACATCAG CGTGCCTGAG GCAGCGCGCG TCATTGTCGG AAGTCGGTTC GATCATCTCC CCGTCGTCTC TGATGATGGA AAGCTCATGG GAATAATAAC GACATGGGAC ATATCCAAGG CTGTTGCAAA TGGCAACATC TCAAGAGTCT CGGAGATAAT GACCAGGCGG GTGTATACGG CGACGCCGGA TGAGCCGATA GAGCTGGCTG CGAGGACGAT GGATATCCAC AGCATCTCCG CGCTGCCGGT TGTGGATAAG GATAACCGCG TCATAGGGAT GATAACAAGC AACGACTTGA GCAGGCTGTT TGCCGGGAGA CGGTCGATAT GA
|
Protein sequence | MEAKKSLSEI NERIRDGSVR VVTAEEMPSI VEELGPDGAV REVDVVTTGT FGAMCSSGVF LNLGHSDPPI KISRAWLNQV EAYGGVAAVD LFLGATQPSE DRGIEYGGAH VIEDLVSGRA VDVMGEGVGT DCYPRMEIET TLHLEDLNQA LMVNPRNAYQ RYNAATNSSD RTLHTYMGTL LPHFGNVHYS GAGVLNPISN DPGFEYIGTG VRIFLGGAQG YIVGPGTQHS PGTGFATLMV SGDLKRMSSE FLRAATFTKY GPTLYVGVGV PIPILNERLA LSTAVRDSDI TVPVVDYGVQ RRDRPVLKST SYAELRSGFV EINGKEVPTA SLSSFHMARM VAGTLKKWIE RGEFLLTEQA EPLSKSGVSR PMKQTKELPY VGDVMNRDVV TVGENISVPE AARVIVGSRF DHLPVVSDDG KLMGIITTWD ISKAVANGNI SRVSEIMTRR VYTATPDEPI ELAARTMDIH SISALPVVDK DNRVIGMITS NDLSRLFAGR RSI
|
| |