Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0898 |
Symbol | |
ID | 4462362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 974555 |
End bp | 976093 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639699917 |
Product | ELP3 family histone acetyltransferase |
Protein accession | YP_843326 |
Protein GI | 116754208 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG1243] Histone acetyltransferase |
TIGRFAM ID | [TIGR01211] histone acetyltransferase, ELP3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.589047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAGGG ATCTCAGGGA TATCGTGGAT GCGATCGCAT CCGGTATTAT CAGAAGCGAG GAGGATCTCG AAAAAGCGAA ACGGGCATTT GCAGCCACTT TGAATCTCTC AGAGATACCG GGCAACTCTG AGATACTCGC TGCAGCCAGG CCGGAGGAGA GGGCCCGGTT AAAGCTGTTG GTCAAAAAGC CGACCCGCAC GCTCTCAGGA GTAGCTGTGA TAGCTGTCAT GACGAGCCCG GCGCGCTGCC CTCATGGGAT CTGCATCCCA TGCCCCGGAG GCGTTCTGGG CGAGAGATGT TCACCGCAGA GCTACACGGG AAGAGAGCCT GCGGCGCTGA GAGCGGTACA GCATAACTTT GATCCCTACG CACAGGTGGC TGCGAGGCTC AAACAGCTCT CAGAGATAGG TCACCCTGTC GACAAGGCAG AGCTCATTCT CATGGGAGGG ACCATCACAT CCAGACCCCT CGGGTATCAG TGTTGGTTTG TGAGGAGATG CCTGGAGGCG ATGAACGATT ATCCTGACAC GAGGAGAAGC ACACGCTGGA GATCCTTCAG AGAGGTGGCA GATGCAAACA CCAGTGCAGC GGTGAGAAAC GTCGGCATAA CCTTCGAGAC CAGGCCTGAC TGGTGCCGTG AGAATCACAT CAAAAACATG CTTCTCCTCG GGGCGACGAA GGTCGAGCTG GGAGTGCAGA GCATCTACGA TGATGTCCTC AGTGCGATCA GGAGAGGCCA TTCTGTGGAG GAGACGATAA GAGCAAACCG TTTGTTGAGA GAGGCCGGGC TGAAGGTCGG GTTCCACATG ATGCCCGGGC TTCCGGGATC TGACCCTGAT AGAGATCTTA AGATGTTCAG GGAGCTTTTC GAGAGCAGCA ATTACCGGCC GGATTACCTC AAGATATACC CCACGCTTGT GATCGAGGGG ACGGAGCTCC ACAGGATGTG GATACGGGGA GATTATGAGC CGCTTTCGGA TGATGAGGCT GCTGAGTTGA TATCGCGCAT CAAGGAGATC CTCCCGAGGT ACACAAGGCT CCAGCGCGTG CAGAGAGATA TACCCGCGCA TCTCATAACT GCTGGCGTCA GGAAGAGCAA CCTCAGGCAG CTCGCCAGAA AGAGACTTGA GGAACGCGGT TTGAGGTGCA GTTGCATAAG ATGCAGAGAG GCCGGGCTTC GTGGTGTATC TGAGGGGGAT CTCTCGATGA ACATTGAGAG CTATGATGCA TGTGGAGCAA AGGAGCACTT CATATCGTTT GATACCGTGG ACGACACCCT CGTCGGATTC CTCAGACTCA GGCTGGGCGC TGAGGCCAGG ATCAGGGAGC TGCACGTCTA CGGCCCTCTC GTTCCTCTCG GAAGAAGGGG CGGATGGCAG CATCGCGGCA TCGGCGCGAG GCTCATAGAG AGGGCGGAGG AGATGGCGAG GGATCAGGGA TACGAGAGGA TCTCGGTCAC GAGTGGTATA GGCGTCAGGG GCTACTATGC ATCTCTGGGC TACAGGCTGA ACGCGCCGTA CATGGAGAAG ACGCTCTGA
|
Protein sequence | MYRDLRDIVD AIASGIIRSE EDLEKAKRAF AATLNLSEIP GNSEILAAAR PEERARLKLL VKKPTRTLSG VAVIAVMTSP ARCPHGICIP CPGGVLGERC SPQSYTGREP AALRAVQHNF DPYAQVAARL KQLSEIGHPV DKAELILMGG TITSRPLGYQ CWFVRRCLEA MNDYPDTRRS TRWRSFREVA DANTSAAVRN VGITFETRPD WCRENHIKNM LLLGATKVEL GVQSIYDDVL SAIRRGHSVE ETIRANRLLR EAGLKVGFHM MPGLPGSDPD RDLKMFRELF ESSNYRPDYL KIYPTLVIEG TELHRMWIRG DYEPLSDDEA AELISRIKEI LPRYTRLQRV QRDIPAHLIT AGVRKSNLRQ LARKRLEERG LRCSCIRCRE AGLRGVSEGD LSMNIESYDA CGAKEHFISF DTVDDTLVGF LRLRLGAEAR IRELHVYGPL VPLGRRGGWQ HRGIGARLIE RAEEMARDQG YERISVTSGI GVRGYYASLG YRLNAPYMEK TL
|
| |