Gene Mthe_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0898 
Symbol 
ID4462362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp974555 
End bp976093 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content57% 
IMG OID639699917 
ProductELP3 family histone acetyltransferase 
Protein accessionYP_843326 
Protein GI116754208 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.589047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGGG ATCTCAGGGA TATCGTGGAT GCGATCGCAT CCGGTATTAT CAGAAGCGAG 
GAGGATCTCG AAAAAGCGAA ACGGGCATTT GCAGCCACTT TGAATCTCTC AGAGATACCG
GGCAACTCTG AGATACTCGC TGCAGCCAGG CCGGAGGAGA GGGCCCGGTT AAAGCTGTTG
GTCAAAAAGC CGACCCGCAC GCTCTCAGGA GTAGCTGTGA TAGCTGTCAT GACGAGCCCG
GCGCGCTGCC CTCATGGGAT CTGCATCCCA TGCCCCGGAG GCGTTCTGGG CGAGAGATGT
TCACCGCAGA GCTACACGGG AAGAGAGCCT GCGGCGCTGA GAGCGGTACA GCATAACTTT
GATCCCTACG CACAGGTGGC TGCGAGGCTC AAACAGCTCT CAGAGATAGG TCACCCTGTC
GACAAGGCAG AGCTCATTCT CATGGGAGGG ACCATCACAT CCAGACCCCT CGGGTATCAG
TGTTGGTTTG TGAGGAGATG CCTGGAGGCG ATGAACGATT ATCCTGACAC GAGGAGAAGC
ACACGCTGGA GATCCTTCAG AGAGGTGGCA GATGCAAACA CCAGTGCAGC GGTGAGAAAC
GTCGGCATAA CCTTCGAGAC CAGGCCTGAC TGGTGCCGTG AGAATCACAT CAAAAACATG
CTTCTCCTCG GGGCGACGAA GGTCGAGCTG GGAGTGCAGA GCATCTACGA TGATGTCCTC
AGTGCGATCA GGAGAGGCCA TTCTGTGGAG GAGACGATAA GAGCAAACCG TTTGTTGAGA
GAGGCCGGGC TGAAGGTCGG GTTCCACATG ATGCCCGGGC TTCCGGGATC TGACCCTGAT
AGAGATCTTA AGATGTTCAG GGAGCTTTTC GAGAGCAGCA ATTACCGGCC GGATTACCTC
AAGATATACC CCACGCTTGT GATCGAGGGG ACGGAGCTCC ACAGGATGTG GATACGGGGA
GATTATGAGC CGCTTTCGGA TGATGAGGCT GCTGAGTTGA TATCGCGCAT CAAGGAGATC
CTCCCGAGGT ACACAAGGCT CCAGCGCGTG CAGAGAGATA TACCCGCGCA TCTCATAACT
GCTGGCGTCA GGAAGAGCAA CCTCAGGCAG CTCGCCAGAA AGAGACTTGA GGAACGCGGT
TTGAGGTGCA GTTGCATAAG ATGCAGAGAG GCCGGGCTTC GTGGTGTATC TGAGGGGGAT
CTCTCGATGA ACATTGAGAG CTATGATGCA TGTGGAGCAA AGGAGCACTT CATATCGTTT
GATACCGTGG ACGACACCCT CGTCGGATTC CTCAGACTCA GGCTGGGCGC TGAGGCCAGG
ATCAGGGAGC TGCACGTCTA CGGCCCTCTC GTTCCTCTCG GAAGAAGGGG CGGATGGCAG
CATCGCGGCA TCGGCGCGAG GCTCATAGAG AGGGCGGAGG AGATGGCGAG GGATCAGGGA
TACGAGAGGA TCTCGGTCAC GAGTGGTATA GGCGTCAGGG GCTACTATGC ATCTCTGGGC
TACAGGCTGA ACGCGCCGTA CATGGAGAAG ACGCTCTGA
 
Protein sequence
MYRDLRDIVD AIASGIIRSE EDLEKAKRAF AATLNLSEIP GNSEILAAAR PEERARLKLL 
VKKPTRTLSG VAVIAVMTSP ARCPHGICIP CPGGVLGERC SPQSYTGREP AALRAVQHNF
DPYAQVAARL KQLSEIGHPV DKAELILMGG TITSRPLGYQ CWFVRRCLEA MNDYPDTRRS
TRWRSFREVA DANTSAAVRN VGITFETRPD WCRENHIKNM LLLGATKVEL GVQSIYDDVL
SAIRRGHSVE ETIRANRLLR EAGLKVGFHM MPGLPGSDPD RDLKMFRELF ESSNYRPDYL
KIYPTLVIEG TELHRMWIRG DYEPLSDDEA AELISRIKEI LPRYTRLQRV QRDIPAHLIT
AGVRKSNLRQ LARKRLEERG LRCSCIRCRE AGLRGVSEGD LSMNIESYDA CGAKEHFISF
DTVDDTLVGF LRLRLGAEAR IRELHVYGPL VPLGRRGGWQ HRGIGARLIE RAEEMARDQG
YERISVTSGI GVRGYYASLG YRLNAPYMEK TL