Gene Mpal_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0952 
Symbol 
ID7272445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp985863 
End bp987443 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content62% 
IMG OID643569591 
Producthistone acetyltransferase, ELP3 family 
Protein accessionYP_002466027 
Protein GI219851595 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAA CTGAAATCTA TCGGGAGATC CTCTCCCTCA TCTCTTCTGA ATCTGCAGAT 
CCGTACCAAA TCCTGCAGAT CAAACGTGCA GTCTGTGGCC GCTATGAACT CCCGGAGGTG
CCGAAGAACT CAGCGTTGCT CGCCGCAGCC CCTCCTGAAG AGTATGAACA GTTTCGGCGC
CTGCTGCTGG TGAAACCGAC CCGGACCCTC TCCGGGGTCG CACCGATCGC GGTGATGACC
TCTCCCTCTC CCTGTCCGCA TGGGAAGTGT CTCCCCTGTC CGGGGGGTCC GGACCATCCG
TTCAAATCCC CGCAGAGTTA CACGGGTGAA GAGCCGGCCG CAAAGCGGGC ACGTGAGCAT
AACTACGACC CGTATAACCA GGTCTTGGCC CGGCTCGATC AGTTCAGGCT GCTTGGCCAC
CACGTCGACA AGGCAGAACT GATCGTGATG GGCGGTACGA TGACCGCCCG GACCCCAGAG
TACCAGGAGT CGTTCGTGGC AGCCTGTCTG CAGGCGATGA ACGAGTCCGG GACCGGGGTT
CGAAGAGAAC TGCAGCCGCT GGAGACGGTT ATGCAGGCGA ACGAACGATC GGATGTCCGG
TGCGTCGCCC TGACCTTTGA GACCCGGCCG GACTGGTCCA GACGGGAACA TATCGACAGC
ATGCTCCGCC TCGGTGTCAC CAAGGTTGAG CTTGGGGTTC AGCACCTGGA CGACGAGATC
CTCACCTTCA ACCGTCGCGG CTGCACCGTC GCCGACATCG TGGAGGCGAA CACGTCCCTC
CGGGACGCCG GGATCAAGGT CGGGTTTCAC ATGATGCCGA ACCTGCCCGG TTCGACGATT
GAGAAGGACC GGGCGATGTT CCGGACCCTC TTCGACGATC CTCGATTCCG ACCGGACTTC
CTGAAACTGT ATCCCACGCT GGTCACACCC TATTCAGAGA TCGAAGACCT GCTGAACCGG
GGAGGGTATG CGCCGTATGC AGAGGACGAT CTGATCGACC TGATCGCCTA CGCCAAGGAA
CTCCTCCCTG AGTACGTCCG CCTCCAGCGG GTCCAGCGGG ACATTCCCGC CAAGCTGATC
GTTGCCGGCT CCCACCACAG CAACTTCCGG CAGCTGGCAG AGGGCCGGCT GCACGCCGCC
GGAAAACGGT GCCGGTGCAT CCGTTGTCGC GAGATAGGCC GCTACCCGCC GCCCAAAGGT
GCAGTGGCTG GGATCCAGAC CCTGGCCTAC GACTGTTGCG GAGGGCGGGA GTTCTTCATA
TCGGCGGTGG CTGGCGACTC GCTGATTGGG TTCCTCCGGC TGCGGTTTCC GGGGAATCCC
TGGCGTGAGG AACTGGCAGA TGCAGCTCTG GTTCGCGAGC TGCATGTCTA TGGAATGGTC
GTCCCGATCG GCGATGAGGC TGAGCCTGCC GAGTACCAGC ACCGACAGTT CGGGGTGCAA
CTGCTCGCCG AGGCGGAACG CCTGGCCGGC GACGCCGGTT TCTCTTCGCT CGCTATCATG
AGCGGGATCG GGGTGCGCCC CTACTACCAG AGACAGGGAT ATATGCGAAC CGGCCCCTAT
ATGGTAAAAC GACTTGTATG A
 
Protein sequence
MDKTEIYREI LSLISSESAD PYQILQIKRA VCGRYELPEV PKNSALLAAA PPEEYEQFRR 
LLLVKPTRTL SGVAPIAVMT SPSPCPHGKC LPCPGGPDHP FKSPQSYTGE EPAAKRAREH
NYDPYNQVLA RLDQFRLLGH HVDKAELIVM GGTMTARTPE YQESFVAACL QAMNESGTGV
RRELQPLETV MQANERSDVR CVALTFETRP DWSRREHIDS MLRLGVTKVE LGVQHLDDEI
LTFNRRGCTV ADIVEANTSL RDAGIKVGFH MMPNLPGSTI EKDRAMFRTL FDDPRFRPDF
LKLYPTLVTP YSEIEDLLNR GGYAPYAEDD LIDLIAYAKE LLPEYVRLQR VQRDIPAKLI
VAGSHHSNFR QLAEGRLHAA GKRCRCIRCR EIGRYPPPKG AVAGIQTLAY DCCGGREFFI
SAVAGDSLIG FLRLRFPGNP WREELADAAL VRELHVYGMV VPIGDEAEPA EYQHRQFGVQ
LLAEAERLAG DAGFSSLAIM SGIGVRPYYQ RQGYMRTGPY MVKRLV