Gene Mboo_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1956 
Symbol 
ID5409993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2020334 
End bp2021914 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID640869197 
ProductELP3 family histone acetyltransferase 
Protein accessionYP_001405114 
Protein GI154151496 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.724461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGG CGCTGGCGAT TCGGGAGATC ATCTCCCTCA TCCTCTCCCA TCCTCAAGGT 
GAGAACGGGA TCGTTGCCGC AAAGATCGAG ACCTGCCGGA AGTACCGGCT GAATGCGGTT
CCCAAGAATT CGGCGATACT TGCCGCAGCC ACTCCTGATG AGCGGGAGAC TTTGCGGCGT
ATCCTGCTGG TCAAGCCCAC CAGGACGCTC TCGGGAGTTG CGCCGGTTGC GGTCATGACC
TCTCCGTATC CATGCCCGCA CGGGCGGTGC CTTCCCTGCC CGGGCGGTCC TTCCCATCCT
TTCAGCTCGC CCCAGAGTTA CACCGGGGAA GAACCCGCAG CAAAGCGGGC CCGGGAGAAT
GGCTACGATC CATTTGCCCA GGTCCATGCG CGGCTCAGCC AGTTCGAGAC GCTGGGGCAC
CGGGTGGAAA AGGTGGAATT GATTGTGATG GGCGGGACCA TGACCGCCCG CCCGATTGAG
TACCAGCATG AATTTGTTGC CCGGTGCATC GAAGCAATGA ATCTCTATCC TGAAAACACG
CCCGCAGCCT CCCCTCCGGC AGTTGAAGGA GTCCAGTCAG CAAATGAGAT ATCAGATGTC
CGGTGCGTGG CGATCACGTT CGAAACACGC CCCGACTGGT GCAGGAAAGA ACATATCAAC
CGGATGCTCG ATCTTGGCGT TACCAAAGTG GAGCTGGGCG TCCAGCATCT GGATGACGAG
ATACTTGCCT TCAACCGTCG GGGATGTACG GTTGCAGACA CCGCGGAAGC AAACCGGCTC
CTGCGGGACG CAGGGATCAA GGTCGGATTC CACATGATGC CAAATCTGCC GCATTCGACC
ATCGCGGCAG ATAAGGCAAT GTTCGAGACT CTCTTCTCCG ATCCCCGGTT TAAGCCGGAT
TTCCTGAAGA TCTACCCGAC CCTGGTCACC CCCGGCTCAG AGATCGAGGA GCTCTGGGAA
CTCAAGCGGT ATGCCCCGTA CGATGAGGAG ACGCTTATCG ATCTTATTGC ATATGCAAAA
ATGCTCATCC CGGAGTACAC ACGCCTATCG AGGGTGCAGC GCGATATCCC GGCCAAGCTG
ATCGTGGCCG GATCCCGGCA TTCCAATTTC CGGCAGCTGG CCCAGAACCG GCTTGCCGCA
CAGGGCCGGC GCTGCCGCTG CATCCGGTGC CGGGAGATCG GCCGTCTCCC CTCAGCAGAT
GAGGCGGAGA TCCGGGTGAT CCGGTACGAG TGCTGCGGGG GCATGGAACA TTTCATATCG
GCTGTAGCTG GTGATTCTCT GATTGGGTTT GCCCGTCTCC GGTTTCCCTC GGCCGAGTTC
CGTCCTGAGA TTGCCGGTGC CGCGCTCCTC CGCGAGCTCC ACGTATACGG GAGCCTTGTC
CCGGTGGGGA TCGATGCGGC AGAACAGGAG GAGTATCAGC ACCGGAATTT TGGGAAGATT
CTGCTCTCCC GGGCTGAGGA GATTGCACAG GCCGCAGGCT TCGGGAGTAT GGCTATTATG
AGCGGCATCG GAGTCCGGCC CTATTACCGG CGGCAGGGAT ATGAGCGCAA CGGGCCCTAT
ATGGTAAAGG AGATGCCATG A
 
Protein sequence
MDEALAIREI ISLILSHPQG ENGIVAAKIE TCRKYRLNAV PKNSAILAAA TPDERETLRR 
ILLVKPTRTL SGVAPVAVMT SPYPCPHGRC LPCPGGPSHP FSSPQSYTGE EPAAKRAREN
GYDPFAQVHA RLSQFETLGH RVEKVELIVM GGTMTARPIE YQHEFVARCI EAMNLYPENT
PAASPPAVEG VQSANEISDV RCVAITFETR PDWCRKEHIN RMLDLGVTKV ELGVQHLDDE
ILAFNRRGCT VADTAEANRL LRDAGIKVGF HMMPNLPHST IAADKAMFET LFSDPRFKPD
FLKIYPTLVT PGSEIEELWE LKRYAPYDEE TLIDLIAYAK MLIPEYTRLS RVQRDIPAKL
IVAGSRHSNF RQLAQNRLAA QGRRCRCIRC REIGRLPSAD EAEIRVIRYE CCGGMEHFIS
AVAGDSLIGF ARLRFPSAEF RPEIAGAALL RELHVYGSLV PVGIDAAEQE EYQHRNFGKI
LLSRAEEIAQ AAGFGSMAIM SGIGVRPYYR RQGYERNGPY MVKEMP