Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1956 |
Symbol | |
ID | 5409993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 2020334 |
End bp | 2021914 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640869197 |
Product | ELP3 family histone acetyltransferase |
Protein accession | YP_001405114 |
Protein GI | 154151496 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG1243] Histone acetyltransferase |
TIGRFAM ID | [TIGR01211] histone acetyltransferase, ELP3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.724461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAGG CGCTGGCGAT TCGGGAGATC ATCTCCCTCA TCCTCTCCCA TCCTCAAGGT GAGAACGGGA TCGTTGCCGC AAAGATCGAG ACCTGCCGGA AGTACCGGCT GAATGCGGTT CCCAAGAATT CGGCGATACT TGCCGCAGCC ACTCCTGATG AGCGGGAGAC TTTGCGGCGT ATCCTGCTGG TCAAGCCCAC CAGGACGCTC TCGGGAGTTG CGCCGGTTGC GGTCATGACC TCTCCGTATC CATGCCCGCA CGGGCGGTGC CTTCCCTGCC CGGGCGGTCC TTCCCATCCT TTCAGCTCGC CCCAGAGTTA CACCGGGGAA GAACCCGCAG CAAAGCGGGC CCGGGAGAAT GGCTACGATC CATTTGCCCA GGTCCATGCG CGGCTCAGCC AGTTCGAGAC GCTGGGGCAC CGGGTGGAAA AGGTGGAATT GATTGTGATG GGCGGGACCA TGACCGCCCG CCCGATTGAG TACCAGCATG AATTTGTTGC CCGGTGCATC GAAGCAATGA ATCTCTATCC TGAAAACACG CCCGCAGCCT CCCCTCCGGC AGTTGAAGGA GTCCAGTCAG CAAATGAGAT ATCAGATGTC CGGTGCGTGG CGATCACGTT CGAAACACGC CCCGACTGGT GCAGGAAAGA ACATATCAAC CGGATGCTCG ATCTTGGCGT TACCAAAGTG GAGCTGGGCG TCCAGCATCT GGATGACGAG ATACTTGCCT TCAACCGTCG GGGATGTACG GTTGCAGACA CCGCGGAAGC AAACCGGCTC CTGCGGGACG CAGGGATCAA GGTCGGATTC CACATGATGC CAAATCTGCC GCATTCGACC ATCGCGGCAG ATAAGGCAAT GTTCGAGACT CTCTTCTCCG ATCCCCGGTT TAAGCCGGAT TTCCTGAAGA TCTACCCGAC CCTGGTCACC CCCGGCTCAG AGATCGAGGA GCTCTGGGAA CTCAAGCGGT ATGCCCCGTA CGATGAGGAG ACGCTTATCG ATCTTATTGC ATATGCAAAA ATGCTCATCC CGGAGTACAC ACGCCTATCG AGGGTGCAGC GCGATATCCC GGCCAAGCTG ATCGTGGCCG GATCCCGGCA TTCCAATTTC CGGCAGCTGG CCCAGAACCG GCTTGCCGCA CAGGGCCGGC GCTGCCGCTG CATCCGGTGC CGGGAGATCG GCCGTCTCCC CTCAGCAGAT GAGGCGGAGA TCCGGGTGAT CCGGTACGAG TGCTGCGGGG GCATGGAACA TTTCATATCG GCTGTAGCTG GTGATTCTCT GATTGGGTTT GCCCGTCTCC GGTTTCCCTC GGCCGAGTTC CGTCCTGAGA TTGCCGGTGC CGCGCTCCTC CGCGAGCTCC ACGTATACGG GAGCCTTGTC CCGGTGGGGA TCGATGCGGC AGAACAGGAG GAGTATCAGC ACCGGAATTT TGGGAAGATT CTGCTCTCCC GGGCTGAGGA GATTGCACAG GCCGCAGGCT TCGGGAGTAT GGCTATTATG AGCGGCATCG GAGTCCGGCC CTATTACCGG CGGCAGGGAT ATGAGCGCAA CGGGCCCTAT ATGGTAAAGG AGATGCCATG A
|
Protein sequence | MDEALAIREI ISLILSHPQG ENGIVAAKIE TCRKYRLNAV PKNSAILAAA TPDERETLRR ILLVKPTRTL SGVAPVAVMT SPYPCPHGRC LPCPGGPSHP FSSPQSYTGE EPAAKRAREN GYDPFAQVHA RLSQFETLGH RVEKVELIVM GGTMTARPIE YQHEFVARCI EAMNLYPENT PAASPPAVEG VQSANEISDV RCVAITFETR PDWCRKEHIN RMLDLGVTKV ELGVQHLDDE ILAFNRRGCT VADTAEANRL LRDAGIKVGF HMMPNLPHST IAADKAMFET LFSDPRFKPD FLKIYPTLVT PGSEIEELWE LKRYAPYDEE TLIDLIAYAK MLIPEYTRLS RVQRDIPAKL IVAGSRHSNF RQLAQNRLAA QGRRCRCIRC REIGRLPSAD EAEIRVIRYE CCGGMEHFIS AVAGDSLIGF ARLRFPSAEF RPEIAGAALL RELHVYGSLV PVGIDAAEQE EYQHRNFGKI LLSRAEEIAQ AAGFGSMAIM SGIGVRPYYR RQGYERNGPY MVKEMP
|
| |