Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0952 |
Symbol | |
ID | 7272445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 985863 |
End bp | 987443 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643569591 |
Product | histone acetyltransferase, ELP3 family |
Protein accession | YP_002466027 |
Protein GI | 219851595 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG1243] Histone acetyltransferase |
TIGRFAM ID | [TIGR01211] histone acetyltransferase, ELP3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAA CTGAAATCTA TCGGGAGATC CTCTCCCTCA TCTCTTCTGA ATCTGCAGAT CCGTACCAAA TCCTGCAGAT CAAACGTGCA GTCTGTGGCC GCTATGAACT CCCGGAGGTG CCGAAGAACT CAGCGTTGCT CGCCGCAGCC CCTCCTGAAG AGTATGAACA GTTTCGGCGC CTGCTGCTGG TGAAACCGAC CCGGACCCTC TCCGGGGTCG CACCGATCGC GGTGATGACC TCTCCCTCTC CCTGTCCGCA TGGGAAGTGT CTCCCCTGTC CGGGGGGTCC GGACCATCCG TTCAAATCCC CGCAGAGTTA CACGGGTGAA GAGCCGGCCG CAAAGCGGGC ACGTGAGCAT AACTACGACC CGTATAACCA GGTCTTGGCC CGGCTCGATC AGTTCAGGCT GCTTGGCCAC CACGTCGACA AGGCAGAACT GATCGTGATG GGCGGTACGA TGACCGCCCG GACCCCAGAG TACCAGGAGT CGTTCGTGGC AGCCTGTCTG CAGGCGATGA ACGAGTCCGG GACCGGGGTT CGAAGAGAAC TGCAGCCGCT GGAGACGGTT ATGCAGGCGA ACGAACGATC GGATGTCCGG TGCGTCGCCC TGACCTTTGA GACCCGGCCG GACTGGTCCA GACGGGAACA TATCGACAGC ATGCTCCGCC TCGGTGTCAC CAAGGTTGAG CTTGGGGTTC AGCACCTGGA CGACGAGATC CTCACCTTCA ACCGTCGCGG CTGCACCGTC GCCGACATCG TGGAGGCGAA CACGTCCCTC CGGGACGCCG GGATCAAGGT CGGGTTTCAC ATGATGCCGA ACCTGCCCGG TTCGACGATT GAGAAGGACC GGGCGATGTT CCGGACCCTC TTCGACGATC CTCGATTCCG ACCGGACTTC CTGAAACTGT ATCCCACGCT GGTCACACCC TATTCAGAGA TCGAAGACCT GCTGAACCGG GGAGGGTATG CGCCGTATGC AGAGGACGAT CTGATCGACC TGATCGCCTA CGCCAAGGAA CTCCTCCCTG AGTACGTCCG CCTCCAGCGG GTCCAGCGGG ACATTCCCGC CAAGCTGATC GTTGCCGGCT CCCACCACAG CAACTTCCGG CAGCTGGCAG AGGGCCGGCT GCACGCCGCC GGAAAACGGT GCCGGTGCAT CCGTTGTCGC GAGATAGGCC GCTACCCGCC GCCCAAAGGT GCAGTGGCTG GGATCCAGAC CCTGGCCTAC GACTGTTGCG GAGGGCGGGA GTTCTTCATA TCGGCGGTGG CTGGCGACTC GCTGATTGGG TTCCTCCGGC TGCGGTTTCC GGGGAATCCC TGGCGTGAGG AACTGGCAGA TGCAGCTCTG GTTCGCGAGC TGCATGTCTA TGGAATGGTC GTCCCGATCG GCGATGAGGC TGAGCCTGCC GAGTACCAGC ACCGACAGTT CGGGGTGCAA CTGCTCGCCG AGGCGGAACG CCTGGCCGGC GACGCCGGTT TCTCTTCGCT CGCTATCATG AGCGGGATCG GGGTGCGCCC CTACTACCAG AGACAGGGAT ATATGCGAAC CGGCCCCTAT ATGGTAAAAC GACTTGTATG A
|
Protein sequence | MDKTEIYREI LSLISSESAD PYQILQIKRA VCGRYELPEV PKNSALLAAA PPEEYEQFRR LLLVKPTRTL SGVAPIAVMT SPSPCPHGKC LPCPGGPDHP FKSPQSYTGE EPAAKRAREH NYDPYNQVLA RLDQFRLLGH HVDKAELIVM GGTMTARTPE YQESFVAACL QAMNESGTGV RRELQPLETV MQANERSDVR CVALTFETRP DWSRREHIDS MLRLGVTKVE LGVQHLDDEI LTFNRRGCTV ADIVEANTSL RDAGIKVGFH MMPNLPGSTI EKDRAMFRTL FDDPRFRPDF LKLYPTLVTP YSEIEDLLNR GGYAPYAEDD LIDLIAYAKE LLPEYVRLQR VQRDIPAKLI VAGSHHSNFR QLAEGRLHAA GKRCRCIRCR EIGRYPPPKG AVAGIQTLAY DCCGGREFFI SAVAGDSLIG FLRLRFPGNP WREELADAAL VRELHVYGMV VPIGDEAEPA EYQHRQFGVQ LLAEAERLAG DAGFSSLAIM SGIGVRPYYQ RQGYMRTGPY MVKRLV
|
| |