Gene Mlab_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1304 
Symbol 
ID4794380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1328725 
End bp1330287 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content54% 
IMG OID640099986 
Producthypothetical protein 
Protein accessionYP_001030739 
Protein GI124486123 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGC ACACCATTTA TCGTGAGCTT ATCTCACTCA TTTTTTCCGA TCCGAACCCG 
GATATTCAGC ACATCAAACT TTCCGTGTGC CGTAAGTATT CTCTCGATGC GATGCCGAAA
AATTCCGCGA TTCTCGCCGC CGCAAAACCC GAGGAGTACG AGGCCCTTCG TCGCGTTCTG
ATGGTCAAAC CGACACGAAC ACTCTCAGGC GTCGCTCCGG TCGCGGTGAT GACGTCTCCT
TGCGCCTGTC CGCACGGAAA ATGTCTGCCG TGCCCCGGAG GACCTGATCA CATATTCAAA
TCACCCCAGA GTTACACCGG AGAAGAACCG GCGGCTCTGC GTGCCCGTCA GAATGAGTAT
GATCCGTACC GTCAGGTGAC CGCAAGACTT GGGCAGTTCA AACTTCTCGG ACACCATGTC
GATAAAGCCG AGCTGATCGT GATGGGGGGG ACGATGACCG CCCGGGACGC TGCATATCAG
GAATGGTTCG TTTCCGAATG TCTGCGGGCG ATGAACGAGT TCTCCGGACA AAAATCCACC
GCGGGATCGG TGGAAGAGCT GATGCTCGAG AACGAAAAAG CCGATGTCCG CTGTATCGCG
ACAACCTTCG AGACCCGTCC CGACTGGTGT CGGGAGGAAC ATATCAATAA GATGCTTGAA
CTGGGCGTGA CCAAAGTCGA ACTCGGGTTC CAACACACCG ATGATGAACT CCTGCTGTTA
AACAAACGCG GCCACACGGT TGCTGACAGC GTTTTGGCAA ACACGCTCCT TCGGGATGCC
GGCATCAAAG TCGGCTTTCA TGTTATGCCG AATCTGTACG GAAGCACGAT TCCGCGTGAC
CGGGAGATGT TCGATACGCT CTTCACCGAC CCAAGATTTT GTCCGGATTT TCTAAAGATC
TATCCAACAT TGGTCACCCC CGGCGCAGAA CTCGAAGAAC TCTGGCAAAA GGGAGAATAC
AAAACATATG ACGAGGATGA CCTTGTCGAT CTCCTCGCCT ACGCAAAAAG CAGGCTTCCT
CCCTATGTCA GACTTCAGCG TATCCAGCGG GATATTCCTG CAAAACTCAT CGTCTCCGGT
TCGATTCACG GGCACATACG TCAGATGGCT GCTGAAAGAC TCAAAGAACA GGGAGGGAGC
TGCCAGTGTA TCCGGTGTCG GGAGATCGGT CGCCGCCCGA GTTCTGCCGT GGATGAGGAG
AAGACCCTCG TGTATCCTTG CTGTGGGGGG ACAGAACATT TCCTTTCGAC CACTGCCGGA
GAATCACTGA TCGGTTTTGT TCGTCTGCGG TTTCCCGGAA CCGTATTCAG ACCGGAGCTC
GACGGTGCGG CTCTCGTTCG AGAACTCCAC GTGTACGGCG AAATCGTCCC TCTCGGTGTG
CATGGGTCAG GAGAGAAGCG TCAGCACAGA TCCTACGGTC AGCAGTTATT GTCGCGTGCC
GAAGAAACTG CGCGGGATGC CGGATATTCC ACGGTGGCCG TGATGAGCGG CATTGGGGTA
AGACCCTATT ATCATAGACA GGGATATCAG CGTATAGGTC CATATATGAT TAAGAATCTA
TGA
 
Protein sequence
MEEHTIYREL ISLIFSDPNP DIQHIKLSVC RKYSLDAMPK NSAILAAAKP EEYEALRRVL 
MVKPTRTLSG VAPVAVMTSP CACPHGKCLP CPGGPDHIFK SPQSYTGEEP AALRARQNEY
DPYRQVTARL GQFKLLGHHV DKAELIVMGG TMTARDAAYQ EWFVSECLRA MNEFSGQKST
AGSVEELMLE NEKADVRCIA TTFETRPDWC REEHINKMLE LGVTKVELGF QHTDDELLLL
NKRGHTVADS VLANTLLRDA GIKVGFHVMP NLYGSTIPRD REMFDTLFTD PRFCPDFLKI
YPTLVTPGAE LEELWQKGEY KTYDEDDLVD LLAYAKSRLP PYVRLQRIQR DIPAKLIVSG
SIHGHIRQMA AERLKEQGGS CQCIRCREIG RRPSSAVDEE KTLVYPCCGG TEHFLSTTAG
ESLIGFVRLR FPGTVFRPEL DGAALVRELH VYGEIVPLGV HGSGEKRQHR SYGQQLLSRA
EETARDAGYS TVAVMSGIGV RPYYHRQGYQ RIGPYMIKNL