Gene Nmar_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0579 
Symbol 
ID5773529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp514892 
End bp516475 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content34% 
IMG OID641316213 
ProductELP3 family histone acetyltransferase 
Protein accessionYP_001581913 
Protein GI161528087 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGT TAGATTCAGT ATTCTCAAAG GCATGTAGTG AAATTACTCA GAATTTGCTT 
ACAATCAATG AACCAAGCAA AAAGCAAGTA AAAGAAGAGA TAAAGAGAAT TTGTGCCAAA
TACTCACTTG AAAGAATTCC AAGAAATCAC GAAATACTTT CAATGGCAAA AGAATCAGAA
TTTGATAAAC TCAGAAAAGT CTTGTTAAAG AAACCTGCAA AAACTGCATC AGGAGTTGCA
GTAGTTGCAT TAATGCCAAA ACCATATGCA TGTCCACATG GAAGATGTAC ATATTGCCCA
GGAGGAATTG AATTTAATTC ACCAAACAGT TACACAGGAA ATGAACCATC AACTCTAAAT
GCAATTGAAA ACGAGTATGA TCCAAAATTA CAAATCACAA CTAAAATTGA TAAGCTGATT
GCATTTGGAC ATGACCCATC AAAAATGGAG ATAGTAATTG TCGGAGGGAC ATTTCTCTTT
ATGCCCAGAG ATTACCAAGA AAATTTTATC AAATCATGTT ATGATGCGCT AAATGGTACA
GACTCTAAAA ATTTGGAAGA AGCAAAATCA AACAATGAAC ATGCATCAAT AAGAAATGTA
GGATTTACAA TTGAAACAAA GCCAGATTTT TGTAAAAAAG AACATGTTGA TTGGATGTTA
GATTATGGAG TGACAAGAAT AGAGATTGGG GTACAGTCAC TGCAAGAAAG AGTCTACAAT
ATTATCAACA GAGGTCACAA TTACAATGAC GTAGTCGAAT CATTTCAAAT TTCAAAAGAT
GCAGGTTACA AACTAGTTGC ACATATGATG CCAGGACTTC CTACAATGAC GCCAGAAGGA
GACATTGCAG ATTTTAAAAA ATTGTTTTCA GATTCACAGT TACGTCCAGA TATGCTCAAA
ATATATCCAT CATTAGTTAT CGAAAACACC CCAATGTATC AAGAATACAA GGACGGAAAA
TACACTCCAT ATTCAGATGA AGATATGATT CAGGTTCTAA CAGAAGTAAA GAAAGACATC
CCAAAATGGG TCAGAATCAT GCGTGTTCAA AGAGAAATAT CTCCAAATGA GATTATCGCA
GGTCCAAAAT CAGGCAATCT AAGACAGATT GTGCATCAAA ATTTGACAAA ACAAGGACTA
AAATGCAAAT GTATCAGATG CCGAGAGGCA GGATTAACCA AATCCAACCC AGAGGAAAGA
GACATCAAAC TAACACGAAT CAATTATGAT TCATCAGGAG GAAAAGAAGT GTTCTTGTCA
TTTGAGGATG AAGATGAATC AATTTATGGA TTTTTGAGAT TAAGAAAACC AAGTGATGAT
GCACATAGAG ATGAAGTCAA AGACACATGT ATTGTAAGAG AATTACATGT TTATGGAAAG
TCTCTAAAGA TTGGAGAAAA AGAAGAAAAT GAAATTCAGC ATTCAGGATT TGGTAAAAAA
TTGATGAAAG AAGCTGAGAA AATATCAAAA GAAGAGTTTG ATGCAAAGAA GATGTTGGTA
ATTAGTGCAG TTGGAACAAG AGAGTATTAC CAAAAATTAG GGTATTCGTT ATATGGGCCA
TACATGGCAA AAGAGTTGAG TTAG
 
Protein sequence
MSKLDSVFSK ACSEITQNLL TINEPSKKQV KEEIKRICAK YSLERIPRNH EILSMAKESE 
FDKLRKVLLK KPAKTASGVA VVALMPKPYA CPHGRCTYCP GGIEFNSPNS YTGNEPSTLN
AIENEYDPKL QITTKIDKLI AFGHDPSKME IVIVGGTFLF MPRDYQENFI KSCYDALNGT
DSKNLEEAKS NNEHASIRNV GFTIETKPDF CKKEHVDWML DYGVTRIEIG VQSLQERVYN
IINRGHNYND VVESFQISKD AGYKLVAHMM PGLPTMTPEG DIADFKKLFS DSQLRPDMLK
IYPSLVIENT PMYQEYKDGK YTPYSDEDMI QVLTEVKKDI PKWVRIMRVQ REISPNEIIA
GPKSGNLRQI VHQNLTKQGL KCKCIRCREA GLTKSNPEER DIKLTRINYD SSGGKEVFLS
FEDEDESIYG FLRLRKPSDD AHRDEVKDTC IVRELHVYGK SLKIGEKEEN EIQHSGFGKK
LMKEAEKISK EEFDAKKMLV ISAVGTREYY QKLGYSLYGP YMAKELS