Gene Pars_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0122 
Symbol 
ID5055267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp109276 
End bp110727 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content63% 
IMG OID640467701 
Productradical SAM domain-containing protein 
Protein accessionYP_001152389 
Protein GI145590387 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGGC CCCTCCGCTT TGTCAGAATG GCGAGCGGAG TCCACGTCGT TGCCCTCATG 
ACACAGCCCT TACCCTGCCC CGGCCGGTGC GCCTTCTGCC CGACCTCGGC CGACGCCCCC
AAGTCCTACA TGCCGGACAG CCCCGTGGTG CTCCGCGCCA AAAGGAATAG GTACGATCCG
TACTTACAGA CGGCGGGCCG GGTAAAGGTG TATCTCGAAA ACGGCCACAC GCCCTCTAAG
ATAGAGGCTG TTGTGATGGG CGGCACCTTC AGCGCGTTGC CGCGGAGCTA CCGGGAGTGG
TTCGTCGCCA ACGTGTACAA GGCGCTTAAC GACTACCCCC ACTGGGCCCC AGCCGCCGAC
CCTGCCCCCG ATCTTGAGGC CGAACAGCTG AGGAACGAGA CAGCGTCCCT CCGGCTGGTG
GCTCTCGCCG TGGAGACCAG GCCGGACTAC GTGGACGGGG CCGAGGCGGA CTTCCTCCTT
AGGCTCGGCG TCACCAGGAT GGAGCTCGGC GTTCAGTCCA TATACGACGA CGTTTTGCAG
AGGGTTAAGA GGGGCCACGG CGTGGCCGAA GTGGCCAGAG CCACCGCGCT TCTGAAAGAC
TCGGCGTATA AGGTGTGCTA CCACCTCATG CCTGGGCTCC CGGGGAGCGA CCCAGACCGC
GACTTGGAGA TGGTGAAGGA GGTGTTTTCA GACCCCCGCT TCGTGCCGGA CTGCGTCAAG
ATATACCCCA CCTACGTGGT GCCGGGGACT GAGCTCTACG ATGAGTGGCG CCGCGGTGCT
TATAGGAGCT ACGACGAGGA GACTTGGCTG GAGCTGTTGG CGAAGATATA CGCGTCAGTC
CCCCGCTGGG CCAGGGTGAT GAGGCTTGGC AGGGACATCC CCCTCCACCA CGTCGTGGAC
GGCCCCAGGT GGGGCAACAT GCGGCAGATG GTGCTGAGGC ACATGGAGAG GCTGGGCCTA
AAGTGCCTGG AGATTAGGTG CAGAGAGGTG GGCATAAAGC TGGCAAACAA CGTGCCGATC
CAGCCGGGCC CCATCGAGGT TAAGAAGACG GAGTACGAGG CCTCAGGGGG CGTGGAGATA
TTTCTAGAAG CCGTCGGCCC AGACGACACC CTATACGCAA TTCTGCGGCT GAGGATCCCC
GGCAAGCCGC ATAGGCCTGA GCTACGCAAG GCGGCGCTTG TGAGGGAGCT CCACGTCTAC
GGCCCCGCCG TACCCGTGGG GGAGCAGGGC ATTTGGTGGC AACACACAGG CCTCGGCAGA
GGCTTGATGA CCCGCGCCGA GGAGATCGCC AGGGAGTACG GCGTGTTGAA AATCGCGGTT
ATCTCCGGGG TCGGGGCGAG GGAGTACTAC CGCAAGCTGG GCTACAGCAG ATGCGGCCCC
TACATGTGCA AGCCCATAGG CGCGCTGCTC GGCTACGCAG ACGACTACCT CGCCGCAAGC
GAAAGTATTT AA
 
Protein sequence
MERPLRFVRM ASGVHVVALM TQPLPCPGRC AFCPTSADAP KSYMPDSPVV LRAKRNRYDP 
YLQTAGRVKV YLENGHTPSK IEAVVMGGTF SALPRSYREW FVANVYKALN DYPHWAPAAD
PAPDLEAEQL RNETASLRLV ALAVETRPDY VDGAEADFLL RLGVTRMELG VQSIYDDVLQ
RVKRGHGVAE VARATALLKD SAYKVCYHLM PGLPGSDPDR DLEMVKEVFS DPRFVPDCVK
IYPTYVVPGT ELYDEWRRGA YRSYDEETWL ELLAKIYASV PRWARVMRLG RDIPLHHVVD
GPRWGNMRQM VLRHMERLGL KCLEIRCREV GIKLANNVPI QPGPIEVKKT EYEASGGVEI
FLEAVGPDDT LYAILRLRIP GKPHRPELRK AALVRELHVY GPAVPVGEQG IWWQHTGLGR
GLMTRAEEIA REYGVLKIAV ISGVGAREYY RKLGYSRCGP YMCKPIGALL GYADDYLAAS
ESI