Gene Apar_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0906 
Symbol 
ID8413773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1013945 
End bp1015798 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content47% 
IMG OID645022490 
ProductHistone acetyltransferase 
Protein accessionYP_003179926 
Protein GI257784709 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01211] histone acetyltransferase, ELP3 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.301086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.526584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAC TCATTTTGGA CATATTGCAG CAGCTCAGAA ATGGCTCACA AGGTGCTCTT 
GATGCGCATC AGCTTGAGAT ACTTATTAAC TCGCATAACA GCGGAATTGA CTCCAGCGCC
CACAGTACAG AGCGCGAGAA ACTCATTCCT AAACGAGCAA TTCTCCCCTA TTTTTTGCAA
GTAAAGCAAA AAAACAATGA ACTGTGGCAA TCATGGAATG TAACACCTGA GCTAGAAGAA
AGGTTTATCC GCTCTGTTCG CATGAAGCCT CGTCGCACCG CTTCTGGTGT TGCCACCATT
ACCGTAATCA CCAGGCCGCA TACCTGCTCA AGCAATTGCA TTTATTGCCC ATGCGATCTG
CGTATGCCTA AGAGCTACCT TGCTAACGAA CCCGCCTGTC AGCGTGCTGA GCTTACGTTT
TTTGATCCAT ACGTACAAGT TGCTGCTCGT CTTCAGGCAC TCCACCAGAT GGGTCACTCA
ACCGATAAAG TGGAGCTTAT TGTGCTTGGT GGTACCTGGA GCGATTATCC AGAGAGTTAC
CAGTACTGGT TTATCAAAGA GCTCTTCCGT GCGCTCAATG AATGGCCCAA CTCTCCTAGC
CACATCCAGG AGCGCCTTGA TTGGTACTCC TCGTTTGGCT TGCAGAACTC TGAAGAGGCA
CTCTCTTCCT TTGTTGCTGA ACAGCAAGCG GCTGTCTTTG ATGATGCTGT CACGTATAAC
CAGGCTTTTC ATAAGCTCTA CGATTCCAGC CGTCCTCACC AAAGAACCTG GTCTCAGATG
CAAAGCACCT ATGATGAGTT GGTAGAGCAG CAACGCGTTA ATGAGACGGC CGCTGCCCGT
GTGGTAGGTC TTGTTATTGA AACCAGGCCC GATACCATCA CGCCAGGTAA CCTACGCATG
TTTAGACAGC TTGGATGCAC CAAAATTCAA ATTGGCATTC AGAGCACGCG TCAAGAAATT
CTTGACGCAA ACAAACGTCA GATGAGCGTT GCTCAGATTA AACGAGCTTT CTCACTCATT
CGCTTGTACG GATTTAAAAT CCACTCTCAC CTAATGGTGA ATCTTCTTGG CGCAACTCCT
GAAGCAGACA AACAGGACTT TAAAACGTTT GTCACGGATC CAGGATTTCT CCCCGATGAG
ATCAAACTGT ATCCGTGCGC TCTAGTATCT GGAACACAAC TGGTGCAGAA GTTTCATGAA
GGTACCTGGC AACCATACAC AAAAGACGCG CTGGTAGATG TACTTGTTCA AGATGTGCTT
AACACACCAC CGTATGTACG CATTTCTCGC ATGATTAGAG ACATTAGCGC AACCGACATC
TTAGTAGGAA ACAAACACAC CAACCTACGT CAAATGGTGG AACAAGAACT TGCTGCTGAA
GACGTTGCAA GCCGCGTACA GGAAATTCGC TTTAGAGAAA TTAACCAGCA GCAGGTACGC
GCCAATGAAC TCACACTGCA AGACTTTGTC TACACCACCT CAGTTAGTAA CGAGCACTTC
CTGCAATGGG TGACTACTGA CAATAAAATT GCAGGTTTCT GTAGGCTTTC ACTTCCCCAC
TGGGACAAAC TGATCTCTGG CACATGTGAT GTTAGTGCCG ATGAGTTGCT GGTACAGCCA
GGTCAAGCCA TGATTAGAGA GCTTCACGTA TACGGACAGG CATTATCCCT AGGCTCGGAA
GGAATGTCTG CCCAGCACCA GGGTCTAGGT CAAAAACTTC TTGCCAAAGC TTCCTCTATC
GCAGCTGATG AGGGATATAC CAGTCTTAAT GTTATCAGCT CAATTGGAAC ACGCGCCTAC
TATCGCACGC AGGGCTTTAC TGATGCTGGC CTCTACCAAC AAAAAGCCTT GTAA
 
Protein sequence
MNELILDILQ QLRNGSQGAL DAHQLEILIN SHNSGIDSSA HSTEREKLIP KRAILPYFLQ 
VKQKNNELWQ SWNVTPELEE RFIRSVRMKP RRTASGVATI TVITRPHTCS SNCIYCPCDL
RMPKSYLANE PACQRAELTF FDPYVQVAAR LQALHQMGHS TDKVELIVLG GTWSDYPESY
QYWFIKELFR ALNEWPNSPS HIQERLDWYS SFGLQNSEEA LSSFVAEQQA AVFDDAVTYN
QAFHKLYDSS RPHQRTWSQM QSTYDELVEQ QRVNETAAAR VVGLVIETRP DTITPGNLRM
FRQLGCTKIQ IGIQSTRQEI LDANKRQMSV AQIKRAFSLI RLYGFKIHSH LMVNLLGATP
EADKQDFKTF VTDPGFLPDE IKLYPCALVS GTQLVQKFHE GTWQPYTKDA LVDVLVQDVL
NTPPYVRISR MIRDISATDI LVGNKHTNLR QMVEQELAAE DVASRVQEIR FREINQQQVR
ANELTLQDFV YTTSVSNEHF LQWVTTDNKI AGFCRLSLPH WDKLISGTCD VSADELLVQP
GQAMIRELHV YGQALSLGSE GMSAQHQGLG QKLLAKASSI AADEGYTSLN VISSIGTRAY
YRTQGFTDAG LYQQKAL