Gene OSTLU_42025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42025 
SymbolHAM3501 
ID5006191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp226605 
End bp227981 
Gene Length1377 bp 
Protein Length458 aa 
Translation table 
GC content61% 
IMG OID640421612 
Productpredicted protein 
Protein accessionXP_001422236 
Protein GI145356011 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.067727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC CGAAGACGAA CGCGCCGGCG CGACCCGACG CGTTCGCGGC GATCGACGCG 
CGCGCGGCGT TCGCGGTCGG CGCGCGGGTG CGCGTGCGGG ACGGGGCGAA CGAGGCGGAC
GGCGCGGACG CGCGCGAGGG CGTCGTGGCG GCGAGGCGGG GCGACGGCGC GAGCGCGACG
GACGGCGAGG TCAAGTATTA CGTGCGGTAC GACGCGACGG GGGTCGCGGA CGAGTGGGTG
AGGGTCGAAA GGTTGACGAG CGCGGGTGGG GAGAGCGTGG GAGGGAGCGA GACCACGGGC
GGGGGGTCGA GATCGGACGC GGCGACGGCG GCGAGAGCGG CGGCGGCGGC GGCGGCGGCG
AGAAAGCGGA AGACGACGGG AGAGAAGGAC GGGGGCGAGC GCGGCGAGGA GGGGGGGCGG
AAGGTGCGGT GGATCGAGCT CGGGCGGTAC ATTTGCGATT GCTGGTTCGA CTCGCCGTTT
CCGGAGGAAT ACACCGACGA GCGAAAGCTA TTTGTATGCG ATTTTTGCTT AAAGTATCAT
CGGAAGCGGA GGGCGTACAT CGCGCATAAG AAGACGTGCG AACTGCGACA TCCGCCGGGG
AATGAGATTT ATAGGCATCC AGAGCGACAG GCGACGGAGA AATCGCCCGC GCGCGCGCAG
TTGCGCATGT GGGAAGTGGA CGGGTCGAAC GCGACGACGT ATTGTCAAAA CTTGTGCCTC
ATCGCGAAAC TCTTCTTGGA TCACAAGACG CTGTATTACG ACGTCGCCGC GTTTTACTTT
TATGTCTTGA CCGAACGGGA CTTTGACGAG AGTGGAAAGC CGCTCTATCG AATAATCGGG
TACTTCAGCA AGGAGAAGGG TCAAGTGGAG ACGAATCTGG CGTGCATTCT CACGTTGCCG
CCGTACCAGC GTCGCGGCTA CGGGGGGTTT TTGATTGAGT TTTCGTACGA GCTCGCCAAA
CGCGAGGGAC GCATCGGCAC CCCGGAGCGA CCTCTGAGCG ATTTGGGGTT CGCTTCCTAT
CGAACGTACT GGAGCCGAGT GATTTACGAA AGTCTCAGCA GCACCAGCGC CGGCGGGGTG
AACGTGGCGG AGCTGAGCAA AAAGACGAAC ATTCGCGTCG ACGACATCGT CTCAACGCTA
CAGCCGTTTA GTTCGATTCG ATTCTTCAAA GATCAAGGCT TCTTGAACAT CACCAAGGAA
GGTAAGCAGG ATTTCAAAGC CGCGCAAGTC AAGCTTCGCG AAAAATGGAG CGAACTGCGC
GTCATACCCG AACGACTACA GTTCGAGCCG AAAATCAACG CCGTCGTGGA AGTAGCTGAG
AAGCGCCGCC GCACGCGTTT GTTCCAACGA GACATTTTGG AAAATAATGA GAAATGA
 
Protein sequence
MRAPKTNAPA RPDAFAAIDA RAAFAVGARV RVRDGANEAD GADAREGVVA ARRGDGASAT 
DGEVKYYVRY DATGVADEWV RVERLTSAGG ESVGGSETTG GGSRSDAATA ARAAAAAAAA
RKRKTTGEKD GGERGEEGGR KVRWIELGRY ICDCWFDSPF PEEYTDERKL FVCDFCLKYH
RKRRAYIAHK KTCELRHPPG NEIYRHPERQ ATEKSPARAQ LRMWEVDGSN ATTYCQNLCL
IAKLFLDHKT LYYDVAAFYF YVLTERDFDE SGKPLYRIIG YFSKEKGQVE TNLACILTLP
PYQRRGYGGF LIEFSYELAK REGRIGTPER PLSDLGFASY RTYWSRVIYE SLSSTSAGGV
NVAELSKKTN IRVDDIVSTL QPFSSIRFFK DQGFLNITKE GKQDFKAAQV KLREKWSELR
VIPERLQFEP KINAVVEVAE KRRRTRLFQR DILENNEK