Gene OSTLU_34656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34656 
SymbolHAM3502 
ID5003645 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp222727 
End bp224070 
Gene Length1344 bp 
Protein Length417 aa 
Translation table 
GC content56% 
IMG OID640419066 
Productpredicted protein 
Protein accessionXP_001419529 
Protein GI145350256 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0290177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG ATGACGCGGG GCGGCGCAAA AGCGAGTCCA CGTCCGATCG CGCGGGCGCG 
CTGGACGTCG GCGCGCGCGT GTGGGCGAAG GCGTCGTTCG ATGAAAAACG ACGGCTCGGG
GAAATCGTCG ACGTGCGTCG CGAGGACGGC GTCGCGGTGG CGTATTACGT GCACTATAGC
GAGTTGAATA AACGCCTGGA CGCGTGGGTG CCAGTGGAGG ACGTGGAGAC GCGTAAGGAG
GGCGAGGGCG AGGGTAAGGG CGAGGGCGGC GACGGTCGAG GCGGCGACGG CGACGGCGGC
GGCGCGGGGA GCGCCGACGA CGCGCTTGGG GCGCACGGTA GAGTGAACGG GAAGCAGCCG
AAGACGTTGA CGCGGAATTC TAAGCGGAGA TATAATGAGA TACACAACGT GGACGCGCCG
GTGGAGGATT TGCCGCCCAT GGATCAAATG TTTGAGCGCA TGCACGAGGA GAAAACAAAG
GTGAAGAACG TGCACTCGAT CGAGCTCGGG CGGCACGAGA TGGACACGTG GTATTATTCT
CCGTATCCGG ATGACTTCGG AAAGTGTTCC AAGCTGTACT TGTGTCAGTA TTGTTTCAAG
TATATGCGAA AGGCGAAGAC GTGCGTGCGG CACAAGGCGG AGTGCGAGAT GAAGCATCCG
CCGGGAAAGC GGGTTTATCG ACATCCACAG GGCGAGGGCG AACCGTTGTT GAGCTTTTGG
GAAATCGATG GCGCGCACTT CAAAATGTAC TGCCAGAATT TGTGTTTGAT GGCGAAACTA
TTCTTGGACC ATAAGACGCT GTACTTTGAT GTCGAACCGT TTATGTTTTA CGTGCTCACT
GAGTCAATGG ATGGAGACGA AACGCACGAT ATCGTGGGTT ACTTTAGCAA AGAAAAGGTT
TCAGTGGACG ATTACAACTT GGCGTGTATT TTGACGCTGC CGGCGTATCA ACGCAAGGGA
TATGGCTCAT TCTTAATATC GATGAGCTAC GAGCTGAGTC GACGCCAAGG CGTGTACGGC
ACCCCCGAAC GGCCGCTGTC TGATCTCGGA CAAGTGAGTT ATAGAAGCTA TTGGAGTCGA
GTCGTTTTGG ACGTGCTCCA CAAGCACAGA GGAAACCTCA GCGTGAAGGA TCTGAGCTCG
ATGTTGATGT TTCGCGAGGC AGACATTGTC AGCGCGTTGC AGTCGCTCAA TCTGGTCAAA
TACTGGAAAG GCCAGCACAT CATCTCTTCT TCGCCAAAAA TTGTGGAAGA ACACTTGAGT
AGTTTTGAAA AGAAATCCGC TATAGCGGGT ACGAGGCTCG AATTCAACCC GGAGTACTTG
AATTGGACGG CGCCGCCGAT GTAG
 
Protein sequence
MARDDAGRRK SESTSDRAGA LDVGARVWAK ASFDEKRRLG EIVDVRREDG VAVAYYVHYS 
ELNKRLDAWV PVEDVETRKE GEGEVNGKQP KTLTRNSKRR YNEIHNVDAP VEDLPPMDQM
FERMHEEKTK VKNVHSIELG RHEMDTWYYS PYPDDFGKCS KLYLCQYCFK YMRKAKTCVR
HKAECEMKHP PGKRVYRHPQ GEGEPLLSFW EIDGAHFKMY CQNLCLMAKL FLDHKTLYFD
VEPFMFYVLT ESMDGDETHD IVGYFSKEKV SVDDYNLACI LTLPAYQRKG YGSFLISMSY
ELSRRQGVYG TPERPLSDLG QVSYRSYWSR VVLDVLHKHR GNLSVKDLSS MLMFREADIV
SALQSLNLVK YWKGQHIISS SPKIVEEHLS SFEKKSAIAG TRLEFNPEYL NWTAPPM