Gene OSTLU_33631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33631 
SymbolHAM3503 
ID5003896 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp541842 
End bp543277 
Gene Length1436 bp 
Protein Length412 aa 
Translation table 
GC content56% 
IMG OID640419317 
Productpredicted protein 
Protein accessionXP_001419629 
Protein GI145350473 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0827491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.813084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCTCGTGGTG AATCGTCACC CATGACGAAA CGCACGGTGC GTCTCCGCGC TCGAATTCGC 
TCGCGGCTGA ACGCGCGCGC GTCGCGCGGC GCACACACAC GTTCGAGCGT CGCGATCGAC
GCTCACGAGC GTCACCGCGC GCGAACGCTT CGACGCCGTG ACGGCGATCG ATCGATCGAC
TGACGACGAA TTCCCCGGGA AAACGCGCGC AGAAGGACGC CGATGTCGAA GATACCCCGC
GACGACGCGC GCGTAATCAC GTCGATATCC CTCCCGGCAC GGAACTCCCG CTCGAGCTCG
GGACGAAGGT GCACTGTCGG TGGCGAGGAA AGGATGAGTT TTACCAGGCA AAGGTCATCG
AGCGACGACC GGGACGAGGG AAATCGGACG AGGATTACGA GTATTACGTG CACTACGAAC
AGTTTAATCG ACGGATGGAC TCGTGGGTGA GCTTGGGAGA GATGGATCTG TCGACCTTGG
TGGTTGTGGA GAAGACAGAA GACGGGAAGA AGAAGAAGGA CGCGCACGAG CCAGACGCCG
AGCACGCGGA GTTTGATCCG AAGGCGTTGC AGGAGCACGA AGAGTTCACC AAGGTTCGTA
ACATCTTGCA GATCGAACTC GGGAAGCACG AGATGGACAC GTGGTATTTC TCACCGTTCC
CACCAGAGTA TAACGACTGT CAAAAGTTGT ACTTTTGCGA GTACACGTTG CAGTTTTTCA
AGCGTAAAGA GCAGCTACAG CGTCACTTGA AGAAGAATGA GATGCGACAC CCGCCCGGGG
ATGAGATTTA CCGCAAAGGC AAGCTGAGCT TTTTCGAGAT CGATGGGAAG AAGCACAAGT
TGTTTTGTCA AAACTTGTGT TACTTGGCGA AACTGTTTTT AGATCACAAG ACGCTGTACT
ACGACGTCGA CTTGTTTTTG TTCTACGTCT TGATGGAGTG CGACGAACGC GGGTATCACA
TCGTCGGTTA CTTTTCCAAG GAAAAGTGCT CAGAAGAAGG ATACAACTTG GCGTGTATCC
TCACGTTACC CCCGTACCAG CGAAAGGGTT ACGGGAAGCT GTTGATATCG TTCTCTTACG
AGCTGTCCAA GATCGAAGGC AAGGTCGGCA CCCCCGAGCG TCCGCTCAGC GATCTCGGTT
TGGTATCGTA CCGCGGCTAC TGGACGCGCG AGCTGTTAAA AATTCTCGGC GACGAGTCCA
AGCAGTTCCT CTCCATAAAA GACCTGAGCG AGATGACGAT GATCAAGACC GAGGACATCA
TCTCCACCCT CCAGCACCTC GGTTTGCTCG CCTACACCAA GGGCGCGTAC GTCATCTGCG
CTTCGCCCGA GCTCATCGAG AAGCATTTCA AAGCCGCCGG CAGCGGCGGC GTACCGTGCG
ATCCCGAAGC GATCATTTGG TCCCCGTACG ACCCCGAGCG CGCGAGGGAC TTGTAA
 
Protein sequence
MTKRTKDADV EDTPRRRARN HVDIPPGTEL PLELGTKVHC RWRGKDEFYQ AKVIERRPGR 
GKSDEDYEYY VHYEQFNRRM DSWVSLGEMD LSTLVVVEKT EDGKKKKDAH EPDAEHAEFD
PKALQEHEEF TKVRNILQIE LGKHEMDTWY FSPFPPEYND CQKLYFCEYT LQFFKRKEQL
QRHLKKNEMR HPPGDEIYRK GKLSFFEIDG KKHKLFCQNL CYLAKLFLDH KTLYYDVDLF
LFYVLMECDE RGYHIVGYFS KEKCSEEGYN LACILTLPPY QRKGYGKLLI SFSYELSKIE
GKVGTPERPL SDLGLVSYRG YWTRELLKIL GDESKQFLSI KDLSEMTMIK TEDIISTLQH
LGLLAYTKGA YVICASPELI EKHFKAAGSG GVPCDPEAII WSPYDPERAR DL