Gene OSTLU_33510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33510 
SymbolHDA3504 
ID5003617 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp333809 
End bp335927 
Gene Length2119 bp 
Protein Length624 aa 
Translation table 
GC content59% 
IMG OID640419038 
Productpredicted protein 
Protein accessionXP_001419561 
Protein GI145350325 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein
[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000509548 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGAC GAGCGACGGA GGGGGGCGCG AGACGAGAGC CGTCGTGGCC GCCGAGCAAA 
GCGGCGTTGA ATCGAATAGA TTCGCTCGGC GATTGCAGAA GCGCGCCGCG AGGAGTGGAC
GCGAAGAATG CGAACGGAGA CACCGCGCTG GCGGTGGCGT GCGCGCTGGA GGATGGGAAG
GCGTGCGAAG AGATGGTGAA GATTTTGGTG GACGCCGGCG CGGACGCGAG CGCGCTTTCA
AACGGAATGG CGCCGGTGCA CTGGGCGTGC GCGCTAGGGC ATCGCGACGC GCTCGCGTGC
ATGGTCGACG CGTCGTCGAC GTCAATCGTG AATCTTCGTG CCGAAGATGG GTCGACGCCG
TTGCTCGTGG CGGCGGAACA CGGTAAAATC GAAGTCATAC GATGGTTGCT CGAGCGCGGC
GACGTCGACG CGATGGCGAG AAACGCGCAT GGACGCGACG TACTCGGCGC GCTCGGCGCG
AAGATGCGTC GACAGAGTCA ATCGGCGAGG AGTGAGTTAC GTGCGGAGAT ATTTGAACTC
ATTCCTTCGC TGCGTTTGGC GTTTTTGTCC CATCCTGATT GCGAAGAACA CGTGTCGTTC
AAGCCACATC AAGAGAGCCC CGAGCGCATA GTCGCCATTC TCGCCGAGTT GCAGCGCGTG
ATCGACCGCG GTGAACTCGC AATGGGAGAG CTAGATAAGA CATGCACGTT CGGCGCCGCC
GAACCGTCGG ATATCCTCCG AGCGCACGAT GAGAATTACG TCCGCGTGTT GGCCGAACTC
AGCGATCGCG TCGGGCAAAC GCCCATCGCG TTCACGCCGT ACTGTCAAAA CCATAACGGC
GTGCCAGAAA AACTACAAAA ACCCGCGGAG AACAGTGACA CATTCTTTTC ACCGGGCACG
TTGCAAGCGG CGCTTAGAGC TGCGGGCGGG GTCATTCACG CCGTGGATAG AGTCTTGGAT
GGGAAGAATA GGAGCGCTTT CGTATGCTGT CGCCCGCCCG GTCATCACGC CGGGATCAAC
GGCGCCACGG AAGGCGCGCC GTCGAGCGGC TTTTCCATTT TGAACAACGC CATGATCGGT
ATGTGCGCAC TTTGCGCGAG CGACGCGAGC GATCGATTTT TCTTGAAACG GACGCACACG
CATAACCGCG GCGCTTCTAC GACACGCGTC GCGGACATGA TGCGCGTCCT ACATCGACGT
CCACTTTCAA TGACAGAGAG TTCTGAGCGT GGCTCTGCTG ACAGATACGG GTGGTACATT
TCGCGCGACT CGACTAACTG ACTGACGCTG ACGTTCACAA CGCAGGCGCT TTGCACGCCA
TCGACGTTCG CAAATGCATG CGGGTCGCCG TGGTTGATTT CGACGTGCAT CACGGGAACG
GCACCGAAGA AATCGCCAGG CATTGGCTCA CGAAGCAGCG CGCGAGAGAC GACTATCACC
GGCACAAATC TCCTGACTTG TTCTTCGCGT CGATTCACTT GGCGGATGAT GGAACGTCTT
CGGGCATGCT TCCATTCTAC CCGGGCAGCG GCGTGCAGGA TGATTTGATG AACAATATCG
TCAATGTCGT CGTCCCACCG ATGTGGCTCG CCAAGGGCGC AAGCGCCACG GCACACGCGG
AAACCGGCGA GCGCGCTCGG AAAAAAACAA AACGCTTCGC TGACTCTGAA GATGCCGCCG
CCGCGCCGCC GCCGAAGACC ATCGCCATTG AACCCGAGCA GCAACAAGGT GGGCGCCTAG
AGTGGATGAA GGCGTTTCGA GAGCGCTTGA TTCCCGCTCT CAGGGCGTTC GGTCCTGAGC
TCATAATCGT GTCCGCGGGA TTCGACGCGG CGGCTTCGGA CGTAGGAAAC TTGGGCGTCG
ATCCGCGTCG AAACACGAGG CACCAAGGCG CTAACCTTCG CGCTGAAGAC TACGAGGACA
TGACGAAGTT GCTCGTAAAC GTGTCGAACG TTTGCGATGG GCGAGTTGTA TCTATTTTAG
AAGGGGGCTA CGGACACTTG ATGAGCGTCG GAAAATCGAG CGACGGCGCG CAAAATGCGT
TGACGCTCGG TAGAGACGTC TTCGCCAAAT GCGTCAAAGC GCACGTCCAG GCGCTCATCT
GATGTTGTAC TTTTCACTC
 
Protein sequence
MVRRATEGGA RREPSWPPSK AALNRIDSLG DCRSAPRGVD AKNANGDTAL AVACALEDGK 
ACEEMVKILV DAGADASALS NGMAPVHWAC ALGHRDALAC MVDASSTSIV NLRAEDGSTP
LLVAAEHGKI EVIRWLLERG DVDAMARNAH GRDVLGALGA KMRRQSQSAR SELRAEIFEL
IPSLRLAFLS HPDCEEHVSF KPHQESPERI VAILAELQRV IDRGELAMGE LDKTCTFGAA
EPSDILRAHD ENYVRVLAEL SDRVGQTPIA FTPYCQNHNG VPEKLQKPAE NSDTFFSPGT
LQAALRAAGG VIHAVDRVLD GKNRSAFVCC RPPGHHAGIN GATEGAPSSG FSILNNAMIG
ALHAIDVRKC MRVAVVDFDV HHGNGTEEIA RHWLTKQRAR DDYHRHKSPD LFFASIHLAD
DGTSSGMLPF YPGSGVQDDL MNNIVNVVVP PMWLAKGASA TAHAETGERA RKKTKRFADS
EDAAAAPPPK TIAIEPEQQQ GGRLEWMKAF RERLIPALRA FGPELIIVSA GFDAAASDVG
NLGVDPRRNT RHQGANLRAE DYEDMTKLLV NVSNVCDGRV VSILEGGYGH LMSVGKSSDG
AQNALTLGRD VFAKCVKAHV QALI