Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33510 |
Symbol | HDA3504 |
ID | 5003617 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 333809 |
End bp | 335927 |
Gene Length | 2119 bp |
Protein Length | 624 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419038 |
Product | predicted protein |
Protein accession | XP_001419561 |
Protein GI | 145350325 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000509548 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGAC GAGCGACGGA GGGGGGCGCG AGACGAGAGC CGTCGTGGCC GCCGAGCAAA GCGGCGTTGA ATCGAATAGA TTCGCTCGGC GATTGCAGAA GCGCGCCGCG AGGAGTGGAC GCGAAGAATG CGAACGGAGA CACCGCGCTG GCGGTGGCGT GCGCGCTGGA GGATGGGAAG GCGTGCGAAG AGATGGTGAA GATTTTGGTG GACGCCGGCG CGGACGCGAG CGCGCTTTCA AACGGAATGG CGCCGGTGCA CTGGGCGTGC GCGCTAGGGC ATCGCGACGC GCTCGCGTGC ATGGTCGACG CGTCGTCGAC GTCAATCGTG AATCTTCGTG CCGAAGATGG GTCGACGCCG TTGCTCGTGG CGGCGGAACA CGGTAAAATC GAAGTCATAC GATGGTTGCT CGAGCGCGGC GACGTCGACG CGATGGCGAG AAACGCGCAT GGACGCGACG TACTCGGCGC GCTCGGCGCG AAGATGCGTC GACAGAGTCA ATCGGCGAGG AGTGAGTTAC GTGCGGAGAT ATTTGAACTC ATTCCTTCGC TGCGTTTGGC GTTTTTGTCC CATCCTGATT GCGAAGAACA CGTGTCGTTC AAGCCACATC AAGAGAGCCC CGAGCGCATA GTCGCCATTC TCGCCGAGTT GCAGCGCGTG ATCGACCGCG GTGAACTCGC AATGGGAGAG CTAGATAAGA CATGCACGTT CGGCGCCGCC GAACCGTCGG ATATCCTCCG AGCGCACGAT GAGAATTACG TCCGCGTGTT GGCCGAACTC AGCGATCGCG TCGGGCAAAC GCCCATCGCG TTCACGCCGT ACTGTCAAAA CCATAACGGC GTGCCAGAAA AACTACAAAA ACCCGCGGAG AACAGTGACA CATTCTTTTC ACCGGGCACG TTGCAAGCGG CGCTTAGAGC TGCGGGCGGG GTCATTCACG CCGTGGATAG AGTCTTGGAT GGGAAGAATA GGAGCGCTTT CGTATGCTGT CGCCCGCCCG GTCATCACGC CGGGATCAAC GGCGCCACGG AAGGCGCGCC GTCGAGCGGC TTTTCCATTT TGAACAACGC CATGATCGGT ATGTGCGCAC TTTGCGCGAG CGACGCGAGC GATCGATTTT TCTTGAAACG GACGCACACG CATAACCGCG GCGCTTCTAC GACACGCGTC GCGGACATGA TGCGCGTCCT ACATCGACGT CCACTTTCAA TGACAGAGAG TTCTGAGCGT GGCTCTGCTG ACAGATACGG GTGGTACATT TCGCGCGACT CGACTAACTG ACTGACGCTG ACGTTCACAA CGCAGGCGCT TTGCACGCCA TCGACGTTCG CAAATGCATG CGGGTCGCCG TGGTTGATTT CGACGTGCAT CACGGGAACG GCACCGAAGA AATCGCCAGG CATTGGCTCA CGAAGCAGCG CGCGAGAGAC GACTATCACC GGCACAAATC TCCTGACTTG TTCTTCGCGT CGATTCACTT GGCGGATGAT GGAACGTCTT CGGGCATGCT TCCATTCTAC CCGGGCAGCG GCGTGCAGGA TGATTTGATG AACAATATCG TCAATGTCGT CGTCCCACCG ATGTGGCTCG CCAAGGGCGC AAGCGCCACG GCACACGCGG AAACCGGCGA GCGCGCTCGG AAAAAAACAA AACGCTTCGC TGACTCTGAA GATGCCGCCG CCGCGCCGCC GCCGAAGACC ATCGCCATTG AACCCGAGCA GCAACAAGGT GGGCGCCTAG AGTGGATGAA GGCGTTTCGA GAGCGCTTGA TTCCCGCTCT CAGGGCGTTC GGTCCTGAGC TCATAATCGT GTCCGCGGGA TTCGACGCGG CGGCTTCGGA CGTAGGAAAC TTGGGCGTCG ATCCGCGTCG AAACACGAGG CACCAAGGCG CTAACCTTCG CGCTGAAGAC TACGAGGACA TGACGAAGTT GCTCGTAAAC GTGTCGAACG TTTGCGATGG GCGAGTTGTA TCTATTTTAG AAGGGGGCTA CGGACACTTG ATGAGCGTCG GAAAATCGAG CGACGGCGCG CAAAATGCGT TGACGCTCGG TAGAGACGTC TTCGCCAAAT GCGTCAAAGC GCACGTCCAG GCGCTCATCT GATGTTGTAC TTTTCACTC
|
Protein sequence | MVRRATEGGA RREPSWPPSK AALNRIDSLG DCRSAPRGVD AKNANGDTAL AVACALEDGK ACEEMVKILV DAGADASALS NGMAPVHWAC ALGHRDALAC MVDASSTSIV NLRAEDGSTP LLVAAEHGKI EVIRWLLERG DVDAMARNAH GRDVLGALGA KMRRQSQSAR SELRAEIFEL IPSLRLAFLS HPDCEEHVSF KPHQESPERI VAILAELQRV IDRGELAMGE LDKTCTFGAA EPSDILRAHD ENYVRVLAEL SDRVGQTPIA FTPYCQNHNG VPEKLQKPAE NSDTFFSPGT LQAALRAAGG VIHAVDRVLD GKNRSAFVCC RPPGHHAGIN GATEGAPSSG FSILNNAMIG ALHAIDVRKC MRVAVVDFDV HHGNGTEEIA RHWLTKQRAR DDYHRHKSPD LFFASIHLAD DGTSSGMLPF YPGSGVQDDL MNNIVNVVVP PMWLAKGASA TAHAETGERA RKKTKRFADS EDAAAAPPPK TIAIEPEQQQ GGRLEWMKAF RERLIPALRA FGPELIIVSA GFDAAASDVG NLGVDPRRNT RHQGANLRAE DYEDMTKLLV NVSNVCDGRV VSILEGGYGH LMSVGKSSDG AQNALTLGRD VFAKCVKAHV QALI
|
| |