Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34647 |
Symbol | HDA3509 |
ID | 5003712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 26139 |
End bp | 27338 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419133 |
Product | predicted protein |
Protein accession | XP_001419469 |
Protein GI | 145350128 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC CTCGCGTCCA TCTCACTCGC GTCGCGCCCG CGTCACCGCG CGCGTCGCGT CGCGCCGAGC GTCACCCGTC TCGCGCGCGA CGAATCGCGT CCGCGTCCGC GTCCGCGTCC GCGTCCTCGT CCCCGTCGCT GCTCTTTGCC ACCTCCGCCG CGCTGGAGCA CGTGCAGTTG GGGCACCCGG AGAGCAACGC TCGCGTGCCC GCCATCTTAG ACGCCCTCGA GGCGGGCGCT CTGACCCCCG CCGCGAGACC GGGTGAGGTG TTGGAGATCA CAGACGTCGT GCCGGCGACG AAAAAGGCGC TCGAGCGTGT GCACGCGAAG AATTACTGCA ACGGGCTCGA GTTGTTGTGC GCCACGCGAG CGCCGACGAA TTTAGACACC GCACCGACGT ACTGCACGCC GTCGAGCTTT CAAGACGTGA TGCTAGGCGT CGGTGCGGCG ACGCGATTGG TGGATGAGGT GATCGATCGC GCGAAGGAGA CGAAAGAAAA GGCGCCGAGC GCGTTTGGTT TGATACGCCC ACCGGGACAC CACGCGGTGC CGCGAGGGGC GATGGGATTT TGTTTGGTCG GCACCGCGGC GGCGGCGGCG AGACACGCGC AGCTTAGAGG GCATAAAAAA GTGCTCATTT TCGATTACGA CGTGCACCAC GGTAACGGTA CGAATGATAT TTTCCGAGAC GACGATTCCG TGCTGTTCAT CTCCACGCAC GAGGACGGGA GCTATCCGGG GACGGGGAAG ATTACGGATA TGGGCGAAGG CGATGGTCTG GGTGCGACTA TTAATATACC TTTGCCTCCG GGAAGCGGTG ATAAGGCGGT GTTGAGCGCT TTGGAAGAAA TCGTCGTTCC GGCGGCGGCG CGGTTCCAAC CCGACTTCAT CATCGTCAGT GCCGGCTACG ACGCGCACTG GCGAGATCCT CTCGCTGGGT TGACGTTTCG CACGGGAACG TACCATCGTC TTTGCACAAA GTTGAAAGAA CTGGCGAATG AGATGTGCGG CGGGAAAATC GTGTTCTTAT TAGAGGGCGG GTATGATTTG GTGGGTTTGA GCGAAGGCGT CGCGGATTCT TTCCGCGCGC TCTTGGGCGA CGCCTCAACG GACGTCGGCG AGATTCCCGG GCTGAGAGAC GAGCCCGATG ATAAAGTGCG AAATGTTCTC ACCGAAGTCA AGGCGATGCA TCAAGTTTAG
|
Protein sequence | MARPRVHLTR VAPASPRASR RAERHPSRAR RIASASASAS ASSSPSLLFA TSAALEHVQL GHPESNARVP AILDALEAGA LTPAARPGEV LEITDVVPAT KKALERVHAK NYCNGLELLC ATRAPTNLDT APTYCTPSSF QDVMLGVGAA TRLVDEVIDR AKETKEKAPS AFGLIRPPGH HAVPRGAMGF CLVGTAAAAA RHAQLRGHKK VLIFDYDVHH GNGTNDIFRD DDSVLFISTH EDGSYPGTGK ITDMGEGDGL GATINIPLPP GSGDKAVLSA LEEIVVPAAA RFQPDFIIVS AGYDAHWRDP LAGLTFRTGT YHRLCTKLKE LANEMCGGKI VFLLEGGYDL VGLSEGVADS FRALLGDAST DVGEIPGLRD EPDDKVRNVL TEVKAMHQV
|
| |