Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40681 |
Symbol | HDA3502 |
ID | 5005865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 155529 |
End bp | 156500 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421286 |
Product | predicted protein |
Protein accession | XP_001421586 |
Protein GI | 145354637 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.000252385 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.188986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCATC ACTCCGCGAC GCGGCACGCG CCGGGGGCGA ACGGCGCGGT GCCCGTGGTG CATCACGCGA GCTACAGCAA ACCAGTCATG CCGCGCGGGC ACAGATTTCC GATGACTGTT TTCCAGCGCG TGCACGACAT CCTGCGCGAA GAGGGCGTCA TCGCGAGAGG GCAGACGAAT TGCTTCGTCC CGGGACGCGC GCCGAGCGTG GACGAGCTGT GTCGAGCGCA CGATGAAGAT TACGTGAGAG ACGTGCGCGC GAGCGCGCTG GACGCGAAGC GCGAGAGAGA AATCGGACTG CCGTGGAGCG ACGCGCTCGT GGAGAGAACG CTGATGGAGG TTTCGGGGAC GATGCTGACG GTGGATTTGG CGATGAAGGT GGGATTGTGC GTGAACACGG CGGGAGGGAC GCATCACGCG CATCGCGATA GGGGGAGCGG GTTTTGCATC GTCAACGATT TGGCGGTGTC GGCGCTGAGG GCGATCGATT CGGGGGCGGT GTCGAGGGTG ATGATTATAG ATTTAGACGT GCATCAAGGC GATGGGACGG CGGCGATTTT AGCGAACGAA CCGGGCGTGT TTACGTTTTC GGCGCACGCG AAGTCTAATT TTCCGGCGCG AAAGCAACAG AGCACGCGAG ACGTCGAGCT GCCGCGCGGG ATGACGGACG ACGCGTACAT GGCCGTCGTC GCGGCGGCGA TGCGAGAGTC TCTCGAAGAC TTTCGGCCAG AGTTGGTGAT TTACGACGCA GGCGTGGACG TGACGTCGAA CGACACACTC GGACATCTCG ATCTCACCGT CGAAGGGCTG TATCGACGCG AGCGCATGGT GATGGACACC GTGCTCGGCG CCGGCATTCC CCTCGCGGGG GTCGTCGGGG GTGGGTACTC ACCGGATATA GATGAGCTCG CGAGTCGTCA CGCAGTCTTG CATCGCGTGG CGCGCGAAAT GTTCGTCGAT CACGGGTTGT AA
|
Protein sequence | MVHHSATRHA PGANGAVPVV HHASYSKPVM PRGHRFPMTV FQRVHDILRE EGVIARGQTN CFVPGRAPSV DELCRAHDED YVRDVRASAL DAKREREIGL PWSDALVERT LMEVSGTMLT VDLAMKVGLC VNTAGGTHHA HRDRGSGFCI VNDLAVSALR AIDSGAVSRV MIIDLDVHQG DGTAAILANE PGVFTFSAHA KSNFPARKQQ STRDVELPRG MTDDAYMAVV AAAMRESLED FRPELVIYDA GVDVTSNDTL GHLDLTVEGL YRRERMVMDT VLGAGIPLAG VVGGGYSPDI DELASRHAVL HRVAREMFVD HGL
|
| |