Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33631 |
Symbol | HAM3503 |
ID | 5003896 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 541842 |
End bp | 543277 |
Gene Length | 1436 bp |
Protein Length | 412 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419317 |
Product | predicted protein |
Protein accession | XP_001419629 |
Protein GI | 145350473 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0827491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.813084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTCGTGGTG AATCGTCACC CATGACGAAA CGCACGGTGC GTCTCCGCGC TCGAATTCGC TCGCGGCTGA ACGCGCGCGC GTCGCGCGGC GCACACACAC GTTCGAGCGT CGCGATCGAC GCTCACGAGC GTCACCGCGC GCGAACGCTT CGACGCCGTG ACGGCGATCG ATCGATCGAC TGACGACGAA TTCCCCGGGA AAACGCGCGC AGAAGGACGC CGATGTCGAA GATACCCCGC GACGACGCGC GCGTAATCAC GTCGATATCC CTCCCGGCAC GGAACTCCCG CTCGAGCTCG GGACGAAGGT GCACTGTCGG TGGCGAGGAA AGGATGAGTT TTACCAGGCA AAGGTCATCG AGCGACGACC GGGACGAGGG AAATCGGACG AGGATTACGA GTATTACGTG CACTACGAAC AGTTTAATCG ACGGATGGAC TCGTGGGTGA GCTTGGGAGA GATGGATCTG TCGACCTTGG TGGTTGTGGA GAAGACAGAA GACGGGAAGA AGAAGAAGGA CGCGCACGAG CCAGACGCCG AGCACGCGGA GTTTGATCCG AAGGCGTTGC AGGAGCACGA AGAGTTCACC AAGGTTCGTA ACATCTTGCA GATCGAACTC GGGAAGCACG AGATGGACAC GTGGTATTTC TCACCGTTCC CACCAGAGTA TAACGACTGT CAAAAGTTGT ACTTTTGCGA GTACACGTTG CAGTTTTTCA AGCGTAAAGA GCAGCTACAG CGTCACTTGA AGAAGAATGA GATGCGACAC CCGCCCGGGG ATGAGATTTA CCGCAAAGGC AAGCTGAGCT TTTTCGAGAT CGATGGGAAG AAGCACAAGT TGTTTTGTCA AAACTTGTGT TACTTGGCGA AACTGTTTTT AGATCACAAG ACGCTGTACT ACGACGTCGA CTTGTTTTTG TTCTACGTCT TGATGGAGTG CGACGAACGC GGGTATCACA TCGTCGGTTA CTTTTCCAAG GAAAAGTGCT CAGAAGAAGG ATACAACTTG GCGTGTATCC TCACGTTACC CCCGTACCAG CGAAAGGGTT ACGGGAAGCT GTTGATATCG TTCTCTTACG AGCTGTCCAA GATCGAAGGC AAGGTCGGCA CCCCCGAGCG TCCGCTCAGC GATCTCGGTT TGGTATCGTA CCGCGGCTAC TGGACGCGCG AGCTGTTAAA AATTCTCGGC GACGAGTCCA AGCAGTTCCT CTCCATAAAA GACCTGAGCG AGATGACGAT GATCAAGACC GAGGACATCA TCTCCACCCT CCAGCACCTC GGTTTGCTCG CCTACACCAA GGGCGCGTAC GTCATCTGCG CTTCGCCCGA GCTCATCGAG AAGCATTTCA AAGCCGCCGG CAGCGGCGGC GTACCGTGCG ATCCCGAAGC GATCATTTGG TCCCCGTACG ACCCCGAGCG CGCGAGGGAC TTGTAA
|
Protein sequence | MTKRTKDADV EDTPRRRARN HVDIPPGTEL PLELGTKVHC RWRGKDEFYQ AKVIERRPGR GKSDEDYEYY VHYEQFNRRM DSWVSLGEMD LSTLVVVEKT EDGKKKKDAH EPDAEHAEFD PKALQEHEEF TKVRNILQIE LGKHEMDTWY FSPFPPEYND CQKLYFCEYT LQFFKRKEQL QRHLKKNEMR HPPGDEIYRK GKLSFFEIDG KKHKLFCQNL CYLAKLFLDH KTLYYDVDLF LFYVLMECDE RGYHIVGYFS KEKCSEEGYN LACILTLPPY QRKGYGKLLI SFSYELSKIE GKVGTPERPL SDLGLVSYRG YWTRELLKIL GDESKQFLSI KDLSEMTMIK TEDIISTLQH LGLLAYTKGA YVICASPELI EKHFKAAGSG GVPCDPEAII WSPYDPERAR DL
|
| |