Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42025 |
Symbol | HAM3501 |
ID | 5006191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 226605 |
End bp | 227981 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421612 |
Product | predicted protein |
Protein accession | XP_001422236 |
Protein GI | 145356011 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.067727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGC CGAAGACGAA CGCGCCGGCG CGACCCGACG CGTTCGCGGC GATCGACGCG CGCGCGGCGT TCGCGGTCGG CGCGCGGGTG CGCGTGCGGG ACGGGGCGAA CGAGGCGGAC GGCGCGGACG CGCGCGAGGG CGTCGTGGCG GCGAGGCGGG GCGACGGCGC GAGCGCGACG GACGGCGAGG TCAAGTATTA CGTGCGGTAC GACGCGACGG GGGTCGCGGA CGAGTGGGTG AGGGTCGAAA GGTTGACGAG CGCGGGTGGG GAGAGCGTGG GAGGGAGCGA GACCACGGGC GGGGGGTCGA GATCGGACGC GGCGACGGCG GCGAGAGCGG CGGCGGCGGC GGCGGCGGCG AGAAAGCGGA AGACGACGGG AGAGAAGGAC GGGGGCGAGC GCGGCGAGGA GGGGGGGCGG AAGGTGCGGT GGATCGAGCT CGGGCGGTAC ATTTGCGATT GCTGGTTCGA CTCGCCGTTT CCGGAGGAAT ACACCGACGA GCGAAAGCTA TTTGTATGCG ATTTTTGCTT AAAGTATCAT CGGAAGCGGA GGGCGTACAT CGCGCATAAG AAGACGTGCG AACTGCGACA TCCGCCGGGG AATGAGATTT ATAGGCATCC AGAGCGACAG GCGACGGAGA AATCGCCCGC GCGCGCGCAG TTGCGCATGT GGGAAGTGGA CGGGTCGAAC GCGACGACGT ATTGTCAAAA CTTGTGCCTC ATCGCGAAAC TCTTCTTGGA TCACAAGACG CTGTATTACG ACGTCGCCGC GTTTTACTTT TATGTCTTGA CCGAACGGGA CTTTGACGAG AGTGGAAAGC CGCTCTATCG AATAATCGGG TACTTCAGCA AGGAGAAGGG TCAAGTGGAG ACGAATCTGG CGTGCATTCT CACGTTGCCG CCGTACCAGC GTCGCGGCTA CGGGGGGTTT TTGATTGAGT TTTCGTACGA GCTCGCCAAA CGCGAGGGAC GCATCGGCAC CCCGGAGCGA CCTCTGAGCG ATTTGGGGTT CGCTTCCTAT CGAACGTACT GGAGCCGAGT GATTTACGAA AGTCTCAGCA GCACCAGCGC CGGCGGGGTG AACGTGGCGG AGCTGAGCAA AAAGACGAAC ATTCGCGTCG ACGACATCGT CTCAACGCTA CAGCCGTTTA GTTCGATTCG ATTCTTCAAA GATCAAGGCT TCTTGAACAT CACCAAGGAA GGTAAGCAGG ATTTCAAAGC CGCGCAAGTC AAGCTTCGCG AAAAATGGAG CGAACTGCGC GTCATACCCG AACGACTACA GTTCGAGCCG AAAATCAACG CCGTCGTGGA AGTAGCTGAG AAGCGCCGCC GCACGCGTTT GTTCCAACGA GACATTTTGG AAAATAATGA GAAATGA
|
Protein sequence | MRAPKTNAPA RPDAFAAIDA RAAFAVGARV RVRDGANEAD GADAREGVVA ARRGDGASAT DGEVKYYVRY DATGVADEWV RVERLTSAGG ESVGGSETTG GGSRSDAATA ARAAAAAAAA RKRKTTGEKD GGERGEEGGR KVRWIELGRY ICDCWFDSPF PEEYTDERKL FVCDFCLKYH RKRRAYIAHK KTCELRHPPG NEIYRHPERQ ATEKSPARAQ LRMWEVDGSN ATTYCQNLCL IAKLFLDHKT LYYDVAAFYF YVLTERDFDE SGKPLYRIIG YFSKEKGQVE TNLACILTLP PYQRRGYGGF LIEFSYELAK REGRIGTPER PLSDLGFASY RTYWSRVIYE SLSSTSAGGV NVAELSKKTN IRVDDIVSTL QPFSSIRFFK DQGFLNITKE GKQDFKAAQV KLREKWSELR VIPERLQFEP KINAVVEVAE KRRRTRLFQR DILENNEK
|
| |