Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34656 |
Symbol | HAM3502 |
ID | 5003645 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 222727 |
End bp | 224070 |
Gene Length | 1344 bp |
Protein Length | 417 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419066 |
Product | predicted protein |
Protein accession | XP_001419529 |
Protein GI | 145350256 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0290177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG ATGACGCGGG GCGGCGCAAA AGCGAGTCCA CGTCCGATCG CGCGGGCGCG CTGGACGTCG GCGCGCGCGT GTGGGCGAAG GCGTCGTTCG ATGAAAAACG ACGGCTCGGG GAAATCGTCG ACGTGCGTCG CGAGGACGGC GTCGCGGTGG CGTATTACGT GCACTATAGC GAGTTGAATA AACGCCTGGA CGCGTGGGTG CCAGTGGAGG ACGTGGAGAC GCGTAAGGAG GGCGAGGGCG AGGGTAAGGG CGAGGGCGGC GACGGTCGAG GCGGCGACGG CGACGGCGGC GGCGCGGGGA GCGCCGACGA CGCGCTTGGG GCGCACGGTA GAGTGAACGG GAAGCAGCCG AAGACGTTGA CGCGGAATTC TAAGCGGAGA TATAATGAGA TACACAACGT GGACGCGCCG GTGGAGGATT TGCCGCCCAT GGATCAAATG TTTGAGCGCA TGCACGAGGA GAAAACAAAG GTGAAGAACG TGCACTCGAT CGAGCTCGGG CGGCACGAGA TGGACACGTG GTATTATTCT CCGTATCCGG ATGACTTCGG AAAGTGTTCC AAGCTGTACT TGTGTCAGTA TTGTTTCAAG TATATGCGAA AGGCGAAGAC GTGCGTGCGG CACAAGGCGG AGTGCGAGAT GAAGCATCCG CCGGGAAAGC GGGTTTATCG ACATCCACAG GGCGAGGGCG AACCGTTGTT GAGCTTTTGG GAAATCGATG GCGCGCACTT CAAAATGTAC TGCCAGAATT TGTGTTTGAT GGCGAAACTA TTCTTGGACC ATAAGACGCT GTACTTTGAT GTCGAACCGT TTATGTTTTA CGTGCTCACT GAGTCAATGG ATGGAGACGA AACGCACGAT ATCGTGGGTT ACTTTAGCAA AGAAAAGGTT TCAGTGGACG ATTACAACTT GGCGTGTATT TTGACGCTGC CGGCGTATCA ACGCAAGGGA TATGGCTCAT TCTTAATATC GATGAGCTAC GAGCTGAGTC GACGCCAAGG CGTGTACGGC ACCCCCGAAC GGCCGCTGTC TGATCTCGGA CAAGTGAGTT ATAGAAGCTA TTGGAGTCGA GTCGTTTTGG ACGTGCTCCA CAAGCACAGA GGAAACCTCA GCGTGAAGGA TCTGAGCTCG ATGTTGATGT TTCGCGAGGC AGACATTGTC AGCGCGTTGC AGTCGCTCAA TCTGGTCAAA TACTGGAAAG GCCAGCACAT CATCTCTTCT TCGCCAAAAA TTGTGGAAGA ACACTTGAGT AGTTTTGAAA AGAAATCCGC TATAGCGGGT ACGAGGCTCG AATTCAACCC GGAGTACTTG AATTGGACGG CGCCGCCGAT GTAG
|
Protein sequence | MARDDAGRRK SESTSDRAGA LDVGARVWAK ASFDEKRRLG EIVDVRREDG VAVAYYVHYS ELNKRLDAWV PVEDVETRKE GEGEVNGKQP KTLTRNSKRR YNEIHNVDAP VEDLPPMDQM FERMHEEKTK VKNVHSIELG RHEMDTWYYS PYPDDFGKCS KLYLCQYCFK YMRKAKTCVR HKAECEMKHP PGKRVYRHPQ GEGEPLLSFW EIDGAHFKMY CQNLCLMAKL FLDHKTLYFD VEPFMFYVLT ESMDGDETHD IVGYFSKEKV SVDDYNLACI LTLPAYQRKG YGSFLISMSY ELSRRQGVYG TPERPLSDLG QVSYRSYWSR VVLDVLHKHR GNLSVKDLSS MLMFREADIV SALQSLNLVK YWKGQHIISS SPKIVEEHLS SFEKKSAIAG TRLEFNPEYL NWTAPPM
|
| |