Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31660 |
Symbol | |
ID | 5001916 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 303579 |
End bp | 305563 |
Gene Length | 1985 bp |
Protein Length | 440 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417337 |
Product | predicted protein |
Protein accession | XP_001417733 |
Protein GI | 145346517 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5602] Histone deacetylase complex, SIN3 component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0526314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0219153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTCG CGACGGCGCC GCCCGCGGGC AACGTCGCCG ACGCGCTCGC GTACGTCCGC GAGGTCCGCG ATCGGTTCGC GCGACAGACG GGAAAGTATC GCGAGTTCCT CGCCGCGATG CGCGACTTTA AGACTGGAAC GTGCGTCGCG ACGTCGCGCG ACGCGACGCG ACGCGACGCG ACCGACGCGC GATGACGCGC GAGCGACGAA CGACTGACGA CGACGACGAC GGCGACGCAG GCTCACGCCC GAGGGCGTGA TCGAGCGCGT GCGACGGTGC CTGCGAGGAC ACGACGATTT GCTCGACGGC TTTCGAGCGT TTCTGCCCGA GGTGCGGGAC GCGCGAGACG AGACGGACGC GAGACGAGAC GGAGACGGAG ACGACGCGAA AACGACGACG ATGGCGCGAG ACGACGAAGG CGCGGTGACT GACGGTGACG CGCGACGCGC GGGTCGGGGC GCAGGGATAC TCGACGGCGA CGGCGACGGC GAGCGGGACG CGCGCGCGCG GACGACCGGC GACGGCGAGC GGGCGCGGAA GACCGAGAAA GGCGCCGAGC GCGCGGGAGG TGGCGATGAA GCGCGAATGC GAGGTGCGCG GGCGACGGCG ACGCGCGCGC GACGCGCGCG CGGCGGCGAA CGCGACGCCG ATCGACGCGG GGAAAAGTCG GCTGGGGTCG GAGGGACTCT TTGACGAGTC ACGACCCGTC GATCGTTTGA AACCACAGCC GTCCGACCGA CCGACCTGAT CCGACGGCGC GAACCGGCGG TTGGGGTCGC GCGGGCGACG CGCGCGGGAC GGCGACGGTT TCGTTCGGTT CGCGAACGCG CGGAAGACGA TCGAGCGCGT GACTGACTGT GTGCGCGAAC GCGCGCGTGA TAAGGCGACG CAGGAAACGG ATGAGAATCG AAGCGCGACG TTACTGAGCG GACAAGATTT GCTCAAGCGC ATTCGCGCGC AGTACGCGTC GGACGACCGT CCGTATCAGG CGTTTTTACA GACGTTGATT AAGTTTAGGA ACAAGCAGTT CAGCGCGGAA GAGGTGGTGC AGAAGTGTGC GATATTGTTT TATGATCATC CGCAGTTATT GGAGGGTGAA AATGGGTTGG CCACCTTTGT GCCGAAGACG TGCAAGCCGC CGACGCGGTC GTCGTGGTTC GAGTGGAGTC CGGCGGTGCA TCATCGCTTC AGCAGCGAGA GCTTTCGATT GTTCGTGCGC ACGTTGTTAT TGTGTGAGTA TAAAAGTCGC TTAACGCCGC CGGAGCCGAA GATGCGGCGT TGGCGTCTGA CGGATGGCGA CAGTAGCGAG TCGGATAGCG ACGACTACAC GTCGTCGGAC GAGAGCGAAA GCGAGAGTTC GACGGTGATA GAAAAAGAGT CTGGCACGCT TCGACCGGAT CAAATTCCCG ATCGATGGGT GGGTAAAATA CACGACCGAA CGTTGAGCGC GTTCGAGCGC CTGACTTTAG AGCTCAAACC GAAAGACGAA GAGTATTGCT GGGCCGACAC CAAGGCGGCG CGCGAAGCCA AGGAGAGCAA GGACGCGATC GAACTTCGGG GCAAGCGCGG TCGCGGCGAA GAGCGAGATT TCACGACGAC GCCTCGAGAT TCACCATCAC CGCCGCCGTC GTTTAATAAA CGGCGCGCGA GCGCTCGCAT CGTGGCCAAG GTGAACCGAG GCATCGTGTA CAACCGCTAT CGCAGCTTGG CGTCCCCCGA CGGTCTCTCG CGCGGTCGCA TGCGCATGAC GCGATCGATG AGCACGGTGG ACGACACCTT GCCCGCCAAA CCAGATCGCA AGTTCTTGAT CAATCGTGAC GAAGGCAAGA TGTCCCCGCT GTGCGCCGAG CGCATCGGAT TACACTCCCT GCCCGCCGTC GCGCTCGAGA AAATCATCGC GTGCTACAGC GCGCAAGAGG GCGTCGCCGA GCTCAATCAA ATTCGCGAGT CTCTGATCAA ATCGCTCATA AAAAATAATA GGTAA
|
Protein sequence | MALATAPPAG NVADALAYVR EVRDRFARQT GKYREFLAAM RDFKTGTLTP EGVIERVRRC LRGHDDLLDG FRAFLPEETD ENRSATLLSG QDLLKRIRAQ YASDDRPYQA FLQTLIKFRN KQFSAEEVVQ KCAILFYDHP QLLEGENGLA TFVPKTCKPP TRSSWFEWSP AVHHRFSSES FRLFVRTLLL CEYKSRLTPP EPKMRRWRLT DGDSSESDSD DYTSSDESES ESSTVIEKES GTLRPDQIPD RWVGKIHDRT LSAFERLTLE LKPKDEEYCW ADTKAAREAK ESKDAIELRG KRGRGEERDF TTTPRDSPSP PPSFNKRRAS ARIVAKVNRG IVYNRYRSLA SPDGLSRGRM RMTRSMSTVD DTLPAKPDRK FLINRDEGKM SPLCAERIGL HSLPAVALEK IIACYSAQEG VAELNQIRES LIKSLIKNNR
|
| |