Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25549 |
Symbol | |
ID | 5005479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 564788 |
End bp | 566483 |
Gene Length | 1696 bp |
Protein Length | 403 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420900 |
Product | predicted protein |
Protein accession | XP_001421504 |
Protein GI | 145354463 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.51719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.155521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCGCGATCG AGGCGCGTCG CATTTCCGAC GCCCACGCGT CAACGCACCG CGAGAATCGC GCGCACGCCG GGAAACGCGC GAAGAGCGCA CCGAGGAGGC GCAATTAACG CTCGCCGTTG ACTGCGAAAC GTCGCCGGCG AATCGAATCG AGCTCGCGTT CGGCGCGCGC GACGATCGCG GCGGGTCGAC GCGGTCGGCG ACGACGCGCG CGCGTGATCG AGGCGCGGTC TCAGCGTCGA CGGGACGCGT CGAAGCGTCG CGTCGATCGC ACGTTCGAGG GTGACGGCTG ACTGGCAAGA TTCGGGGCGT CATCGAACGC GCGATGGCTC GCACGCGAGG CGAGGAGCCG GCGGAGAAGT CGGCGACGAT CGCGTTGGCT GATATCGCCG CCGCGACTGA GACGATTAAG AAGGCTCACA TGAAGAAGGA ATCGAAGAAA CGTAAGGCGG AACTCGACGT CAGGGCGCCG TTGAGCAAGC GACAGCTGGA CGACGACGCG GCGTACGCGG CGTTGATGCA CGACAAGTCA AAGGATGCAG ACTTGCCGGT GGCGCAGGCT TCGGAGAAGG AATTGCAAAC TGTTTCTGAT AGTAGCGAGT GGGACGTCAA GTTTCCGGGG CAGAGATTCG GGCAGTGGAG CACGCTCGAG GTGGAACAGA TGAAGCGCTC GCTCGAGAAA TGGGCCAACG AGCACGGGCT CGCAGAAGAT TTCATGAATG GCAACTATGA GTTCTTGTTC AACCGTCGTC AAAAGCAAGG AGGTAAAGGT GCACACTTAC CGCTATCTGA GCGACGCGCG TTTATCGAAG TCGCTCGCGA GACGCCGACA AGAAACGCCA AGCAAATTTA CGGCTGGATT TTGAGAAACA TGGACAAAAA GTCGAAGTCG GGGAAGTGGC AGAAGGAGGA GACGGAAGCT TTGCTCGAGC AATACACGAA ACTGGGCCCG AAATGGTCCA AGATCGCGGA AATAGTCGGC AGGCCGGCGT CGGCGTGCCG TGACAAGTGG CGTCTCGCCA AGGGAGGTGA ACACAAAAAG TCGGGGCACT GGAGCCAGGA AGAAACCGAC AAGTTGTGTG AGCTCGTGAA GGAACACTTC CGCCAGCGAG GCGCGGAAGC TGGATGCGGG CCGGGAACGG GCAACGAACA CCTTTCACTT CGCGACAATA TCAACTGGGT CACCATCTCT GCCAAAATGG GCACTCGAAA CGAGCAGGCT TGTTTGCAAC GATGGTATCA AATCTCGCCT CCAATGACGA GTACGGGCGA GTGGGATGTC GAACAAGACT ACGAGATGTT GAATAACGTC ATCAAGTACA GATCGATGAC TGCCGAAGCT GTGCCATGGG CGTCGACTGT TCGGGGCCGT GATTTGTCTC GAATCATGCG ACGGTGGAAA TTGCTTTCGT CCAAAATCTC TGGACACGTC GACATGGCAT TCCGCGAACT CGTGCTCCAA GTTTGCAAGA GTAAGGACTA CAAAGACCTC GTTATCAAGG CGCAAGCTTT GGTCAAGTCT TCCTCGAGCG CGTGACTACG AGCTTAAGAT GCGATGATGT TTTCACCTAC CCACTGTACC ATCCACATCA CCTGCCCACT GTACCATCCA TATTCATGTC ATAATTTTTT CCGTTTCAGC GTGAGTTTTG AAACAGGCTC GTCATACAAT TTAGATAATT CCTTAGTCAT GCAAGA
|
Protein sequence | MARTRGEEPA EKSATIALAD IAAATETIKK AHMKKESKKR KAELDVRAPL SKRQLDDDAA YAALMHDKSK DADLPVAQAS EKELQTVSDS SEWDVKFPGQ RFGQWSTLEV EQMKRSLEKW ANEHGLAEDF MNGNYEFLFN RRQKQGGKGA HLPLSERRAF IEVARETPTR NAKQIYGWIL RNMDKKSKSG KWQKEETEAL LEQYTKLGPK WSKIAEIVGR PASACRDKWR LAKGGEHKKS GHWSQEETDK LCELVKEHFR QRGAEAGCGP GTGNEHLSLR DNINWVTISA KMGTRNEQAC LQRWYQISPP MTSTGEWDVE QDYEMLNNVI KYRSMTAEAV PWASTVRGRD LSRIMRRWKL LSSKISGHVD MAFRELVLQV CKSKDYKDLV IKAQALVKSS SSA
|
| |