Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34891 |
Symbol | |
ID | 5003904 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 68400 |
End bp | 69240 |
Gene Length | 841 bp |
Protein Length | 180 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419325 |
Product | predicted protein |
Protein accession | XP_001419697 |
Protein GI | 145350614 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCCGTGCGC GCGTCGCGAG CAGGAAGATC GGAGGACCGA CGCGGAGAAG CGCCAAAGGA GGATGGACAC CCGAGGAGGT GCGTTAAAAT TTGAAATTTG AATTTGAATT TGGATGCGGA TTTACGCGAC GCGAATGCGC CGTTTGCGGT GAAAGAGGGC GGCGAAAGAG ACTGACGTGG GGTGATTGCG TTCGTAGGAT GAATTGTTGC GCGGCGCGGT GGCGGTGTAC GGGGGGAGGA ATTGGAAGAA GATTGGTACG CGAGCGACCG CGAGATCCCT CGGCGAAGCG CGCGTGAACG AGCGTCGGGA CGCTCGAGCG CGGCGGTTGA TGTCGTGCTC TTTCGCACTC GTTCGGACGA CGAGGACGGT GTTTGTGTGA TTGCTCGCGT GTTTCACGGT TACTGACGAT GAGTTTGTGT TTTCTCGACG CGCGCAGCGG TGTACTTTAG CGATAGCCGC ACCGACGTGC AGTGCTTGCA CAGGTGGCAA AAAGTGCTCA ACCCCGAACT GGTGAAAGGG CCGTGGACCG CGGAGGAGGA CGCGCGAATC ATCGAGCTCG TCACCGAACT CGGGGCGAAG CGATGGTCCA AAATCGCGGG CGAGCTTCCT GGGAGAATCG GGAAGCAATG CCGCGAGAGA TGGTACAACC ACCTCGACCC GGAGATCAAG CGCGAGGAGT GGAGCGCAGA CGAGGACCGT CAGTTGATCA TCGCGCACGC ACAGTACGGC AATCGCTGGG CGGAAATCGC CAAGTCTTTC AAAGGACGCA CCGATAACGC GATTAAAAAT CACTGGAACT CCACGTTGAA GCGCAAGGTG GACCAAGCGT TGAATCAAGG G
|
Protein sequence | RRARVASRKI GGPTRRSAKG GWTPEEDELL RGAVAVYGGR NWKKIAVYFS DSRTDVQCLH RWQKVLNPEL VKGPWTAEED ARIIELVTEL GAKRWSKIAG ELPGRIGKQC RERWYNHLDP EIKREEWSAD EDRQLIIAHA QYGNRWAEIA KSFKGRTDNA IKNHWNSTLK RKVDQALNQG
|
| |