Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34381 |
Symbol | |
ID | 5001112 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 392208 |
End bp | 393338 |
Gene Length | 1131 bp |
Protein Length | 319 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416533 |
Product | predicted protein |
Protein accession | XP_001416933 |
Protein GI | 145344842 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.693095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.168449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTCTG ATGGTCAAGG TTACGCGGCG ACGACGCCGT CGTCGTCGAG GAAACACGCG AATTTGTACG TGAAGAACAT CTCCGAACGC GTCGACGAGT TGACGCTTCG AAGGCTCTTC GAGGCGTGCG GGGAGGTGCA ATCGTGCTGC GTCATTCGCG ATGTGTCGAC GAATAAGAGT CGAGGATTTG GGTTCGTTAA GTTTGTCAGC ACGGCGCGCG CTGAGGACGC GATCGAGCGA TTCAACGGTA AGGAATACGC GGGAAAGATG CTCGAGGTGA AATTCGCGAA CACCGACGGC GAGAGCGACG GCGCGGGGGG CGCGGCAAAC GCGCCGCCGA GTGATAACGT CTACGTCAAA GGTCTGCCCC CTTCTTGGAC GCACGATGAT CTGAAGGCAT TTTTTACGCA TTTTGGGCAC ATTGTTGAGT GCCGTTTGCT CCACGCAAAC AGGAGCACGT CGAGCGGGGC GTTGATTCGA TTTTTGCGCG AGTCCGAAGC CACGGCGGCG GTCACGCGCG CCAACGGGCG CTTGCTCGTC CCCAATGGGC CGCCACTCGT CGTGCGCTAC GCCGAGGCGC AAGGAAAAAA TAATAAACGT TCAACGCAAG TTGTACCCGT GAATACTCAG CGCCTGTCGA ACAACACGGA CGACTCCGCG CACGACGGGG GGGAGTTGAT CGACGTCCTC GGCTCGAGTA TGAATCTCAA CCGCATATCT TCCCAGAGCG GTCTCACTGA ATTACTCAGT GTTGGCCCGC AAGACGGAGA CGACTTTGCC GCGCTTGCGG TGCTCGGTTC GTCCCCTTCG CATAAATTCG ATGCTCCTAT GCAATCCCAT CAAAAGTTCG CGTCGGCGAC TTCGATGGCG CAAGGTGGCG CTACGATGTG TATCCAAAAT CTTCCACCCG CTGCGGATGA ATTATTCCTG TACAAAACAT TTGCTCCGTT CGGAGCTATC AACTCCGTCC AAATTGTCCG CGACGATTGG ACTGGTCTTT GTTCTGGCGT CGCTGTGATA AACTTTCGAA GTTACTCCGA CGCTTGCGAC GCTCAAAGAG CGTCTCAAAA CGGAAAGAGC AGGCTGAGCA TTTCTGTTCA GCTTCAGACG GCGAATCTGG CGAACTTTTA A
|
Protein sequence | MYSDGQGYAA TTPSSSRKHA NLYVKNISER VDELTLRRLF EACGEVQSCC VIRDVSTNKS RGFGFVKFVS TARAEDAIER FNGKEYAGKM LEVKFANTDG ESDGAGGAAN APPSDNVYVK GLPPSWTHDD LKAFFTHFGH IVECRLLHAN RSTSSGALIR FLRESEATAA VTRANGRLLV PNGPPLVVRY AEAQGKNNKH DFAALAVLGS SPSHKFDAPM QSHQKFASAT SMAQGGATMC IQNLPPAADE LFLYKTFAPF GAINSVQIVR DDWTGLCSGV AVINFRSYSD ACDAQRASQN GKSRLSISVQ LQTANLANF
|
| |