Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32864 |
Symbol | |
ID | 5002735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 769779 |
End bp | 771026 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | |
GC content | 56% |
IMG OID | 640418156 |
Product | predicted protein |
Protein accession | XP_001418799 |
Protein GI | 145348735 |
COG category | [R] General function prediction only |
COG ID | [COG1094] Predicted RNA-binding protein (contains KH domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACC TCGTCGCCGC GTCCAACGCG CGTCCGTCGA CGCGAAATGT TTCGGCGGCG GCGGCGGACG AGCCGGACGC CGACGCCGAC GCGGCGGCGA CGACGTCTTC GGGCGCGAAA AAGGGCCGCT ATCGAAGAGA TAAACCCTGG GATCACGATG GCATCGATCA CTGGAGCGTG ACGCCGTTCA CGGCGGAGGA TAACCCGAAC GGCGTGCTCG AGGAGAGCTC GTTCGCGGTG CTGTTTCCAA AGTACCGAGA AAAATATTTA CGCGAGACGT GGCCGAGCGT GACGAAAGCG CTGAAGGAGC AAGGGGTGAG TTGTGAATTA AACCTCGTCG AGGGTTCGAT GACGGTGCGC ACGACGCGAA AGACTTTCGA TCCGTACATC ATCATGAAGG CGAGGGACTT GATAAAGCTG CTCAGTCGGT CGGTGCCGGC GCCGCAGGCG TTAAAGGTGC TCGAAGACGA GACGAATTGC GACGTGATTA AGATTGGTGG GATGGTGAGG AACAAAGAAC GATTCGTGAA GAGGCGGCAG CGATTGATTG GGCCCAACGG CTCGACGCTC AAGGCGATCG AGATGCTCAC AGGGTGCTAC GTGCTGGTTC AAGGGAACAC GGTGAGCGTC ATGGGCGGAT GGAAAGGTTT GAAGATGGTT CGCAAAATCG TCGAGGACGC GATGAAAAAC ACGCACCCGA TTTATCACAT TAAAGAACTC ATGATCAAAC GGGAACTGGA AAAAGATCCC GAGCTCGCCA CGCAAAGCTG GGACCGATTC TTGCCGAAAT TCAAGAAGAA GAATGTCCAA CGCAAGAAGC CCGCCAAAAT CGGCAAGAAG GAACGCGCGG TTTTCCCGCC GACCCAACCG ATGAGCAAGA TAGACAAACA AATCGAATCC GGGGAGTACT TTTTGTCCAA AGAAGCCAAG GAACGCAAGG CAGCGTACGA CAAGTTGCAA AAGCAAAAAG ACACGTCGAC GGACAACCAC AAGAAGCGAC AAGCCGCCTT CGTCGCGCCG AAGGAGGACG ACAAACCGGC TCGCTCAAAG TCGTCAAAGG CGAAGGAAGA GGACGTCGAC GCCATCACGG CATCACTCAA GGCGAAGGCG AAGGCGAAAA AGGAGGAAGA CAAGCGCTCG AAGGCGTCAG CTTCATCCTT CGTCATGGGT GGTGAAGCGA AATCGTCCAA GCGCGATCGA GAGGACAAGA CGGAGAAGAA GTCAAAGAAG TCGAAGAAGG ACAAGTGA
|
Protein sequence | MAHLVAASNA RPSTRNVSAA AADEPDADAD AAATTSSGAK KGRYRRDKPW DHDGIDHWSV TPFTAEDNPN GVLEESSFAV LFPKYREKYL RETWPSVTKA LKEQGVSCEL NLVEGSMTVR TTRKTFDPYI IMKARDLIKL LSRSVPAPQA LKVLEDETNC DVIKIGGMVR NKERFVKRRQ RLIGPNGSTL KAIEMLTGCY VLVQGNTVSV MGGWKGLKMV RKIVEDAMKN THPIYHIKEL MIKRELEKDP ELATQSWDRF LPKFKKKNVQ RKKPAKIGKK ERAVFPPTQP MSKIDKQIES GEYFLSKEAK ERKAAYDKLQ KQKDTSTDNH KKRQAAFVAP KEDDKPARSK SSKAKEEDVD AITASLKAKA KAKKEEDKRS KASASSFVMG GEAKSSKRDR EDKTEKKSKK SKKDK
|
| |