Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29894 |
Symbol | |
ID | 5000533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 145623 |
End bp | 146720 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | |
GC content | 63% |
IMG OID | 640415954 |
Product | predicted protein |
Protein accession | XP_001416083 |
Protein GI | 145341994 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0723329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTACG ACGGCGCGAG CGACGCGGAG GAAATCGCGC GCGCGGTGAA GACGTTCGAC GCGATCAACG CGCTGCGACG AGCGACGGAG GAGGACGCGG GGGAGGGGGA GGACGCGAGA CGCGCGAGAA AGCGCGCGGC GGCGCTGGAA CGCGCGGCGG AGCGCGCGAA GCGCGCGAAG GCGAAGCGAC AGGGCGAGGG ATTCGATGGG AATAAAAATG TGAAGTCGAC GGGGGCGTAC GTGACGGGGC TGCCGTCGGA CGCGACGGAG GAGGAGTTGG GGGAGGCGTT TAAAAAGTGC GGCGTGGTGA AGCTGGACGC GAAGACGGGA CGAGCGAGGG TGAAGGTGTA CAGGGATGCG GACGGGAAGG TGAAGGGAGA TGGGTTGGTG GTGTTTTTGA AGGCGCCGAG CGTGGATTTA GCGATCGCGC TGTTGGATCA GACGGAGTTG AGGCTTGGGG ACGCGACGAC GAGGATGACG GTGACGGCGG CCAAGTTTGA GGCCAAGGCG CGGGGGGACG ACGAAGGTGG GGGGGCGAAA GTCGCGGCGA AAGCGAGCGG TGGCGGCGCG CGAATGACGA AGGCCGATCG CAAACGCGCG GCCGCTCTTC TGAAAAGGCA AGAGGCGGAG GCGTTGGGAT GGGCGGGTTT CGACGACGAC GTCGACGCGA AGAAGCTCAT CGTCGTCTTG CGGCGGATGT TTACTTTAGA AGAGATGTAC GCCGACGCAA ATTTGCGTAA AGAGCTCGAA GAAGACGTTA TGGAGGAAGC GCAGCGTACG TGCGGGCCGG TGATGAGCGT GAAGACGTAC ACGACGTCGC AAGATGGAAC GATGACGATT CGCTTCAAAT CTCTCGAAGC CGTCGAAGCG TGCGTCAAGG CGTGGAACGG TCGCTGGTTT GACGGTAGAC AAATAGAAGC CTCGATGTGG GACGGAAAGA GTAAGTTTGT GAGCCAACGT GACGAGAGCG AGGCGGCGCA ACGTGCGCGG TTAGACGCGT ACGCCGCCGA ACTCGGCGGC GGCTCGGACG CCGAAGACGC CGAAGACGAC GACGACGACG TCGACGACGA CGACGAACAT TCCGACGACG AACAATAG
|
Protein sequence | MTYDGASDAE EIARAVKTFD AINALRRATE EDAGEGEDAR RARKRAAALE RAAERAKRAK AKRQGEGFDG NKNVKSTGAY VTGLPSDATE EELGEAFKKC GVVKLDAKTG RARVKVYRDA DGKVKGDGLV VFLKAPSVDL AIALLDQTEL RLGDATTRMT VTAAKFEAKA RGDDEGGGAK VAAKASGGGA RMTKADRKRA AALLKRQEAE ALGWAGFDDD VDAKKLIVVL RRMFTLEEMY ADANLRKELE EDVMEEAQRT CGPVMSVKTY TTSQDGTMTI RFKSLEAVEA CVKAWNGRWF DGRQIEASMW DGKSKFVSQR DESEAAQRAR LDAYAAELGG GSDAEDAEDD DDDVDDDDEH SDDEQ
|
| |