Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33299 |
Symbol | |
ID | 5003503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 2158 |
End bp | 3621 |
Gene Length | 1464 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418924 |
Product | predicted protein |
Protein accession | XP_001419670 |
Protein GI | 145350558 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCGATGCCAC CGCTTGAGGA TATCAAAAAT GTGTTACTCG ACGCGCTCAC GGAGATCGAG AGCACGGCGC GCGACGTTCT CGCGAGTCTC GACGAACGTC GTCTCGTTCG GGTGTCGAAG AAGCTCCCCG GCGGCGACGT CGAATGCGGA GCCGCGAACG CGCTGTTTCA CGCCCTACGC GCGCAGAGCG GTCGTTATGC GTCCACCACG GTGACGTGCG TCTCGCCCAT GGGCGTCGAA CGTGAACACA CGGCCAAATC GCTCGCCACG GCGGCGACGG AGCTCATACC AAACGCGTTC GCGTCAAAGC ATTTCGCCGC CGTGCGCGTG CACGACGGCG GTGACAGCGG GATGATCACC ATATGCACGT TAGAGCGATA TGCCGAGCTG CGTGACGCGG GATCGGCGAT GTGCGATAAA TGCGGAAAAT TTATCTCCGG CGGCGAACGC GGTTTGTGGT GGCATCGAAA GACGAGGCAT AACGATTTAC ACCAGGAAGC CATGGACGCG GTGGAGAGAG AGCGAAACGC GCTCGTGGCG ATGTCGACCT CGGGATCGAG GTCGGACCTC ACGAACGGAG ACGCGGCGTA TTTGGATAAT AAGACGAAAA AAGCGTCGCG GGAGGACGAT TTGCGCGAGG CGATGGCGGC GGCCCGCCGC GGCGACGCCA CTGTCATGGA TGCCCTGATC GCGGCGAAGC GCGTCAAAGC ACTGCCTTTG CCCGGACTTG AAGCCGCGCG GCGAGGCGAT TTGAATCTCT TACGCTCGCT CGTTTCGCGC GATGGATGGG ATCCACGCTC GAAGGACGCC GTCGATAAGC ACGGTTCCAA CGCGTTGCTT TGGGCCGCGG GCGCCGGGCA CGTCGAGTGC GTCGAGTTTC TCGTCGAAAA ATGCTGTATG AATCCTCAAA CCTCCGTCCA GAGCGGACGG CGCTCGTACG CCGGTCGAAG CGCCTTGCAC TGGGCGGCGC GAAACGGCCA CGTCGAGGTG GTGGAATATC TGCTTTCGCG CGGCGTCGAT CCGAACAGCA CCACTGAAGA CGGATCCACC GCTTTCGCGT GGGCTTGTTG GCAAGGTCAT CTCGCCGTCA TGCGCCAGCT CGTTGAACGC GCCGAGTGCG ATTACAAGTC GTGCAACGAT TACGGTTGTA ACGTCGCGTG CTGGACCGCC ATGGGCGCCG GTGGCGTCGA GTGTTGCGAA TATCTCGCCT CACTCGGCGT GCGTTTCAAT TTGATCAACG CCAACGGTCA TAGCTGTTTA CACAAAGCCG CACAGCGTGG AAATCGAGAC GTGTGCGAGT GGCTCTTAGA TACGCCGAGT CTGGGTCTAA CGCGAGACCA CGCCCAACCC GACGCGGAGG GATACGATCC GGCGGGTTTA GCTCTCGTGG AAGGCTTCAA CGACGTCGCC GACTGGCTCA AGGCGCGCCA GCTGGAGCTC GAGTTCGCAA ACCATAAACC TTAG
|
Protein sequence | MPPLEDIKNV LLDALTEIES TARDVLASLD ERRLVRVSKK LPGGDVECGA ANALFHALRA QSGRYASTTV TCVSPMGVER EHTAKSLATA ATELIPNAFA SKHFAAVRVH DGGDSGMITI CTLERYAELR DAGSAMCDKC GKFISGGERG LWWHRKTRHN DLHQEAMDAV ERERNALVAM STSGSRSDLT NGDAAYLDNK TKKASREDDL REAMAAARRG DATVMDALIA AKRVKALPLP GLEAARRGDL NLLRSLVSRD GWDPRSKDAV DKHGSNALLW AAGAGHVECV EFLVEKCCMN PQTSVQSGRR SYAGRSALHW AARNGHVEVV EYLLSRGVDP NSTTEDGSTA FAWACWQGHL AVMRQLVERA ECDYKSCNDY GCNVACWTAM GAGGVECCEY LASLGVRFNL INANGHSCLH KAAQRGNRDV CEWLLDTPSL GLTRDHAQPD AEGYDPAGLA LVEGFNDVAD WLKARQLELE FANHKP
|
| |