Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15788 |
Symbol | |
ID | 5002489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 399461 |
End bp | 400894 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417910 |
Product | predicted protein |
Protein accession | XP_001418470 |
Protein GI | 145348049 |
COG category | [S] Function unknown |
COG ID | [COG2433] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00010721 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.705917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGAC CGGCGTCGGA ATCGCTGACG ACGATCGCGC CGCTGCAGCT GCGGTTCGAG GACATATCCG TGCACTCGCC GGGGGTGTTC GCGAAGTCGG ACGCGATGAA GACGCTGCGG GAGGCGCCGA GCGGGGTTGG ATTTCGCGCG GCGCTCAAGG AGTTGAATCA GACGGAGGAT TCGTTCGCGA AGATTTTGTC GCAGGCGCGG GAGAAGGCGC TCGGGAACGG GGACGGGGAT TTGGAGCGAA TCAAGGCGCA CATTAAGCGG TTGAAGTTCG GGACGATCGA GTTGTCGACG GAACAGGCGT TTTTGAAGGC GGTTTTGGAA TCGCGACCGT TGCCCAAGGA GACGCCGACG GAGAGCGAAG GGTTCAAAGC TTTGCAACAG CGGTTGAAGA ATTTGACGAA CGAGAACGAA CGACGGGCGG AGGAGATGGA ACGCGAGATT GCGTCGCTCG GGGACGAGTA CGAGACGTTT CGCGTCGAGC ATAGGGCGTT GACGGAGGTT GTGGAAGAAT TAGAGCAGTT GGAGGCGGAA GCCGCGGCGG CGGCGTGGGA AGCGCAGGCG GGAGACGCCT CGGTGAGCGC CGCGGACCGG GCGAAGTCCA AGGAGGAGTT ACAGCGCGAG CTCGAAGCGC TCGATGCGGA ACTGCGCGAG GTGAATGTGA AGTTGAGCGA GAATCAAAGC GGCGCGAGCG AACTCAAGGC GGAGTTGGCG CCGGACGAGA GCGCGATTGA TCAAATCAAC GCCCAGGCCA AGTACTTGAT CGGTATCTCC AAGGCTGCGG CGCGAGAAGC GGAAATGTCG GCGCAAATCG CCGAAGCGCA AGAGGCGGTG CTCGAGCAAA CGAAATTGCT AGTCACTTTA CAAGGGGTGG AAATCATGGC CATCGAAGAA AACGCCCTCG TGCTCAAGAT TCACACGCAC TTGCCCGCGA CGCCCGAGTT CGCCATGGAA AGCGCCAAGC CGCGTGGCCC GAGCTCGACG ACGCACACGG TCACGCTGCA CTTGATGAAG GATAGCGCTC GTCTCGCCGG GGCGACGCTC GAACCGTCGG ACACGCCCAT CCTCGACATC ATCGAAGAGT CGGCGGGCAT GCCCGTCCTC GCTCGCGCGT TGGCGGAAAT CCGCCTTCGC ATCGCCGCCA CGGCGCAACG TGCCGAGGCC CTCGCGGCGG CGGCGGCGCG AACGGCGCTC AGATGGAGTT CCGGCGAGTC ATTGGTGCGC GCGGCGCTGC CAAACGGCGC CGTCGCCGTC TTGGACGTGC CTTTCGAATG GCCGACGCGC GGCGCGAACA TTTCGTTGAT CAATGTCGAG ATGGTGCCGC CGCAAGTGGC ACAACTCGCG GTGACCCGCC TGATGACGAA GAATCTGTTG TGCATCGGCG ACGCGCTCGA AGCCACGAGC GAGGCGCTGA AGAACCAGGC GTAA
|
Protein sequence | MERPASESLT TIAPLQLRFE DISVHSPGVF AKSDAMKTLR EAPSGVGFRA ALKELNQTED SFAKILSQAR EKALGNGDGD LERIKAHIKR LKFGTIELST EQAFLKAVLE SRPLPKETPT ESEGFKALQQ RLKNLTNENE RRAEEMEREI ASLGDEYETF RVEHRALTEV VEELEQLEAE AAAAAWEAQA GDASVSAADR AKSKEELQRE LEALDAELRE VNVKLSENQS GASELKAELA PDESAIDQIN AQAKYLIGIS KAAAREAEMS AQIAEAQEAV LEQTKLLVTL QGVEIMAIEE NALVLKIHTH LPATPEFAME SAKPRGPSST THTVTLHLMK DSARLAGATL EPSDTPILDI IEESAGMPVL ARALAEIRLR IAATAQRAEA LAAAAARTAL RWSSGESLVR AALPNGAVAV LDVPFEWPTR GANISLINVE MVPPQVAQLA VTRLMTKNLL CIGDALEATS EALKNQA
|
| |