Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31018 |
Symbol | |
ID | 5001419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 169596 |
End bp | 170930 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 61% |
IMG OID | 640416840 |
Product | predicted protein |
Protein accession | XP_001417162 |
Protein GI | 145345320 |
COG category | [S] Function unknown |
COG ID | [COG0398] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.212376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCGC TTGAACTGGC GCGACGAGGA CCCGAACCGG GGGAGACGTG GACCGCGGAG CGAGAGGAGA TGCGAGCGCG GGCGCGGGCG TCGATTCCGA GCGCGGTGGG GACGTATTGG CGAGCGAGGA GCGAGGCGCT GATTGAAAGT CAAGCGGTGC GGGACGCGGC GCCGTTGATG TCGAGCGCGG TGAACGTGGC GATCGCGGGG GTGGTGCTGC GGTTGTTTTT GCCGCGCGTC GCGGCGCTGC AAGCGGTGGG TGGGTTCGAC GAGTTGACTG AATTTTTTGG GTTGCCGCCG AGGAGCGAGT TGAGCGGATA TTTGGATCAG TTGCGAGCGT TGCCCGTGGC GGCGGTGTTC GCGGTGTACG TTGGGTTGTT CGTGGCGGAA AAGTTGACGA TGACGGATGA GTTTTTGCCG ATCGGCTTCG TGCTGCCCGT GGTGTCGCCG GTGGTGTTCG GCGGCGTCTT CGGAGGGACG ATGGTGACGT CGCTGGCGAG CACGCTGGCG GCGAGCGTCA ATTTCTTCCT CGCGCGGTAC GTGTTGAAGG ATAAGATATT AGGCTTTAAG TGGGGCGAAA GTGATCCCGT GGGCGAGCAA AAGTGGTTCG CCGCGCTGAG TCGAAGATTC GACTCGTCGC AATTCCCCGA GTCCACGGTG CCCGAGGGGT TCAAGTCGGC GCTCTTGCTC AGGCTGTGCC CGATTTTACC GATTCCGATA AGTGGGAACT GGTACGTGTG CGGGCTGACG CCTCTCAAAT TCAAAGAGTT CTTCGCCGCG CACTTCATCG GAAGCTCGAA GACTGCGTTC ATCGACGCGT ATTTAGGTTC AATTTTGCTC ACCGCGGTGT TCGACGAGTC ATCCGTCAAG GACCAGGCGC AAGGCGCGCT CGTGTTCGAA ACCGTCGCCA TCATGGTTGT TTCCATCTTA GTCAGCACGT ACGCCACGGA GCTCTTCACG CAGATTCTCG ACGAAGAAGG CGTCGACGCG GGGGCGATGA TGGGATTCGG TTCGGAATCC AAAGACGAAG ACGAAGGCGA AGACGCCGTC GACGCCACCG TCGCCTTCAT CGCCGCCGCC GCCCTGCCCG TGGAACCAGC CGGCTCCACG GCGACGAGCG ATGATGAACC GAAAGCCGAC GACGACGAGA ACGACGACGA CGCCACGTCG AACGAACCTG AACTCATCCC AATCGAGCGC ATGCCCGAGG ATGAAAAAGT TTTAATCGCC GAAGGCGAAG CGCTCTGGCG ACGCGCCGCG CGCGTCGAAG CCGAGCGTCA AAAGCTCACC ATCGAAGAGA TGACCGATTA CGACTCCATG GGACCAGACA TGTGA
|
Protein sequence | MAALELARRG PEPGETWTAE REEMRARARA SIPSAVGTYW RARSEALIES QAVRDAAPLM SSAVNVAIAG VVLRLFLPRV AALQAVGGFD ELTEFFGLPP RSELSGYLDQ LRALPVAAVF AVYVGLFVAE KLTMTDEFLP IGFVLPVVSP VVFGGVFGGT MVTSLASTLA ASVNFFLARY VLKDKILGFK WGESDPVGEQ KWFAALSRRF DSSQFPESTV PEGFKSALLL RLCPILPIPI SGNWYVCGLT PLKFKEFFAA HFIGSSKTAF IDAYLGSILL TAVFDESSVK DQAQGALVFE TVAIMVVSIL VSTYATELFT QILDEEGVDA GAMMGFGSES KDEDEGEDAV DATVAFIAAA ALPVEPAGST ATSDDEPKAD DDENDDDATS NEPELIPIER MPEDEKVLIA EGEALWRRAA RVEAERQKLT IEEMTDYDSM GPDM
|
| |