Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28221 |
Symbol | |
ID | 5006136 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 21230 |
End bp | 22233 |
Gene Length | 1004 bp |
Protein Length | 296 aa |
Translation table | |
GC content | 55% |
IMG OID | 640421557 |
Product | predicted protein |
Protein accession | XP_001422180 |
Protein GI | 145355891 |
COG category | [S] Function unknown |
COG ID | [COG5134] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0015286 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAACCT TAGCTGCGGT GCAGGCTGAT GGGTTCTATT ACCCTCCTGT ACGTATCCTC GCGCGATTCC GCGCAAGAGA GACCGCGAAG CGCGTCAATC GACGCTCGAT TCCAGCGCCG TCGTCTCGAC ACTGACCGCG GCGCGTCGTA TTCACTCGCA GGATTGGACG CCCGAGGGTG GGGCGAAGAA CAGCGCGTAT AAGGGCTCAA ATGGTTCGCT CGGCAAGCGC GCGAACAAAC TGTCTCAGGG TGTGCTGACG ATTCGTTTTG AGATGCCCTT CAACGTCAAG TGCGGGGCGT GCGGACACAT GATTGCGAAG GGTGTGCGGT TCAACGCAGA GAAGAGGAAG ATTGGGAAGT ATCATTCGAC GCCTATATGG AGCTTTACAA TGCACTCGGC GTGTTGTTCG CAGGAGATCG AAGTTCAGAC GGATCCTGCA AAGACGGAGT ACAATGTGAC CAAGGGCGCG GAGCGATGCG TGGGCTACGG CGGAGCGATG GACGACGAAG AGTCGCCTGA GGATGCGATG CAAATGCTGG AATATCAAAA CGAAGAAGAA CGCGCGAAGT TTTTAGCGAA TCCTATAGCG GTATTAGAAC GAGGCGTGGA GGACGAGAAA CGAGCGAGAG CGAGAGCAAG GACAACTTTT GAGCTCAACG CGTTGAGCAA GGCGCGGTGG AAGGAAGACT ACGACGTCAA CAAATCTTTA CGTCGAAGCA TGCGAGGGCG GCGTAAAGAA GAGCACGCGT TGAGAGATCA CGCCCGCGCG CTTGGTTTAC CCGAGCACGT CAAGTTAGAA CCCGAGCGAG ACGAGGACAA GGAGTATGCC CGCAGAGTGT TCAACGCGTC GGTTTTCGAA CGCAATCGGA AACAGAAAAG AAAGGATATT CTTTCGGAGT CCATTTTTGC GGATAAGCGA ACGAGGCCGA AACCAGCTGC AAAACGCACG AGTTCGGCGT CTTCTAATCA CTCAAGAGCG GCGAAATTAG CGAAGAGACG TTGA
|
Protein sequence | MSTLAAVQAD GFYYPPDWTP EGGAKNSAYK GSNGSLGKRA NKLSQGVLTI RFEMPFNVKC GACGHMIAKG VRFNAEKRKI GKYHSTPIWS FTMHSACCSQ EIEVQTDPAK TEYNVTKGAE RCVGYGGAMD DEESPEDAMQ MLEYQNEEER AKFLANPIAV LERGVEDEKR ARARARTTFE LNALSKARWK EDYDVNKSLR RSMRGRRKEE HALRDHARAL GLPEHVKLEP ERDEDKEYAR RVFNASVFER NRKQKRKDIL SESIFADKRT RPKPAAKRTS SASSNHSRAA KLAKRR
|
| |