Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15837 |
Symbol | |
ID | 5002595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 515570 |
End bp | 517174 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418016 |
Product | predicted protein |
Protein accession | XP_001418510 |
Protein GI | 145348132 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5260] DNA polymerase sigma |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.391811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAGA TGAACGGGAT CGAGACGTCC GCGAGTGCGG CGACGAGCGA AGAAAGCGCG AGCGATGCGC GTTGCGATGG TTCCGTGAGC GCCGAGGACA ACACGATCGA TGAAAGCGAT GCCGAAGACG ATCATTTCGA CGATTCATTC GACGAATACA CGGATGAGGA AGACGATCAT AACTTTTCGT TCAACGTCGA GGAGACTGAG CTCAACAACA TCGAGAGCGA TCGAGTCGTC GGCGATGATT CTTTGCATTC GCAAGTGATT GATTTCGTGA AGCATAGCGA GATGTCTACG CACGAAGCGA GTCGTCGACA ACAGTGTTTC GACGCCATTC AATCCGCCAT CGCTCGACAT TACGCTAATC ACAAAGATTG CTCTCTGCAC GTATTCGGGA GCGGCGCCAC GGGGCTGGCG CTCGCAGGAG CCGATCTAGA TTTAGTTTTA CTCGGCGTCG GTCCGCAGTC GCGCAAGGGC GGCGGTGGTG GGTTTACACG AAGCGAACGC GACGAAATCG TCGGCCACTT GCGAAAAATG GCTCGATTCT TGCGCAAGGT GAACGCTGTG TCGCGAGCGG AGATCATCGC CTCGGCGAAG GTTCCGATCA TTAAAATGAA AAGTGCTGTG CCGCCATATA TCGCCGTGGA TCTGTCCTTA GGAACTTCAA ACGGCCTCGA AGCCGTGTAC TGGATTCGCG AACAAGTCGA GACGTACACG GCGTTGAAAC CATTGGTCTT TTACTTGAAG CGCCTTCTTA GCACGCATCA CTTGAACGAC GCCGCGACTG GAGGGTGTGG GGGCTACTTG CTCGTGTCGC TTGTGGTTTC GCACTTGAAA CAAACGGGTT CAGTGGTAGC TGTAAATAAA GCCGGTTTGT TGGGCGAGCT CTTACTCGGT TTTCTCCGGC GATTCGGCTC TGTGTTTGAT TACAGAACAA ACGCCGTCGC CGCCGGTCGC GAATCGGGCG TGATGTCCGC CGCGGAACTC CCCGGTCCAC CGTTTGGGAC GCGGCCGTAC ATCATGGCAG AAGATCCGCA GGAGCGACTG CGTTGCTTTA CCGCGGCGGC GTATCGATTC AAAGAGGTTC AGAACCTGTT TAGACTCGCC GCGGAGCACA TCTCGGTGAG CGGCGAGCTC TCGTTACTCT CCGAGGTCGC CGCGCCACCG CCTAGAAACA GTTTCGGTGC GTTCCCAAAG AGCGGTCAAC TCATCAAGGT TCGCCAGAAC ACTATGGTCC GCCGCGAATC CTCGCCCTCG GGGAGCGGGC GGAACAACCA TCAGTATTTC CGCAAAGCGA ACGCGAAAAA CAGCTTCTCG CCGAACAAAT CCGATTGGAC CGACGATCCG CGCAACGGTA ACAATAAACG TCCTCGTGGT GCTAGCGCGT GGGCCTCCGA CGGCCACCGC GGCTACGGAG GCACCCCGGC GTCACCGAGC GCCAAGCGTC GCCGCGCCGC CGACGAGCAG CGCGCGTTTC GCGAGCGCGG CGCCGGCAAC GCGAAGAACG GCAAAAAGAC CGCGCGCGCC TCGAGTGCGA AGCCCAAGCA GCGCGCAAAG CTCGCGAGCA AGCAATCTCG CGTCACGAAG AAGAAGAAAA GGTAG
|
Protein sequence | MREMNGIETS ASAATSEESA SDARCDGSVS AEDNTIDESD AEDDHFDDSF DEYTDEEDDH NFSFNVEETE LNNIESDRVV GDDSLHSQVI DFVKHSEMST HEASRRQQCF DAIQSAIARH YANHKDCSLH VFGSGATGLA LAGADLDLVL LGVGPQSRKG GGGGFTRSER DEIVGHLRKM ARFLRKVNAV SRAEIIASAK VPIIKMKSAV PPYIAVDLSL GTSNGLEAVY WIREQVETYT ALKPLVFYLK RLLSTHHLND AATGGCGGYL LVSLVVSHLK QTGSVVAVNK AGLLGELLLG FLRRFGSVFD YRTNAVAAGR ESGVMSAAEL PGPPFGTRPY IMAEDPQERL RCFTAAAYRF KEVQNLFRLA AEHISVSGEL SLLSEVAAPP PRNSFGAFPK SGQLIKVRQN TMVRRESSPS GSGRNNHQYF RKANAKNSFS PNKSDWTDDP RNGNNKRPRG ASAWASDGHR GYGGTPASPS AKRRRAADEQ RAFRERGAGN AKNGKKTARA SSAKPKQRAK LASKQSRVTK KKKR
|
| |