Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33043 |
Symbol | |
ID | 5003083 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 299180 |
End bp | 300753 |
Gene Length | 1574 bp |
Protein Length | 484 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418504 |
Product | predicted protein |
Protein accession | XP_001419337 |
Protein GI | 145349845 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.804843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.549748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACGA GCGCCGAGGT GCGGCGCGCG CGCGCGTCGC GTCGTTCGTC GCGCGTGGAA GATGGCGCGC GTCGTCGCGA GGCGCGCTGT GGACTGACGC GGTCACGTCT CGAACAGGAA TTCGCCGCGC TCGTGCGCGT GGTCAAGGAT GCGCAGCGTC GATTCACCGC CGCGCCGAGC GAACACCCGG GGTGCGACTT CAAGGCGTGG CTCAAGGCGA ACGTGAAGCG ACCGGAGAAC CAACGAGGCG ATCCGAGCCG GCACACGGCG GAGACGCTGA GAAAGTTTGT GGAGACGCTC GGGACGGCGC TCGGGACGGC GCGCGTGACG GCGCTGACGA AACACGATCA ATGGCGGAAG AAACGCGCGG AGTGGGACGA GTGGGGCGAC GACGAGGTCG TCGATGCGAA GGGTGATCGA GCGTGGGCGC TGGTGCGACG AACGCGAAAG CATGAGGGGT TCGCGGAGGC GTATTATCGG TTGCCGTCGC ACGAACCGCC GTGGCGACGG ACAAAGTATC GACCCGTGCG TTGGTGCGTG GAGAAAGAGA TGAAGCCGAG GTTGTTGGGG GTGGATTGTG AGATGTGTGA GACGGATGAC GACACGCGAG CGCTCGTCGG GGTGTCCGTG GTGGATGATG AGGGAAATAT TTTGTTGAAG ACGCTCGTGA AGCCGCCGGG GAACATCGTC GACATGAGAA CGGAGATTAC TGGGTTGAAG GCGGAGAACG TCCTCGCGGC GCCGACGACG TTGAGCGACG TGCAGGATAG ACTCGTGGAG TTGTGTAAAC CGGGAACTGT GCTCGTGGGT CATTCGTTGA TGCATGATTT GAAGTCTTTA AAGATTGACC ATCAACCCGT CATCGACACT GGAATGTTGT TTCGTTACAA GAATCTCCCT CGGTCGACGC CGAGCTTGGC GATTTTGTGC GAAACTTTGC TCAAACGAAA GATGAGACAA ACTGAGGCGG GCTATCACGA TTCCGTCGAG GACGCCAAAG CGGCATTAGA CTTAGTGTTA TGGGCGGTTC GCGAGGCGAA ACCCATCTTT GAGGTGGACG CGCCGCCGCA CAAAGTGGAC GCCGAGGACC TGTGCAAGCT TTTTATCCAT CGCATTCCGC GTGGGACGAG CGCGGAGGCG TTGAAGATGG TGTTCGAAGA AACCGATCGA GCGCACATCG AGAGCGTGCA GGGAAGCTTT CTCGATGCGA CGACGACGGA CTCCGCGGCT CTCGGCGGCA AAAAAACCAC CTCCTGCCTC GTCACGTTTA CCGACACCAA ACGCGCCAAC GACGCGTTCG AACGTCTCGA CGGTGCGGTA ACGAAAGATG CAATCGGTCG CGCGCAGAAG TCTCGCGCCT TACCGCTCGA TTCCACCGAT CGATCCGTCA GCGTCGTCGT CCGTCGGATG ACGTCGAGCG GTAGCGTCGT CTCCGCCGGC GCCGCCGCAG GCGCCAAGCG CCCGGCGAAC AGCGTCCCGG CGACGGTTCC AAAGTCGAAA AAAGTTCGTC AGCGCAAGCC TAAATCGATC GCTCTTCCCG GCGATAAATC GTAGCCCGGA TTCGCGCGCG CGCG
|
Protein sequence | MPTSAEEFAA LVRVVKDAQR RFTAAPSEHP GCDFKAWLKA NVKRPENQRG DPSRHTAETL RKFVETLGTA LGTARVTALT KHDQWRKKRA EWDEWGDDEV VDAKGDRAWA LVRRTRKHEG FAEAYYRLPS HEPPWRRTKY RPVRWCVEKE MKPRLLGVDC EMCETDDDTR ALVGVSVVDD EGNILLKTLV KPPGNIVDMR TEITGLKAEN VLAAPTTLSD VQDRLVELCK PGTVLVGHSL MHDLKSLKID HQPVIDTGML FRYKNLPRST PSLAILCETL LKRKMRQTEA GYHDSVEDAK AALDLVLWAV REAKPIFEVD APPHKVDAED LCKLFIHRIP RGTSAEALKM VFEETDRAHI ESVQGSFLDA TTTDSAALGG KKTTSCLVTF TDTKRANDAF ERLDGAVTKD AIGRAQKSRA LPLDSTDRSV SVVVRRMTSS GSVVSAGAAA GAKRPANSVP ATVPKSKKVR QRKPKSIALP GDKS
|
| |