Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44713 |
Symbol | |
ID | 5000123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 259705 |
End bp | 262882 |
Gene Length | 3178 bp |
Protein Length | 982 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415544 |
Product | predicted protein |
Protein accession | XP_001416112 |
Protein GI | 145342057 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | [TIGR00592] DNA polymerase (pol2) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCGATGTGC GAGCGGCGGA TCGCGCGAAT TGGGTGCGAC GACCGCTGGC GCGGGCGGTG GACGCGACGC GGGACGCGCT GCTGTTTCAA CAGTTGGACA TCGATTACGC GGTGGAGAAG CCGCACGAGG GACTGGGGGC GCGACGGGCG AAAGGTGACG CCGCGGTGGT GCGGATGTTC GGGGTGACGA AAGAGGGACA CAGCGTGTGC GCGCACGTGC ACGGGTTCGA GCCGTATTTT TACGCGTCGG TGCCGGAAAA TTTCGGCGAG GCGGATTGCG CGGCGTTCAG GCGACGTTTG AACGAGGAGG TGAGCGCGGC GAGGAAGAAC GCGCCGGGAG TGCACGTCGT GGACGTGTCG CTCGAGAGGA AGCAGAGTTT GATGCATTAT AGCGACGTGA AGGATCGCTT GTTCGCCAAG ATCACGATGG GGCTGCCGAA TATGGTGAGC GCGGCGCGAG GGATTTTGGA AAAGGGATTC AGCGTGCCGG GGGTTCGGGA CGGGGCGTTC ACGACGTACC CGACGTTTGA GTCCAACATC GTGTACGCCT TGCGGTTTAT GGTGGACTGC GCCGTGGTCG GTGGGAACTG GATCGAGTTT CCGGTGAATT CGTACACGGT TCGTGCGAAG AAGGCGTCGC ACTGTCAAAT CGAAGTGGAC ATCATGTACG ACAAGCTCAT CTCGCACCCG GCGGAGGGTG AGTACTCCAA GCTCGCCCCG TTTCGCATCT TATCCGTGGA TATCGAGTGT GCGGGGCGTA AAGGCCACTT TCCCGACGCA CAATTGGATC CGGTGATTCA GATCGCGACT CTAGTCACCG AGCAAGGGAG CGATAAACCG ATTATTCGAG CCGTGTGGAC GCTCGATACG TGCGCTCCGA TCGTCGGCGC TGACGTCTTG AGTTTTAAGG ATGAGCGGGA ACTGCTTCGG AACTGGGGTA ATTTTCTTCG CGGGTGCGAC CCGGATTTGT TGATCGGCTA TAACATCGTC AATTTCGATT TCCCGTACTT GTTGGAGCGC GCCGAGAAGC TCGGCGTCGC CGACTTTCCG TACTGGGGAC GCTTGATCGG CACCAAGGTG CGCATGCGCG ACACGGTTTT CTCGTCCAAG GCGTACGGTA CGCACGAAAG TAAAGAAATA TCTTGCGAGG GACGCGTGCA ATTCGACATG CTTCGCGCTA TTCAGCGCGA TTACAAGTTG TCTTCATATT CGCTCAACGC AGTTTCTGCG CATTTTTTGG GCGAGCAAAA AGAAGACGTG CACTACAGCG CTATCTCGGA GCTGCAAAAC GGTACGCCCG AGACGAGACG GCGTTTAGCG GTGTACTGTT TGAAGGATGC GTATCTTCCG CAGAGATTGC TGGACAAGTT GATGTACATG TACAACTACA TCGAAATGGC TCGCGTCACC GGCGTGCCGC TGAATTACCT GTTGACTCGC GGTCAGTCCA TCAAGGTCAT GTCGCAGCTT TTGCGCAAGG CGAGGCAGCG AAACATGCTG ATTCCTCACC ATGTGAAGCA GGGCGGTGGC AACGTTCAAG CGGAGGGTGG TGTCGCGTAC GAAGGTGCGA CGGTCTTAGA CGCCAAGGCT GGTTACTACG AGATGCCCAT CGCGACGCTC GATTTCGCTT CGCTGTATCC ATCCATCATG ATGGCTCACA ATCTGTGCTA CTCGACTTTA GTGCCAAAGG ATAAGGTGGG TAAGCTCAAC CCGGATGATT ACGGGACATC GCCATCGGGA GATACGTTCG TGAAGGCATC CAAGGTGAAG GGGATTTTGC CAGAAATTTT AGAGGAACTC TTGGGGGCGC GTAAACGCGC CAAGGCTGAT CTTAAAAAAG CCACTGATCC GTTGCAAAAA GCGGTCTTAG ACGGTCGCCA GCTCGCGCTG AAAGTCTCAG CAAACTCTGT GTACGGTTTC ACCGGTGCCA TGGTCGGACA GCTGCCTTGC CTCGAAATTT CTTCTTCAAC GACGGCGTAC GGCCGTACGA TGATTGATCA CACGAAGAAA ATGGTGGAAG AGAAGTATCG AACCGTGAAT GGATACACTG GAAATGCCGA GGTCATTTAT GGTGATACCG ACTCCGTGAT GATCAAGTTC AATACGCCGG ATTTGGCGGA GTCTATGAAA CTCGGTGAAG AAGCCGCGGT GTACGTCTCT GAGACGTTCT TGAAGCCTAT CAAACTCGAA TTCGAAAAGG TGTACTGGCC ATATCTCTTG ATTAGCAAGA AGCGCTACGC TGGTTTACTC TGGACGAACA CTGAAAATTA CGACAAGATG GACACCAAAG GTATCGAGAC CGTGCGTCGC GATAACTGTT TGCTCGTGCG ACAAGTCATC GAGACGGTTT TGGAGAAGAT TTTGAAGCAG CGAGATGTGC AGGGCGCGGT GAAATACGTG CAGAGTACGA TCGCAGACTT GCTCATGAAT AGATTAGATT TCTCACAGCT CGTGATCACG AAGGGTTTCA CAAAAGAAGC AGACTCGTAT GGGGTCAAGA TGGCACACAT AGAGCTCGCG CAGCGCATGC GCCAGCGCGA TCCCGCCACC GCGCCGGTAG TCGGCGATCG TATCGCGTAC GTCATCATTA AAGCGGCGAA GAACGCAAAG GCGTATGAAA AGTCAGAGGA TCCGATTTTT GCTTTGGACA ACAACTTACC CATCGATACG AAGCACTATT TGGATCACTA CTTGACCAAG CCTCTGCTGC GTATTTTCGA GCCCATCTTG CACAACGCGC AATCCGTACT GTTGCACGGC GAGCACACGC GTCGCATCGC GCAGCCGACG CCGACTGCGA AAGCTGGCGG AATCATGCAG TTTGCCAAGA TTCGTTTAAG CTGTATCGGT TGCCGCGCAC CGATATCGGA TGAGAAAATG TCCAAGTCGC TGTGCAAGAA TTGCCTCAAC GACGAATCGC AGCACCTGAG AAAAGCGCTC GCCTCCGTGA ACAACCTCGA GGGAGACTTC AACCGGCTTT GGACGCAGTG TCAGCGGTGC CAAGGCTCAC TTCACCAGGA CGTCCTCTGC ACATCGCGGG ATTGTCCCAT TTTCTACCGT CGCAAAAAAG TCCAAAAAGA TTTAACCGAA GCCACGGCGC AGCTGCGACG TTTCGATTGG TGAGCGCGAC GAAGATTCCC GACGCGATAG TCGAGTCGAT TCCCGACGCG ATAGCCAA
|
Protein sequence | MFGVTKEGHS VCAHVHGFEP YFYASVPENF GEADCAAFRR RLNEEVSAAR KNAPGVHVVD VSLERKQSLM HYSDVKDRLF AKITMGLPNM VSAARGILEK GFSVPGVRDG AFTTYPTFES NIVYALRFMV DCAVVGGNWI EFPVNSYTVR AKKASHCQIE VDIMYDKLIS HPAEGEYSKL APFRILSVDI ECAGRKGHFP DAQLDPVIQI ATLVTEQGSD KPIIRAVWTL DTCAPIVGAD VLSFKDEREL LRNWGNFLRG CDPDLLIGYN IVNFDFPYLL ERAEKLGVAD FPYWGRLIGT KVRMRDTVFS SKAYGTHESK EISCEGRVQF DMLRAIQRDY KLSSYSLNAV SAHFLGEQKE DVHYSAISEL QNGTPETRRR LAVYCLKDAY LPQRLLDKLM YMYNYIEMAR VTGVPLNYLL TRGQSIKVMS QLLRKARQRN MLIPHHVKQG GGNVQAEGGV AYEGATVLDA KAGYYEMPIA TLDFASLYPS IMMAHNLCYS TLVPKDKVGK LNPDDYGTSP SGDTFVKASK VKGILPEILE ELLGARKRAK ADLKKATDPL QKAVLDGRQL ALKVSANSVY GFTGAMVGQL PCLEISSSTT AYGRTMIDHT KKMVEEKYRT VNGYTGNAEV IYGDTDSVMI KFNTPDLAES MKLGEEAAVY VSETFLKPIK LEFEKVYWPY LLISKKRYAG LLWTNTENYD KMDTKGIETV RRDNCLLVRQ VIETVLEKIL KQRDVQGAVK YVQSTIADLL MNRLDFSQLV ITKGFTKEAD SYGVKMAHIE LAQRMRQRDP ATAPVVGDRI AYVIIKAAKN AKAYEKSEDP IFALDNNLPI DTKHYLDHYL TKPLLRIFEP ILHNAQSVLL HGEHTRRIAQ PTPTAKAGGI MQFAKIRLSC IGCRAPISDE KMSKSLCKNC LNDESQHLRK ALASVNNLEG DFNRLWTQCQ RCQGSLHQDV LCTSRDCPIF YRRKKVQKDL TEATAQLRRF DW
|
| |