Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34669 |
Symbol | |
ID | 5003515 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 591667 |
End bp | 594522 |
Gene Length | 2856 bp |
Protein Length | 922 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418936 |
Product | predicted protein |
Protein accession | XP_001419848 |
Protein GI | 145350935 |
COG category | [K] Transcription |
COG ID | [COG5108] Mitochondrial DNA-directed RNA polymerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00224083 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.282079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGA CGGCGTTCGG GGACGTGAAG GATCGTATTT TTGAAACTAC GGACGAGCGG AGGGCGAAGA TCAAGCTCGC GCTGCAGTCG CTGCACGCGC GAAGACGACA GCTCAGACGC GCGGGGGGGC ACGAGGCGCG AGAGCGCGCG GAGGACGCGG ATGAGGGGGA GAGACGGGCG CAAACGCGGA AGCAGCGCGA GGCGGAAAAT TTGGCGGTGG AAAGAGAGGT GATGCGGTAC AAGGAGTTGG CGAGGAAGAC GTTCGCGGCG GGGATTGGGG CGCAGTTACC CGTGGTGCAA AAGTTGCTGG CGTCGTTTTA CGTGCCTCTG GTGGAGGCGC TGACGGAAGA ACAAGAGAAG ATACGGGGCA ACGTGCCGGG CGTCGACAGA CGCGTGTACG GGCACTACTT GGGCTTGCTC GAACCGGATA AACTCGCGGT GTTGACGCTT CACGCGACGC TCAGCACGCT GATGAAGGGT GACGGTAGGA ATAATGGCGC GTGGTCGTTC ATAAGGGGCG ACCAGACCGC GGCGGGGAGC GCGAAGTTTA TCAGCGTCGC CGACCAAGTT GGAAGCGCGG TGCAAGCTGA GGTGAACTTG GAGCGCATGC GGTCGGCGGA AAAGGCGGCC AAGGAGGCAT TTAGACGATC TAGACTAGTG AACACCGAAG GCAACGTCAA GGAAATAGAC GACGCGGCGG AGCATAGGAT TAATTTGGCG TTGGATACGT CAAAGATGAA CACCATCAAG TCGGTGTCCA AGCACGCTCG GCAAGCGCTC GAGAACGCGG AGTGGGGTCG TGAGATTCGC CTTAAAATCG GAACAGTGCT GCTGACCGCG CTCATGAACA CGGCAAAGAT TGGCGTACCG GACGAAAAAG GTGAATTATT GACGCTTCCA GCCTTTTATC ACGACTACAA AGAGGCTGGG TACGGAATGC TGCACTGGCA CGATAGCATT TACAGATTCA TCAACACGGA GACGATGACT CGCGCAGCGC TCGTACCGGT GCGACACTTT CCCATGGTTA TCCCGCCGCG GTATTGGGAG AGGTACAATA AGGGAGGATA CTTGCGCGCT GATAATCTAT GCATGCGAGG GAAGTACTCG AACGAAGGGC CGAGTCGAGC GCAAATCGCG GCGTTGGAGG AGAAAGCGCG CGAGGCGGAC GCATCGGGGG AGCCCGTGCA ATACCAACCC GTGTTAGACG CTCTCAATGC TTTGGGGCAA ACCGCGTGGC AAATCAATAC GGATGTGTTA CCGATCGTGG AAGAAGTCTG GGCTCGAGGT GGTGGCGTGG CTGAGGTTCC GCTTCGCGCT GAGTTGCAAC TCCCACGCTG GCCCGGCGGG TCATATGCGC TTCGCAGCGA CAAAAAGCGT TTGCAGCTGC TCGCTAGTGG ACTTCCGGGC AAAGGTGAGG TTATCGACTT TTTGCAAAGC GTGCGAAAGA CGAAAAAGTC AAACATGGAG CTCCACTCTC AGCGCTGCGA CTTTCTCATT AAATTGCAAG TTGCGCGAGA GATGAAAAAC GAACCAAATA TTTATTTCCC GCACAATTTG GATTTTAGAG GGCGCGCGTA CACGATGCAC GTGCACTTGA ATCACATAGG AAGTGATTTG TGTCGCGGTT TGCTTCGATT TAACGAAAAG AAGCCGCTCG GCGAGCGCGG CCTGCGTTGG ATGCACATCC AGTGCGCGAC GCTGTTTGGT AACGGCGCCG ACAAGCTTCC GATGGATGAG AGAGTGCAGT TCATCAAAGA TCGCATAGAA GACGTGCGAG CGTCGGCTCA AGACCCGCTC GCGAAGGATG CGTGGTGGCA AGAAGCGGAA GAGCCCTGGC AGTGTTTGGC GACGTGCATC GAGCTCGACA AAGCGCTCGA GCTCTCGGAC CCGACGCAGT TCATGAGTAA TTTACCCGTG CATCAAGATG GGTCGTGCAA CGGGTTGCAA CACTACGCCG CGCTCGGTCG CGACTTACAC GGTGGTGAAG CCGTGAACTT GGTCCCCGCA GACAGAGGTG CGGACGTCTA CACCGGCATC GCGAATGTGT TGAAGCGCAT CGTTGCTGAG GATATTAAAC TCATCGACAG CGAAGACGAG GAAGACGTTA ACAACGCCAA GCTCGCGATG TCACTCGCTC AGCACATCGA CCGTAAGCTT GTGAAGCAGA CGGTGATGAC GTCCGTGTAC GGCGTTACTT TCATCGGGGC GCGAGCGCAA ATATATAGCC GTCTCCGTGA GCGCGAGGCG ATGGAGGACA ACGAACTTCT TCGCTATCGT GTGTCCAACT ACGCCGCGAA AAGGACGCTC GACGCGTTGA ATAATATGTT TTCAAACGCC CGAGATGTCA TGGGATGGCT CACGACCTGC GCCACGATCG CTACCTCAGC GGGCGAGCCC GTGCGTTGGA CCACGCCTCT GGGATTGCCC GTCGTGCAGC CGTATCACAG TCAGCGAACC AAGCGCGTGC GGACGATTTT GCAGTCATTC TCGCTAAAAG TTCACGACGA ACAACAGCCG GTTATGAAAG TGAAGCAAAG GAGCGCGTTC CCGCCGAATT ATATTCACAG TATCGACAGT TCTCATATGA TGAGGACGGC GATCGCGTGC GTGGACGCCG GATTGACGTT CGCCGGCGTT CACGATTCCT TTTGGACGCA CGCGACGGAC GTGGACACCA TGAATGTCAT CCTGCGTGAA AAGTTCATCG AGGTTCACAA AGAGCCTCTT CTCGAAAATC TTTATCACGA GTTCCGCGCG AATTACCCAG ACGTCGCGGA CGAGTTCCCT CAGCCGCCCG CACCTGGCGA TTTGGATTTA GACGTCGTTC AGGACTCGGT GTACTTTTTC AGCTAG
|
Protein sequence | MAATAFGDVK DRIFETTDER RAKIKLALQS LHARRRQLRR AGGHEARERA EDADEGERRA QTRKQREAEN LAVEREVMRY KELARKTFAA GIGAQLPVVQ KLLASFYVPL VEALTEEQEK IRGNVPGVDR RVYGHYLGLL EPDKLAVLTL HATLSTLMKG DGRNNGAAKF ISVADQVGSA VQAEVNLERM RSAEKAAKEA FRRSRLVNTE GNVKEIDDAA EHRINLALDT SKMNTIKSVS KHARQALENA EWGREIRLKI GTVLLTALMN TAKIGVPDEK GELLTLPAFY HDYKEAGYGM LHWHDSIYRF INTETMTRAA LVPVRHFPMV IPPRYWERYN KGGYLRADNL CMRGKYSNEG PSRAQIAALE EKAREADASG EPVQYQPVLD ALNALGQTAW QINTDVLPIV EEVWARGGGV AEVPLRAELQ LPRWPGGGLP GKGEVIDFLQ SVRKTKKSNM ELHSQRCDFL IKLQVAREMK NEPNIYFPHN LDFRGRAYTM HVHLNHIGSD LCRGLLRFNE KKPLGERGLR WMHIQCATLF GNGADKLPMD ERVQFIKDRI EDVRASAQDP LAKDAWWQEA EEPWQCLATC IELDKALELS DPTQFMSNLP VHQDGSCNGL QHYAALGRDL HGGEAVNLVP ADRGADVYTG IANVLKRIVA EDIKLIDSED EEDVNNAKLA MSLAQHIDRK LVKQTVMTSV YGVTFIGARA QIYSRLRERE AMEDNELLRY RVSNYAAKRT LDALNNMFSN ARDVMGWLTT CATIATSAGE PVRWTTPLGL PVVQPYHSQR TKRVRTILQS FSLKVHDEQQ PVMKVKQRSA FPPNYIHSID SSHMMRTAIA CVDAGLTFAG VHDSFWTHAT DVDTMNVILR EKFIEVHKEP LLENLYHEFR ANYPDVADEF PQPPAPGDLD LDVVQDSVYF FS
|
| |