Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38035 |
Symbol | |
ID | 5004200 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 4122 |
End bp | 9980 |
Gene Length | 5859 bp |
Protein Length | 1869 aa |
Translation table | |
GC content | 53% |
IMG OID | 640419621 |
Product | predicted protein |
Protein accession | XP_001419873 |
Protein GI | 145350993 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0179427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAAC GAACCAGAGA CAAGGCGAAA GCCGAGGTAG AGGACGACGA CGATGACGTC GACGCCTCGG CCTTCCCTCG AGGCGGCGCC GCGAGCGGTG GACGCGGCGA TGAAGATGCG TTTCCTCGAG GCGGCGGCGG CGGCGGCGAC GGTGACGACG CCGGACGGGG CAGGAAGCGT CGATCGAGCC AACGAGGCGG TGATGGCGAT GGAAGCAATG ACGATGACGA TGATCCGTTC TCGCGAATCT CACGCGCGGC GAAAGGGGCG AGTTCGAGAG CGGTGTCTTC GAGCGGCGGC GGGGCGAAGT ACGTCGAGAC TTTGAAGTAC AAGTCGCTAC GACCTGGAGC TAAACTTTTG GGCATCATCT CCGAAGTTAC CGCGCGGGGA TTAGTGATGA GTTTACCAGA CGGCTTGCGC GGCACCGTGG CGCGCGCGGA AGTTGCTGGC ACGTTCGGGA GTAGTCGACG CAACCGCACC GCCGCCGCCG ACGGCTCGGA ATCTTCGGAG GAGGAGTATA GCAGCGACGA GGACGACGAT GACGATGACG CAGAGGCGAG CTTGGAGTTG CTGTACGAGC CCGGGCAAGT GCTTCGATGC GCGGTGGTGA GTTTGGAGAA AGGTAAAACG GGTGGCAAGA GAATCGAGTT GTCTCTCAGA CTAGAAAAGG TGTGCGAGGG CCTCACAAAG GAAAGTCTCA CCGAAGGCTC GGTAGCGCCA GCCGTAGTTC AAAGCGTCGA AGATCACGGC TACATCTTGA GTTTTGGTAT CGCAGATACG AGCGGATTCT TACCGAAGAA AAATGTGGCG AGCGATTTGG GCGAGATTCG TAAAGGCAGA ATCATCGATG TAGTCATCAC CGGCGCACCA AAGGGCAATA AAGGTTATTT TACCGTGACG AGCGATCAGA AGCGAATCAA GACTTCGGTC GCCCACGAGA CGTCGGCGAC GAATGTCGAT ACATTGCTTC CGGGGATGCT CGTAAATTCT AGAATCAAGC AAATACTTTC TGACGGTGTC TCAGTTTCTT TCATGACTTA CTTCAGTGGC ACCGTGGACT GTTTTCACAC TGGGGCTCTT GCGACGTCGA AAGGGGTTTC GTCCGCGTTT AAAGTAGGGC AACGCATGCG TGCGCGAATC ATTTTTGTCG ACTCTGCGTC GAAGCGTGTT AGCCTAACCT TGCTGCCGCA TCTGCTTGAT TACGCGTCCA TCGAACTTCC AAAGCTTGGC AAAACTTTCC AAACTGCCAA GATTGAGCGC GTGGATGCGG GTCAAGGGGT CGCGCTGAGT ATTTCAGATG GTAAGAACGA TATCGCTGGA TATGCACACG TTTCACAGCT TTCTGACGAA CGCGTGGAAA AGGTGGAGAA GAAGTTCAAA ATCGGAAGAA GCGTAAGTGT TCGCGTCATC GGTCATCGTT TGCTCGATGG AGTGGTTAGC GTCAGTTTGA AGTCATCTGT CATGGCTCAA CCTTTCTTTT CATTGGATGA ACTCACACCA GGGATGCTTG TGAACGGCGA AGTTCTCGCC GTTGAGCATT ACGGAGCCAT AGTGAAACTT GCCGAGGGCA TTAAAGCGCT GTGTCCCCCG CTTCACATTT CTGACATTGT CGGCCGAACG ACTTCTGCAA AAGTCGCCCC TGGTGCCAAA TTGAAGTTCA GAGTATTGAA CGTGGATCGA AATAGCCGGA GAGCGACGGT ATCGCATAAA AGAACGCTCA TCAAGTCCGA GCTCCCAGTA ATTGGTCAGA TTGAAGACGC TGTGCCCGGA TCAATCACGC ACGGCGTGGT GACGGGCGTG AATGAGTACG GTGTGTTCGT CTCCTTGTAC GGTGATTTGA AGGGTCTGGC TGGTTTGAAT GACTTGGGTC TTCTGCGAGA TCAAAAACCG TCCGACGCGT TTGGTGTCGG ACAAGTTGTT CGAGTACAGG TTGTTTCAGC CGACACGTCT GGTCGGTTAC GTCTTTCGCT CGCGTCTGGC GACGCGGATG GAAACTCTGC GAGCATGATT ATTAATGCGT CCGCAGATGC CTTGAAGCCG GGTCATGTCG TTGAAAAAGC AGTGGTCACG CACGTGGCGT CGGGCACAGG TAATGTCGAG GTGGTTTTTT CTATGGAAGA AGGCAACATA CCAGGCGTCG TGCCGCTCGC CCATTTATCT GACCATCCGC TGACGGCGCA AGGATTGAGC GCTGTTCTCA ATCCCGGTGA CGAGATTGGT CCTTTAGTAG TTCTTGAAGG CAAATCAACT CGAGCAGTGA TGTCGCGCAA ACTTTCACTC GTGGAAAGCT CGCGAGAAGG GAAGCTTCCA GCGACGGCGA AGGAAGCGAC GCTCGGCGCA GTGTTCCCAG GCTACGTCGC ATCAGCCACC GCTGCGGGCG TTTTCGTTCG CTTTTTAGGT CGACTCACCG GTCTTGCACC GCCTTCACAG CTCACAGACG GTACTACGGG AGACGTGCAC GAAATGTTTC CGGTAGGTAA GACTGTCAAC GCATTAATAC TGTCTGTGGA TACGTCCACG CCGACGCCGA GGTTGTCACT CTCATTGAAA GTTTCTGCCA CTTCGTCGCC TCTCAGCGAT GCACCGTTGG TTCGCTCGTT TTTCCAGGAT ATTGAGTTTC TCGATGACAG AGATGTCGGA GCCGAAGACG TGGGTATATC ACCTGAAACC GCAAAGTCGC TCAAGCCCGG TACGTGGATG GATGTGTCAG TTAACGAAAC AAAGGATTAC GGCGTTTTGA TGGATGTTCC GATCGATTCC AACGTCGTCG GTCTGGTGAC GCCTCATCAG ATACCAGTAG ACACGACGTT CACAGCGGGG GATGAGGTAA AAGGTTACGT TTTAGACGTC AGCCGCCGAG AAGGCGTAGT TGACATCGGC ATGCGGGATG GATTGGGCAA ATTCAAGCGA AACAAGACGT CGTCTGGCAA AAGTTTGAAG AAGCTTAAGG TGGGAGATCA AGTCTCCGCT GAGGTCGAAC TCATAAAGGC TGAGTATGTG GCACTTTCTT TACCAGAGCA TAACGGCTTG ATAGGTTTCG CTCCTGTGCA TCATTTGAAC CTTCGTTACG AAGACGCGTC GGAACGCTTT ACGCCGACGC AATGCGTCAA GGCTGTCATC GCACAGCTTC CAGAGGGTGA AATGGGGCGT CTTCTCCTGA CGGTTCCCGT TACTAAAGGA ACCACAGCAA GCGGACGAAT TGCCGCCGGG ACGCTCGTCA AGGGTGTCGT GTCAGAGGTT CAAAATCTGC AGGCATTGGT CGCTTTGCCA AATAACGCTC GGGGACGACT TTACATTAGC GAGTTCAGCC CCGGTGAAGA TACCCCACTG GAGTCTATTT CAGTGGGTTC AACTGTTGAA GCCACTGTGA TGGGTCTTGC TGGAGACCGT GGAGGACTTC TGGACCTGTC GATGCATAGG AAATCCGCGT TTGTGCTTGA AGATGTCTCT GTTGGCGACG ACGTGAGCGC GTACGTCGTT TCCGTAACGG ACGATGGGAT CAAGGTGACT ATCGCTCCCG GAATCACATC CTTCATTCCG AAGATTGAAA CGTCGGACAA ATCATCTGAG CTCGCCATGA AGCTGAGCTC TCGCTTCACC GTGGGAGAGC GCGTGTCCGC GATTATCGTT GGAGTTAAGG CGACCAAGAA GCGAGTCGAC CTCAGCCTTC GAACGGACGG CGCATCCGGG TCGTCTCGCG TGTGCGTCGG GGCTAAAGTG CAAGGTATTA TTACGCGAGT CGTGGAAAAC GTCGGTCTCA TGGTTCAACT CGGATCGCAT TCCGTGGGAC GAGTACACTT GACAGACATG GCGGACGAGT ACGACGACGA TCCGTGCGCC AAGTACGAAG CGGGACAAGT CGTGCAGGTG CGCGTGTTAA ACGCTTCTTC AAACGGAGAA CTCGATTTAT CTATGCGCGC GTCTCGTTTG AGTAGCAAGC GAACCTCGCC GACGGATCCC GAGATCACGG ATATCAGCAA CCTCGTTCCT GGTCAACGCG TAAAGGGATA CGTCAAGGCG ACTTCAAAGA AAGGATGCTT CATCGCTCTT TCCCGCGGCA TCGACGCTAT GTGTAAGCTG TCAAACCTCG CGGACAGTTT CATCGCGGAT CCAGCGAAAA CGTTTCCTCC TGGAAAACTT GTCGAAGGAC GGATCGTGAG TGCCGATGCG GCTAAAGGAC GAGTTGAGCT CGCGTTCCGC GAGACGGACG CCACGCAAGG AAATGCAGAT GTTTCGACGG TGAAGGTGGG AGACGTGCTC ATTGGCACTG TTCGCCGCGT ACAACCGTAC GGAGTGTTCG TCAGTCTCGA TGGCACGAAG TTATCTGGAC TCTGTCACAT CTCTATGTTC GCAGACGCTC GAATTAGCGA CGATTTGGCG TCTCACGTGC GCCAAGGCGA AAGGGTGCGA ACGAAAGTGT TAGAAATCAA CACTGAGACG AACAAGATAT CGCTCGGTAT CAAGGCTTCG CTCTTTGAAG ACGACGACGG CGACGGAGAC GAAGAGATGG CCGACGTCAA CACAGCGCAC ACGTTTGATC CACTGATGGA TGTGGATGGT GAAAACGACG GAGAGGACGA CGACAATGAT GGTGAATCCA GCGACGACGA CGACGACGAC GACGACGACG GCTCCAGCGA CGAAGCAAAC GCCAGTGAAA GTACGGAGGC GAGTAGCGAA GAAGGAGAAG AAGAAGAAGA AGAAGAAGAA GAAGAAGAAG AAGAAGAAGA GTCCTCAGAA AGTGGCGAGT CAGACTCCGA CATTGACGAA GACGGACCGC TGCATGCAGA CGAGGGCGAA TCGACAGACG AGGAATCTGA TCCATCTGAT TCAGAGGACG CGCCGATCGG CAACGATTTA GGGTTCGATT GGGATGCCGA AAAAACGGAC GCCAGTATGA CCGACGTCGC TGATGAAAAG GCGGGTAAGA AGGGTGCCGA CAAAGCGCCG TCAAAACGCG AAAAAAAACG ATTGAAAGAG GCGAGGGAGC TCGAAATTTT ACAAAAAGAG CAAGAGATGA GAGATGGCGA TCATATTCCC GAATCTGCGA TGGAGTTTGA AAAGTTACTC ATCGCATCGC CTCGCTCGTC GTTTCTTTGG GTAAGATATA TGGCGTTTCA CGTCAGCTGT GGCGCGTACG ATGAGGCTAA AGAAGTCGCG GAACGAGCTC TCGGAGCGAT ACCCGCCTCG GAAGAGGCTG AGCGCATGAA TGTGTGGGCG GCGTATTTGA ACTTGGAAAA CAAATACGGC ACTCCGTCGC CGGAAGAAGC TGTGAAAAAG CTCTTTACGC GCGCGGTTCA AATCGCCGAT GCCAAGCACA TGCACTTGAC GCTCGTATCG ATGTATGAGC GAAACGCTCA AGAGGATGCG CTCGAAGAAA GCTTGAAGAA GGCGGCCAAA AAGTTTTCGT ACAGTGCGAA AATCTGGCTC GCATACATAC GCTCTGCTGT GTTGAAAAAT GATTCGGAAA AGGCGCGAAA ACTTTTGGAT CGCGCGACGC AGTCATTGCC GAAGCACAAA CATATAAAGA TTCTCACGCG TACGGCTCTC CTCGAGATGA AAGAGGGAAA TCCGGAGCGC GGCCGCACGA TGTTTGAGGG TATATTACGA AACTACCCGC GACGTACTGA TATTTGGTCA GTGTACATTG ATCAAGAAAT CAAGCAAGGT GACATTCAAC GCATCAGAGC ATTATTCGAG AGAGCGACGC ACCTCGATCT TAACGCAAAG AGCATGAAAT TTTTGTTCAA GCGTTACCTG GACTTTGAAA GATCGGAGGG TGATGACGAA CGCATAGCGC ACGTGAAGCA AAGAGCGATG GAATACGTTA GCAACAAGTT CGGCTCAGCC GCGGAATGA
|
Protein sequence | MGKRTRDKAK AEVEDDDDDV DASAFPRGGA ASGGRGDEDA FPRGGGGGGD GDDAGRGRKR RSSQRGGDGD GSNDDDDDPF SRISRAAKGA SSRAVSSSGG GAKYVETLKY KSLRPGAKLL GIISEVTARG LVMSLPDGLR GTVARAEVAG TSDEDDDDDD AEASLELLYE PGQVLRCAVV SLEKGKTGGK RIELSLRLEK VCEGLTKESL TEGSVAPAVV QSVEDHGYIL SFGIADTSGF LPKKNVASDL GEIRKGRIID VVITGAPKGN KGYFTVTSDQ KRIKTSVAHE TSATNVDTLL PGMLVNSRIK QILSDGVSVS FMTYFSGTVD CFHTGALATS KGVSSAFKVG QRMRARIIFV DSASKRVSLT LLPHLLDYAS IELPKLGKTF QTAKIERVDA GQGVALSISD GKNDIAGYAH VSQLSDERVE KVEKKFKIGR SVSVRVIGHR LLDGVVSVSL KSSVMAQPFF SLDELTPGML VNGEVLAVEH YGAIVKLAEG IKALCPPLHI SDIVGRTTSA KVAPGAKLKF RVLNVDRNSR RATVSHKRTL IKSELPVIGQ IEDAVPGSIT HGVVTGVNEY GVFVSLYGDL KGLAGLNDLG LLRDQKPSDA FGVGQVVRVQ VVSADTSGRL RLSLASGDAD GNSASMIINA SADALKPGHV VEKAVVTHVA SGTGNVEVVF SMEEGNIPGV VPLAHLSDHP LTAQGLSAVL NPGDEIGPLV VLEGKSTRAV MSRKLSLVES SREGKLPATA KEATLGAVFP GYVASATAAG VFVRFLGRLT GLAPPSQLTD GTTGDVHEMF PVGKTVNALI LSVDTSTPTP RLSLSLKVSA TSSPLSDAPL VRSFFQDIEF LDDRDVGAED VGISPETAKS LKPGTWMDVS VNETKDYGVL MDVPIDSNVV GLVTPHQIPV DTTFTAGDEV KGYVLDVSRR EGVVDIGMRD GLGKFKRNKT SSGKSLKKLK VGDQVSAEVE LIKAEYVALS LPEHNGLIGF APVHHLNLRY EDASERFTPT QCVKAVIAQL PEGEMGRLLL TVPVTKGTTA SGRIAAGTLV KGVVSEVQNL QALVALPNNA RGRLYISEFS PGEDTPLESI SVGSTVEATV MGLAGDRGGL LDLSMHRKSA FVLEDVSVGD DVSAYVVSVT DDGIKVTIAP GITSFIPKIE TSDKSSELAM KLSSRFTVGE RVSAIIVGVK ATKKRVDLSL RTDGASGSSR VCVGAKVQGI ITRVVENVGL MVQLGSHSVG RVHLTDMADE YDDDPCAKYE AGQVVQVRVL NASSNGELDL SMRASRLSSK RTSPTDPEIT DISNLVPGQR VKGYVKATSK KGCFIALSRG IDAMCKLSNL ADSFIADPAK TFPPGKLVEG RIVSADAAKG RVELAFRETD ATQGNADVST VKVGDVLIGT VRRVQPYGVF VSLDGTKLSG LCHISMFADA RISDDLASHV RQGERVRTKV LEINTETNKI SLGIKASLFE DDDGDGDEEM ADVNTAHTFD PLMDSSESGE SDSDIDEDGP LHADEGESTD EESDPSDSED APIGNDLGFD WDAEKTDASM TDVADEKAGK KGADKAPSKR EKKRLKEARE LEILQKEQEM RDGDHIPESA MEFEKLLIAS PRSSFLWVRY MAFHVSCGAY DEAKEVAERA LGAIPASEEA ERMNVWAAYL NLENKYGTPS PEEAVKKLFT RAVQIADAKH MHLTLVSMYE RNAQEDALEE SLKKAAKKFS YSAKIWLAYI RSAVLKNDSE KARKLLDRAT QSLPKHKHIK ILTRTALLEM KEGNPERGRT MFEGILRNYP RRTDIWSVYI DQEIKQGDIQ RIRALFERAT HLDLNAKSMK FLFKRYLDFE RSEGDDERIA HVKQRAMEYV SNKFGSAAE
|
| |