Gene OSTLU_38035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38035 
Symbol 
ID5004200 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp4122 
End bp9980 
Gene Length5859 bp 
Protein Length1869 aa 
Translation table 
GC content53% 
IMG OID640419621 
Productpredicted protein 
Protein accessionXP_001419873 
Protein GI145350993 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0179427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAAC GAACCAGAGA CAAGGCGAAA GCCGAGGTAG AGGACGACGA CGATGACGTC 
GACGCCTCGG CCTTCCCTCG AGGCGGCGCC GCGAGCGGTG GACGCGGCGA TGAAGATGCG
TTTCCTCGAG GCGGCGGCGG CGGCGGCGAC GGTGACGACG CCGGACGGGG CAGGAAGCGT
CGATCGAGCC AACGAGGCGG TGATGGCGAT GGAAGCAATG ACGATGACGA TGATCCGTTC
TCGCGAATCT CACGCGCGGC GAAAGGGGCG AGTTCGAGAG CGGTGTCTTC GAGCGGCGGC
GGGGCGAAGT ACGTCGAGAC TTTGAAGTAC AAGTCGCTAC GACCTGGAGC TAAACTTTTG
GGCATCATCT CCGAAGTTAC CGCGCGGGGA TTAGTGATGA GTTTACCAGA CGGCTTGCGC
GGCACCGTGG CGCGCGCGGA AGTTGCTGGC ACGTTCGGGA GTAGTCGACG CAACCGCACC
GCCGCCGCCG ACGGCTCGGA ATCTTCGGAG GAGGAGTATA GCAGCGACGA GGACGACGAT
GACGATGACG CAGAGGCGAG CTTGGAGTTG CTGTACGAGC CCGGGCAAGT GCTTCGATGC
GCGGTGGTGA GTTTGGAGAA AGGTAAAACG GGTGGCAAGA GAATCGAGTT GTCTCTCAGA
CTAGAAAAGG TGTGCGAGGG CCTCACAAAG GAAAGTCTCA CCGAAGGCTC GGTAGCGCCA
GCCGTAGTTC AAAGCGTCGA AGATCACGGC TACATCTTGA GTTTTGGTAT CGCAGATACG
AGCGGATTCT TACCGAAGAA AAATGTGGCG AGCGATTTGG GCGAGATTCG TAAAGGCAGA
ATCATCGATG TAGTCATCAC CGGCGCACCA AAGGGCAATA AAGGTTATTT TACCGTGACG
AGCGATCAGA AGCGAATCAA GACTTCGGTC GCCCACGAGA CGTCGGCGAC GAATGTCGAT
ACATTGCTTC CGGGGATGCT CGTAAATTCT AGAATCAAGC AAATACTTTC TGACGGTGTC
TCAGTTTCTT TCATGACTTA CTTCAGTGGC ACCGTGGACT GTTTTCACAC TGGGGCTCTT
GCGACGTCGA AAGGGGTTTC GTCCGCGTTT AAAGTAGGGC AACGCATGCG TGCGCGAATC
ATTTTTGTCG ACTCTGCGTC GAAGCGTGTT AGCCTAACCT TGCTGCCGCA TCTGCTTGAT
TACGCGTCCA TCGAACTTCC AAAGCTTGGC AAAACTTTCC AAACTGCCAA GATTGAGCGC
GTGGATGCGG GTCAAGGGGT CGCGCTGAGT ATTTCAGATG GTAAGAACGA TATCGCTGGA
TATGCACACG TTTCACAGCT TTCTGACGAA CGCGTGGAAA AGGTGGAGAA GAAGTTCAAA
ATCGGAAGAA GCGTAAGTGT TCGCGTCATC GGTCATCGTT TGCTCGATGG AGTGGTTAGC
GTCAGTTTGA AGTCATCTGT CATGGCTCAA CCTTTCTTTT CATTGGATGA ACTCACACCA
GGGATGCTTG TGAACGGCGA AGTTCTCGCC GTTGAGCATT ACGGAGCCAT AGTGAAACTT
GCCGAGGGCA TTAAAGCGCT GTGTCCCCCG CTTCACATTT CTGACATTGT CGGCCGAACG
ACTTCTGCAA AAGTCGCCCC TGGTGCCAAA TTGAAGTTCA GAGTATTGAA CGTGGATCGA
AATAGCCGGA GAGCGACGGT ATCGCATAAA AGAACGCTCA TCAAGTCCGA GCTCCCAGTA
ATTGGTCAGA TTGAAGACGC TGTGCCCGGA TCAATCACGC ACGGCGTGGT GACGGGCGTG
AATGAGTACG GTGTGTTCGT CTCCTTGTAC GGTGATTTGA AGGGTCTGGC TGGTTTGAAT
GACTTGGGTC TTCTGCGAGA TCAAAAACCG TCCGACGCGT TTGGTGTCGG ACAAGTTGTT
CGAGTACAGG TTGTTTCAGC CGACACGTCT GGTCGGTTAC GTCTTTCGCT CGCGTCTGGC
GACGCGGATG GAAACTCTGC GAGCATGATT ATTAATGCGT CCGCAGATGC CTTGAAGCCG
GGTCATGTCG TTGAAAAAGC AGTGGTCACG CACGTGGCGT CGGGCACAGG TAATGTCGAG
GTGGTTTTTT CTATGGAAGA AGGCAACATA CCAGGCGTCG TGCCGCTCGC CCATTTATCT
GACCATCCGC TGACGGCGCA AGGATTGAGC GCTGTTCTCA ATCCCGGTGA CGAGATTGGT
CCTTTAGTAG TTCTTGAAGG CAAATCAACT CGAGCAGTGA TGTCGCGCAA ACTTTCACTC
GTGGAAAGCT CGCGAGAAGG GAAGCTTCCA GCGACGGCGA AGGAAGCGAC GCTCGGCGCA
GTGTTCCCAG GCTACGTCGC ATCAGCCACC GCTGCGGGCG TTTTCGTTCG CTTTTTAGGT
CGACTCACCG GTCTTGCACC GCCTTCACAG CTCACAGACG GTACTACGGG AGACGTGCAC
GAAATGTTTC CGGTAGGTAA GACTGTCAAC GCATTAATAC TGTCTGTGGA TACGTCCACG
CCGACGCCGA GGTTGTCACT CTCATTGAAA GTTTCTGCCA CTTCGTCGCC TCTCAGCGAT
GCACCGTTGG TTCGCTCGTT TTTCCAGGAT ATTGAGTTTC TCGATGACAG AGATGTCGGA
GCCGAAGACG TGGGTATATC ACCTGAAACC GCAAAGTCGC TCAAGCCCGG TACGTGGATG
GATGTGTCAG TTAACGAAAC AAAGGATTAC GGCGTTTTGA TGGATGTTCC GATCGATTCC
AACGTCGTCG GTCTGGTGAC GCCTCATCAG ATACCAGTAG ACACGACGTT CACAGCGGGG
GATGAGGTAA AAGGTTACGT TTTAGACGTC AGCCGCCGAG AAGGCGTAGT TGACATCGGC
ATGCGGGATG GATTGGGCAA ATTCAAGCGA AACAAGACGT CGTCTGGCAA AAGTTTGAAG
AAGCTTAAGG TGGGAGATCA AGTCTCCGCT GAGGTCGAAC TCATAAAGGC TGAGTATGTG
GCACTTTCTT TACCAGAGCA TAACGGCTTG ATAGGTTTCG CTCCTGTGCA TCATTTGAAC
CTTCGTTACG AAGACGCGTC GGAACGCTTT ACGCCGACGC AATGCGTCAA GGCTGTCATC
GCACAGCTTC CAGAGGGTGA AATGGGGCGT CTTCTCCTGA CGGTTCCCGT TACTAAAGGA
ACCACAGCAA GCGGACGAAT TGCCGCCGGG ACGCTCGTCA AGGGTGTCGT GTCAGAGGTT
CAAAATCTGC AGGCATTGGT CGCTTTGCCA AATAACGCTC GGGGACGACT TTACATTAGC
GAGTTCAGCC CCGGTGAAGA TACCCCACTG GAGTCTATTT CAGTGGGTTC AACTGTTGAA
GCCACTGTGA TGGGTCTTGC TGGAGACCGT GGAGGACTTC TGGACCTGTC GATGCATAGG
AAATCCGCGT TTGTGCTTGA AGATGTCTCT GTTGGCGACG ACGTGAGCGC GTACGTCGTT
TCCGTAACGG ACGATGGGAT CAAGGTGACT ATCGCTCCCG GAATCACATC CTTCATTCCG
AAGATTGAAA CGTCGGACAA ATCATCTGAG CTCGCCATGA AGCTGAGCTC TCGCTTCACC
GTGGGAGAGC GCGTGTCCGC GATTATCGTT GGAGTTAAGG CGACCAAGAA GCGAGTCGAC
CTCAGCCTTC GAACGGACGG CGCATCCGGG TCGTCTCGCG TGTGCGTCGG GGCTAAAGTG
CAAGGTATTA TTACGCGAGT CGTGGAAAAC GTCGGTCTCA TGGTTCAACT CGGATCGCAT
TCCGTGGGAC GAGTACACTT GACAGACATG GCGGACGAGT ACGACGACGA TCCGTGCGCC
AAGTACGAAG CGGGACAAGT CGTGCAGGTG CGCGTGTTAA ACGCTTCTTC AAACGGAGAA
CTCGATTTAT CTATGCGCGC GTCTCGTTTG AGTAGCAAGC GAACCTCGCC GACGGATCCC
GAGATCACGG ATATCAGCAA CCTCGTTCCT GGTCAACGCG TAAAGGGATA CGTCAAGGCG
ACTTCAAAGA AAGGATGCTT CATCGCTCTT TCCCGCGGCA TCGACGCTAT GTGTAAGCTG
TCAAACCTCG CGGACAGTTT CATCGCGGAT CCAGCGAAAA CGTTTCCTCC TGGAAAACTT
GTCGAAGGAC GGATCGTGAG TGCCGATGCG GCTAAAGGAC GAGTTGAGCT CGCGTTCCGC
GAGACGGACG CCACGCAAGG AAATGCAGAT GTTTCGACGG TGAAGGTGGG AGACGTGCTC
ATTGGCACTG TTCGCCGCGT ACAACCGTAC GGAGTGTTCG TCAGTCTCGA TGGCACGAAG
TTATCTGGAC TCTGTCACAT CTCTATGTTC GCAGACGCTC GAATTAGCGA CGATTTGGCG
TCTCACGTGC GCCAAGGCGA AAGGGTGCGA ACGAAAGTGT TAGAAATCAA CACTGAGACG
AACAAGATAT CGCTCGGTAT CAAGGCTTCG CTCTTTGAAG ACGACGACGG CGACGGAGAC
GAAGAGATGG CCGACGTCAA CACAGCGCAC ACGTTTGATC CACTGATGGA TGTGGATGGT
GAAAACGACG GAGAGGACGA CGACAATGAT GGTGAATCCA GCGACGACGA CGACGACGAC
GACGACGACG GCTCCAGCGA CGAAGCAAAC GCCAGTGAAA GTACGGAGGC GAGTAGCGAA
GAAGGAGAAG AAGAAGAAGA AGAAGAAGAA GAAGAAGAAG AAGAAGAAGA GTCCTCAGAA
AGTGGCGAGT CAGACTCCGA CATTGACGAA GACGGACCGC TGCATGCAGA CGAGGGCGAA
TCGACAGACG AGGAATCTGA TCCATCTGAT TCAGAGGACG CGCCGATCGG CAACGATTTA
GGGTTCGATT GGGATGCCGA AAAAACGGAC GCCAGTATGA CCGACGTCGC TGATGAAAAG
GCGGGTAAGA AGGGTGCCGA CAAAGCGCCG TCAAAACGCG AAAAAAAACG ATTGAAAGAG
GCGAGGGAGC TCGAAATTTT ACAAAAAGAG CAAGAGATGA GAGATGGCGA TCATATTCCC
GAATCTGCGA TGGAGTTTGA AAAGTTACTC ATCGCATCGC CTCGCTCGTC GTTTCTTTGG
GTAAGATATA TGGCGTTTCA CGTCAGCTGT GGCGCGTACG ATGAGGCTAA AGAAGTCGCG
GAACGAGCTC TCGGAGCGAT ACCCGCCTCG GAAGAGGCTG AGCGCATGAA TGTGTGGGCG
GCGTATTTGA ACTTGGAAAA CAAATACGGC ACTCCGTCGC CGGAAGAAGC TGTGAAAAAG
CTCTTTACGC GCGCGGTTCA AATCGCCGAT GCCAAGCACA TGCACTTGAC GCTCGTATCG
ATGTATGAGC GAAACGCTCA AGAGGATGCG CTCGAAGAAA GCTTGAAGAA GGCGGCCAAA
AAGTTTTCGT ACAGTGCGAA AATCTGGCTC GCATACATAC GCTCTGCTGT GTTGAAAAAT
GATTCGGAAA AGGCGCGAAA ACTTTTGGAT CGCGCGACGC AGTCATTGCC GAAGCACAAA
CATATAAAGA TTCTCACGCG TACGGCTCTC CTCGAGATGA AAGAGGGAAA TCCGGAGCGC
GGCCGCACGA TGTTTGAGGG TATATTACGA AACTACCCGC GACGTACTGA TATTTGGTCA
GTGTACATTG ATCAAGAAAT CAAGCAAGGT GACATTCAAC GCATCAGAGC ATTATTCGAG
AGAGCGACGC ACCTCGATCT TAACGCAAAG AGCATGAAAT TTTTGTTCAA GCGTTACCTG
GACTTTGAAA GATCGGAGGG TGATGACGAA CGCATAGCGC ACGTGAAGCA AAGAGCGATG
GAATACGTTA GCAACAAGTT CGGCTCAGCC GCGGAATGA
 
Protein sequence
MGKRTRDKAK AEVEDDDDDV DASAFPRGGA ASGGRGDEDA FPRGGGGGGD GDDAGRGRKR 
RSSQRGGDGD GSNDDDDDPF SRISRAAKGA SSRAVSSSGG GAKYVETLKY KSLRPGAKLL
GIISEVTARG LVMSLPDGLR GTVARAEVAG TSDEDDDDDD AEASLELLYE PGQVLRCAVV
SLEKGKTGGK RIELSLRLEK VCEGLTKESL TEGSVAPAVV QSVEDHGYIL SFGIADTSGF
LPKKNVASDL GEIRKGRIID VVITGAPKGN KGYFTVTSDQ KRIKTSVAHE TSATNVDTLL
PGMLVNSRIK QILSDGVSVS FMTYFSGTVD CFHTGALATS KGVSSAFKVG QRMRARIIFV
DSASKRVSLT LLPHLLDYAS IELPKLGKTF QTAKIERVDA GQGVALSISD GKNDIAGYAH
VSQLSDERVE KVEKKFKIGR SVSVRVIGHR LLDGVVSVSL KSSVMAQPFF SLDELTPGML
VNGEVLAVEH YGAIVKLAEG IKALCPPLHI SDIVGRTTSA KVAPGAKLKF RVLNVDRNSR
RATVSHKRTL IKSELPVIGQ IEDAVPGSIT HGVVTGVNEY GVFVSLYGDL KGLAGLNDLG
LLRDQKPSDA FGVGQVVRVQ VVSADTSGRL RLSLASGDAD GNSASMIINA SADALKPGHV
VEKAVVTHVA SGTGNVEVVF SMEEGNIPGV VPLAHLSDHP LTAQGLSAVL NPGDEIGPLV
VLEGKSTRAV MSRKLSLVES SREGKLPATA KEATLGAVFP GYVASATAAG VFVRFLGRLT
GLAPPSQLTD GTTGDVHEMF PVGKTVNALI LSVDTSTPTP RLSLSLKVSA TSSPLSDAPL
VRSFFQDIEF LDDRDVGAED VGISPETAKS LKPGTWMDVS VNETKDYGVL MDVPIDSNVV
GLVTPHQIPV DTTFTAGDEV KGYVLDVSRR EGVVDIGMRD GLGKFKRNKT SSGKSLKKLK
VGDQVSAEVE LIKAEYVALS LPEHNGLIGF APVHHLNLRY EDASERFTPT QCVKAVIAQL
PEGEMGRLLL TVPVTKGTTA SGRIAAGTLV KGVVSEVQNL QALVALPNNA RGRLYISEFS
PGEDTPLESI SVGSTVEATV MGLAGDRGGL LDLSMHRKSA FVLEDVSVGD DVSAYVVSVT
DDGIKVTIAP GITSFIPKIE TSDKSSELAM KLSSRFTVGE RVSAIIVGVK ATKKRVDLSL
RTDGASGSSR VCVGAKVQGI ITRVVENVGL MVQLGSHSVG RVHLTDMADE YDDDPCAKYE
AGQVVQVRVL NASSNGELDL SMRASRLSSK RTSPTDPEIT DISNLVPGQR VKGYVKATSK
KGCFIALSRG IDAMCKLSNL ADSFIADPAK TFPPGKLVEG RIVSADAAKG RVELAFRETD
ATQGNADVST VKVGDVLIGT VRRVQPYGVF VSLDGTKLSG LCHISMFADA RISDDLASHV
RQGERVRTKV LEINTETNKI SLGIKASLFE DDDGDGDEEM ADVNTAHTFD PLMDSSESGE
SDSDIDEDGP LHADEGESTD EESDPSDSED APIGNDLGFD WDAEKTDASM TDVADEKAGK
KGADKAPSKR EKKRLKEARE LEILQKEQEM RDGDHIPESA MEFEKLLIAS PRSSFLWVRY
MAFHVSCGAY DEAKEVAERA LGAIPASEEA ERMNVWAAYL NLENKYGTPS PEEAVKKLFT
RAVQIADAKH MHLTLVSMYE RNAQEDALEE SLKKAAKKFS YSAKIWLAYI RSAVLKNDSE
KARKLLDRAT QSLPKHKHIK ILTRTALLEM KEGNPERGRT MFEGILRNYP RRTDIWSVYI
DQEIKQGDIQ RIRALFERAT HLDLNAKSMK FLFKRYLDFE RSEGDDERIA HVKQRAMEYV
SNKFGSAAE