Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17572 |
Symbol | |
ID | 5004625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 228025 |
End bp | 231798 |
Gene Length | 3774 bp |
Protein Length | 1257 aa |
Translation table | |
GC content | 48% |
IMG OID | 640420046 |
Product | predicted protein |
Protein accession | XP_001420789 |
Protein GI | 145352935 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.632728 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAA ATAAGTCTCT CGAGCATGGT TGCCTGGCGT GCAGCTTTGG ATTTGATTTC AAGCGACGGG GCAACTTGAC TCGAGTACGC GATTCAGTCG TGGCGTACAC TGTTGGAAAT CTCTTGACCT TGCGAGACGT GAAAACACAA AGGCAGACGT ACCTGCGGAG CGCGGAGGGG GGCAATATCG GCTTGGTCGC CGTGGATGGA GATGGCACGC ACATTGCGAT CGGCGAGAAT GGTGCGGCGC CCCGAATATC AATATATAGT TACCCTTCAC TTGAGCTCGT AAACACGATT CACGGCGGCG CCGCGGCGTG TTACTCGACT GGAAAATTTT CTCCAGACGG ACAGCTTCTC GCGAGCGTGA GTGGGCTCCC AGACTATTGG CTCACTTTGT GGGACTGGAA GACAGCAAAG ACAGTCCTTC GTGCCAAAGC GTATGGACAA GAAGTGTTTG ACTTATCCTT CCTAGGCGCT AACTCGGAGC GAATGATCAC TCACGGCGCT GGTCACATTC AGTTTTGGCA GATGGTCAAG ACGTTTACTG GATTGAAGTT ACAAGGTACT CTTGGAAAGT TCAACGGCGA ACCTCTGTCC AATATCACGT GCGTGTTGGA GTTACCCTTT AATGATATGA TTTTGACGAG CACAGAGTAT GGCGCTCTAC TAATCTGGGT CGATGGGCGC GTTCAGTACA AAATAATGCG AGCAAACGCG TCGAGCGACG ATGAAGACTG CCACGGCGGT TCGATCGAGA CGATGATTTA TATCAACAAC GGTTCAGTGA TTCTCACGGC GGGCGATGAT GGGTGGGTTC GAGAATGGAA TATTGACGAG ATTCGTCGCG TCATATCGTC GCAAAGCAAA GGGGTGGTTT GTATGAAACC CATCTCAGAG CGCTCAATCG ATGGAGCCAA TATCCGCCAC ATTTGCGCGC TTGAAAATAA GTTATTCATT CAAGACAGCG GGCGAGGGTT AGTGTGTTCT TTGGACGATG ATGATATGGA GCGTCCTGTT GTTTCTGCGC ACAGTGGTGT CATCACTGCA TGCGAGACTT CGCTGAACAG TACTCACTTG ATCACTTGCG ACAATATCGG AGAAGTGCAC TGCTATGATT ATTTATCCCG TGCGTTTCTC TACAAAAGTC GGTTCAGCTG TTCGGCGACG GCACTTACTA ACGTACCTTT GAATGTCAAT GAAGATGGGA GATGTATCGT CGTTGGATTT GCCGATGGTG TCGTTCGCGT TCTCGTGAGA AACACGTCCT CTTGGCGCTT GGCTCTCGCG CTCAAGCCAC ACGCGCATGC CGTCGAGCAG TTCACATGGA GTTCTGATGG TGAGTATTTG GCAATCGCTG CTGCTAATAA TACTGTGTTT ATTTTCAGTT CATCTTCCTT CGCATCGACG TTGCAATTTG AGCCAATTGG TTTCATACGC CTGAGTGCTC GCCCGAAGAA GATATTTTGG GATGGTGAAA GTATTTTTGG TGCTCAAATC GAGGACGAGA TGCACTGCTT TGCCGTCCAT CGTGACTTGG GTGAGTCTCG TAAAGATTCG TACGAAATCG ATGTCACTCC TCGTATTACC ACCGATGTAG AAACCAAGAA CGCCGTTGAT GAATCTTTCA GCGCGATAAC GTATGATAAT TCTTACTGCG TTAAAATCTG TAAATCGGGA TCTTTTGAAG TACACGCGTT AAGCGATGCT TCTCCATCAA ACACACACTC TTCAATCGAC GAGGCGCTGT GCATCTATCC AGAAGTCGAA GCGATCGACA CACTCTCCGC ACAGTGCGCG TCCATCGAAG AGGTACTGCA GCGAGAACGC GCGAAGGAAT TGAGCGAGGA AGAAGCGGCA GCACTTATAC GACTCTCGAG CTTGGTGGCG CTCTGTCGAG ATGAATTCAA TGTCATCATT GAAGAAAATG CAAAACTTCC TCGAGAGATG CGAATCTCTG GCGCAGAACT TCTCGTAGAC GATACACAAA TGCGCCGAAT TCGCGAGGAT TCAGTACACT GGTTAAAAGC GACTCACGAC AAGCTTCAAA GCTCAGTTAT GTGTGCTGAA TCTGTCTCAA ATCAACTGCA AAAAAGTTAT TACGACAAAA TAATGGCCCT TCCTCGTATG ATTCATTCAT CTAGCGAAGC TTTGACGTGC TCATCTTTTG CGTTGCGCGC TGGAAGCGAG GGTGATTGGC AGACAACTAC GGTCGCAGAG GCGACAGTTC ATGAATGCGA ACAAGAGATG CGTGTCGTTC ATGTGCAAAA TACTTGCATA AGTGAAAATG CGCACGAAAG ACAGAGCATC GAAGGCTCTG AAATGAGTTC AGAGGCTCGA CGGCGCTTCG AAAGATTACG GAGGAACGAA GAGTTGAGAA AATTGGAGAA GGCAGAGCCT TTGAAAGACC AAACAGACGA CAGCGAAGCG TTGAAACGCG ACAATCTTGG CGATGGGTTT CCATTGCGTT GCGTTCCGGA TTCACGTGTA CCACCTGAAG ACAAATTAAC GACGGAGGGT AAACTTTCCG AATTGAGATA TGTCGATGGT CAACTACGCG GCAACCAGAG CCGGTTCAAT CGAGAGTTTG ATTGCATCGA TGCGAAGGAC GCTGTCACCG ATGAAGATCT TATCTCGCTA TGCCATCTAA AAGCAGATAT CATCGCGCGA AGTAAAGTCA TTGAACTTTT TCGTATTGTA GTGCATCGCG AGCTGAGCGT CATGCCTGAA TATGACGTGA AGAACACCGA AATGCAAAAT GCTCAACGGC AAGCAAAGGA AGAGGTCGAA ACCGCTCGTG CGGCGGTAGA AAATGCCCAG ACGGCCGTAA ACGATCAACA GACCAAATTA GAGGGGCAGA ACGGAGTAAA AGCGGAAATC GAGGCGACAT TTGACAAATT CATGATTCCA GGCGTCCCGC GAGCGGCATT GCTCAAAGTT TTCCACAAAC GCGTCGTCAA TAAAACCGAA CAAAAGAGCG ACAAAGGTGA CTCCGATTCC GACTTAGACT CTGATTTTGA CGATGATTCG TACTATTCGT CGTCAGACGA AGAAAATGGC GACACTTGCC CAAAAGGTTG CGACAGTGCT GTCTACGAGC GCGTGTGCGA TCTTCGTTTG CAAAGAATCG AAGCCATGGA CGCAATAACT GAAATACAAC GGGCATTGGA TGGGAAGAAA AAGATTCACG ATGCCGCACT GAAAAAGTTG CGAAGTTTGG AGGAAAACGT ACGGGTACTC GAGAGAGATG CGGATGCTTT TGAGAAGGTA AAGCAGCAAA GTTTGAACGG GCTCAGGACT GCATTCGTGT TGCCGGTGAG CTGCGTCGAG ACGAGTCAAG GCGACATGGA CGACATGATT ATTTTCTCCA AGACTCGCTT AGCAGAACTT GAGAATAAAG TCGTAGAGTG GGATGCAGAA GTCTTAAAAT TGAAACGTCA GCAGCAGGAG TTAAAGCGAG AGCACTCCTC GCTCATCGCG CAACGTTCCG CTAAGGCGAA AGAATTTCAG AGTCTGCAAC GCAAATACTC TGAGACTCAG ATCCGAAAGT TTGGAAAGCT CATCGTTCTG GAAGATTTAG ACAAGGTTGT GAATAATGGT CAAGGCACAG AAGATTTACG CGACAAGCTC AAGGTACAAG AGCTGGAGAA TGCGCGTGAG CTGCGAGAGA TTAAAAAAGA CATCTCAAAT GTTGAAAGAG AAGTGTTGAA TCTCACGGAA GCGCATACCA CGTGTCTCAA TGAACTTCTC GCGCTGCGCT CAGTCAAAGT CTAA
|
Protein sequence | MAKNKSLEHG CLACSFGFDF KRRGNLTRVR DSVVAYTVGN LLTLRDVKTQ RQTYLRSAEG GNIGLVAVDG DGTHIAIGEN GAAPRISIYS YPSLELVNTI HGGAAACYST GKFSPDGQLL ASVSGLPDYW LTLWDWKTAK TVLRAKAYGQ EVFDLSFLGA NSERMITHGA GHIQFWQMVK TFTGLKLQGT LGKFNGEPLS NITCVLELPF NDMILTSTEY GALLIWVDGR VQYKIMRANA SSDDEDCHGG SIETMIYINN GSVILTAGDD GWVREWNIDE IRRVISSQSK GVVCMKPISE RSIDGANIRH ICALENKLFI QDSGRGLVCS LDDDDMERPV VSAHSGVITA CETSLNSTHL ITCDNIGEVH CYDYLSRAFL YKSRFSCSAT ALTNVPLNVN EDGRCIVVGF ADGVVRVLVR NTSSWRLALA LKPHAHAVEQ FTWSSDGEYL AIAAANNTVF IFSSSSFAST LQFEPIGFIR LSARPKKIFW DGESIFGAQI EDEMHCFAVH RDLGESRKDS YEIDVTPRIT TDVETKNAVD ESFSAITYDN SYCVKICKSG SFEVHALSDA SPSNTHSSID EALCIYPEVE AIDTLSAQCA SIEEVLQRER AKELSEEEAA ALIRLSSLVA LCRDEFNVII EENAKLPREM RISGAELLVD DTQMRRIRED SVHWLKATHD KLQSSVMCAE SVSNQLQKSY YDKIMALPRM IHSSSEALTC SSFALRAGSE GDWQTTTVAE ATVHECEQEM RVVHVQNTCI SENAHERQSI EGSEMSSEAR RRFERLRRNE ELRKLEKAEP LKDQTDDSEA LKRDNLGDGF PLRCVPDSRV PPEDKLTTEG KLSELRYVDG QLRGNQSRFN REFDCIDAKD AVTDEDLISL CHLKADIIAR SKVIELFRIV VHRELSVMPE YDVKNTEMQN AQRQAKEEVE TARAAVENAQ TAVNDQQTKL EGQNGVKAEI EATFDKFMIP GVPRAALLKV FHKRVVNKTE QKSDKGDSDS DLDSDFDDDS YYSSSDEENG DTCPKGCDSA VYERVCDLRL QRIEAMDAIT EIQRALDGKK KIHDAALKKL RSLEENVRVL ERDADAFEKV KQQSLNGLRT AFVLPVSCVE TSQGDMDDMI IFSKTRLAEL ENKVVEWDAE VLKLKRQQQE LKREHSSLIA QRSAKAKEFQ SLQRKYSETQ IRKFGKLIVL EDLDKVVNNG QGTEDLRDKL KVQELENARE LREIKKDISN VEREVLNLTE AHTTCLNELL ALRSVKV
|
| |