Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15541 |
Symbol | |
ID | 5001788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 657343 |
End bp | 660699 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417209 |
Product | predicted protein |
Protein accession | XP_001418078 |
Protein GI | 145347232 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACG AAGACGCGGC GGCGATCGAA CGCGCGCGCG CGGACGATGC GAGCGTCGCG TCGACGCGCT CGGGGGACGA TGCGGGCGCG ATGGCGCCGC GATGGCGCTG CGTGGCGTCG AGCGCGGATT TGCGAGCCGT GAGCTCTAAA CGCGTTCGGT TCACGTGCGT CGATGCCACG CGCGCGCTGG TGATTTTGGG GGCGAACACT GGGTCGGCGT ACGTGTTCGC GCGATCGAGG GCGCGCGATG ATGGCGCGGG AGGCGTGGAA CGACGCGCGA GATTCGTCGC GGTGGTGTCG CCGGAATTGG TCGAGCCGAG CGCGAGACGA CAGGGGAATG TGGGCGCGCG CGCGGCGCCG CAGAGCGTGC GGACGATCAG AGCGTCGCCA TGCGGACGAA TGTGCGCGCT GGGGTTCGCA GATGGACACG TGAGGGTGAT TGAGCTCGAT GGATTGACGC GGGAGGAGGC GCCGCGGAGA AGCGCGCTGG GATCGACGGT GGCGTTTCTG TCGAGCACGC ATGAAGGACG AGCTGTCACG GCGCTGTCGT GGTCGAGCGA TTCGCGGGTG TTGTATGCGG GAAGCGATCA AGGGGTGGTG ACGGTGACGT CGTGCGCAGC GTTCGTGGAG TGGTGCGACG GTGGCCGCAC GGGCGCGAGA CCGGCGCCGG TGAATAAGAC ATCGTACACA GACGTGAGTA GTGCGGTGCA TCAGCTCGAT GCGTCACCGA GCGGTCGTCA CGTGATAACA AGCGCACAAT CTAGCGCACA ACTCGTGATT GTGCAGGGAG CGCATAGCGG AACGACGTCG AATATAGGTA GCAAGCAGCG CGAAGGATCG TACGGAAGTT GCTTCCACGG GTACGCAAGT TTGAGCGTAG AAGATGACGA TCCCGTGGAA GAGGAAGACG AATGGGACGA AGAATTAGAA GGTGTGAGGG TAGTAGAGCA TGCTGTGGTG TCACGACCGG GGCGTAAGCT GTGGATTGCC AAAGTGGACA ATGTTGACTC TGAGCGAGCC GACGTGGAAA TCATGGCAAC CATCAAACCC GAAGTGCCAG TGCCTTCCAG CGTACCAGGA TGGGATCAAT CGAGTGAATC CGCGGATGCG GTGAAACGCG CGTCAAAGAA ACTAGAGTTT GGCCTCTTGC ATCGTTTAGG ACCTTGCGTG TTGTCGACGA CGGAGCGCGC AGTCGCCATC ATCGACGTCG CTACTCCAGC AATCGTTAGA TGGTATCCCC TAAAAGAGCC AGGGAGCGAA AGTTTGAGCG CTGGTTTCGT CGATGCATGC ACAGTTGATC ATCGAGCGTT TTTCCTGACG CCATCTGAAG ACAGTGGGAA TTCGGTGTGG TGTTTGGAAT CATTCGTCGA CGCCAAGGCG CTTGCGCATG ACGTCGTGAG CGAGACATCC TCTGCAAGTG CATTCATCCG TGCCCTAGAC ATTTGCCGCA AGACAAACTC GTACGACGAC GGCTTATTCA GAAGGGCGAA AGAGGCGTTG GATTCGTCGG AAGCGGACGC CGCCGGAGTA AATAGTAGGC TTGCGTTTCT CGTGCAATGG GGCGAACAAG TGGGCTCGAA GTTGGCGCCG GCGAAGAAGG ACACCGAAGT TGACCTGCGC GAGCTCGTGA CCGAGGAAGA AGTATCATCA CCGAAAGCAC CAAAGACGCC GGACTTCGAT CTCGGAAAAC CACCTCTCGG TAGAGATCGT TCGACGATTG ATCGACCATT GTCCGAATTG GCTGCCCCAG CTTCGAACGA GTTCCCGAAG GCGGAAACCG AAGGAGGCAT CTTCTTTTAC AATCCTCGTG GCGTTTCCAA GACATCCAAG GTAGATGGAG AATTCGCGCC GAAGGCAAAT GGAAAGGTGA AGAAGCGACA CGCGCTCATT CTGGACGATG TCGAATCAAA CGAGCCTGCA GTCATACTAC AAAGCAACGT GAAAGTAGTC TCGCCGACAA AATCCAAGAA TTACGGGGTT GATTTACCAA ACAATGACAT CGAATGGGAA GAGTGTGACG CGTTCGATGC TCAAGAGTGG CAAAAGGCGA TGGATAGCGT GAAGTGTTTG CTCCCAATTC GAGGGGCGCA TTTTGAGCAT TGGAGCTGTT ACGAAGACAT TAAGACGACG TCGACACTTG ACTCCACTGA CACTCCATTG CCCGAGCAAA ACGCTCGGAT TCAGTACAGA TTTATGACGA ATGTACGAGT CGCCGCGATC GTCGACTCCG TCACGAAACT TAAAGAGTGT CGAATGAGCC TCGACGCGTC GATTCTCCTG CCTTCGCTTC GGCGTTGGCG CAATATACGC GCCGAGTCGA TTGAACTTCT GACACGCTTT GAAAAGGGCG AAGATGCACC ACTCACCGCT AAGATAAAAC TGCAATGGAG TTCGCTGTTA GAAAACGTTG AGAAAGAGCT TGACGCAGTG TGTGATGAAC TCAAGCTCGA CGTCGCAGCG GTGAAAAAGC AAACGCCGCC AAAGAATCGT GAGACTCAAA CGTCTCTCGC GTCAGCGGAA CATTCGACGG ACTCAATGAC AATGGCAAGT GTAACTGCAG CGTTGACGGA AGGAGGTGCG ATTGATATGA GTCAATCTTT GGAGTCCGAA TGTGTGGCAT CGCTTCGATC AACCGATGTT GCAGAAGCGT CGGCTATCGT ATCAGAATGT CTTAGGCGGG CGCTTCTCCA GACTTTAGAG TCGCTTGACG GCGCTGACAC ATCTGCGGCT CTCGAGCACG GCGTATCGCA TCTTATGTTA GTCTCGCGCG TAGGTTCCGC CGCAGTTGGA GCGGCGGAAG TCATTCGAGC GCTTGCCAAG GCTTCAGAAG AGGAGAAGTT GTCGAAAGCG ATATCACCGA TGAACACGAA CGAATCCATA TCCAGAGCTC TCAGTCGTAT TTTCACAAAT GTCACCACAT TCTTGACTTC CGAAAATTCT GTGGATTTGG ACTTACGCCG TGGCGCTGCC AAGCTTCTCG AGCCACTTGA CGCACACTTA TCGCGGCCAC CCATGCGGCA GTTTGGAAGA TTTCCACAGC TCCAAGCCGC GCTCGCGGCG GAGATCGACG GTGTCGTGGA CCAATTACCG TTTGTATGTC GCTCGTCTAC CGATGCTGAT GACGGAGTGT CACAATTTAC TCTGAAAGTG CCTCACGAGA ACCCAATCGC GCCCGCCATC GAAGATTTTG GCGACTGGGG CATCAAGATG GATCTTCGTC GCTGCCCCGC GTGTAGTCAC TCCCTCCTGT GCCCCAACGA CGGCGAGCTC ATCACCTTTA TGTGCGCGCA CACGTATCAC AAAGCGTGTT GCGCGGCGTC TATGGCTTGT TTCGCGTGTT GCGCCGACTC CCGCTGA
|
Protein sequence | MPDEDAAAIE RARADDASVA STRSGDDAGA MAPRWRCVAS SADLRAVSSK RVRFTCVDAT RALVILGANT GSAYVFARSR ARDDGAGGVE RRARFVAVVS PELVEPSARR QGNVGARAAP QSVRTIRASP CGRMCALGFA DGHVRVIELD GLTREEAPRR SALGSTVAFL SSTHEGRAVT ALSWSSDSRV LYAGSDQGVV TVTSCAAFVE WCDGGRTGAR PAPVNKTSYT DVSSAVHQLD ASPSGRHVIT SAQSSAQLVI VQGAHSGTTS NIGSKQREGS YGSCFHGYAS LSVEDDDPVE EEDEWDEELE GVRVVEHAVV SRPGRKLWIA KVDNVDSERA DVEIMATIKP EVPVPSSVPG WDQSSESADA VKRASKKLEF GLLHRLGPCV LSTTERAVAI IDVATPAIVR WYPLKEPGSE SLSAGFVDAC TVDHRAFFLT PSEDSGNSVW CLESFVDAKA LAHDVVSETS SASAFIRALD ICRKTNSYDD GLFRRAKEAL DSSEADAAGV NSRLAFLVQW GEQVGSKLAP AKKDTEVDLR ELVTEEEVSS PKAPKTPDFD LGKPPLGRDR STIDRPLSEL AAPASNEFPK AETEGGIFFY NPRGVSKTSK VDGEFAPKAN GKVKKRHALI LDDVESNEPA VILQSNVKVV SPTKSKNYGV DLPNNDIEWE ECDAFDAQEW QKAMDSVKCL LPIRGAHFEH WSCYEDIKTT STLDSTDTPL PEQNARIQYR FMTNVRVAAI VDSVTKLKEC RMSLDASILL PSLRRWRNIR AESIELLTRF EKGEDAPLTA KIKLQWSSLL ENVEKELDAV CDELKLDVAA VKKQTPPKNR ETQTSLASAE HSTDSMTMAS VTAALTEGGA IDMSQSLESE CVASLRSTDV AEASAIVSEC LRRALLQTLE SLDGADTSAA LEHGVSHLML VSRVGSAAVG AAEVIRALAK ASEEEKLSKA ISPMNTNESI SRALSRIFTN VTTFLTSENS VDLDLRRGAA KLLEPLDAHL SRPPMRQFGR FPQLQAALAA EIDGVVDQLP FVCRSSTDAD DGVSQFTLKV PHENPIAPAI EDFGDWGIKM DLRRCPACSH SLLCPNDGEL ITFMCAHTYH KACCAASMAC FACCADSR
|
| |