Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31813 |
Symbol | |
ID | 5001762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 577672 |
End bp | 582417 |
Gene Length | 4746 bp |
Protein Length | 1581 aa |
Translation table | |
GC content | 54% |
IMG OID | 640417183 |
Product | predicted protein |
Protein accession | XP_001418053 |
Protein GI | 145347180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.209863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATC CCGCGGATTG GGGAGTGTGG CCTTCGTCGC AGTTGAATCA GAATAATTTA ACCGTAGACG TGCACGGCGC GTCGTTCTTT TCGACGAACG TGTCGGCGTG TCGAGTCGGG GACTTTGTGG TGCCGCTCAC GCACGTCAGC TCGACGTTAG TGCGGTGCAC GGTGTCACCG AGTCTACGGT TGAGTAAAGG ATATGCCTAC GTCGAGGTTT CGATGAATGG ATTAGATTTT ACTAGCGATA GGATCACATT TACGCTGCAC GAGCCGGTGA AGCTCGAGCA AGTGTTTCCG TACGGATGGG ATATAGGTGG GGGCGGGGTG TTGCGAGTGA GCGGTTCAAA CTTTGTGCCC GGATCGAGCA AATGCGCGTT TGGGGCCGGA AGTACCGGCA CAGTGCCGGC TGTGAGCGGG TTAGCGCCCT CAGAGGTCGT TTCCAGTGCA TTTATGAAGT GCGAGACGCC GGCTTTTTCG ACAGTCGGGA TGACCGACAC GTCTTTGCGT TCCTGGGGAG ACTCGGGATC TCTCAGCGGT AGTCGGGCAT TCGAAGTTTG GCGAGAGCCA TGCTCGATAG AAATTTTCAC CACGTGGAAT TCTACCAAGG CTGAATACGG AGGCGTCGCC GTCGTGCTTC AGGGAAGTGG GAACGAAGAT CTCGTCGAGC CGAATAACAT GTTTGGTTAT TCATGTCATT TCGGCACAAT CACGGTAAGC GCGGTGATGG ACACGTTGAC GATCAATAGG ACTGTTACGT GCATCTCTCC GGCTTCGCAA CATCCGAGCA CGAGCGTTGA ACTTTGGACC GGGCCAAATA TAGACTCTTT CAAGCCTTGC ACCACGTTCC CGACATTTGC GTATCCCGCA GACAGCGAAG ACACGTACAC AGAAGTGGTT GGCGATGAAA CTACAGTGGT GTACACTGGG ACAAATATTT CTGCCGTACC GACATACGTC ACCACATCTG GAGGATCGGT GATATCACTC AGCGGTATCA GCCTCTCTAC TGCATCGTTA TGTAGAGTCG GCTCGGATGC ATCCGCTCCG GTTCACTTTG TATCTTCCTC TCTCATTCAA TGTGAGATTC CTCCGCATAC TGAAGGCGAG GAGTATCTTT ATAGTGCACC AAGCGCGAGT GTGGAAATCA CTAGCGTGTA TTTTACTGCA CTCGCGGAGA TTAGCTCCGT TTTACCGAGC ACGGGAGGCC TCGAGGGCGG GACTGCGGTC AGAGTTCGTG GGGCAAATTT CAAGGATACA GATGACTTGC TTTGTAGATA TGGATCAATA TCCGTGTTGG CAAGTTATTA CTCAACCACA CAAGTACAGT GCGTCACCCC AGCGCACATG ATAGGTAGTA TTCCCGTAGG TGTCGGGCGC CGCGATGGTA TATCGCATTC ATTTTGGGGT GATAAGTTGT TCGTTTACGA CTCAACCGAT ATTCTCGGCG CCGTCATGCC ATCTGTCGTG AACAACGTCG GATCAACATC TCTGAGCCTG GTCTGGTCGG TTGTCTACAG CGGCGGTACA GGGTGCAAAG TCGGTGGTGT TCTACTCGAC CCATGTACCG TTTCAGGTAG CACGACACCG GGTTTCGTTC AGGTTTACTC GTACACTTTT GCAAGCGACG AGCGCGTGAG CGAACCTGCG GCGTTTGCTT ACTACACGTC TCCAATTGTG TCAGGCTCGA TTCCCGTCGT TCTCCAGAAC TTCATTCCAA CTGTCGTTTA TATGCTAGGC TCAGACTTTG TCGACGAGGC GAATGTACTC TGCTCCTTTG GTGACAAAGC CGTAGACGCG ACTTTCGTGT CATCTGCTTT GGTCAAGTGC GCTTCCCATT TGGTGGGCGC CGTCGGCAAT ACGGATGCAC AAGTCGGGTT TGGTAGCGCG AGTGACGACG GCGTGTGGAG TCGAACGGTA ATGTCGCTGA GCGTGGTCGA CATACTCACG ATGAGTAGTG TCAGTCCGAC ACGAGGAGTA TTGGCTGGCG GCACCGTCGT CACCGTCACG GGAACTGGTT TTGAGGGCGG CGGCGTTGTC TACTGTCGCA TAGGTACGGT TTCGTACATC GAAGCACGAA CTATACACGA TAAGAAAGTG GAGTGTACCG CTCCGTCGTA CTACGATGAA ACGGTTGATA TTCAAGTCGC TATCCTGGGC AATGTGTACG CGGACACGGC GCAGTCATTC GTGTACAGCA CGGGTGTTGA CGTAGTCGCC ATCATTCCTC CTACAAGCCC TCTAGCTGGT GGTACTGCCA TCAGCCTCTT TGGTTTGGCT GGCAGTGTAG GCGATAGCTA TGATTGCGTG ATGAGTGGTG CTGCGGTAGC TGGTACGATT AGCCGCTTTG GTGAAGTTGA ATGCGCGAAT CCCGCCGGTG AAGAAGGCTT CGCCGCTGTC GGCATTGGGA GCATCATCGA TGACGAGATT GATCAGCAAA CCATCGAATA CGCGCGGGCA CCGGTGATAT CATCTGTTTA CCCTCTGAAC GGCCCAACGT CGGGTGGCAC GTTGATTTAC ACTTCAGGCT CACACATGCG TGACTCTGCA TACCTGTCTC TTGAAGCTAA CGCCGGCGCT TCGAGTCACT TTGTCAGCTC AGCGCTAGTC GTTACAGAGC TCCCAATTTC GACCGCAGCC GTGTTCAGCG CTAGTGTGAA GCAAAATGGC AACTTGGTGA GTAACGCGCT GACGTTTGCG TCACGCGCTG CCGTCACGCT GAGCTCTGTG ACTCCGACTG GGATTGCTAT TTCGGGCGGG AGCGTCGTGT ACGTGACGGG CTCGAACATG CCAAACGACA ACACTCTATA TTGTTCCTTC GGCACGATTC TTGTTTCGGC GCAGTGGTCA TCAAGCACAG CGGCAAACTG TGTGAGTCCC GCTCATCTGG TCGATAGCAC CAGTACGAAG TTTAGGGTGC ACGCCGATGG CCTATCATCG ACAACATCCA AAGACATAAC GTACGTGTCC ACTTCTGAAA TCACCGAGAC GCTGCCACCG AGCTTGAGTG CCACGGAGTT GCCAGCGTCG GTGACTATTC TCGGCGCTTG GTTGGCGAGC GCATCTTGCG ATGGGATCGC TCTCTCGCTC AACTCTACTT GGGCGTCCGA GTTTGCGTGC ACGCTCGACC CTGTTGGCGT TGGTTACACT GCCGTGAGCG TGATTTCGCG CGGCCAGACG ATGAGCGTTT CGTATTTGAT CAAAGAGACG CCACTATTGC TGAGCGTTTC CCCTCCTGGC GCGTCAACTA TGCCTGGTGA GCTATTCACG TTGACCGTGC AGCACTTTAT CGCTGATGAT GCCGATCAAT TCCACTGTTT ATTTGACGCC ACGAGCGCGG TTGCGCCACA CATAATCTCC TCTAGTTTGA TCCGGTGCGA GTCAGTCGCG ACGACAAAAG TGTCGACGCG CTTGACCATC GAGGGTGGCG ATGGCGCTTA CCCACTGTCT CGACAAGCTG CTCCGGTGGT CTCGAGCATC GCGCCATCGT CGAGCGGTGA TATTGGAGGG ACGTTGGTTA CGCTCACAGG CACGAACATT CCTCTCATAG ACAACTCTGC CGTGTGTTCG TTTGGTTCAA TTGGTCCCAT CGCTGCGCAG TACGCCACTA CTACGACAGT ACAGTGTGTG TCTCCTGCAG GTGTGGTGAG TTCATCGGCA AATATCTGTG TGAGCGTTTT CAGCACTGCA TCACCGTCCA GAAGCTGCAA TACTACACAC GCGCTTCAGC GATCGGACAT TCTCGGCGCC GTCATGCCGT CTGTCGTGAA CAACGTCGGA TCAACATCTC TGAGCCTGGT CTGGTCGGTT GTCTACAGCG GCGGTACAGG GTGCAAAGTC GGTGGTGTTC TACTCGACCC ATGTACCGTT TCAGGTAGCA CGACACCGGG TTTCGTTCAG GTTTACTCGT ACACTTTTGC AAGCGACGAG CGCGTGAGCG AACCTGCGGC GTTTGCTTAC TACACGTCTC CAATTGTGTC AGGCTCGATT CCCGTCGTTC TCCAGAACTT CATTCCAACT GTCGTTTATA TGCTAGGCTC AGACTTTGTC GACGAGGCGA ATGTACTCTG CTCCTTTGGT GACAAAGCCG TAGACGCGAC TTTCGTGTCA TCTGCTTTGG TCAAGTGCGC TTCCCATTTG GTGGGCGCCG TCGGCAATAC GGATGCACAA GTCGGGTTTG GTAGCGCGAG TGACGACGGC GTGTGGAGTC GAACGGTAAT GTCGCTGAGC GTGGTCGACA TACTCACGAT GAGTAGTGTC AGTCCGACAC GAGGAGTATT GGCTGGCGGC ACCGTCGTCA CCGTCACGGG AACTGGTTTT GAGGGCGGCG GCGTTGTCTA CTGTCGCATA GGTACGGTTT CGTACATCGA AGCACGAACT ATACACGATA AGAAAGTGGA GTGTACCGCT CCGTCGTACT ACGATGAAAC GGTTGATATT CAAGTCGCTA TCCTGGGCAA TGTGTACGCG GACACGGCGC AGTCATTCGT GTACAGCACG GGTGTTGACG TAGTCGCCAT CATTCCTCCT ACAAGCCCTC TAGCTGGTGG TACTGCCATC AGCCTCTTTG GTTTGGCTGG CAGTGTAGGC GATAGCTATG ATTGCGTGAT GAGTGGTGCT GCGGTAGCTG GTACGATTAG CCGCTTTGGT GAAGTTGAAT GCGCGAATCC CGCCGGTGAA GAAGGCTTCG CCGCTGTCGG CATGCACAGC GGCAAACTGT GTGAGTCCCG CTCATCTGGT CGATAG
|
Protein sequence | MLDPADWGVW PSSQLNQNNL TVDVHGASFF STNVSACRVG DFVVPLTHVS STLVRCTVSP SLRLSKGYAY VEVSMNGLDF TSDRITFTLH EPVKLEQVFP YGWDIGGGGV LRVSGSNFVP GSSKCAFGAG STGTVPAVSG LAPSEVVSSA FMKCETPAFS TVGMTDTSLR SWGDSGSLSG SRAFEVWREP CSIEIFTTWN STKAEYGGVA VVLQGSGNED LVEPNNMFGY SCHFGTITVS AVMDTLTINR TVTCISPASQ HPSTSVELWT GPNIDSFKPC TTFPTFAYPA DSEDTYTEVV GDETTVVYTG TNISAVPTYV TTSGGSVISL SGISLSTASL CRVGSDASAP VHFVSSSLIQ CEIPPHTEGE EYLYSAPSAS VEITSVYFTA LAEISSVLPS TGGLEGGTAV RVRGANFKDT DDLLCRYGSI SVLASYYSTT QVQCVTPAHM IGSIPVGVGR RDGISHSFWG DKLFVYDSTD ILGAVMPSVV NNVGSTSLSL VWSVVYSGGT GCKVGGVLLD PCTVSGSTTP GFVQVYSYTF ASDERVSEPA AFAYYTSPIV SGSIPVVLQN FIPTVVYMLG SDFVDEANVL CSFGDKAVDA TFVSSALVKC ASHLVGAVGN TDAQVGFGSA SDDGVWSRTV MSLSVVDILT MSSVSPTRGV LAGGTVVTVT GTGFEGGGVV YCRIGTVSYI EARTIHDKKV ECTAPSYYDE TVDIQVAILG NVYADTAQSF VYSTGVDVVA IIPPTSPLAG GTAISLFGLA GSVGDSYDCV MSGAAVAGTI SRFGEVECAN PAGEEGFAAV GIGSIIDDEI DQQTIEYARA PVISSVYPLN GPTSGGTLIY TSGSHMRDSA YLSLEANAGA SSHFVSSALV VTELPISTAA VFSASVKQNG NLVSNALTFA SRAAVTLSSV TPTGIAISGG SVVYVTGSNM PNDNTLYCSF GTILVSAQWS SSTAANCVSP AHLVDSTSTK FRVHADGLSS TTSKDITYVS TSEITETLPP SLSATELPAS VTILGAWLAS ASCDGIALSL NSTWASEFAC TLDPVGVGYT AVSVISRGQT MSVSYLIKET PLLLSVSPPG ASTMPGELFT LTVQHFIADD ADQFHCLFDA TSAVAPHIIS SSLIRCESVA TTKVSTRLTI EGGDGAYPLS RQAAPVVSSI APSSSGDIGG TLVTLTGTNI PLIDNSAVCS FGSIGPIAAQ YATTTTVQCV SPAGVVSSSA NICVSVFSTA SPSRSCNTTH ALQRSDILGA VMPSVVNNVG STSLSLVWSV VYSGGTGCKV GGVLLDPCTV SGSTTPGFVQ VYSYTFASDE RVSEPAAFAY YTSPIVSGSI PVVLQNFIPT VVYMLGSDFV DEANVLCSFG DKAVDATFVS SALVKCASHL VGAVGNTDAQ VGFGSASDDG VWSRTVMSLS VVDILTMSSV SPTRGVLAGG TVVTVTGTGF EGGGVVYCRI GTVSYIEART IHDKKVECTA PSYYDETVDI QVAILGNVYA DTAQSFVYST GVDVVAIIPP TSPLAGGTAI SLFGLAGSVG DSYDCVMSGA AVAGTISRFG EVECANPAGE EGFAAVGMHS GKLCESRSSG R
|
| |