Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39452 |
Symbol | |
ID | 5004682 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 139145 |
End bp | 142864 |
Gene Length | 3720 bp |
Protein Length | 1077 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420103 |
Product | predicted protein |
Protein accession | XP_001420597 |
Protein GI | 145352536 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCCCGGATG CGCCGTTTCT GGCGACGGGG ACGATGGCGG GGGCGATCGA TTTGTCGTTT AGCACGACGG CGTGTTTGGA AATCTTCTCG ACGGACTACG CGGATGGGGA GTTTGAGATG CCGACGCGCG GGAAGGCGGT GCCGTCGACG GAGAGGTTTC ATAGGTTGGT GTGGGGGAGG GGCGCGGCGT CGGAGGAGAC GCGGCTGGGG TTGATTGCCG GGGGGTTGGT GGACGGCACG GTGAACGTGT ACAATCCGGC GAAGATCGTG GACGGCGCCC AGAGCGGGGC GATAATCACA AAGTTGGCCA AGCATCAGGG TGCGGTGAGG GGCTTGGACT TTAACACGTT CTCACCCAAC CTGCTCGCGA GCGGGGCGGA GGATGGGGAG CTTTGTATTT GGGATTTGGC AAATCCGAAC AAGCCGTCGT TGTATCCCGC GCTCAAGTCT ACCTCGGGCG GCCCGAGCGC CGGGGAGGTT TCGTATTTGG CGTGGAATCA TAAAGTGCAG CACATTTTGG CGTCTTCTTC GCTCAACGGT ACCACGGTGG TTTGGGATTT GAAGCGTCAG CGTCCGGTGA TTTCGTTCAC CGATCCGAAT TCCCGCCGGC GGTGCTCGGC GTTGCAGTGG AACCCAGAGG TGGCGACCCA GCTTATCGTC GCGAGCGACG ACGATCGCTC GTGCTCGCTT CAGGTGTGGG ATTTGCGGAA CTCTATCTCT CCTGCGCGCG AATTCGTCGC CCATTCCAAA GGCGTGCTCG CCATGGCCTG GAACTTGCAA GATCCGTCCC TGCTGCTCAC GTGCGGCAAG GACAACAGGA CGCTTTGCTG GGACACCGAG GCCGGTGAGG TCATTTCCGA GCTTCCCGCG AGCGCGAACT GGAATTTCGA CGTTCAATGG AGCAAGACGA CGCCCGGCAT CTTGTCCACA TCTTCGTTCG ACGGCAAGAT CACCTTGCAC AACTTGCAAA AGGCGGGGGC GACGGCGCAA GGTTCAGCGG ACGCTCACGG CGTGAGCTCG GACTTTTCCG AACTCGCCCA CCAGCAAAGC GCCGGTCCAA CGATGCCGAT GAAACGCGCA CCGAACTGGT TGAAACGTCC TTGCGGGGCC ACTTTCGGAT TCGGTGGTAA GCTCGTGGCG CACGGAGCCA CGCTCCAGGG TGCCCCGGCG ACGCCGGGCG CCGTGACCGT GATTTCCGTC AAGAGCGAGA CGACGGATGG TTTGGTGGTG AAAGAAAACA GCTCCGAGTT TGAAGACGCC GTGAAGGGAG GAGACACGGA GGCGCTGATT AAATTTTGCG AGGCGAAGAA ACGCATGGCC AAGGGTGAAG AGCACGAAGC TTGGACGTTT ATGAGTATCT TATTCACCGA GGACGCCAGG CGAGAAATGT TACGCCACCT CGAGTTTGGT GACGCCTTGG AGGCGCGCGA GAAGTCCCTC GCGGCCTTGT CGATTAAGGA TACTGATGAT TCGCAAGCCG ATGCGTCGAC GCCGGTGAGT ACACCGACGT CGCCAGCGCT GCCTCCGGTC GAGGACTCTG ACGCATTCTT CGACAACCTT GGAGACGCGG CGCAACCTAC GTCGCCGAAG CACATTCCGT CGCCGAAAAA ACCGGCGCCG AAAAAACCGG CGCCGGTGGT TTCAAAGGCT CCACCGCCGA CGGCTGCTGA TTTTGAGATT CAACGCGCTT TGATTGTAGG CGATTACAAA GCCGCCGTGG ACGCGTGCAA GCGCGCCGAG CGCTACGCCG ACGCCCTCAT CCTCGCCGCG GCTGGTGGTC CGGAACTATG GGCTGAAACC CAAGCTGCGC ACATTGCGCG AGTTCCGCGC CCGTACATGC AAGTGGCCGC GGCGGTGGTC GGAAAGAACT TGTCGAATCT CGTCAAGGCG CGGCCGGTGT CGGCTTGGCG TGAGACGCTG GCGTTGTTGT GCACGTACGC CCCTGGTGAT GAATGGGGTC CATTAGCGGA AGTTCTCGCC GAGGGATTGG CAAAGAGCGG AGATCATAAG TCTGCGATGC TGTGCTACGT TTGTGCGGGT AATGTGGATG CGGCTATCAA GTACTGGCTG TCTTCACTGC CGTCGAGAAA CTTTAGTCCA AAGGATTTGT ACAGTGTAGT CGAGAAGGCT GTGATGATGA CTCGCGCGGC GGGACAATCG GAAGCTACGC AAGGTTTCAC GAGTTTGATT ACGAATTACG CGGAAATGTT GTCTTCCCAA GGTGATCTGG ACAGCGCGTT AGATTACTTG GGCATGGTTC CCGGGACCCC GGGTGAAGAA GTAAACGTCT TGCGCAACAG AATTATTCGC AGCAGTATTT CCAACAAGGC GACCACGAGT GCGCCAGTCG TCGCGTCTCC GGTCTCTGCG TCGGTGGTAC AGCCGTCGTA TGGCGCTCCA TCCACGATGT CTGCGTACCA GGCCTCGCCA CCTCCGGCAC AGCCATCGTC TTATGGATCC CAAGCCGCGT ACGGCGTCCC GCCGTCGATG ACTTCGGCGT ACAGCACCCC GGCGCCTCCG CCTCCGCCGA CGACTTTCAC GCCTGCGGCT ATGGCGCCAC CGCCAGTTTC GCAAGTTCCT CAACGAGCGC CGCAGCAACC GGCGCCGATG ATACCGACGA TGCCTCCGAT CGATATGCAG GGTGGGTACT TTTCGGGATC ATCGCCGACC CCTGCGATGC CCCCCGGGAC CGCAGCGCCC CGGCCGCCGC CACAACAGGC AGCTCCCACG GGACCGCCGC CGAGACAGAC CGCGTACGAC AGTGCGCCTG TTGAGAGCGC GTACGCCACC CCTAGTGGAT ACGGGGGAGG TATGAGTCAG CCAGCGCAAC CGTCTGTGCC TCCGTCGATG ACCGCGGCGT ACAACGTCGC CGCTCAAGTC GCCGCGGGCA ACATGGGCGC CCCCGCAGCA CCCCCGATCG GCGGTGGTTA CGGTGCGCCT CCGGCGCCGA TGGCTCCGCA GCATCACCAA GCGCCACCAC CAATGCCCGG AATGATGGTT CCGACTCCCA TGATGCCGCA ACAACAGCAG CAAATGGGCG GGCCGCCGCA ACAACAGCCC TCGCGCGCAC CGCCGCCGCC GCCTTCGTCC GTCGCCCCCG CTGCCTCGTT CAACCCTGCG CCCGCAACGC CGCCGCCGAT CATTCCGCAA CGCGTCGAGC AACCGGCGTA CGGTGCGAAT GCGCCTACAC CGCCACCTTC GGCGCAGGCG TACGGCGCGC CGCAGCACCA GCAGCAGAGC TTCCAGCCCG GCCCGCCGCA ACCGTCAACC TTTGCGCCGG CTCCGCCGCC GATGCAGCAG CAGCCGGCCG CAGCACCACC ACCGCCACCT GCGGCGCCGG CAAAGCCGCA ACCACCGGCG AACTGCTCCG TGGAAACCGT CGACACGAGC AAGATTTCAC CCGACCTCGC GCCCGTCGCG CAGTCGCTTC GCTCGCTCTA CGACGCGTGC GCCGCCGCCG CCGCCGCCCA TCCAGCGAAG CGTAAAGAGA TGGACGACAG CTCCAGACGA CTCGGTGTTC TCCTCTGGAA GCTCAACGCC GCCGAGGTGT CCCCATCCGT CGTCGCCAAG CTCAAATCTC TCGCCGAGGC GCTCGACACG GCGAATTTCA CCGCCGCGCA CGGCGTCCAA ATGGCCCTCA CCACCGGCGA TTGGGACGAG TGCAGCGCCT GGCTCACCGC GTTAAAGCGT CTCACGAAAT TTCGATCCAC GTTTCCATAG
|
Protein sequence | SPDAPFLATG TMAGAIDLSF STTACLEIFS TDYADGEFEM PTRGKAVPST ERFHRLVWGR GAASEETRLG LIAGGLVDGT VNVYNPAKIV DGAQSGAIIT KLAKHQGAVR GLDFNTFSPN LLASGAEDGE LCIWDLANPN KPSLYPALKS TSGGPSAGEV SYLAWNHKVQ HILASSSLNG TTVVWDLKRQ RPVISFTDPN SRRRCSALQW NPEVATQLIV ASDDDRSCSL QVWDLRNSIS PAREFVAHSK GVLAMAWNLQ DPSLLLTCGK DNRTLCWDTE AGEVISELPA SANWNFDVQW SKTTPGILST SSFDGKITLH NLQKAGATAQ GSADAHGVSS DFSELAHQQS AGPTMPMKRA PNWLKRPCGA TFGFGGKLVA HGATLQGAPA TPGAVTVISV KSETTDGLVV KENSSEFEDA VKGGDTEALI KFCEAKKRMA KGEEHEAWTF MSILFTEDAR REMLRHLEFG DALEAREKSL AALSIKDTDD SQADASTPVS TPTSPALPPV EDSDAFFDNL GDAAQPTSPK HIPSPKKPAP KKPAPVVSKA PPPTAADFEI QRALIVGDYK AAVDACKRAE RYADALILAA AGGPELWAET QAAHIARVPR PYMQVAAAVV GKNLSNLVKA RPVSAWRETL ALLCTYAPGD EWGPLAEVLA EGLAKSGDHK SAMLCYVCAG NVDAAIKYWL SSLPSRNFSP KDLYSVVEKA VMMTRAAGQS EATQGFTSLI TNYAEMLSSQ GDLDSALDYL GMVPGTPGEE VNVLRNRIIR SSISNKATTT QPSVPPSMTA AYNVAAQVAA GNMGAPAAPP IGGGYGAPPA PMAPQHHQAP PPMPGMMVPT PMMPQQQQQM GGPPQQQPSR APPPPPSSVA PAASFNPAPA TPPPIIPQRV EQPAYGANAP TPPPSAQAYG APQHQQQSFQ PGPPQPSTFA PAPPPMQQQP AAAPPPPPAA PAKPQPPANC SVETVDTSKI SPDLAPVAQS LRSLYDACAA AAAAHPAKRK EMDDSSRRLG VLLWKLNAAE VSPSVVAKLK SLAEALDTAN FTAAHGVQMA LTTGDWDECS AWLTALKRLT KFRSTFP
|
| |