Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24216 |
Symbol | |
ID | 5000957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 330386 |
End bp | 332341 |
Gene Length | 1956 bp |
Protein Length | 614 aa |
Translation table | |
GC content | 52% |
IMG OID | 640416378 |
Product | predicted protein |
Protein accession | XP_001416625 |
Protein GI | 145344201 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0349] Ribonuclease D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000105574 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.257182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCTTTGGCT CTGAAGCACC GCGCGAACCG ACCCGGTCGT CGTGCGCGAA CGATGACTTA CTCACACATA CCTTTCGCGC ACTGATGCAA GCGCACTTTC CCTCAATCAC GAGCGAGCAC CGGCCAACTG CTGTTCGGAT TGGGCGCACA AAACACAAGG CGGCACGAAG GCAATCGTCG AGGTTCGTAC CGACAATGGC ACTCGACTCG ACGATAGATC CTATCTCGTT CGCCCCCGAT GAAGCTTTGA CGAAACTGTC CAACTTAATG ACCCAAGGAG TGAAATACCC GCAACTGAGA GCGGTGGTCG TGAGCTGGCT CGCCGAGGTG CGCGAACCAT CGACTCTCGT TGCGGCGTGC ATCGACGTCA TCTTCACCAC CCTGCGCGAT CAGTACGAGA TCGGACGCGA AGAAGACGAT GATGGCGAGT TACTGATTTG CATGTACTTG TTGTTGGAGT GCCTCGGTAG AACTCCCGGA ACGCGGGAAG GGCGAGAACG GAGACGCTTT ACCAGCACCG TCACGAGAGG CGTAGTTTTG TGTGATCGAA GTCGCGCACA CGTAGGTAAA TTGGTACATT CAGCGATGAC CACGACAAAT GATGTCATAC CAAGCCGTGC GATCGTTCGC TTGATCGAAA CATTTGGGCT TGAGTTTGAG GATTTGAACT TTACGAGCTG CGCAGCTGCT GCCGAAGGCG TCTCAGGCTT CGTTCGACAG CTTTTAGAAG ACGGCAAACA GAGCTCGGCC ATGGCGCTCA TTTTTCACTT CAACCTGGAA GAGTTCGCAA CAAATGAAAC ACTGCAAGGT TTGGCGCAGT GTAATGAATT TTCACTCGCT CTTGAGTTTG TTCAACTCGA ACCAACTCTG GCCAAAAATT GGATTGAGTT TTGCGTGCAC CAGGGCGTCC ACGAGCACCA TCACGCTGCT TTGCGACAAG CACACAAAGT TGTCACGATA TTTGAGATGG AAGCCGAGTT TCCGGACGTG CGCACGCAGT ATTACAAGTC GACTATTGCG AGAATGATTG CGAAAGGACA ATTTGAAGTC GCGCTCAAGC GCGCAGGCGC CGAAATCAAT CTTCAAGAGT TTGTTGTCGA ATCTCTTGCA GCGATCGAAC AATTCGAATA CGCTTTGGAG TTCGCAAATA GGTGTGGACT AGAGTTTGAC TGTGATCCAG TGGAGCTGGA GAAGTTAGTC GCACGCAGGA GAGCAACGTT TTTCCAGCTT CCAGAGCATC TCTCTCTCGA CGCGAACGTC GTCTTTGTTG ACGATGCAAA GAGCTTGCAT TACATCTCAG AACGGTACCT CGCAAATAAA AAAGACATTG GAATAGATAC TGAATGGGGT GCTGCGGTGG GTGAAGACGC GGACAAGGAA GACACGAGTC AGGTAGCTAC TCTACAACTT GCGTCAGAAG ACGGCGTAGC AATTTTAGAC TTGCCAGTTC TAGTTCAGAG CTGTCCCGAG GCGTTAGAGG CAACGATCGG GCGAATGTTT CAGGACGATA AGGTGTTAAA ATTAGGGTTC GCAGTTCAAG AAGACTTGCG ACGCTTGGCA AAATGTCACC CTGCGTCTTT CGGCAACGTA CGCAATGTGG CCGACTTGCA ATCGTTGTGG AAATTAGCTG TGTCAAAGGC GCGAATGACG AAAGAGACTC GTGATTTTCC ATGGGCAACC GACGAAGAAC TGTCACGATA TCAACCCGTT GGCTTATCGA CTATGGTCGC TGCCGTACTC GGGAAACCGT TGGACAAGAC GATGCGGATG TCTGATTGGT CGAAACGTCC GCTGACCGCA CAACAGCGAG TGTATGCGGC ACTCGACGCG TGGACTCTGG TCGAATCGCA TCGTTCGCTC CTCGCATCTC ACGCCGATCG ATACATCGCA CTAGTCGATC AAGTGAACAA ATCATATGAG TTCAAATAGC GCATAATCCC ATGGTTGTTG TATCAA
|
Protein sequence | MQAHFPSITS EHRPTAVRIG RTKHKAARRQ SSRFVPTMAL DSTIDPISFA PDEALTKLSN LMTQGVKYPQ LRAVVVSWLA EVREPSTLVA ACIDVIFTTL RDQYEIGREE DDDGELLICM YLLLECLGRT PGTREGRERR RFTSTVTRGV VLCDRSRAHV GKLVHSAMTT TNDVIPSRAI VRLIETFGLE FEDLNFTSCA AAAEGVSGFV RQLLEDGKQS SAMALIFHFN LEEFATNETL QGLAQCNEFS LALEFVQLEP TLAKNWIEFC VHQGVHEHHH AALRQAHKVV TIFEMEAEFP DVRTQYYKST IARMIAKGQF EVALKRAGAE INLQEFVVES LAAIEQFEYA LEFANRCGLE FDCDPVELEK LVARRRATFF QLPEHLSLDA NVVFVDDAKS LHYISERYLA NKKDIGIDTE WGAAVGEDAD KEDTSQVATL QLASEDGVAI LDLPVLVQSC PEALEATIGR MFQDDKVLKL GFAVQEDLRR LAKCHPASFG NVRNVADLQS LWKLAVSKAR MTKETRDFPW ATDEELSRYQ PVGLSTMVAA VLGKPLDKTM RMSDWSKRPL TAQQRVYAAL DAWTLVESHR SLLASHADRY IALVDQVNKS YEFK
|
| |