Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31221 |
Symbol | |
ID | 5001336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 513616 |
End bp | 516029 |
Gene Length | 2414 bp |
Protein Length | 727 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416757 |
Product | predicted protein |
Protein accession | XP_001417267 |
Protein GI | 145345544 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.005519 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGGATGGA TGTCGACGAC GCGCGCGACG GCGGCGACGA GGACCGTTCC GACGACGACG ATGACGCGCG CGAGGACGCG ATCGACGCGA GCGAAGGCTC GAGCGAGGAC GACGAGGACG AGGACGACGA CGAGGACGAC GACGAGGACG ACGATGATGT AGAATTGGAT GATGCGCAAT TGGACACGAT AATGACGCTC GAACAAGAGC TCGAGAGCGA CTCCTGGCGC GATTACGCGA AAGCGGGGAG ATTGATTGAT TTGTTGCGCG CGGCGAACCT GCGCGAGCGG TGCAGAACGG CGAGGGAACG ATATAACGAA GCGTTCGCGA TGGACGAGGC GCGCTGGTCG GCGTGGATTC AAGACGAATT GGCGAATAAG ACGACGGGGA CGAAACGCGA ACGGCACACG CGATGCGATG AGTTGTTCGC GAGGGCGATC GAAGAGTGTG GACGGACGAG CGTGCGGTTG CACATGGGAA GGTCGAGGAA CGCGATGGAG CTGGAGGCGG ATGAGAGCGA CCGACGCGCG CTTTACGAGA CGGCGACGGC GGGACCGGGG ATGAACTTTA ACGATGGGCA TTTGATTTGG CAGGCGTATC GAGCGTTTGA ACTGTCGTGG GCGGCGTCGG AAACGCAGAA AGTGCGGGTG AAGGCGCTGT ATTTGCGACA GTTGAAGATT CCACAAGCGC AGTCTCAGGC GACGCTCGAG GCGGCAAAGA CGTGGGCCGC CGACGCGGGT CTTCATAACG CAGACGCCGC GTTCCACGAA GCGTACGCGG TTGGTAATGC TGCAAAAGTA CTGCGCGAGC CTTACGAGGC GCGTCTGCTC GCCGTGACGA GTTCGCATGA AGGCGACGCG AAGCTGCTTC GAGCGTACGC AACCTACATT GATTTTGAAA TGGCGAGCGG ATCTCCGGAC AGAGTCGTTC ACTTGTACGA ACGCGCGCTT TCGTCGCTGC CGTATGTCGC AGAGCTCTGG CGAGACTACG TGTTGTACGT GTGGTCGATA TCGTTCAAAT CAGCAGAAGC AGCTTCGCGC ACTTTGATGC TTCGCGCCGT TCGCATGTGC CCGTCTAGCG TTCTGTGGAA GTCGGTGCTC GAGCTGGAGT CGTCGTACGA CTTGTACACG CATGCGCTGA GAACGAAGTT CAGAGATCCG AACGACTACG GTGCGGTTCT TACAAAAGTT CTCACACAGT GCGTACGTTT GGACGATTGG GCAAAAGCGT CCGGGTGCGT GACTTTTGGT TTCGAGCAGA TGGCCAAGGA TTATTCGCCC AACATCATGG CGGCGGCGGC GATCCATGTC TTGAAAGAAC TAGTAGACTA CGCGTTGATG AAAGATCACA AGACGACGGT CGCGTCATTC ATAGACGCTG TTTTTGCGCA ACTCGCGGAG CGGGCACCGT TCAAGACCGC CGCTGAATTT GTCATCCTTC GAACTGATAC GTCGCGCCTT ATAAAGAAGA CGCAAAAGGA AGTACTAGAC CTGTACGATG TAGCGTTACA GCGAGGAACC GTGGAACCCG TCGGCCTGTC GAAGAAGACG GTAGATGAAG AGACCCGTCG ATTGGCTGGA ACGGCTATGC TACTGAAAGC AAAGGCGCGA TATCTGTCGA CATTTGCACC AAAAAAGTAC GACGATTTCA ACGAAAAGAC CAACGCTATG GCCGCTGTGC GGCGGTACGA GTTAAAACTC GCGTGTGAGC GATTGGCTGC GAACGCGGCG GCGAAAGAGC GCATGAAAAG CGGTGGAAGC CGCCGCGAAC GCGCCGCCGC GCGTCGCGCC GGAGCCGAAC CCATTCCACG CAAACGAACT CGCGAAATTC CAGAGGGAGG CGACAAGACT GATATGAATC TTGCCGGAAT GGATCATGAC GCTCGTGTCA AGACGCTTTT CCCAACAAGA GACACGCAAA CAGCCTTTGT CAAAAATTTG TCCTGGGACG TTACGGACGC CGAGCTCATG GAGTTCTTCA CCGGCGCGGT GAGCTGTCGA ATCGTCAAGG ACAAAGCCAC TGGTCGTTCG CGAGGAATCG CGTACGTTGA CTTTGGAGAA GAAGCTGCTC TGAACGCTGC AATCATGCGA TCCGGTGAGG CGCTCAAGGG AAGACTGGTT GATATCGCGA AGAGTCGGCC TCCCGGTGAC GACGGACCCG ATGGTCGCGG TGGACGTGGC GGTGGACGTG GCAGTCGTGG CGGTGGTCGT GGTGGCGGGC GCGCCGCACC GTCTGTGGCT TCTGGACGTG GTCGCGGAGG CTTGGGCCTT ATGCCTCGCG CGATAACGGT GACGCGTACG GACAACGGCG AAGGCGCACA AGCAAAAACT AACGCAGACT TCAGGGCAAT GTTTGTGAAG GGATCATCGC AGTAGAGACG CTCGTAGTAG TGATGAAAAA AACGTTGTTG TATA
|
Protein sequence | MTLEQELESD SWRDYAKAGR LIDLLRAANL RERCRTARER YNEAFAMDEA RWSAWIQDEL ANKTTGTKRE RHTRCDELFA RAIEECGRTS VRLHMGRSRN AMELEADESD RRALYETATA GPGMNFNDGH LIWQAYRAFE LSWAASETQK VRVKALYLRQ LKIPQAQSQA TLEAAKTWAA DAGLHNADAA FHEAYAVGNA AKVLREPYEA RLLAVTSSHE GDAKLLRAYA TYIDFEMASG SPDRVVHLYE RALSSLPYVA ELWRDYVLYV WSISFKSAEA ASRTLMLRAV RMCPSSVLWK SVLELESSYD LYTHALRTKF RDPNDYGAVL TKVLTQCVRL DDWAKASGCV TFGFEQMAKD YSPNIMAAAA IHVLKELVDY ALMKDHKTTV ASFIDAVFAQ LAERAPFKTA AEFVILRTDT SRLIKKTQKE VLDLYDVALQ RGTVEPVGLS KKTVDEETRR LAGTAMLLKA KARYLSTFAP KKYDDFNEKT NAMAAVRRYE LKLACERLAA NAAAKERMKS GGSRRERAAA RRAGAEPIPR KRTREIPEGG DKTDMNLAGM DHDARVKTLF PTRDTQTAFV KNLSWDVTDA ELMEFFTGAV SCRIVKDKAT GRSRGIAYVD FGEEAALNAA IMRSGEALKG RLVDIAKSRP PGDDGPDGRG GRGGGRGSRG GGRGGGRAAP SVASGRGRGG LGLMPRAITV TRTDNGEGAQ AKTNADFRAM FVKGSSQ
|
| |