Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50098 |
Symbol | |
ID | 5002784 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 634425 |
End bp | 636786 |
Gene Length | 2362 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418205 |
Product | predicted protein |
Protein accession | XP_001418998 |
Protein GI | 145349140 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | [TIGR01628] polyadenylate binding protein, human types 1, 2, 3, 4 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.442075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.11137 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCG CGACGGCGAC GAACGTGACG AACGATGCGA AAGCGTCGAA CGCGACGGCG AACGCGGATG GGACGACGCC CGCGGCGCAA CAACCGGGGG CGGGGACGAG CTCGCTGTAC GTCGGGGACC TGGAGACCTC GGTGACCGAG GCGCAGCTGT ACGAAAAGTT CTCATCCATC GGACCGGTGG TGTCGATTCG GGTGTGCCGG GATTTGATCA CGCGACGATC GCTCGGGTAC GCGTACGTGA ACTTTCAATC GCCGAACGAT GCGGCGCACG CGATTGATGT GTTGAACTTT CAAGTCATCA ACGGGAAGCC GATTCGCGTG TTGTACTCGC AGCGAGATCC GGCGGTGCGA AGATCTGGCG TGGGGAACAT CTTCATCAAG AATCTCGATA AGGCGATCGA CAACAAGGCG TTGCTCGATA CCTTTGCGCA GTTCGGGACG ATCACGAGCG CCAAGGTGGC GATGGATGGC CAAGGGAACT CGAAGGGCTA CGGCTTCGTG CAATTCGAGA CCCAAGAGGC GGCGCAGGCG GCGATTGACA ACGTCAACGG TATGGAACTG AACGACAAGC AAGTGTACGT CGGTCCGTTC CAACGCCGTG CCGAGCGCTC GAACACCGGC GAAGCCAAGT TTAACAACGT CTATGTTAAG AACTTGAGCG AAAACTTGAG CGACGAAAAG TTGCGTGAAA AGTTTGCCGA GCACGGCGCG GTGACGAGCT GCGTCATCAT GCGCGACGAA GAAGGCAAGT CTAAGGGCTT CGGCTTCGTG TGCTACGAAG AACCTGAAGG CGCGGCGGCC GCGGTCGAAA AGCTCGACGG GTACACCGAG GATGAAAAGA CTTGGGTTGT CTGCCGAGCG CAAAAGAAGG CTGAGCGCGA AGCCGAATTG AAGGCCAAGT TCGACCAAGA ACGCCGTGAA CGCATGGAAA AGATGGCGGG CGCCAACCTC TACATCAAAA ATCTTGAGGA CGGCACCGAC GATGAAAAGC TTCGCGAATT GTTCAAGGAG TTTGGCACCA TCACCTCTTG CCGCGTCATG CGTGACGCCT CGGGCGTTTC TCGCGGTTCC GCGTTCGTCG CCTTCTCTTC TCCCGACGAA GCCACCCGCG CGGTGACGGA GATGAACGGT AAGATGGTCG GTGCCAAGCC GCTCTATGTC GCTCTCGCGC AACGCAAGGA AGAACGCCGC ATGCGTCTCC AAGCGCAATT CGCGCAGCGC ATGCCGGGCG CCGGTATGCC GGGTGGCATG GCTCCGTACA TGCCGCCGCC GGGCGTGCCA GGTGCTCCTA TGTACTACGG CCAACCGCCT CCGGGTATGA TGCCGCCGCA ACCGCAACCA GGCTTCGGTT TCCAACCGGT GATGCCGGGC GGACCGCGAC CGGGCATGCC GGGCATGCCG GGTTACGGCA TGCCGATGCC GCAACGCCAA GGCGTTCCGG GCGCCCAGCG TGGCCGTGGT GGCCGAGGTG GCCCGGCCGG TGGCCGTGGC CAGCGTCAAA ACATGCGATA CAACGCCGCC GCGGCTATGC CGATGCCGCC GATGCCGGCG GAGGCGGCTA ACCCGATGGC TATTCTCGCT TCTCAGCTCT CCGCCGCCGC TCCGGATCAA CAGCGCATGA TTCTTGGTGA AGCCCTCTAC CCGCTCATTG AGAGCAAGGA CGCCGCCAAC GCCGCAAAGA TCACGGGTAT GTTGCTCGAG ATGGACCAGT CGGAGGTCCT TCACTTGATC GAGTCTCCGG ACGCGCTCAC CTCCAAGGTT CAAGAAGCCC TCGCTGTGCT GAAGGCTGCC GCCGAGGAAG GTGCGTAATC GCTCCGTCGA TGAAGTATGC ACTTCAAACT CAGTGAGTGA GGATACGTTT GGTATGAGAA TTCTTTGGCG AGTCTTCTCG TGCGCGCGCA CGAGAAGAGG CACAACATTT TTAGCGAGCT GCGGGCCGCG CGAGCGCGAG CGAGCTGTCG CGATATTTCA TCGCGACGTC TCGTCGACGC TTTGACGCGG CCCGCGGCAA CGATACGATA GCACCAATAG GCGGGCGCCA TTGGGGGGTG CGAGACGGTA TTTCTTTTGT CTATGCGCTC AACCGCGACC ATGGTGTCCA CAGCCGCCGC GAGAATCACG AATGCTCTCG CCGCGAGCCG ACTGCGGACC AGAAACTAGC GTGATGGACT CCATCACGTG CATGAAAAAA ACGAAAAAAA AAGGGCGACG CTGTTGCGCT TTTGAGCCAA GAACAGGTCG CAGTATCAAG ACTTTAAGAC GCATACGTTG CTGTCTGCAC TTTTGACTCT CGAAATCCAC CAATACGAGC AGCAAATTTT CAAAATATCG ATACAAAAAA TACAACTCTC TA
|
Protein sequence | MSAATATNVT NDAKASNATA NADGTTPAAQ QPGAGTSSLY VGDLETSVTE AQLYEKFSSI GPVVSIRVCR DLITRRSLGY AYVNFQSPND AAHAIDVLNF QVINGKPIRV LYSQRDPAVR RSGVGNIFIK NLDKAIDNKA LLDTFAQFGT ITSAKVAMDG QGNSKGYGFV QFETQEAAQA AIDNVNGMEL NDKQVYVGPF QRRAERSNTG EAKFNNVYVK NLSENLSDEK LREKFAEHGA VTSCVIMRDE EGKSKGFGFV CYEEPEGAAA AVEKLDGYTE DEKTWVVCRA QKKAEREAEL KAKFDQERRE RMEKMAGANL YIKNLEDGTD DEKLRELFKE FGTITSCRVM RDASGVSRGS AFVAFSSPDE ATRAVTEMNG KMVGAKPLYV ALAQRKEERR MRLQAQFAQR MPGAGMPGGM APYMPPPGVP GAPMYYGQPP PAGHAGQGVP GAQRGRGGRG GPAGGRGQRQ NMRYNAAAAM PMPPMPAEAA NPMAILASQL SAAAPDQQRM ILGEALYPLI ESKDAANAAK ITGMLLEMDQ SEVLHLIESP DALTSKVQEA LAVLKAAAEE GA
|
| |