Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24759 |
Symbol | |
ID | 5003046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 127162 |
End bp | 128203 |
Gene Length | 1042 bp |
Protein Length | 332 aa |
Translation table | |
GC content | 71% |
IMG OID | 640418467 |
Product | predicted protein |
Protein accession | XP_001418844 |
Protein GI | 145348825 |
COG category | [A] RNA processing and modification |
COG ID | [COG5178] U5 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.429004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0101629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GATGCCTCGC GCGCGATCCA CCGCCGCGCT CGTCGCCGTC GCGCTCGTCG CGCGCGTCGC GACGACCGCC GGCGCGGCGC TCGCGTCGTC GTGCGCGCCG CCTCGATTCT CGTGCGCGAG CGCGCGCGCG ACGGTGGACG ACGCGTCGAA GTGCCACGCG TCGTTTCGGT GCCGAATCGT CTGCTCGCCG AGCGCGTCGT GCGTCGAGAC GTTCACGATA CGCGCGGGCG AGGCGACGCT GCCGACGGGA GAGACGGTGA CGTTCAACGC GTCGACGCGG GAGAGATCGG CGCTGGGACC GACGACGTAC GGAGGGGACG CGGCGTGCAG CGCGGTGGGA TGCGACGCGT ACGCGACGCG ACTGACGACG CAAAACGCGG CGCTGACGGA GAGGTACGAC GGGAAGTTTT GCGAGGTGAG GGAGACGAGC GAGCGGTGCG CGCTGGAGGG GGCGCCGAAG GATGGGTCTT GGACGGAGTT TACGGTGACG ATGAAGTGTC GGAATAAGAT TTTGCCGTGC GGCGCGCGAG CGGTGGCGGA GATTGATTTC CTCGGCGTCG TCGGGTCGGT CGCGCCGGGC GCGCCGAGCC CGCCGACGCC GCCCGACGCG CCGCCGATGC CGCCGAAACC ACCGCCGCCG CCGCCGCCGC CGTCGACGTT CGCGAAGGGC GGCGCCGGTC GGGCGGCGCT TCGCGCGGCG CTCGTCGTCG CGTTGAGCGC CGTCGTCGCG TACGCCGCGC TCGGCGGCGC GTACGTCTGG GCGCTCGAAC GCGGCTGGGT CGAGGGCATC GCCGAACCGT GCTGGGAGCG CGACGGTGCG TTTTGCGAAT TGCTGTGGTG TTGGTGCGCG CCCGCGTATT GGCGCGCGAG CGACGAGGAG TGCCTCGCCG CGTACGAAGG TCGCGCCGCC GCCGAAACCG CGACGCGCGA GGACGTCGCG CAAACCACCG CGCCTTTATT AGGCAACGCG TCGTCCGACG ACGACGACGA CGACGCGTGA CCGACGCGCT CGCTCTTGTA AAACACGAAT TGTTTCCCGC GA
|
Protein sequence | MPRARSTAAL VAVALVARVA TTAGAALASS CAPPRFSCAS ARATVDDASK CHASFRCRIV CSPSASCVET FTIRAGEATL PTGETVTFNA STRERSALGP TTYGGDAACS AVGCDAYATR LTTQNAALTE RYDGKFCEVR ETSERCALEG APKDGSWTEF TVTMKCRNKI LPCGARAVAE IDFLGVVGSV APGAPSPPTP PDAPPMPPKP PPPPPPPSTF AKGGAGRAAL RAALVVALSA VVAYAALGGA YVWALERGWV EGIAEPCWER DGAFCELLWC WCAPAYWRAS DEECLAAYEG RAAAETATRE DVAQTTAPLL GNASSDDDDD DA
|
| |