Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38656 |
Symbol | |
ID | 5002148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 842502 |
End bp | 843530 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417569 |
Product | predicted protein |
Protein accession | XP_001418126 |
Protein GI | 145347330 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1097] RNA-binding protein Rrp4 and related proteins (contain S1 domain and KH domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 84 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTGA GTTTTCGAGA CGCCGACGCG TCGACGACGC GCGCGTTCAA GCGCGCGAAG CGCGCGCTGG AGCAATGCGC GACGGCGAGC TCGTCCGCGC GCGCGTTCGT GAGCCCGGGC GAACGCATCC CAGGGATCGA TTCGTCTGAA GGTTTCTTGC GAGGACACGG CACGCGACCG ATCGCAGAGG ACAATGGAAC CGACGGTGAC GGCGACGATG ATCGTGGGTT AGTGGCCACG ACCGCGGGCG TGGTTGAACG CGTGAATAAA CTCGTCTCAG TGCGAGCGTT GAAAGCGCGC TATGCGCCAG AAACGGGTGA CGTCGTGCTT GGGCGCGTGA AAGAGATATC TGGTAAGCGT TGGATTTTAG ACGTGAACGC GCGACAGAAT GGGGTATTGC AGCTCAGTGC GGTGCATTTG CCCGGGAACG TGCAGAGACG ACGGAACGAT GTGGATGAGT TGAACATGCG CATGCTGTAC GCCGAAGACG ACGTCGTGAG CGCCGAAGTG CAGAGCGTGT ACGCCGATGG CGCCGCCGCG CTGCACACGC GAAGCTTGAA GTACGGTTGC TTGAAGAATG GTCAGCTCGT GCGAGTGACT GCGAATTTAG TGCGCCGATT GCCTCAGCAT TTTCACAGGC TTAAGATGGA CGAGTTTCAC GACGGCGTCG CCGCTGAAGC GAACGACGTG GAAATCTTAC TCGGGTGCAA CGGTTTCATT TGGGTTGGTG CGCCGAGCGG CGCCACCGCA CCGCGCGAGT CGGAGATTCG CCGCGAGCCG AGTGATGTCG TGGACGAGCT GCGCGAGCTT CACGGCGATG AAGTGTCTCC GGTTCAACGC GAAAATATAT CCCGCGTGGC AAATTCAGTG CGCGCGCTCG CCGAGCTGTT CCTTCCCATC TCGCCGCCGG CGGTCATGGA TGTTTTTAAA GCGTCGAGCG AGTGTGGGGT GGCGGTGAGG GATATGCTGA GCCAAGGATT CTTAACTCGT ATCCTCGAGC GAGAATTCGA AAAGCGCGTC GCCGACTGA
|
Protein sequence | MVVSFRDADA STTRAFKRAK RALEQCATAS SSARAFVSPG ERIPGIDSSE GFLRGHGTRP IAEDNGTDGD GDDDRGLVAT TAGVVERVNK LVSVRALKAR YAPETGDVVL GRVKEISGKR WILDVNARQN GVLQLSAVHL PGNVQRRRND VDELNMRMLY AEDDVVSAEV QSVYADGAAA LHTRSLKYGC LKNGQLVRVT ANLVRRLPQH FHRLKMDEFH DGVAAEANDV EILLGCNGFI WVGAPSGATA PRESEIRREP SDVVDELREL HGDEVSPVQR ENISRVANSV RALAELFLPI SPPAVMDVFK ASSECGVAVR DMLSQGFLTR ILEREFEKRV AD
|
| |