Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33078 |
Symbol | |
ID | 5003242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 357782 |
End bp | 358768 |
Gene Length | 987 bp |
Protein Length | 315 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418663 |
Product | predicted protein |
Protein accession | XP_001419355 |
Protein GI | 145349881 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.638325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCCACGCTC TCGCGATGTC CTCCGGCGGC GCGCGCGCCA CGGATGATGT CGTTCCAATG AACCCATTCC AGCGCTCCGA CGCCCCGAGC GCGCCGAGCC AGAGCGCAGA ACCGTTTCCC GCGATCGGCA ATCCCGTCTT CACCCCGCGT GCCGACGATG CGGAAACGAA CGAGCCGTGG TACTCGCGCT CACCCAAGCT GTTCACCATC ATGGGCATCG ACGTGCACGT GAGTCCGCTT TTATACACCT ACATCGTGTT CACGCTGGTG CTCGCCGCGT TTTCGGCGAG TTTCTTCCTT TTCGCGCTGC AGGCGTTCGC TAGCGTGTTG CTCTTCTTCA CGGTGCTGGT GCACGAGCTC GGACATTGCG CGGCGGCGCG CGCGGTAGGA GGACAGGTGA GTCATATTTT GCTCTGGCCG CTCGGGGGAC TAGCATACGT GAACGTCGAC GCGACGGACG CCAAGGGAGA CTTTTTTGTC TCGTTCGCGG GTCCATTGAC GCACGCGCCG ATGCTCGCCG GATGGGTGGC GAGTTTTGCG TTGGCCACTG GGAGCACGGA CATCAGCGAG TGGCCTCGAG ATAATTTTGT GGGAGCACTC GCGTACGAAG GATGTTGGCT GAACATTTTC TTGTTCTTAT TCAACGTGTG CATCCCGGCG TATCCATTAG ATGGCTGCCG CATGCTGATG GCGCTCTTGG CCATGTGCTC TGTGTCTTTG ACTACGACGG CGACGACCAT CATTTGCCTC TCCACCGTGA TGAGTTTGGG AGTGATAGCT TATGGATTCT GGCTCGTTCG GGTTATGCCA GTTTTTGTGG GTGCGTTTAC GCTGGCAGAG ACGTATAAGC TGTACACGCT TCTCAAGAGC GGCGCGTTAG AGGAACATCC GAGCTTTGCG AAGTACAATG CTATGAGTGG GAGGCGCAAC ACGAACACGT CGTGGAACGT GCAGGCCGTA TAAAGTCGAC TAAACAGAAT TTGTATG
|
Protein sequence | MSSGGARATD DVVPMNPFQR SDAPSAPSQS AEPFPAIGNP VFTPRADDAE TNEPWYSRSP KLFTIMGIDV HVSPLLYTYI VFTLVLAAFS ASFFLFALQA FASVLLFFTV LVHELGHCAA ARAVGGQVSH ILLWPLGGLA YVNVDATDAK GDFFVSFAGP LTHAPMLAGW VASFALATGS TDISEWPRDN FVGALAYEGC WLNIFLFLFN VCIPAYPLDG CRMLMALLAM CSVSLTTTAT TIICLSTVMS LGVIAYGFWL VRVMPVFVGA FTLAETYKLY TLLKSGALEE HPSFAKYNAM SGRRNTNTSW NVQAV
|
| |