Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_93242 |
Symbol | |
ID | 5003614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 320209 |
End bp | 321678 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419035 |
Product | predicted protein |
Protein accession | XP_001419773 |
Protein GI | 145350774 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.432514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.15533 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGGC GCGGCGGTAC CGCCCCTGGC GCGCGCGCCA GCGCGGCGCA CGATGACGAC GAGTTTGCCT TCCCGACATC GACGTCGGCG CCGCGCGAGA GCGACGACAC GCCGTCGCCG TCTCGGCCAT TCAATTTCAA CGAGAATTTA AGCCCGAGCC CGATATTCGC CCCTGGACAC CACACGGCGA GCGCGCGAGA CGTCACGGTG ATCACCATGG GTGTGGCCGT CATCGCGAGC GCGTGCGCCT GTTGGTTGCT CCAGGTGCTG GCGTATTTGC TGAGACCGTT GGTGACGGCG ACCGGAGCGG CGCTGCTGCT GCGACCGCTA GTTGATTTCG TGAGCGACGC GGCGTATCAG CGCGAAACAT TGCGAAGGAA TCGATTGCGA TTGTGCTGTC CGCGAGTGTT CACGGTACCG AGGATCGTGG CGATCGTGCT GGCGCTAGTG GTTGTGTGCG CGGTGTTCGG GACGTTGGTG TGGGCGTTAT ATCTCGGGCG CGTGTACTTC TTGGTGCACT GGAACGACCC CGACTGGAAT TCGAGGTTCA CGAGTCGAAT TAATGACTTA GGACGAAGCA TGAGCGAGAT GGCGAAGCGC GTGTTCAAGA AAGATGACGT TGCGGAGGAG GCATGGGCGC GTTTGTTGGC GAACATCGAA GCCACGCTGA AGGATGAAGA GTTTTGGGAG ACGCTCGGCG TAGGAGTGTT TCACTACATG CAAGATATCA TGCTCATGCT GTTCTACACT TTGTTCTTGC TCGCACCGCG CCGAGAGTTA TCGAGGGCGC GTTCGCTCGT GGCGAGACGA ATTCACAAGG CGACGCGGCG GTTCGTTTCC ATCATGGTGC AAATTGCCGT CGTGCGGGCG CTTTCCGTGG GATCGTTGCT GGGACTGTGC GGTGTGCCGT GGCCGCTGGC GATCGCCGTG GCGATCGTTT CGTTCTGGCT CTTCTTCATT CCGACTTTAG GATCCATCGT GGCGTCGATC CTGCCTTTGC CAATGATCGT GCTGCTGCCG GATTTGACGG AACGCCAGCG CGTCGCAGGA TCGATCATTC CAAACTTGGT TTCGTTTTTA GTCGGAGATT TGATAGGGCC GGTGGTATAT CGCAAAGGAC TAGATTTGAA TGAAGTCACG GTCTTGCTCG CGCTTGTGTT CTGGTACTCG GTTTGGGGAG CCGCTGGGGC GGTGCTGGGT GTGCCCTTGA CCTGCGCGTT AAAGATTGTC TTGGAGGAGA TGCCTCACGA GGGCGCGCAC GCGCTCGCGG CGGTGATGGC GCCGGTGCAA AACGACACGA CCGTAAACGG AGACACCGCG TCCGCTGTTC GTGTTCCGAT GACATTGACG TTCATTGCTT GGTTGAAGCG CACGTGCGGT CGGCTGTTCA GGCCCTCGGG AACGACGCAA CCAAGACGCC GCGCCGACGA GGAAGCGCTC CTCGCCGAGT CGTCTTCGGA CAGTGAATAA
|
Protein sequence | MPRRGGTAPG ARASAAHDDD EFAFPTSTSA PRESDDTPSP SRPFNFNENL SPSPIFAPGH HTASARDVTV ITMGVAVIAS ACACWLLQVL AYLLRPLVTA TGAALLLRPL VDFVSDAAYQ RETLRRNRLR LCCPRVFTVP RIVAIVLALV VVCAVFGTLV WALYLGRVYF LVHWNDPDWN SRFTSRINDL GRSMSEMAKR VFKKDDVAEE AWARLLANIE ATLKDEEFWE TLGVGVFHYM QDIMLMLFYT LFLLAPRREL SRARSLVARR IHKATRRFVS IMVQIAVVRA LSVGSLLGLC GVPWPLAIAV AIVSFWLFFI PTLGSIVASI LPLPMIVLLP DLTERQRVAG SIIPNLVSFL VGDLIGPVVY RKGLDLNEVT VLLALVFWYS VWGAAGAVLG VPLTCALKIV LEEMPHEGAH ALAAVMAPVQ NDTTVNGDTA SAVRVPMTLT FIAWLKRTCG RLFRPSGTTQ PRRRADEEAL LAESSSDSE
|
| |