Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27410 |
Symbol | |
ID | 5005356 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 217067 |
End bp | 219912 |
Gene Length | 2846 bp |
Protein Length | 794 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420777 |
Product | predicted protein |
Protein accession | XP_001421236 |
Protein GI | 145353899 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.294058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGCGAC GCCGCCGCGC GACGCCGCAC ACGCGCGCGC GCCGCGCGCG GACCTCGCGC GCGACGTTCC TCGCGCGCGC GGCGTCCCGA AGCGCGCGCG CGATCGCGCG ACGCGACGCG ACGGCGACCG AACGCGATCG GTTCGCGCTT TGAATCGTCG CGACGACGAC GATGTCGACG TCGCCGACGC GACGATCCTC GCGACGAGGC GCGCCGACGC CCGACGGCGA GACGATGGAC GATGGAGGCG GAATCGCGCG CGACGCGCGC GCGGCGGCGT CGCGGGCGAA CGTGAAGGTG CGATTCGATT TCGGAAACCT TTTGAGAATT TGAATTTCGG ACGGCGACCG ACGCCGAGCG GAGGCGCGCG CGGATCGAGG ATCGAGCGAG GCGCGCGATC GGCGCGCGGC GGAGACGGCG GTGGACGACG GGCGCGAAAG CGCGGGCGCG ATGGGGCGGC GCGCGGGAGG GATCGGACGG GCGGGGAAGA CTGACGGCGA CGCGCGCTCG AAATCGACAG GTCGAGACGG AGGAGGGGAA GGAGGACGCG CTGGGACGAG GACGGCGGGT GCGAACGGCG AAGCTCGAGA CGGGGGACGA TGGAAGGAAC GGGAGCTCGA CGCGCGCGGG GTCGTCGACG CCGAAACGGG GGAGTTCGGT GGAGAACGAG CCGGCGTCGA AGAGCGACGG GACGACGACG GTTTCGGCGT TGAGCGTGCT CGGCGTGCAG GCGCCGCGAG TCGTGGGTAA GCGAACGCCC AGGAAGCCGT CGAGCGCGTA CGTTAACTTT GAAAATGCGG CGGAGGTTCT AGTTTCGCTG TCGCCGCAGA TCAACCGCGA AGCGGTTCGC GCGGCGCAGC AAAATAGCCC GAGTAGGAGC TACGCGACGA AGAGTCAAAT CGCGGACTTG GCGTTGGGTA ACGTGACGAC GCCCAGACGA AAGCGCGCGC GGGCGTTGTT CCCGGATGAA GTGTTCACGA CGAGCACGGC GAACGGTTCA TCGCCCGCTA GAAGGAAGAA GCCGATGTCG GCGGCGGATG AAGACACGCT CTACGGCGCT CTAGACGGAT TGATGACGCT CGCAGAGAAC GTCGCGGCTC CATCGCCGCA ACAGACGGAT AGCATGAAGA AGAAACGTAA GCAAGGTAGT GGACGACCGC GCGGCTCGTA CGGGTCACCT TCACGCCACC TGCCGCCGTT ACCGAAGTAT GCGCTCAAGC CGACGCCGAT GCCGGTCGGG AAGAAGATGT TGAAGATAAA CAGGCCTCGG CGCCACGTTT CGCGCGAGAT CGAATCTGTT GGTCTCGGCG CGGCGTCCTC ACTCTTTCAC GATCATCGTC TGAGCGAAAC GGCTGAGGCG ACGACGACGA CGCAAGGAGA CAACCGGGCG CGTTTGCTCA TCGGTGCGCA CGATCCGTTG ACGAGACGCT GGGCAAACGC CAACTTCTTC ACCGCTGGTA CGGACAAGGG CTGGTTTGAA GACAGTGGCT TCTCACGATG GTTGCAGCAC ATTGGTAAAG GCGATATGCG CGTCGCTACG CGCGAAGAAT GGCAAAAAGT CCGTCGAAAA TTACCAAAGA CGAGACGCCT TTCGCTCAAA TTTTTGAAGG ACGAGCGTGT GGACTTGGAG TATTTCCGAC ACGCCGCGCG CGAGATGACA AACTTGAAAT TGCACGGTAC GGTTCTAACG GATGAGCTTA AGGCCTTAAT GCTCAAGTGG ACGGGTGGCG TTCCGGTCCC GTCGCCGCTC GAAGTCGGGC AAACGGTGTT GGCGGTGCAT CCCCGATTCC ATTCGCCGTA CATAGGAAAC ATATTGATTG TCGAGCGCGC CACCTGTCGC GTACAATTTG CACGTCCAGA ACTCGGTGTC GAGCTCGTCC GAGACATCGA TATCATGCCA GTAGACGCCA ACCAAGAAGA GATGGATCTG ATTGCTGCGG GGACCGCGGA ACAAATCGAA AACGAAGCCT TCAACGCTGG TTTCCGAGGA ATTTACGATC CGCTGCCGGT GGTCGGTGGC GGTGGCCACG CCTACGGCGC CGGCGTCGCA GTTGCCGCAC AAATGCGTGA GATGGACGTC CGTTTGCTCA ACGAAGCTCA ACAAGCTCTC GAGCGCAAGC GCGAGCTCGT CGATGCGTTG CGGATGAAGA ATGACGCCGC GGAGGAGTTC AAAAAGAAAC CAGAACGCGT CAAGCAAGCG GCGGAGACGA ACGGCGAAGC GCTTAAATTC CAACGAGAAT ACGCCGCCAT CGTTCTCGCC TTGCGCGGCG CCAACGCGGA GTTGGAGGGT GCTTTAGTTC GCTTGCGGCA GCAACAGGGA TATCACGACA AACCCCTCGG GTTGTGGCGG AAAATCAAGA GTCAGAATTT GGGCGACGGT CATCGCGCTC TCTCTCGATT CGCCCCACCA TCCGCGACGC CGAGCTCGGA CGAACTCTAC GCTTCGACGG CGGAAGACAT CGTCGCCACC GCCGGCATGG GCGCTCGTCG CATCGTTTAC CACGTGCAAA CCACCACCAA CGCCGGCGCC GAGAGCACGG CGACGCTCAA TCCGAGTCTC CAGGAACCTC TCGGAACCGG CGCCACTAAG TCCGATCTCC CTCCCACCGC CGCCGACGTC GACGCCGAAA AGCTCAAAGT CACCCAGCTC GTCACCGCCA TCGTCCAGGC CGCGCTCACC ATCAAAGCCT GCGCCGACCG CGGCGCCAGC GCGAGCGTCC TCGACGCCTG CCTCCGCCGC GTTTCCGATT CCCTTCGCCC CGTCGCCGTG AGCAATCGCG CCGCTTTCGA CGCCGTCGAG CGCGCTCTCA AAGACCTTCG CGACGTCTTT CTCATCCGTT GACGCT
|
Protein sequence | MDDGGGIARD ARAAASRANV KVETEEGKED ALGRGRRVRT AKLETGDDGR NGSSTRAGSS TPKRGSSVEN EPASKSDGTT TVSALSVLGV QAPRVVGKRT PRKPSSAYVN FENAAEVLVS LSPQINREAV RAAQQNSPSR SYATKSQIAD LALGNVTTPR RKRARALFPD EVFTTSTANG SSPARRKKPM SAADEDTLYG ALDGLMTLAE NVAAPSPQQT DSMKKKRKQG SGRPRGSYGS PSRHLPPLPK YALKPTPMPV GKKMLKINRP RRHVSREIES VGLGAASSLF HDHRLSETAE ATTTTQGDNR ARLLIGAHDP LTRRWANANF FTAGTDKGWF EDSGFSRWLQ HIGKGDMRVA TREEWQKVRR KLPKTRRLSL KFLKDERVDL EYFRHAAREM TNLKLHGTVL TDELKALMLK WTGGVPVPSP LEVGQTVLAV HPRFHSPYIG NILIVERATC RVQFARPELG VELVRDIDIM PVDANQEEMD LIAAGTAEQI ENEAFNAGFR GIYDPLPVVG GGGHAYGAGV AVAAQMREMD VRLLNEAQQA LERKRELVDA LRMKNDAAEE FKKKPERVKQ AAETNGEALK FQREYAAIVL ALRGANAELE GALVRLRQQQ GYHDKPLGLW RKIKSQNLGD GHRALSRFAP PSATPSSDEL YASTAEDIVA TAGMGARRIV YHVQTTTNAG AESTATLNPS LQEPLGTGAT KSDLPPTAAD VDAEKLKVTQ LVTAIVQAAL TIKACADRGA SASVLDACLR RVSDSLRPVA VSNRAAFDAV ERALKDLRDV FLIR
|
| |