Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50104 |
Symbol | |
ID | 5002923 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 71692 |
End bp | 76035 |
Gene Length | 4344 bp |
Protein Length | 1386 aa |
Translation table | |
GC content | 60% |
IMG OID | 640418344 |
Product | predicted protein |
Protein accession | XP_001418827 |
Protein GI | 145348791 |
COG category | [A] RNA processing and modification |
COG ID | [COG5161] Pre-mRNA cleavage and polyadenylation specificity factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.544218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.543331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACG CGGTGCACCG CGAGGTGCAC CCGCCCACGG GCGTCGACCA CGCGGTGACG GCGTACTTCA CGCGCCCCGT GGGCGACGGC GGGGACCCGA ATCTGATCGT CGCGAGCGCG AATCGCATCA CCGTGTACGC CGTCAATCGG CGCGGTGACG AAGAATCGCT CGACGTGTGC GCCGAGTTCG ACGCGCAGGG CGCGATAGGG TCGATGAGCG TCCTGAGACG ACGGTTCGGG GCGCCGAGGA ATCAGCGAGA CGCGCTGTTG ATCGCGATTC GCGAGCGAAA GCTGAGCGTG GTGGAGTACG ACGCGGCGAC CGGGGACGTG TGCTGCTCGT CGATGCACTC GTTCGAGAGT GCGCTGGGGT GTAATCCACT GGGAACGACG CTGCGGATGT CGAGAGAGGC GCCGCTGGTG GTGAGCGATC CCGAGGGTCG GTGCGCGGCG GTGGTGCTGC GGGAGGACGG CGTCGCGGGC AAGGTGCGAG TGCTGCCGAG CGTGGACGGG GGGTTGGGGC TGGTGGCGAA CGACGACGAG GGACGCGTGC GAGGACCCGC GGCGAGCGTG CGCGAGTCGT TCCCGCTGCA CCTGCCGGGG GTGCGATTGA TTCGCGACGC GTGTTTTTTG CACGGATACG GCGAGCCGGC GCTGGCGGTT TTGTACGAAA AGACGCCGAC GTGGGCTGGG CGGTATAACC TGAGCAAAGA TACGTGTGAG ATCGTGGCGC TGAGCGTGGA CGTTGACAAG CAAAAGGGGA CGGTGATTTG GCGTCGGCAG AACTTGCCGT CGAGCTCGTA CAAGTTGACG GCGCTCTTGC CGCCGCTCGG CGGCGCGTTG GTGTTTTCGC AAGACTTTTT GCTGCACGAA TCCCAAGAGA GCTCGTCCGT GCTCGGGTTG AATACTTTCG GTCACGGTGG GCCGCAAGAA GGGAACGACG CCGAAGTCGC CGCGCGCGCG GCGGGAATGG GGGAGAACGC CATGGCGAAC CCGCCGCCGG CGTGCGCGGC GCGCGCCGTC GATTGCGGAT TAGAGATCAC GTTAGATGGC GCTCAGGCGT CTGTTGTGTC GGAAGATCGC GTCTTGGTGA CAACAAAAAC CGGTGCGTTG TTGCTCTTAG CTCTTCACAC CGATGGCCGT AGTTTACGTC GCATGATGCT TCAGCGCGCC GGCGGCGCTG TGCTCTCTTC GGGCATGTGT CTGCTGTCGA GAGATTTGTT GTTCCTAGGA AGTCGCATTG GTGACTCGCT TTTGGTGAAG TTCACGCCTA AAGAAGAACC GACGGCACCT CTCATGCTTC CCGACGCCGA AGACGAGAGC GAGGATGAGG CGACGGAGAA GTCGAAGGGT AAGCGCTCTA AATCGGGCGG CGCGGCGAAT CGCAAACGCG CCAAAACCGC CGAGGCGCCG CCACCAGCGC CGTCGACGCC GAGTCCCGAA GATGACGATG ACGAGCTCGA GGCGTTGCTC TACGGCACGA CGAAAACGGA GACTGTGCAG ACCGACGCCG TGCAAACGGA GAAGAAGCGC GAAGGCTTGG CCGGCATCAT CCCAGGCTTG AAGGTTGCTG GTTACGATTT GAAGGTCAAG GACTCTTTGC TCGGCGTCGC GCCTGTGGTG GATATCGCCG TCGGCGCGAG CGCGCCCATG GGTTCGAACA AGAACGAGCG CACCGAGCTC ATCACCGCGT GCGGTCAAGG GAAGAACGGT GCGCTGGCAA TTTTGACTCG TGGCGTCCAG CCTGAACTCG TCACCGAAGT CGAATCCGGT ACGCTGCCCA ATTTACAAGG TTTGTGGACG TTGCACTATC GCAAAGAAGG TTCGAAAGAA GAACGCGAAC CTTTCCATCA TCATTTGTTA CTGAGCATGA AGTCGTCGAC GATGATCATG GAAACCGGCG AGGAGCTTCA AGAAGTGAGC GCCTCGCTCG AGTTCATCAC GAATCAAGCC ACGTTGGCGG CGTCGAATAT TTTCGGACAT TACTGCTCGG TCCAAGTCAC GGGAACGGGC ATCCGCGTGT TAAAGGGAGG CGTGAAGGTA CAAGACGTCG GTTTGCAGGA CATGGACGCA CCGAAGGGAG CCGCGATCGC GTCCGCACAG ATTTTAGATC CTTACATCAT CGTTCGGCTA TCGGATGGTT CGATAAGATT GTTGTCCGGA GACGAAAAAC AGATGAGTGT TTCGTTGATG GAAACGGGGG CGATCCCAAC ATCTTCAGTA ACGGCGTTCG CTCTGGTGGA TGATTCTGTT GAAGCCGCAG ACGCGGCGGG CGGGGGCGAA CGCAAGAGCG GATGGATTCA TCGAGCTGCG ACGAATGGCA CCATTACGGG TTTGGAAGGG AACAAGAAGA GCGGCGCGTG CAACAACAGC GAGGCCATCG TCGCGCTGAC GCGCGAGGGT GGAAGTCTGG AATTGTTTTC GTTGCCCAGC TGCACGCGCA TCTGGTGCGC CGATGGGCTG TCCGAGGGCA TGCGCGTATT GAGTCCGCAA ACACCCGTCA ACGCTGAGTC CAGTGTCCCT GAGATTGTCG ATATTCGCAT CGATTCATTC CAAGATGCGC ACGAGCGCCC TTTGCTCACC GCAGTGCGTG GCGATGGCAC GTTGCTCTTG TATAAGGGTT TCATTGTTCC CGCCGGGACG ACGTACGAAG GACAAGACGA ACCGCTCGAA AAGAATGAGT TGCGATTTTC TCGCGTCAAC GTCGACGTTG AAGGTTCTGG TTTGAATGTC GCTGGCATAG GCGCCGCAGG TCAGCTTAGA GACTCCCTAG CGGGCGCGCG ATTGACGCGC ATCGGTAACG TCGGCGAAGG ACAAGGCGTG CAAGGCATCT TTGTCGCAGG CCCAAACCCG TTGTGGCTCA TCGTTCGCAG GTCTCGCGTA TTAGCTCTCC CGACGCGTGG TGAAGGCGAG GTCGTCGCAT TCACGGTATT CCACAACGTC AACTGTCCGC ACGGTTTCAT TCTAGGCACC GCGTTGGGTG GCGTGCGCAT TTGTCAGATG CCTAGTAAAA TGCACTACGA AGCTGCGTGG CCGGTGCGCA AGGTGGCGCT CAAGTGTACG CCGCACACGA TTACGTACCT GCCAGATTTC AAGCTCTACG CACTGGTTAC ATCAGCTCCT GTGCCTTGGG TCGAACGGGA AATAGAGCAA GATAATGTCC ACGGTATCGC CTTGGCAAAA GTGCGACGCG AGCGCGCGAA AGCGAACGAT GACATGGAGT TACAATACTC GGTGCGACTC CTCGTTCCTG GATCGCTCGA TAGCGCGTGG CAACACGCGC TCGAACCGGG CGAGCACGTG CAGTGCGTTC GAAATGTCCA ATTGAGAGAC ATTAACACTG GGGCACTCCT TTCACTTCTT GCCGTCGGTA CGGCGATGCC TGGAGGAGAA GACACGCCGT GTCGCGGTCG CGTCATTTTA TTTCAAATGG TGTGGGAGCG CGACGCCGAA TCCATGGATG GGTACAGATG GAAAGGACAA GTGTGCTGCG TGCGCGAAGC GAAGATGGCG TGCACCGCGT TATCGGCGCT CGACGGTCAC CTGATTGTTG CGGTCGGTAC CAAGCTCACC GTGCACACGT GGGATGGCGT CGAATTGAAT AGTGTCGCTT TCTTCGACAC CCCAATTCAC ACCGTGAGCA TCAACGTCGT GAAGAATTTC ATCCTGGTGG GCGATTTAGA GAAGGGCTTG CACTTTTTCC GCTGGAAGGC GAACGGCTTC GAGAAGTCGA TCATTCAGCT CAGCAAGGAT TTCGATCGCA TGGACGTCGT GAGCACAGAG TTCTTGATCG ACGGCGCCAC TCTGAGTTTG CTCGGGTCCG ACATGAGCGG CAACGCGCGC ATCTTTGGCT ACGATCCAAA ATCGCTCGAG TCGTGGAAAG GACAGAAACT CCTCGTGCGT TCGGCGTACC ACGTCGGTTC GCCCATCTCT CGCATGGTGC GTTTTAACGT GGAAGGTACG ACCGCGAAAG CCGCGCCGGG AGAACGCCCC AAAGGCACCA ATCGACACGC CGTCTTCTTC GGCACGCTCG ACGGCGCTTT GGGCATCTTC ATGCCCACCG ACGAGCCGAC GTACGCCAAG CTCCACGCCC TTCAACGCGA GTTGAACACC ACGGTGCGCT CACCGATCGG TTGCAACCCG CGCACGTTTC GCACCCCCAA AGTCTTCGAA GGCAAGCATG TGCAATTACT CGCCCCCCTC GACGTCCTCG ACGGCGGTTT ACTCTCCAAA TTCGAGACGC TCACCTTCAC CGAGCAGCGC GCCGTCGCCG AGCGAAGCGG CGTCGACCGC GATTTGGCCC TCGGTTTGAT CCAGCACCTC AGCGCCAGCA ACGCGTTCGT GTGA
|
Protein sequence | MSHAVHREVH PPTGVDHAVT AYFTRPVGDG GDPNLIVASA NRITVYAVNR RGDEESLDVC AEFDAQGAIG SMSVLRRRFG APRNQRDALL IAIRERKLSV VEYDAATGDV CCSSMHSFES ALGCNPLGTT LRMSREAPLV VSDPEGRCAA VVLREDGVAG KVRVLPSVDG GLGLVANDDE GRVRGPAASV RESFPLHLPG VRLIRDACFL HGYGEPALAV LYEKTPTWAG RYNLSKDTCE IVALSVDVDK QKGTVIWRRQ NLPSSSYKLT ALLPPLGGAL VFSQDFLLHE SQESSSVLGL NTFGHGGPQE GNDAEITLDG AQASVVSEDR VLVTTKTGAL LLLALHTDGR SLRRMMLQRA GGAVLSSGMC LLSRDLLFLG SRIGDSLLVK FTPKEEPTAP LMLPDAEDES EDEATEKSKD DDDELEALLY GTTKTETVQT DAVQTEKKRE GLAGIIPGLK VAGYDLKVKD SLLGVAPVVD IAVGASAPMG SNKNERTELI TACGQGKNGA LAILTRGVQP ELVTEVESGT LPNLQGLWTL HYRKEGSKEE REPFHHHLLL SMKSSTMIME TGEELQEVSA SLEFITNQAT LAASNIFGHY CSVQVTGTGI RVLKGGVKVQ DVGLQDMDAP KGAAIASAQI LDPYIIVRLS DGSIRLLSGD EKQMSVSLME TGAIPTSSVT AFALVDDSVE AADAAGGGER KSGWIHRAAT NGTITGLEGN KKSGACNNSE AIVALTREGG SLELFSLPSC TRIWCADGLS EGMRVLSPQT PVNAESSVPE IVDIRIDSFQ DAHERPLLTA VRGDGTLLLY KGFIVPAGTT YEGQDEPLEK NELRFSRVNV DVEGSGLNVA GIGAAGQLRD SLAGARLTRI GNVGEGQGVQ GIFVAGPNPL WLIVRRSRVL ALPTRGEGEV VAFTVFHNVN CPHGFILGTA LGGVRICQMP SKMHYEAAWP VRKVALKCTP HTITYLPDFK LYALVTSAPV PWVEREIEQD NVHGIALAKV RRERAKANDD MELQYSVRLL VPGSLDSAWQ HALEPGEHVQ CVRNVQLRDI NTGALLSLLA VGTAMPGGED TPCRGRVILF QMVWERDAES MDGYRWKGQV CCVREAKMAC TALSALDGHL IVAVGTKLTV HTWDGVELNS VAFFDTPIHT VSINVVKNFI LVGDLEKGLH FFRWKANGFE KSIIQLSKDF DRMDVVSTEF LIDGATLSLL GSDMSGNARI FGYDPKSLES WKGQKLLVRS AYHVGSPISR MVRFNVEGTT AKAAPGERPK GTNRHAVFFG TLDGALGIFM PTDEPTYAKL HALQRELNTT VRSPIGCNPR TFRTPKVFEG KHVQLLAPLD VLDGGLLSKF ETLTFTEQRA VAERSGVDRD LALGLIQHLS ASNAFV
|
| |