Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50795 |
Symbol | |
ID | 5004280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 52705 |
End bp | 59716 |
Gene Length | 7012 bp |
Protein Length | 2320 aa |
Translation table | |
GC content | 52% |
IMG OID | 640419701 |
Product | predicted protein |
Protein accession | XP_001420401 |
Protein GI | 145352111 |
COG category | [A] RNA processing and modification |
COG ID | [COG5178] U5 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0224006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCG GTGTGTCGGA GGAAGTGATG GAGAAGGCGC GGAAGTGGCG AGCGCTGAAC GCGAAGCGGT ACGGCGCGCG GCGCAAGTTT GGGTACCAGG AACCGCCGAA GGAGGAGATG CCGCCGGAGC ACGTGCGGAA GATTATCAAG GATCACGGCG ACATGTCGAG CCGAAAGTTT CGACACGATA AGAGGGTGTA CTTGGGAGCG TTGAAATTCG TGCCGCACGC GGTGTATAAA CTGTTGGAGA ACATGCCGAT GCCGTGGGAA CAGGTGCGGC ACTGCGAGGT GATTTACCAC ATCACGGGGG CGATTACGTT TGTGAACGAG ACGCCGAGGG TGATCGAGCC GGTGTTCATC GCGCAGTGGG GGACGATGTG GATCATGATG CGACGGGAGA AGAGGGATAG GAAACATTTT AAGCGCATGC GGTTTCCACC GTTTGACGAC GAAGAACCGC CGTTGGATTA CGCCGACAAC TTGCTCGACG TGGACCCGCT GGAGGCGATC GCGCTGGAGT TAGACGAAGA GGAAGACGGC GCGGTGGCGG AGTGGTTCTA CGATCATAAC CCGCTGAAGT GGACGAAATT CGTCAACGGA CCGAGTTACC GCAAGTGGCA GTTGCCGCTA CCGGTGATGG CGAACCTGCA CAGACTGTCG TCGCAGCTCC TGAGCGACTT GACGGATAAA AATTACTTTT ACCTCTTTGA TCACAAATCT TTCTTCACCG CCAAGGCTTT GGGCATGTGC ATCCCGGGTG GACCAAAGTT TGAGCCGTTA TTTCGAGACA TGGATCGCGC GGACGAGGAT TGGAATGAGT TCAACGACAT CAACAAGCTC ATCATTCGCA GCGCGCTGCG CACCGAGTAC AAGGTGGCGT TTCCGTACTT GTACAACAAC CGGCCGCGCA AAGTGTCGCT CGCGGTGTAT CACACGCCCA TGGTGATGTT CATCAAGACG GAAGATCCCG ATTTACCGGC GTATTACTAC GACCCATTGA TTCATCCGAT TGCGTTTTAT AGGTCAAACA AGCAGAAGAC GACGCCGCGA GAAGAAGACG ACGGCGAGGA CGACTTTCAG TTGCCGGATG GTATCGAGCC GTTTTTAGAA GACACCCCGC TTTACACGGA CAACACTGCG GGTGGGATCG CACTTCTCTA TGCACCTCGA CCGTTCAACC TGAGGAGCGG GAAGACGCGT CGCGCGATGG ACGTGCCCTT GGTGAACAAC TGGTTTCACG AGCACTGCCC GTCGGGCTAC CCGGTCAAGG TTCGCGTGTC GTATCAAAAG CTGTTGAAAG GTTTCGTGTT GAACCAATTG CATAAGCGCC CGCCGAAGCC GATGAAGCGC CGAAACTTGT TCGTCGCGCT GAAGAAGACG AAATTCTTCC AGTGCACCGA GCTCGACTGG GTCGAAGTCG GTTTGCAAGT GTGTAGGCAA GGTTACAACA TGCTCAATCT CTTGATTCAT CGAAAGAACT TGAACTACTT GCACTTGGAC TACAACTTCA ACTTGAAGCC CGTGAAAACG CTGACGACAA AGGAGCGTAA GAAGAGTCGA TTTGGGAACG CGTTCCACCT CTGCCGCGAA GTCTTGCGTC TCACCAAGCT CGTGGTGGAT TCCAACGTGC AGTTCCGTCT CGGTAACATC GACGCGTATC AACTCGCCGA TGGATTGCAA TACGTGTTCT CGCACGTCGG TCAATTGACT GGGATGTACA GATACAAATA CAAGCTCATG CGTCAGATTA GAATGTGCAA GGATCTCAAG CACCTGATTT ACTATCGTTT CAATACGGGT CCGGTGGGTA AAGGGCCGGG CGTGGGCTTC TGGGCCCCCA TGTGGCGCGT GTGGCTCTTC TTCTTGCGCG GCATTGTGCC TTTACTCGAA CGATGGCTCG GCAACTTGCT CGCGCGTCAA TTTGAAGGTC GTAACAACAA GGGCATCGCC AAGACTGTGA CGAAGCAACG CATCGAGTCG CACTTTGATC TTGAATTACG CGCAGCCGTG ATGCACGACA TCTTAGATTC CATGCCAGAG GGCGTCAAGC AAAACAAAGC TCGCACGATT CTCGCGCATC TTTCAGAGGC GTGGAGATGC TTCAAGGCGA ACATCCCGTG GAAGGTTCCG GGCATGCCGA CACCTATAGA AAACATGATC TTGAGATACG TCAAGAGCAA AGCTGATTGG TGGACCAACG TCGCGCACTA CAACCGCGAG CGCATTCGCC GCGGCGCGAC AGTGGACAAG ACGGTGTGCC GCAAGAATCT CGGCCGTTTG ACGCGTTTAT ACTTGAAAGC GGAACAAGAG CGACAGCACA ACTACCTGAA AGACGGTCCT TACGTCACTC CAGAAGAAGC AGTCGCCATT TACACCACCA CGGTGCACTG GCTTGAATCT CGCAGATTCT CGCCGATTCC TTTCCCGCCG TTGTCGTACA AGCACGATCG CAAGCTGCTC ATCTTGGCGC TCGAGCGTTT GAAAGAGAAC TACTCAGTGA ACGCTCGCTT GAATCAAAAT CAGCGTGAAG AGCTCGGCTT AATTGAGCAA GCCTTCGACA ACCCGCACGA GGCGCTCAGC CGAATCAAGC GTCACTTGCT AACTCAACGC GCGTTCAAGG AGGTGAACAT CGAGTTCATG GATTTGTACT CTCACTTGAT ACCGGTGTAC GAAATCGAAC CTTTGGAAAA GATTTCTGAC GCGTATCTCG ATCAGTACAT CTGGTACGAA GCGGACAAAC GCCAACTGTT CCCCAACTGG ATCAAGCCTT CAGACACCGA GCCGGCGCCG CTCTTGGTGT ACAAGTGGTG CCAAGGCATC AACAACTTGA CGGATGTTTG GGACACGAAC GAGGGCGAAT GTGTGGTAAT GTTGCAAACG CGCTTCGAGA AAATGTTTGA AAAGGTTGAC CTCACCCTGC TCAACCGTTT GATGCGTTTG ATTGTCGATC ACAACATCGC TGATTATTGC ACGGCAAAGA ACAACGTGGT GATTTCATAC AAGGACATGA GTCACACGAA CAGCTACGGT ATGATCCGTG GCTTGCAATT CGCATCGTTC ATGACGCAGT ACTACGGTTT AGTTTTAGAT TTGTTGCTGC TCGGCTTAAC TCGCGCCTCA GAGATTGCTG GTCCGGCGAA CATGCCGAAT GAATTCATTT CATATCGAGA CGTGGAGACA GAGACGCGAC ACCCGATTCG TTTGTACTCT CGTTACATCG ATCGAGTGCA CGTTCTGTTC AGATTCACCG CTGATGAGTC AAAAGACTTG ATTCAGCGTT ATCTCACCGA GCATCCCGAT CCAAACAACG AAAACATGGT TGGTTACAAC AACAAGAAGT GCTGGCCTCG CGATGCGCGC ATGCGCTTGA TGAAGCACGA CGTCAACTTG GGTCGCGCAG TATTCTGGGA TATCAAAAAC CGTCTTCCGC GCTCGTTAAC GACGTTGGAA TGGGATAATG GTTTCGTCAG CGTGTACTCT CGCGACAACC CCAACTTGCT CTTTGCCATG AGCGGCTTTG AAGTGCGTAT TTTGCCAAAG ATTCGCATGG CGACTGAGTA CTTTGCCAAC AAGGACGGCG TCTGGAACTT GCACAACGAG CAAACCAAGG AGCGAACGGC GCAGGCATTC TTGCGTGTCG ATGATGAAGC GCTTAAGTCA TTCGAAAACC GTATCCGGCA AGTGCTCATG TCATCCGGTG CGACGACGTT TTCCAAGATT GTCAACAAGT GGAACACGGC ACTCATCGGC TTGATGACGT ACTACCGAGA GGCAACAATT CACACCTCGG AGTTGCTTGA TTTGTTGGTG AAGTGTGAGA ATAAGATTCA GACTCGCGTG AAAATTGGTT TGAATTCGAA GATGCCGAGC CGTTTCCCGC CGTGCGTGTT CGCCGCACCG AAAGAAATCG GTGGTTTGGG TATGCTCTCG ATGGGTCACA TTCTCATCCC CCAGAGCGAC TTGCGGTACA GCGTGCAGAC CGACTCTGGC ATCACGCACT TCCGCTCGGG AATGACGCAC GAAGAAGATC AACTCATTCC GAACCTGTTC CGCTACATCC AACCTTGGGA GGCTGAATTT AACGATTCCC AACGCGTGTG GGCTGAGTAC GCGTTGAAGC GTCAAGAAGC GCAGGCGCAG AATCGTCGTC TCACATTGGA GGACTTGGAA GACTCGTGGG ATCGCGGTAT TCCGCGCATC AACACCTTGT TCCAAAAGGA TCGACACACG CTCGCGTACG ATAAGGGTTG GCGCGTGCGA TTGACTTTCA AAGAGTACAA TCTCACCAGA CAAAATCCGT TCTGGTGGAC GCATCAGCGA CACGACGGCA AACTCTGGAA CTTGAACAAC TACCGCACCG ACGTCATTCA GGCGCTCGGT GGCGTCGAGG GTATCCTCGA GCACACGCTC TTCAAGGGAA CGTACTTTCC CACTTGGGAA GGTTTGTTCT GGGAGAAAGC AAGTGGATTC GAGCAATCGA TGCAGTACAA GAAATTGACG AACGCGCAGC GCAGTGGCTT AAACCAAATT CCGAATCGTC GATTCACGCT CTGGTGGTCG CCTACGATCA ACCGCGCTAA TGTCTACGTC GGATTCCAAG TCCAACTTGA CTTGACAGGC ATCTTCATGC ACGGTAAAAT TCCTACGCTG AAGATTTCGT TGATTCAAAT CTTCCGAGCG CACTTGTGGC AAAAGATCCA CGAGTCCGTC GTCATGGATA TGTGCAACGT TTTCGACCAA GAACTCGACG CGCTAGAGAT TGAAACGGTT CAAAAGGAAA CGATTCACCC GAGAAAGTCG TACAAGATGA ACAGCTCTTG CGCCGACATT TTGCTCTTCG CCGCGTACAA GTGGTCGATT TGCAAGCCGT CGCTCATGGG CGAGACAAAC GATTCTTTCG ATCAAAAGAG TTCGAACAAG TTCTGGGTCG ACGTGCAGCT TCGCTGGGGT GACTTTGATT CTCATGACAT TGAGCGTTAC ACGCGCGCAA AGTTTTTGGA TTACACCACG GACAACATGT CCATTTACCC GTCGCCGACT GGGGTGATGA TAGGCATCGA CTTGGCGTAC AACTTGCACT CAGCCTACGG TAACTGGTTC CCTGGGTGCA AACCTTTGGT GCAACAAGCG ATGGCGAAGA TCATGAAAGC TAACCCGGCG CTTTACGTAT TGCGCGAGCG CATTCGTAAG GGTTTGCAGT TGTACTCCTC TGAGCCCACC GAACCGTATC TCAACTCTCA AAACTACGGC GAGTTGTTCA GCAACCAAAC AATTTGGTTC GTGGACGATA CGAATGTGTA CCGCGTGACG ATTCACAAAA CGTTCGAGGG TAACTTGGTG ACAAAACCAA TCAACGGTGC TATTTTCATC TTCAACCCGC GCACTGGTCA ACTCTTTTTG AAGATCATTC ACACGAGCGT GTGGGCTGGG CAGAAGCGTC TCGCGCAGTT GGCCAAGTGG AAGACAGCAG AAGAAGTCGC GGCCTTGATT CGCTCTCTCC CGATCGAAGA GCAACCCAAA CAAATCATCG TGACGAGAAA GGGTATGCTC GACCCGTTGG AGACGCACAT GCTCGATTAC CCGAACATCG TCATCAAGGG AAGCGAGTTG CAACTGCCGT TCCAAGCGTG CATGAAGATC GAAAAGTTTG GCGATCTCAT CTTGAAAGCC ACTGAGCCGC AAATGGTGTT GTTCAACATC TACGACGATT GGTTGAAGAC AATTTCGAGT TACACAGCCT TTAGTCGTTT GATTTTGATT TTACGAGCGT TGCACGTCAA CAACGAAAAG GCAAAGATGA TGTTGCGTCC TGACAAGAGT GTCGTGACGC TACCGCATCA CGTGTGGCCA GATCTCACCG ATGAGCAATG GATCAAGGTT GAAATCGCTT TGAAAGACTT GATCTTGGCG GATTATTCTG CGAAGAACAA CGTCAACGTC TCGGCGTTGA CGCAAAGCGA GGTTCGCGAC ATCATTCTTG GCGCTGAAAT CACGCCGCCG TCTGTGCAGC GACAAGAGAT TGCCGAAATC GAAAAACGTG GGCAAGATGC CAACCAGCAG ATTGCCGTCA CGACCAAGAC TACGAATGTG CACGGAGACG AGCTCATCGT CACCACTACG AGTCCGTACG AACAATCAAC GTTCGGTAGT AAGACCGATT GGCGTATTCG AGCCATCAGC GCGACAAATT TACATTTGCG CGTGAACCAC ATTTACGTCA ACAGCGACGA CTTGAAGGAG ACTGGATACA CATACATCAT GCCCAAGAAC GTGTTGAAAA AGTTCATCAC TATCGCCGAT TTGCGTACGC AGATTGCGGG TTACATGTAC GGCGTTTCGC CGCCGGACAA CCCTCAAGTG AAGGAGATCC GTTGCGTCGT GATGCCGCCT CAGTGGGGCA ACCACAGCTC GGTGAATTTA CCGTCGACGC TCCCGGAGCA CGACTACTTG AGCGATCTCG AGCCGTTGGG GTGGATTCAC ACGCAACCGA ACGAGAGCTC TCAATTGCAA CCGCAGGATT GCACGCAGCA CGCCAAGATT CTCGAGCAAA ATTCGTCCTG GGATGGAGAA AAGAGCATCA TCTTGACGTG CTCGTTCACC CCGGGTTCGT GCTCGTTGAC GGCGTACAAG ATCACGCCCG CAGGGTACGA ATGGGGCCGC GCGAACAAGG ACATGACGAG CACAAATCCG CAAGGATATG GACCCGGACA TTTCGAAAAG GTTCAAATGT TACTCTCCGA TCGTTTCTTG GGTTACTACA TGGTTCCCGA CGGCGGGTCG TGGAACTATT CTTTCCAAGG CGTCAAGCAC TCGGCGGGCA TGAAGTACGC ACTCAAGCTC GGGAATCCGC TCGAATTTTA CCACGAGCGA CACAGACCGA CGCACTTCCT CGAGTTCGCC GCGCTCGAGG CGGAGAAACC GGAAGAAACG GCACCGATGG ATCGGGAAGA CGTCTTTTCT TGATCGAGAC TTACAAGACC GTCCGACGAC GAAAGATTTG TAAAACAATA AT
|
Protein sequence | MDVGVSEEVM EKARKWRALN AKRYGARRKF GYQEPPKEEM PPEHVRKIIK DHGDMSSRKF RHDKRVYLGA LKFVPHAVYK LLENMPMPWE QVRHCEVIYH ITGAITFVNE TPRVIEPVFI AQWGTMWIMM RREKRDRKHF KRMRFPPFDD EEPPLDYADN LLDVDPLEAI ALELDEEEDG AVAEWFYDHN PLKWTKFVNG PSYRKWQLPL PVMANLHRLS SQLLSDLTDK NYFYLFDHKS FFTAKALGMC IPGGPKFEPL FRDMDRADED WNEFNDINKL IIRSALRTEY KVAFPYLYNN RPRKVSLAVY HTPMVMFIKT EDPDLPAYYY DPLIHPIAFY RSNKQKTTPR EEDDGEDDFQ LPDGIEPFLE DTPLYTDNTA GGIALLYAPR PFNLRSGKTR RAMDVPLVNN WFHEHCPSGY PVKVRVSYQK LLKGFVLNQL HKRPPKPMKR RNLFVALKKT KFFQCTELDW VEVGLQVCRQ GYNMLNLLIH RKNLNYLHLD YNFNLKPVKT LTTKERKKSR FGNAFHLCRE VLRLTKLVVD SNVQFRLGNI DAYQLADGLQ YVFSHVGQLT GMYRYKYKLM RQIRMCKDLK HLIYYRFNTG PVGKGPGVGF WAPMWRVWLF FLRGIVPLLE RWLGNLLARQ FEGRNNKGIA KTVTKQRIES HFDLELRAAV MHDILDSMPE GVKQNKARTI LAHLSEAWRC FKANIPWKVP GMPTPIENMI LRYVKSKADW WTNVAHYNRE RIRRGATVDK TVCRKNLGRL TRLYLKAEQE RQHNYLKDGP YVTPEEAVAI YTTTVHWLES RRFSPIPFPP LSYKHDRKLL ILALERLKEN YSVNARLNQN QREELGLIEQ AFDNPHEALS RIKRHLLTQR AFKEVNIEFM DLYSHLIPVY EIEPLEKISD AYLDQYIWYE ADKRQLFPNW IKPSDTEPAP LLVYKWCQGI NNLTDVWDTN EGECVVMLQT RFEKMFEKVD LTLLNRLMRL IVDHNIADYC TAKNNVVISY KDMSHTNSYG MIRGLQFASF MTQYYGLVLD LLLLGLTRAS EIAGPANMPN EFISYRDVET ETRHPIRLYS RYIDRVHVLF RFTADESKDL IQRYLTEHPD PNNENMVGYN NKKCWPRDAR MRLMKHDVNL GRAVFWDIKN RLPRSLTTLE WDNGFVSVYS RDNPNLLFAM SGFEVRILPK IRMATEYFAN KDGVWNLHNE QTKERTAQAF LRVDDEALKS FENRIRQVLM SSGATTFSKI VNKWNTALIG LMTYYREATI HTSELLDLLV KCENKIQTRV KIGLNSKMPS RFPPCVFAAP KEIGGLGMLS MGHILIPQSD LRYSVQTDSG ITHFRSGMTH EEDQLIPNLF RYIQPWEAEF NDSQRVWAEY ALKRQEAQAQ NRRLTLEDLE DSWDRGIPRI NTLFQKDRHT LAYDKGWRVR LTFKEYNLTR QNPFWWTHQR HDGKLWNLNN YRTDVIQALG GVEGILEHTL FKGTYFPTWE GLFWEKASGF EQSMQYKKLT NAQRSGLNQI PNRRFTLWWS PTINRANVYV GFQVQLDLTG IFMHGKIPTL KISLIQIFRA HLWQKIHESV VMDMCNVFDQ ELDALEIETV QKETIHPRKS YKMNSSCADI LLFAAYKWSI CKPSLMGETN DSFDQKSSNK FWVDVQLRWG DFDSHDIERY TRAKFLDYTT DNMSIYPSPT GVMIGIDLAY NLHSAYGNWF PGCKPLVQQA MAKIMKANPA LYVLRERIRK GLQLYSSEPT EPYLNSQNYG ELFSNQTIWF VDDTNVYRVT IHKTFEGNLV TKPINGAIFI FNPRTGQLFL KIIHTSVWAG QKRLAQLAKW KTAEEVAALI RSLPIEEQPK QIIVTRKGML DPLETHMLDY PNIVIKGSEL QLPFQACMKI EKFGDLILKA TEPQMVLFNI YDDWLKTISS YTAFSRLILI LRALHVNNEK AKMMLRPDKS VVTLPHHVWP DLTDEQWIKV EIALKDLILA DYSAKNNVNV SALTQSEVRD IILGAEITPP SVQRQEIAEI EKRGQDANQQ IAVTTKTTNV HGDELIVTTT SPYEQSTFGS KTDWRIRAIS ATNLHLRVNH IYVNSDDLKE TGYTYIMPKN VLKKFITIAD LRTQIAGYMY GVSPPDNPQV KEIRCVVMPP QWGNHSSVNL PSTLPEHDYL SDLEPLGWIH TQPNESSQLQ PQDCTQHAKI LEQNSSWDGE KSIILTCSFT PGSCSLTAYK ITPAGYEWGR ANKDMTSTNP QGYGPGHFEK VQMLLSDRFL GYYMVPDGGS WNYSFQGVKH SAGMKYALKL GNPLEFYHER HRPTHFLEFA ALEAEKPEET APMDREDVFS
|
| |