Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27506 |
Symbol | |
ID | 5005590 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 404501 |
End bp | 409211 |
Gene Length | 4711 bp |
Protein Length | 1558 aa |
Translation table | |
GC content | 47% |
IMG OID | 640421011 |
Product | predicted protein |
Protein accession | XP_001421283 |
Protein GI | 145353997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAATT TCATTATGAA ATTCGACGTG CCTCTTGTGA GCGTCGGACT CTCATTTGAA AGATTGTTTC GCAACCAGAT GCTCTTCTTG CTTCGATTGG AAAAAACGAA GTTGAGTTGT ACGAAAACGT CGAAATTGAA TCTGTCGTTA GATTCTTCGC TGTCTCTCCT GCATAGAAAT ACGCGACTGG GATACAGTCT AGAACCCATT GTGGCGCCGT GGAAGTTTTC GGTTCTGTTG GATGACTCAA AAAATGATGT TTCGGGCGTG AATCCCGCGG GGTTGGACAT GTTCATTGAA AGTAAAAAGA AGATTGAGAT AATCGTCAAT CAAGCCTCCA TAGTTGACGC AGTGGATGCC GTGATGCAGT TTAAGAAACC ACGCGACAAA AACTCAAAAA TTGCACATTT GTTGGCTTAC GAAGCCGTGC AAAACAACGT CATTGTGAAT TCCACGGGAC GCGACTTATG GGTTCGCGAA CTGAATGGCA ATATACAAGA AGTACCGCCT GGAAATAGCA TTTCGCATTT GGTACATGCC GCTGAAACCG AAGCAAATTC TGGATTCGAG AGTGCATCAA ATGAGCAGAC GATGCGCGGT ATGACTGACG TCGTTGATGA CATGAATCAT CACGGTGACG AACAGCTCAT TATCGCATAC CTGATGATGG ATATCGTGGC AATTGACTCG TTGGATGACT GGCTTTACCC CCGCATAACT TTCAAGATTT GGGATGAGGC TTTGCAGGGT CATCGTACCG ATCCATTACC CATCGGCGTT ACGCGCGGCT CAGCAAAGTT ACCGATCCCT CATCCTTTTT TCCATCGCGA TAGTGATAGC GAGTTCATCA ACAATGACGA TATCACCATT ATCGTCGAAA TTTCTCATAT TGGGCCAAAT GGCGAAACAT TGAGCTTCAG TGGTGACTTT GAAGCTCCAA TCGAACAACA ACGCGGTGGA TGGATTGGCG GTAAAACTGG AAGCTGGATA GACTGTGCTG CAGGCAATGG TGACTCCGTG AGAATTAAGC TGCGCGCTCG AGTGGTGGAA GGTTTGAATG GCGACGCCAG TCACAGAAAC GGTAGAGTTG TATCAACTCT TAAAGAGTCC GCAGAAAACG ATTATCAAGA GTAAATTACA AGTTGAATCA CCACCTCTCA TCGTGTGTCG TACGAATGAC ATGGTGAAGC AATGGTGGAC AAGAGATCAT CAGCAAAGTA AAAGTGCCCT AAGTGCTTGG AAGCCGCACG TCCCATCGAC GTATGAACTC GAGACGAGAT ATAAGTTTTG TTCGAGTAGA GCTGAAAACG AGTATAAACT CGTACCTTTT GGGACCATCC TCTTGCCCGG CCTCATCGCG CCTCGCAGTG CGCTCATGGC AGTTGTCTTA GAACATAATT CAGATCAGCG GGCACCGACT GCTTTTCCCT CGGGGTTCGA AAATATTTGG ACGAGCGATA AAAATGATGT CTCATTTTGG AAGCCCATTG CACCAGACGG TTACGTTGCA GTTGGTAACG TGGTTGCGGC GAGCGCCGAA ATGCCTTCCA CCGATTGCGT CGTTTGTGTT CGCGAGGATT TGACAAAACT TGCAGAAGTG CCTCCGGATG TTGCATGGAA ATCGAACAGA CGATTTCGCA ATAATGATGA AAGAGAATTG GCGCTTATTC AAAACGGCCA ACTGAGTACT GGTGGATGGT CGTTCTTGAA GGAGAAAATC AAGCGCACGA GTCCAGGCGT GATTTCGAAC GTCATTCCAT CTATGCATAG ACTTAATGAC AGGATGTGCA TTCATTCTTT GGATGACGAA GGTATTCGCG AGTTTTGGAT CGATAACCGC ACTCATATAG ATCATATAAA GCAAGAATTA CAAGCTGGAG AGTCAATTCT CGCCATTTCG TTTTCTCCTT CCGGGCCGTT TGAGCAAGTG CGTCTTAATC TAGGCGCTTC ACACGTTGTT CGTCACAAGA CTCACGATTT AGTCTTTGAC AGTCAAAAGG GGCATCTTTT GAGCTATGTT TCGTGCGAAA ACGAAACAGA TTCTGATATA TTCGTTGATG TTCATCCAGC TATGAAGAGA GAACTCACGA CTTCTAATTT GGACATATCT CTGAGTGAGA AGTATGAGTC GCCGAGCGAT GTCGCGGATA TAGAAGTATT TGAATCGGAA CGATATTACC CGTTGAGAGG ATGGCGTGCT CCAAAAGACG CTATCGTAAA AGCTCGTTAT AGCAGGCATA AAAGTGGACG AGGTTCGCAA AAACGGTTTC CAGACGTCAA AGCTCCAAAA GGTTTCAGAT TTGAAGGACC GTGGGAGCTC GACAAACCAC CAAATCTGGT GAATGACGAC GGTTGGGCGT ACGGTGCTAT TTGGCTAAGG AAGTGGCCAC CGCCTCGTGG ATCAGACAAA AGTAAAGGTC GTGCAACCCG TCGCCGACGA TGGTTTCGTC GTGTCGTTCG ATTGAAATCA GCTGAAGAAA TTGAATCGCG CGTTTCCCTT CATAACATGA AGCGAGATTT CATCGAAGAT TGGCACGGCA CGTTGGCGGT ACACGGACGT CTTACGCTAC CTTCTTGTTC ACGAACCGAG TCCTACATGT TGGCGTTACG CACGGGGGCT GACGTCGCAT CAATTACCGC TCCGTCGTTT ACCGAGAGAG ATGACGCGCT GGATATTCGT GACGTGTTAT CAGAGCAAGG CAACTGTGCG CTTGCTTACA ACAACCTGCA CGATCACTTT CTTTTGGTCA AAGAAGAATT GTCGAATGAT GTGCAAAGTA GCGCGATAGG TGCTATTCCA GGCGTCAAGA TTCGTGTCGT TGCCCCTGTT CACGTTGTCT CATTGATGGA TTGTCCGTCG CAAGTTGTTT TGTACGCCAA TGGCGAAGAA AGAATTGCCA TGCATCTTGC CGCGAACGAA ACGAAACGTA TCACGGCGAT CGATGCAAAC TGTGATACAA TAGAGGTGCT CGTGTCTTTG TTCGAAGATT ATCTCAGCCT TAGTAACGAC AGGCCGCTGC GTCTAGATGT CACCCCGGCT GGAAGTCCAC CGGCGCCACA GGTAAACTGG TTGAACATCA AAGGCTATCC GTATGTGGTT TCTATGCTGT TGATGGAGAC GTGTAGAACT AGAATGGCGG GTGATAAATT CAACGATGAT CCTTCGCAGC ATCTCTTGAG TCTGACGTAT CGCAATGTGC TGATGGTTAC AAATGCATCA CCGATCGACA TTAAGTGTTT GGCGTGGGTC GCTGGCACAA GTACGACAAA CGCGCACATA GATGCATACG TTGGGGTACC CAAAGGCGAG TCGCAACCTG CCATGTTTCT GAAATCGAGT GCGAAAACCC TCGCCAAAGT TTCGAGCAAA GAATTTGTGC TCGGTTTTTG TATCGATTCG TCTCCAGTAC TAGTGAAAAT TTCACCAAGT TCTGAGCCCG TGATGTTTAC GCTTATGACA GCAAGTGGGG ACGAACTTGC CTTTCAATGT TCCGTAGTAT TATGGGAGGG TCAAAACACC AACGCCGCGA CATGGCCAAC GCTTGAAGTT GTTTTACAAC CCGTGCTCGT GGCGATCAAC TTGACTCGAA TTCCTCTCGC GATTCGCTCA AACACTATGG AACACAAGAT ACTCGAGCCA GGTTCATATC CCACGCCGTT TTCTTTGGAT ATTAAGTCTT CGAAAAGCGT AGACGAAGTT GAAGCCGTGC TTAATCAAGC CGCGATTCAA ATAGCGGATA TGTCGTGTCA TTCGGAAGGA AAAGATATAC TTTGGAGTTC GCCGCTGCAG AATTTGGTGC GCTATTCTGG AGCGATGTGG AGCATCCCAC AAGGTGCCTC CCCCTTGGCG GATGAAATTA ATGCATTTAG AAGACTTGGA GGTATCGACG AGTCTGGCGA AAACGTAAAT TTCAAGGCAC CTATCATCGT TCAGTTTCTT GTCGAGCGCG GTGAGCACGG TTGCTTCAGG GTCTTTTTTA GGGGGGGCGA CGGAGCATCT CTGCAGAAGG CGCCAATCAT GATCAAAAAC CATTTGCACG AACCTTTACT CATTCGACAA GTGAAAGCGA ACGACTTTGA GGACGTAGAA AAGAAGTCGG TGCAAAAGTC CACCGTGCGA CTACTGCAAG GTATTAAGAA AGCCTCACAC GGCGTAGGAC AGTCTTTTGC AATTCGCCCG TTCTCAGCGA TATCTTGGGC GTGGAGCGTC CCAGGTATTC CTATGAGATT GGACAACTTG AAAAAACAGG AGCTCGAGGC AAACGCCGCG GAGCTCGCAT TTCGAAAGCG GAAATTTTTG GAGATATCGA ACAATACAGG GGATCACGTC ATCATAGAGA TAAACGCGTT CAGCTCGCGA TTCAAGGGTT TCATGCACCT CGGCGACATG CAATTTGGAG AACTAGGCAA GCGCACGTTA CACGCAACGA TTATTGGTGA ATGGCGTGAC GCATCGTTCG TGATCTTCGT GGTTGATAAA ACGATAACGT TTGACAACTT TTCTTTCACG ACTGAAATGA CGAATCAAGC GAAAGCATTG ACGTTGTACG CGCGGCAGCA AGAAACGCAC ATCTCTGTCA ATTTGGCTGG CTTCTCCGTA ACAATCATCG ATCGCCTCCG CGACTCATTC GTCGAGTTAT TCCATCTCAC GATCGACGAC GTGATGATCA GGCACGGAAT CAACTTGAAT CAACAGCAGT TCTGTCTATA G
|
Protein sequence | MPNFIMKFDV PLVSVGLSFE RLFRNQMLFL LRLEKTKLSC TKTSKLNLSL DSSLSLLHRN TRLGYSLEPI VAPWKFSVLL DDSKNDVSGV NPAGLDMFIE SKKKIEIIVN QASIVDAVDA VMQFKKPRDK NSKIAHLLAY EAVQNNVIVN STGRDLWVRE LNGNIQEVPP GNSISHLVHA AETEANSGFE SASNEQTMRG MTDVVDDMNH HGDEQLIIAY LMMDIVAIDS LDDWLYPRIT FKIWDEALQG HRTDPLPIGV TRGSAKLPIP HPFFHRDSDS EFINNDDITI IVEISHIGPN GETLSFSGDF EAPIEQQRGG WIGGKTGSWI DCAAGNGDSV RIKLRARVVE ELYQLLKSPQ KTIIKSKLQV ESPPLIVCRT NDMVKQWWTR DHQQSKSALS AWKPHVPSTY ELETRYKFCS SRAENEYKLV PFGTILLPGL IAPRSALMAV VLEHNSDQRA PTAFPSGFEN IWTSDKNDVS FWKPIAPDGY VAVGNVVAAS AEMPSTDCVV CVREDLTKLA EVPPDVAWKS NRRFRNNDER ELALIQNGQL STGGWSFLKE KIKRTSPGVI SNVIPSMHRL NDRMCIHSLD DEGIREFWID NRTHIDHIKQ ELQAGESILA ISFSPSGPFE QVRLNLGASH VVRHKTHDLV FDSQKGHLLS YVSCENETDS DIFVDVHPAM KRELTTSNLD ISLSEKYESP SDVADIEVFE SERYYPLRGW RAPKDAIVKA RYSRHKSGRG SQKRFPDVKA PKGFRFEGPW ELDKPPNLVN DDGWAYGAIW LRKWPPPRGS DKSKGRATRR RRWFRRVVRL KSAEEIESRV SLHNMKRDFI EDWHGTLAVH GRLTLPSCSR TESYMLALRT GADVASITAP SFTERDDALD IRDVLSEQGN CALAYNNLHD HFLLVKEELS NDVQSSAIGA IPGVKIRVVA PVHVVSLMDC PSQVVLYANG EERIAMHLAA NETKRITAID ANCDTIEVLV SLFEDYLSLS NDRPLRLDVT PAGSPPAPQV NWLNIKGYPY VVSMLLMETC RTRMAGDKFN DDPSQHLLSL TYRNVLMVTN ASPIDIKCLA WVAGTSTTNA HIDAYVGVPK GESQPAMFLK SSAKTLAKVS SKEFVLGFCI DSSPVLVKIS PSSEPVMFTL MTASGDELAF QCSVVLWEGQ NTNAATWPTL EVVLQPVLVA INLTRIPLAI RSNTMEHKIL EPGSYPTPFS LDIKSSKSVD EVEAVLNQAA IQIADMSCHS EGKDILWSSP LQNLVRYSGA MWSIPQGASP LADEINAFRR LGGIDESGEN VNFKAPIIVQ FLVERGEHGC FRVFFRGGDG ASLQKAPIMI KNHLHEPLLI RQVKANDFED VEKKSVQKST VRLLQGIKKA SHGVGQSFAI RPFSAISWAW SVPGIPMRLD NLKKQELEAN AAELAFRKRK FLEISNNTGD HVIIEINAFS SRFKGFMHLG DMQFGELGKR TLHATIIGEW RDASFVIFVV DKTITFDNFS FTTEMTNQAK ALTLYARQQE THISVNLAGF SVTIIDRLRD SFVELFHLTI DDVMIRHGIN LNQQQFCL
|
| |