Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24634 |
Symbol | |
ID | 5002240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 66252 |
End bp | 69281 |
Gene Length | 3030 bp |
Protein Length | 974 aa |
Translation table | |
GC content | 57% |
IMG OID | 640417661 |
Product | predicted protein |
Protein accession | XP_001418363 |
Protein GI | 145347829 |
COG category | [R] General function prediction only |
COG ID | [COG5038] Ca2+-dependent lipid-binding protein, contains C2 domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0466541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG GCGCGGCGCG AGGGACGAAC GAAAAGGGTC GCGCGCGCGC GCCGACGACG CCGCGCGCGC GCGACGGCGA CGGCGCGCGC GACGCCGCGC GAACTGACGC CGGAGACCGA CCGGATGCCG AACGTCGCGA CGACGACGAC GACGACGACG TCGAGCGCGC GCGGAAGCGA CGGGCGACGG CGCGAGCGGC GCGATTGGGA TTTTTCGGCG CGCTCCGTAA GGCGGTCGAG CGAAACGCGC TGATGATGGA GAGCGACTCG AGCGATGATT ACGATTCGGA CGAGGACGCC GCGCCGGAGC GCGCGCGCGC GGAGCGACGG CAGAGCGCGT GGGCGCGCGC GGTGGGAGAA TCGGACGCGG ATGAGCTGGA GAAGATGATG TCCGACCTGT CGGGGGCGAG GGAGATGGCG ATGCGGATGC GGTCGGACGT CGGGGGCGGC GGCGACGCGG AGGAGGACGC GACGACGGCG GAGCGCGAAC GCGGGGCGAA GCGCGAGGTC GAGGCGACGC CGGCGACGTC GGAGACGAAG CGAGGCGACG AGGACGACGG TTCGGGGGGG ATGGCGATCG CCAAGAGAGT CGCGTACGAG GTCGGCGCGA CGGGGGCGGC GATGTTTTCG TCGGCGGGCG AGGCGACGAA CGAGGACGAG GACGGTTCCG CGGGTTCTAT TTTCGTCGAC GTTCGCATGA TCCAGTGCTT GAGCGGCTTA TTCATACGAG TGAACGTCGA ATCCGGGAGC GGTTTGCTCG CCATGGACGC GGGCGAGACG TCGGACCCGT ACGTGAAGGT GTCTATCGTC AACAGCGCCG GGCTCGCCGT CGACGGTCAA GTACACCGAA CGGGGTACCG CCCGAAAACT ACGAATCCGG TGTGGAACGA GTCGTTTTAC ATGGGTAGCG AAAAACTGAG ATATAAGGAC TGCTCGTTGA AGTTTGAAGT GTACGACTTC GATCTCATGT CAGCCGACGA CGCCATGGGC TCGGCGACGT TGCCTCTGAC GCATTTCGCG AAAATGAAAG CCAACTTTAA AGACGTAAAG TCAGTGAAGG ACACCGACGC AAAGGTGATC GATCCATCGC CTTCACTCGC CGACGCGCTT TCAAAGTCAC CATCGCAAGT GCCCGCGGCG GCGTTACATT TGCACAAAGT AAAGAACACG CCTGAAAACC GGGCACCGAC GCACTTTGAC GACGGATACT TCGTCGAACG CAACTCAAAC GAGCTGGTCG TGCACTTTAA GCTGCGACCT CCGCGGACGT CGTCGTTCAT ATTCGAAGAA GCCAAAGCGA TGTTGTCCTT CGCGAAGCAG CTGGTCGACC CACGAAACTT TGGAAAGAGC TTCAACCAAA CTGGAGTCGG TCTTTGGCTA AATAGCAAAG TCGAACAAGC CGTCGACGTT GGAAAGCGCC GTGCTTTGAA AGTCATCGAT GCGACGATTG AGGAAAAGAA GAATAAGGTT TGTGAACTCG CCGTGGCGGA TCGCGATATG CCGTCGGTGA TCCGAAACTA TCTCGCAGGC GTTGTGAACA TGTACATCTC GGATATTCAG CAGGGGCTGA TGAGCGACTT GGGGATTCGT TTGAAAATTC TAGACAGCGC TAAATCGCAA GGATCCGCTC AAGCAGATCA AAATGACGCA GAGCAGGTTA AACGCAGGAG CGTTCCACAG AGCTTCAGCA TGAAGCGTTC GGTGTTTGGA TTATTTGAAT CTATTCGGTG CTGGTACCTT TACAACGAAG TTCCGGGTGA TCTCTCCATC TTCGGTAAGG TGCGAAATCC TTATTGGTGG TTGTTCCTCG TTTCAAAGCT CTACTTTGGT CTCGGTCTCC AAGCGTTTGT GTTCTTCATT CGTCTGGCTC TCGTGGATAG GCGAGACGAG TGGCAGATGT TCGAGTACAT CATGGTTTTC AAGGGAATTC AGTTCATTTC GGGAACGATT TCCGTCTTCG CCGGCGTGTT CGCGTTCATC CAGTGTGCCG GCGTACGTGA CGCTGGCACC ACGCACACGT GTGACACCAC AGGACCATGG GTAACAGAGA TGCAGTCGTG TGCGATTGAT TCGAGCACTT GTGTCGCGCT CAATGTCGGC GCGTACCTCG TACGTATCAT GATGTGCTGG TACGCGTTTC ACAAGCTCCG TCGTTCGTTC GCGTTTGGGC GCGCGATCGC GAGCGATCAC AGATTGGTTG GTGCTAGAAT TCGAATCACA ACCGTTGGAA TCGGTTCACG GTATAAGACC TACGGCGCGT TGAAGAAGGC TTTGAAGTAT ATTTTCGTTA AGCGCAAAGA AACGCCGCTC GAACGGTTCC GTCGAATCGT GAATTTGCAG CTTGAACGTC TGGCGGCGGC CAAAGGTATT CGTGGTGGTA CGGCACGCGC GCGAAATGCG TTCAAATCAT TTCATTACGT TCACCGCATC GCCAAAGTCA AGGACTACGA TGTTGCGAGC GGAATGCACA CGTTGTACTA CAAGGATGAC CTTACGCACC AGCGACATCA AATCGATCTC GGCACGGTGA CGTATACGGT TATGAAGCTC AAGCACATCC AACCGCGCCG CGTACAGCGC ATTCTGTGGA TCTACGAACT TGTCACCTTT GCGGTCACCG TTGGTTGCTC CATCCGATTT CTGGCGTGGG TCGACTGGGG TCGCGGCGAA ACCTGGCAAC TTTACGGCGT CGCGTTTTGG GGGCAGACTT TGTACAACCT TCTCGCGTTT CCATTCATTT TGATGGTCAT CCCGGGCGTG AACAAGCTCA TCTGTCACGC CCCGAAGACT GGATACGACA GAAATGGCAA CTTGAAACGC TTCAGAAAGC GTGCGACGTT CGAAGAAGAC ATCGAAGACG AGCCCAACCC GCGTTCGCGG TGCGCGCCGG CGTGTTACCC ATTCGTCAAG CCCTGGCGCG TATGACGCGC GCGACATCGT CGACCCATAT TTATCTCTCA TCAATAACCA ACATCATCGA TAACAATAAC CAACATCATC GGTACTGTAT CGCGTACTGT AATAATCATA
|
Protein sequence | MARGAARGTN EKGRARAPTT PRARDGDGAR DAARTDAGDR PDAERRDDDD DDDVERARKR RATARAARLG FFGALRKAVE RNALMMESDS SDDYDSDEDA APERARAERR QSAWARAVGE SDADELEKMM SDLSGAREMA MRMRSDVGGG GDAEEDATTA ERERGAKREV EATPATSETK RGDEDDGSGG MAIAKRVAYE VGATGAAMFS SAGEATNEDE DGSAGSIFVD VRMIQCLSGL FIRVNVESGS GLLAMDAGET SDPYVKVSIV NSAGLAVDGQ VHRTGYRPKT TNPVWNESFY MGSEKLRYKD CSLKFEVYDF DLMSADDAMG SATLPLTHFA KMKANFKDVK SVKDTDAKVI DPSPSLADAL SKSPSQVPAA ALHLHKVKNT PENRAPTHFD DGYFVERNSN ELVVHFKLRP PRTSSFIFEE AKAMLSFAKQ LVDPRNFGKS FNQTGVGLWL NSKVEQAVDV GKRRALKVID ATIEEKKNKV CELAVADRDM PSVIRNYLAG VVNMYISDIQ QGLMSDLGIR LKILDSAKSQ GSAQADQNDA EQVKRRSVPQ SFSMKRSVFG LFESIRCWYL YNEVPGDLSI FGKVRNPYWW LFLVSKLYFG LGLQAFVFFI RLALVDRRDE WQMFEYIMVF KGIQFISGTI SVFAGVFAFI QCAGVRDAGT THTCDTTGPW VTEMQSCAID SSTCVALNVG AYLVRIMMCW YAFHKLRRSF AFGRAIASDH RLVGARIRIT TVGIGSRYKT YGALKKALKY IFVKRKETPL ERFRRIVNLQ LERLAAAKGI RGGTARARNA FKSFHYVHRI AKVKDYDVAS GMHTLYYKDD LTHQRHQIDL GTVTYTVMKL KHIQPRRVQR ILWIYELVTF AVTVGCSIRF LAWVDWGRGE TWQLYGVAFW GQTLYNLLAF PFILMVIPGV NKLICHAPKT GYDRNGNLKR FRKRATFEED IEDEPNPRSR CAPACYPFVK PWRV
|
| |