Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87084 |
Symbol | |
ID | 5001241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 694677 |
End bp | 697670 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | |
GC content | 60% |
IMG OID | 640416662 |
Product | predicted protein |
Protein accession | XP_001417570 |
Protein GI | 145346178 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.019106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGAGCG CGGAAGCGCC GTCGCCGCCG CCGCCGCCGC CACCGCCACC GCCACCGCAA GTCGAACCGA TCAAAGTCAC CGAAGACACG AGTTTCGCGG CGCTGAACCA GTGGCGCGGT CAAGAAGTGA AATTACAAAC GCACAAAGGC GGCGGCGGCG AACGTAACCT TCAGTGGAAC ACGGATAATC TCGCCGACGC CGCGCGCCGT ATCGTCGAGG GCGATCGCGA CTCTTCTTCA TGGAGACAGA AACTGCAAAT GGTTGAGAAG CTAATTTGCG ATGACGGCGC CGACGTTGAC TCTTTGGCGT ACGCTACTGT GTATTTATTT TGGATATCTG TCGGCGCCAT CGCGTGCGTC GAGGACGGTA CGCACTACCG TCCAAACCAC CACGCCGGTT CGGCTGAACG TATGTACGGC GCAATCGAGG CTGCGGAACG TTTCGCGAAC GATGTTGCGA GTGGTGGTGA TATTTACCGC GCGCGTGAGC TTCGTGCGTT GATTCGCCGT CTGCATCCAC GACTCCCCGC GTTCACCGCC GAGTTCACGC AAAGCGTGCC GCTGACGCGC ATTCGCGACA TCGCGCACGG CAAGGGTGAT CAACATGGGA AATGTCGCGA AGTTCGACAA GAAATCAAGC ACACGATTCA GAACAAACTC CATCGCTGCG CTGGTCCCGA GGATCTAGTT GCGACGGAAT CTATGCTCGC CAAACTCACC GCTCCGGGAA CCGACTACCC CGAAGAGTTT GTCAACGAGT TTAAGATTTT CTATCGCGAG CTCAAGGAAT TCTTCAACGC CTCCTCTGTG GCCGATCGCA TCGATCGTAT CGCGAACGAA AATGGCGCCC CTGGCCGTGC GGCCGATAGC TCAAAGAAGT TTCTCTCTGC GAAGGCGACC GTGGATGCGC TTCCTGCGAG CGACCGCGTG GGCGATCAAA CGACTATGAG CGCGCTCGTC GCGTGCTTGC GTGCTATTCA CGACGCGCGA ACGGACATCA CCGCCGCCTT GGAATCGGGC GGGGATTTGG GTCAAGCTGA ATCATCGACG CGTCAACAGT GGCGCTTGGC CGAGGTGAGC ATGGAAGATT ACGCGTTCGT GTTGTTGAGC CGATTGCTCA ACGCGCTCGG CGCAGAGTCT GAGCCGCCGC GAAACGACAT CAGCGCGAGC GAAGTAAAAC TCACTCTTGA GGCGCTGGCT TTGACTTCTC GCACCATGGC GCTGAGTGCA GGAGGCGATA ACGAGCTAGA GGCAATCGCG TCCGAAGCCG AAGCACTGGC TCGTAATGGT TTGCCCGCGG GCGAGGAAGG CGGTTTGCGC GTCCAAGCCG TCGCCGAACG CGCTCGACGC GGCGCCGTCG ACTTTTGTTC GCTGTTGGAA TCTTTATTCG ATGGACGGGC GTCGAGTCTC GGCAATGCGC TCGGTATTGA CCACGGCTCA ATCAGTGTGT TCACAGAAGG TCAGATTCGC GCGAGCGTAG TGTTCCAATC CGCCAAGATC GCTTCCTTGT TGCTGCGAGT CAGCCGGCAA ATCACCGGCG CCGCTGGATG GGATTGCGTC GTGCAAGGCG AGGCTATCGG CGCGCTCAAG TGCGTCGAAA GGCTCACGCC CGAAGAGTGC GCGCAGTTCA CCGAGCCAGT AATCGTGCTC GTTGCTAGTG CTGATGGTGA TGAAGAAGTG TCGACGTGCG GCCCGAACGT GCGCGGCGTG GTGCTGTGTC ACGCGTTGCC GCATCTCAGT CATTTGGCGC TTCGCGCGCG TCAAGCCAAA GTGCCCCTCA TCGCCGTCGA AGACGACAAG CTTGTCGACT ACGCGCGTTC TTTGGCGAAT GAACCTGCCG TGAAGCTCAG TGCGGAAACC ACTGGAATTA AGCTCGAGCC AACGACGGCT CCGGCGTCTG TTGCGGCTGC TTCAAGCGAA GCGGGGCCAC AGGCGACGAA ACCCGTGATC CGCCTCGACA CTGATCTCTC AAGAGCGGGA ACCGTGTTCG ATCTCGTCGC GCTGGACAAG CGTGGACTCG AAAAGTCGAT TCGCATCGCC GGTACGAAGT CCGCTATGTG CGCACGATTG AGCACTATCG CCGAAAACTC TTCTGGATCG GCGGCGTTCG CCGCGCCCGC CGGTGTCGTC ATTCCATTCG GCGCAATGGA ATTCGCGTGC GCGAGCATCA GCAAGCTCGA ACATCTCGAC AGTTTGCTCC TCGAACTCAA CCAGTACGCG GACGACCCAG TGAGGATGCG ACACACGTGC GAAGCCATCC AGAACCTCGT TCGTTCGCTC AAGCCGTCCG CGAGCGCGCT GCAATCCGTC GCTGAAAAGT TTGGCCCGAA TGCGCGCGTC ATGGTTCGAA GTAGCGCCAA CGTTGAAGAT CTCGAGGGGA TGTCCGCGGC TGGTTTGTAC GATTCCATCC CAAATGTCGA CCCGAACTCG GAAGACGCAT TCAGTCGCGC TGTTGGCGAG GTATGGGCGT CCCTGTACAC CACTCGCGCC GTGGCTTCTC GCGCCGCCGC CGGCGTCGAT CAACTCGAGG CGCACATGTG CGTCCTCGTC CAAGAGATGC TCTCGCCCGA GGTCAGTTTC GTTCTACACA CGAAGCACCC GCTCACAAAT GATAATAACG AAGCGTACGT CGAGTTTGCG CTCGGTTTGG GCGAGACTTT GGCGTCGGGC GCGGTTCGAG GATCGCCCTG CCGCGTGAGC GTCGACAAGC GATCCGGCAA AGCGACGGTG AATGCGTTCG CCTCGTTCGG AACCGCCCTC GTCCGCGATG ACGACTCGGC AACCGGAATG AAATCTGTCG CCGCGGATTA CGCATCCCAC TGGCTTCACA ACGACGTCGC GAAGCGCGAC GAAATCGCCA CCAAACTTCT CGCCATCGGC TCTGAGCTCG AGCGCGAGTT GAGTCCGCGC GGCGAGACGC TCCCGCAAGA CGTCGAAGGC TGCATCCTTC CCTCTGGGGA AATTTGCATC GTCCAAGCGC GCCCGCAGCC CTAA
|
Protein sequence | VPSAEAPSPP PPPPPPPPPQ VEPIKVTEDT SFAALNQWRG QEVKLQTHKG GGGERNLQWN TDNLADAARR IVEGDRDSSS WRQKLQMVEK LICDDGADVD SLAYATVYLF WISVGAIACV EDGTHYRPNH HAGSAERMYG AIEAAERFAN DVASGGDIYR ARELRALIRR LHPRLPAFTA EFTQSVPLTR IRDIAHGKGD QHGKCREVRQ EIKHTIQNKL HRCAGPEDLV ATESMLAKLT APGTDYPEEF VNEFKIFYRE LKEFFNASSV ADRIDRIANE NGAPGRAADS SKKFLSAKAT VDALPASDRV GDQTTMSALV ACLRAIHDAR TDITAALESG GDLGQAESST RQQWRLAEVS MEDYAFVLLS RLLNALGAES EPPRNDISAS EVKLTLEALA LTSRTMALSA GGDNELEAIA SEAEALARNG LPAGEEGGLR VQAVAERARR GAVDFCSLLE SLFDGRASSL GNALGIDHGS ISVFTEGQIR ASVVFQSAKI ASLLLRVSRQ ITGAAGWDCV VQGEAIGALK CVERLTPEEC AQFTEPVIVL VASADGDEEV STCGPNVRGV VLCHALPHLS HLALRARQAK VPLIAVEDDK LVDYARSLAN EPAVKLSAET TGIKLEPTTA PASVAAASSE AGPQATKPVI RLDTDLSRAG TVFDLVALDK RGLEKSIRIA GTKSAMCARL STIAENSSGS AAFAAPAGVV IPFGAMEFAC ASISKLEHLD SLLLELNQYA DDPVRMRHTC EAIQNLVRSL KPSASALQSV AEKFGPNARV MVRSSANVED LEGMSAAGLY DSIPNVDPNS EDAFSRAVGE VWASLYTTRA VASRAAAGVD QLEAHMCVLV QEMLSPEVSF VLHTKHPLTN DNNEAYVEFA LGLGETLASG AVRGSPCRVS VDKRSGKATV NAFASFGTAL VRDDDSATGM KSVAADYASH WLHNDVAKRD EIATKLLAIG SELERELSPR GETLPQDVEG CILPSGEICI VQARPQP
|
| |