Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_51006 |
Symbol | |
ID | 5004725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 480016 |
End bp | 483049 |
Gene Length | 3034 bp |
Protein Length | 1007 aa |
Translation table | |
GC content | 55% |
IMG OID | 640420146 |
Product | predicted protein |
Protein accession | XP_001420862 |
Protein GI | 145353090 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.381291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0264006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAGG CGGTGTTCAG TGGGAAGGAT AAGGCGAAGA TAACGTCGCA TAAACGGACG GGGTCGTTGT TCGCGAGCGA GGAGGGCGAG GCGCTGGACG CCCTGGCGCG GTCGTCGAGT TATCTGAGCG GTAGGGGAGA GACGAAGGAG TTCAACGCGC ACGACGTGAT TGAGGAGTGT GATGAACTGC TTCGGACGAT CTTCTTCGCC GTGGTGAGGG AGACGGCGGG AGATAAGTTT TTGGGGCAGT TGAAGAGCGT GTACGAGGCG AGCGAAAAGT TTGGCAGCTC GCACGACCCG AAGGATTTCG ACGCGATGCA GGCGATGTTG GAGACGATGG AAGTGGATGA ATCGTTGCAG TTCGCGTCGG CGTACTCGAA TCTGTTGAAT CTGCACAATA TTTCCGAGCA AGTGGCGAAC GCGATGGAGG AAAGACACAG ACGTTTGGAC GATATTCCGC GAGGGCCGGC AAAGACGACG AACGGCGCGA TTAAAGGTTT GCTTCGAGCC GGGAAGTCGA CGGAGGAAAT CTACAGCGCT TTAGCGGTGC AGCACGTTGA TTTAGTGCTC ACCGCCCACC CCACGCAAGC CTTGAGACGG TCGATGTTGA AATCTTTCGG AATAATTCGT GAAAAGCTGT TGCAGCTGCA ACGATTTCGC CTTTCGCGAT ACGAGCGTGC AGAAGTTTTG GACGAGATTC GATCGAAGGT CGCCTCGGCG TGGCGCACAG ATGAGATCCG CCGCACGCCA CCGAAGCCTC AAGACGAAAT GCGCGCGGGT CTGACGTACT TTCAGCAAAC CATCTGGGAT GGCATCCCAA CTTTCATGCG TCGCGTGGAC ACGTCGCTCT TGGCAAACGG ATGCCCTCGT TTGCCGCTCG ACAGATCCAT CGTTACGTTC GGGTCGTGGA TGGGTGGTGA TCGTGACGGC AACCCATACG TCACGGCATC ATGTACGCGC GACGTCGTTC TCTTGGCCCG TGTGCAAGGG GTTAACCTAC TCTTCCGAGC GATTCAGCGT CTGATTTTCG ATTTGTCCAT GTGGCGATGC AACGACGCCG TCAAGGCGTT GGCAAAGGAC ATCTTGGAGA ACTCGGAAAC GGACAACTTT ACGATCTTCG AAGAGCGCAA GAAGCGAAAC TACGATGACT TTTGGAAGGC CATTCCCGAG CACGAGCCTT ACCGCGTCAT CTTGGCTGAA CTTCGAGACA AGCTTTACAA CACGCGAGAG GCGTTGCAGC GTTGCATCGC GGACAACGAC GTCAACATCG ACATGAACGA CGAGACTATC ATTCGTTCCA AGGACGAACT GTTCGCACCC TTGGTGGTGT GCTACGAGTC TCTCATCGAA GTCGGTGACG CTCAAATTGC CAACGCCTAT CTTCTCGACG TCATTCGCCA AGTGCAGTGC TTTGGGTTGG GGCTGGTCAA ACTCGATATC CGGCAAGAAT CTGATCGCCA CGCCGAGGCT TTGGACGCGG TGACCAGGTA CATCGGCCTC GGATCGTACC TCGAATGGAG TGAGGAACAG AAAATTGAGT TCCTGACACG CGAGCTTGAG AGCAAGCGGC CACTTTTACC TTCCGACCTC GAATGCTCGG ACGACGTTCG AGAAGTCCTG GACACGTGTA AGATGATTGC GCACTTGCAA CAAACGTGCC CGGGTGCCCT CGGAACGTAC GTTATATCCA TGGCGACAAG TGCCAGCGAT GTTTTGGCAG TTGTATTATT ACAGCGCGAG TGTGGCTGTC GCAAACAGGA TCTTCTTCGC GTTGCACCGC TCTTTGAGCG ACTCGACGAT CTGAACGACG CCCCGCGCGT GTTGCGCCAG CTCTTCTCCG TGAAGTGGTA TCACGACCAC ATCGCGGGAT TCCAAGAGGT TATGATCGGG TATTCGGATA GTGGAAAAGA CGCTGGTCGC ATGGCTGCAG CATGGGCGCT GTACGATGGT CAAGAACGCG TCGTGGCGGC AGGGAAGGAG TTTGACGTTG CGCTCACCCT GTTTCACGGT CGCGGAGGCA CCGTAGGTCG CGGGGGAGGT CCAGCGCACA TTGCCATGTT GTCTCAGCCT CCAGGCACTG TGAATGGTAG CATCAGAGTC ACCGTGCAAG GAGAAGTGAT CGAAACCGAC TTCGGTGAAA AAGAAAACTG TTTTCACACT CTGGATTTGT ACACGGCGTC CGTTCTGGAG CACACGTTGA AGCCGCCGGC GCATCCTCGC GATGAATGGC GCCGAGTCAT GGATAGAATG AGCGAATACT CATGCGCGCA TTATCGTAAG ACTGTGTTCG AAACCCCTGA TTTCGTCGGA TACTTTGCAC AGGCCACGCC TGGAGCCGAG CTTGGATCGC TCAACATCGG TTCTCGACCG GCCAAGCGTA AACCGAGTGC GGGCGTCACC GCTCTTCGAG CAATTCCGTG GATTTTCGCG TGGACGCAGT CGCGATTCCA CTTGCCAGTC TGGCTCGGAA TTTCCACATC ATTCAGACGC TTGATCGACG AAGGCGAACT GGAAACGCTG AGGGACATGT ACAAAAGTTG GCCTTTCTTT GAGGTGACGA TCGATTTAGT GGAGATGGTG CTCGCCAAGG CGGATCCCGT CGTCGTGGCG TATTACGAAA GAGCCCTTGT CGATCCAAAA TTGCACGACT TCGGCGCCAG TCTGCGCGGC GAGCTTCAAG AAAGCATCGA TTGCATCCTC GCCGTCTCCG AGCATATCGG TTTGCTCGCC AAACCAGAAA AAGTCGAGGC GAACGAGGCG GTTCAGGTGC ATAAAAAGCT CGCTCACAAA CTTCACAAGC GGTCGCTGTA CATCACGCCG CTCAACGTGT GTCAAGTGCG ATATCTCATC GCCGCTCGCG CGCTTGAAAA TGAAGAAGAT GGCGATAAGC TAAGCATGCA AAAAGTCAAG ATCACCTTGC TCGAGGGTTA CCCGTTCCAA GATTACAATT ACAAGGGCGC GGTGAACGAC GTTTTAAAAA TCACGATGAA GGGCATCGCA GCCGGGATGC AAAACACTGG GTGACAACAT CGAA
|
Protein sequence | MLKAVFSGKD KAKITSHKRT GSLFASEEGE ALDALARSSS YLSGRGETKE FNAHDVIEEC DELLRTIFFA VVRETAGDKF LGQLKSVYEA SEKFGSSHDP KDFDAMQAML ETMEVDESLQ FASAYSNLLN LHNISEQVAN AMEERHRRLD DIPRGPAKTT NGAIKGLLRA GKSTEEIYSA LAVQHVDLVL TAHPTQALRR SMLKSFGIIR EKLLQLQRFR LSRYERAEVL DEIRSKVASA WRTDEIRRTP PKPQDEMRAG LTYFQQTIWD GIPTFMRRVD TSLLANGCPR LPLDRSIVTF GSWMGGDRDG NPYVTASCTR DVVLLARVQG VNLLFRAIQR LIFDLSMWRC NDAVKALAKD ILENSETDNF TIFEERKKRN YDDFWKAIPE HEPYRVILAE LRDKLYNTRE ALQRCIADND VNIDMNDETI IRSKDELFAP LVVCYESLIE VGDAQIANAY LLDVIRQVQC FGLGLVKLDI RQESDRHAEA LDAVTRYIGL GSYLEWSEEQ KIEFLTRELE SKRPLLPSDL ECSDDVREVL DTCKMIAHLQ QTCPGALGTY VISMATSASD VLAVVLLQRE CGCRKQDLLR VAPLFERLDD LNDAPRVLRQ LFSVKWYHDH IAGFQEVMIG YSDSGKDAGR MAAAWALYDG QERVVAAGKE FDVALTLFHG RGGTVGRGGG PAHIAMLSQP PGTVNGSIRV TVQGEVIETD FGEKENCFHT LDLYTASVLE HTLKPPAHPR DEWRRVMDRM SEYSCAHYRK TVFETPDFVG YFAQATPGAE LGSLNIGSRP AKRKPSAGVT ALRAIPWIFA WTQSRFHLPV WLGISTSFRR LIDEGELETL RDMYKSWPFF EVTIDLVEMV LAKADPVVVA YYERALVDPK LHDFGASLRG ELQESIDCIL AVSEHIGLLA KPEKVEANEA VQVHKKLAHK LHKRSLYITP LNVCQVRYLI AARALENEED GDKLSMQKVK ITLLEGYPFQ DYNYKGAVND VLKITMKGIA AGMQNTG
|
| |